the  classic  work 

EXTENDED  AND  REFINED 


The  Art  of 
Computer 
Programming 

VOLUME  4A 

Combinatorial  Algorithms 

Part  1 


DONALD  E.  KNUTH 


Volume  II  Fundamental  Algorithms 
Third  Edition  (0-201-89683-4) 

This  first  volume  begins  with  basic  programming 
concepts  and  techniques,  then  focuses  on 
information  structures — the  representation 
of  information  inside  a computer,  the  structural 
relationships  between  data  elements  and  how 
to  deal  with  them  efficiently.  Elementary 
applications  are  given  to  simulation,  numerical 
methods,  symbolic  computing,  software  and 
system  design. 


Volume  2/  Seminumerical  Algorithms 
Third  Edition  (0-201-89684-2) 

The  second  volume  offers  a complete 
introduction  to  the  field  of  seminumerical 
algorithms,  with  separate  chapters  on  random 
numbers  and  arithmetic.  The  book  summarizes 
the  major  paradigms  and  basic  theory  of  such 
algorithms,  thereby  providing  a comprehensive 
interface  between  computer  programming 
and  numerical  analysis. 


Volume  3/  Sorting  and  Searching 
Second  Edition  (0-201-89685-0) 

The  third  volume  comprises  the  most 
comprehensive  survey  of  classical  computer 
techniques  for  sorting  and  searching.  It  extends 
the  treatment  of  data  structures  in  Volume  I 
to  consider  both  large  and  small  databases  and 
internal  and  external  memories. 


Volume  4A  / Combinatorial  Algorithms, 

Part  1 (0-201-03804-8) 

This  volume  introduces  techniques  that  allow 
computers  to  deal  efficiently  with  gigantic 
problems.  Its  coverage  begins  with  Boolean 
functions  and  bitwise  tricks  and  techniques, 
then  treats  in  depth  the  generation  of  all 
tuples  and  permutations,  all  combinations 
and  partitions,  and  all  trees. 


DONALD  E.  KNUTH  Stanford  University 


▲ 

▼▼  ADDISON-WESLEY 


Volume  4A  / Combinatorial  Algorithms,  Part  1 


THE  ART  OF 

COMPUTER  PROGRAMMING 


Upper  Saddle  River,  NJ  ■ Boston  • Indianapolis  • San  Francisco 
New  York  • Toronto  • Montreal  • London  • Munich  • Paris  • Madrid 
Capetown  • Sydney  • Tokyo  • Singapore  • Mexico  City 


The  poem  on  page  437  is  quoted  from  The  Golden  Gate  by  Vikram  Seth  (New  York: 
Random  House,  1986),  copyright  © 1986  by  Vikram  Seth. 

The  author  and  publisher  have  taken  care  in  the  preparation  of  this  book,  but  make  no 
expressed  or  implied  warranty  of  any  kind  and  assume  no  responsibility  for  errors  or 
omissions.  No  liability  is  assumed  for  incidental  or  consequential  damages  in  connection 
with  or  arising  out  of  the  use  of  the  information  or  programs  contained  herein. 

The  publisher  offers  excellent  discounts  on  this  book  when  ordered  in  quantity  for  bulk 
purposes  or  special  sales,  which  may  include  electronic  versions  and/or  custom  covers 
and  content  particular  to  your  business,  training  goals,  marketing  focus,  and  branding 
interests.  For  more  information,  please  contact: 

U.S.  Corporate  and  Government  Sales  (800)  382-3419 
corpsalesSpearsontechgroup . com 
For  sales  outside  the  U.S.,  please  contact: 

International  Sales  internatlonal@pearsoned.com 

Visit  us  on  the  Web:  informit.com/aw 

Library  of  Congress  Cataloging-in-Publication  Data 
Knuth,  Donald  Ervin,  1938- 

The  art  of  computer  programming  / Donald  Ervin  Knuth. 
xvi,883  p.  24  cm. 

Includes  bibliographical  references  and  index. 

Contents:  v.  1.  Fundamental  algorithms.  — v.  2.  Seminumerical 
algorithms.  — v.  3.  Sorting  and  searching.  — v.  4a.  Combinatorial 
algorithms , part  1 . 

Contents:  v.  4a.  Combinatorial  algorithms,  part  1. 

ISBN  978-0-201-89683-1  (v.  1,  3rd  ed.) 

ISBN  978-0-201-89684-8  (v.  2,  3rd  ed.) 

ISBN  978-0-201-89685-5  (v.  3,  2nd  ed.) 

ISBN  978-0-201-03804-0  (v.  4a) 

1.  Electronic  digital  computers — Programming.  2.  Computer 

algorithms.  I.  Title. 

QA76.6.K64  1997 

005.1 — DC21  97-2147 

Internet  page  http://www-cs-faculty.stanford.edu/~knuth/taocp.html  contains 
current  information  about  this  book  and  related  books. 

See  also  http://www-cs-faculty.stanford.edu/~knuth/sgb.html  for  information 
about  The  Stanford  GraphBase,  including  downloadable  software  for  dealing  with 
the  graphs  used  in  many  of  the  examples. 

And  see  http://www-cs-faculty.stanford.edu/~knuth/mmix.html  for  basic  infor- 
mation about  the  MMIX  computer. 

Copyright  © 2011  by  Pearson  Education,  Inc. 

All  rights  reserved.  Printed  in  the  United  States  of  America.  This  publication  is 
protected  by  copyright,  and  permission  must  be  obtained  from  the  publisher  prior  to 
any  prohibited  reproduction,  storage  in  a retrieval  system,  or  transmission  in  any  form 
or  by  any  means,  electronic,  mechanical,  photocopying,  recording,  or  likewise.  For 
information  regarding  permissions,  write  to: 

Pearson  Education,  Inc. 

Rights  and  Contracts  Department 
501  Boylston  Street,  Suite  900 
Boston,  MA  02116  Fax:  (617)  671-3447 
ISBN-13  978-0-201-03804-0 
ISBN-10  0-201-03804-8 

Text  printed  in  the  United  States  at  Courier  Westford  in  West.ford,  Massachusetts. 
Second  printing,  March  2011 


PREFACE 


To  put  all  the  good  stuff  into  one  book  is  patently  impossible, 
and  attempting  even  to  be  reasonably  comprehensive 
about  certain  aspects  of  the  subject  is  likely  to  lead  to  runaway  growth. 

— GERALD  B.  FOLLAND,  “Editor’s  Corner"  (2005) 

The  title  of  Volume  4 is  Combinatorial  Algorithms,  and  when  I proposed  it 
I was  strongly  inclined  to  add  a subtitle:  The  Kind  of  Programming  I Like  Best. 
My  editors  have  decided  to  tone  down  such  exuberance,  but  the  fact  remains 
that  programs  with  a combinatorial  flavor  have  always  been  my  favorites. 

On  the  other  hand  I’ve  often  been  surprised  to  find  that,  in  many  people’s 
minds,  the  word  “combinatorial”  is  linked  with  computational  difficulty.  Indeed, 
Samuel  Johnson,  in  his  famous  dictionary  of  the  English  language  (1755),  said 
that  the  corresponding  noun  “is  now  generally  used  in  an  ill  sense.”  Colleagues 
tell  me  tales  of  woe,  in  which  they  report  that  “the  combinatorics  of  the  sit- 
uation defeated  us.”  Why  is  it  that,  for  me,  combinatorics  arouses  feelings  of 
pure  pleasure,  yet  for  many  others  it  evokes  pure  panic? 

It’s  true  that  combinatorial  problems  are  often  associated  with  humongously 
large  numbers.  Johnson’s  dictionary  entry  also  included  a quote  from  Ephraim 
Chambers,  who  had  stated  that  the  total  number  of  words  of  length  24  or  less, 
in  a 24-letter  alphabet,  is  1,391,724,288,887,252,999,425,128,493,402,200.  The 
corresponding  number  for  a 10-letter  alphabet  is  11,111,111,110;  and  it’s  only 
3905  when  the  number  of  letters  is  5.  Thus  a “combinatorial  explosion”  certainly 
does  occur  as  the  size  of  the  problem  grows  from  5 to  10  to  24  and  beyond. 

Computing  machines  have  become  tremendously  more  powerful  throughout 
my  life.  As  I write  these  words,  I know  that  they  are  being  processed  by  a “lap- 
top” whose  speed  is  more  than  100,000  times  faster  than  the  trusty  IBM  Type  650 
computer  to  which  I’ve  dedicated  these  books;  my  current  machine’s  memory 
capacity  is  also  more  than  100,000  times  greater.  Tomorrow’s  computers  will  be 
even  faster  and  more  capacious.  But  these  amazing  advances  have  not  diminished 
people’s  craving  for  answers  to  combinatorial  questions;  quite  the  contrary.  Our 
once-unimaginable  ability  to  compute  so  rapidly  has  raised  our  expectations, 
and  whetted  our  appetite  for  more  — because,  in  fact,  the  size  of  a combinatorial 
problem  can  increase  more  than  100,000-fold  when  n simply  increases  by  1. 

Combinatorial  algorithms  can  be  defined  informally  as  techniques  for  the 
high-speed  manipulation  of  combinatorial  objects  such  as  permutations  or  graphs. 
We  typically  try  to  find  patterns  or  arrangements  that  are  the  best  possible  ways 
to  satisfy  certain  constraints.  The  number  of  such  problems  is  vast,  and  the  art 
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of  writing  such  programs  is  especially  important  and  appealing  because  a single 
good  idea  can  save  years  or  even  centuries  of  computer  time. 

Indeed,  the  fact  that  good  algorithms  for  combinatorial  problems  can  have  a 
terrific  payoff  has  led  to  terrific  advances  in  the  state  of  the  art.  Many  problems 
that  once  were  thought  to  be  intractable  can  now  be  polished  off  with  ease,  and 
many  algorithms  that  once  were  known  to  be  good  have  now  become  better. 
Starting  about  1970,  computer  scientists  began  to  experience  a phenomenon 
that  we  called  “Floyd’s  Lemma”:  Problems  that  seemed  to  need  n3  operations 
could  actually  be  solved  in  ()(n2):  problems  that  seemed  to  require  n2  could  be 
handled  in  O(nlogn);  and  nlogn  was  often  reducible  to  0(n).  More  difficult 
problems  saw  a reduction  in  running  time  from  0(2”)  to  0(1.5”)  to  0(1.3"), 
etc.  Other  problems  remained  difficult  in  general,  but  they  were  found  to  have 
important  special  cases  that  are  much  simpler.  Many  combinatorial  questions 
that  I once  thought  would  never  be  answered  during  my  lifetime  have  now  been 
resolved,  and  those  breakthroughs  have  been  due  mainly  to  improvements  in 
algorithms  rather  than  to  improvements  in  processor  speeds. 

By  1975,  such  research  was  advancing  so  rapidly  that  a substantial  fraction 
of  the  papers  published  in  leading  journals  of  computer  science  were  devoted 
to  combinatorial  algorithms.  And  the  advances  weren’t  being  made  only  by 
people  in  the  core  of  computer  science;  significant  contributions  were  coming 
from  workers  in  electrical  engineering,  artificial  intelligence,  operations  research, 
mathematics,  physics,  statistics,  and  other  fields.  I was  trying  to  complete 
Volume  4 of  The  Art  of  Computer  Programming,  but  instead  I felt  like  I was 
sitting  on  the  lid  of  a boiling  kettle:  I was  confronted  with  a combinatorial 
explosion  of  another  kind,  a prodigious  explosion  of  new  ideas! 

This  series  of  books  was  born  at  the  beginning  of  1962,  when  I naively 
wrote  out  a list  of  tentative  chapter  titles  for  a 12-chapter  book.  At  that  time 
I decided  to  include  a brief  chapter  about  combinatorial  algorithms,  just  for 
fun.  “Hey  look,  most  people  use  computers  to  deal  with  numbers,  but  we  can 
also  write  programs  that  deal  with  patterns.”  In  those  days  it  was  easy  to  give 
a fairly  complete  description  of  just  about  every  combinatorial  algorithm  that 
was  known.  And  even  by  1966,  when  I’d  finished  a first  draft  of  about  3000 
handwritten  pages  for  that  already-overgrown  book,  fewer  than  100  of  those 
pages  belonged  to  Chapter  7.  I had  absolutely  no  idea  that  what  I’d  foreseen  as 
a sort  of  “salad  course”  would  eventually  turn  out  to  be  the  main  dish. 

The  great  combinatorial  fermentation  of  1975  has  continued  to  churn,  as 
more  and  more  people  have  begun  to  participate.  New  ideas  improve  upon  the 
older  ones,  but  rarely  replace  them  or  make  them  obsolete.  So  of  course  I’ve 
had  to  abandon  any  hopes  that  I once  had  of  being  able  to  surround  the  field, 
to  write  a definitive  book  that  sets  everything  in  order  and  provides  one-stop 
shopping  for  everyone  who  has  combinatorial  problems  to  solve.  The  array  of 
applicable  techniques  has  mushroomed  to  the  point  where  I can  almost  never 
discuss  a subtopic  and  say,  “Here’s  the  final  solution:  end  of  story.”  Instead,  I 
must  restrict  myself  to  explaining  the  most  important  principles  that  seem  to 
underlie  all  of  the  efficient  combinatorial  methods  that  I’ve  encountered  so  far. 
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At  present  I’ve  accumulated  more  than  twice  as  much  raw  material  for  Volume  4 
as  for  all  of  Volumes  1 3 combined. 

This  sheer  mass  of  material  implies  that  the  once-planned  “Volume  4”  must 
actually  become  several  physical  volumes.  You  are  now  looking  at  Volume  4A. 
Volumes  4B  and  4C  will  exist  someday,  assuming  that  I’m  able  to  remain  healthy; 
and  (who  knows?)  there  may  also  be  Volumes  4D,  4E,  . . . ; but  surely  not  4Z. 

My  plan  is  to  go  systematically  through  the  files  that  I’ve  amassed  since  1962 
and  to  tell  the  stories  that  I believe  are  still  waiting  to  be  told,  to  the  best  of 
my  ability.  I can’t  aspire  to  completeness,  but  I do  want  to  give  proper  credit  to 
all  of  the  pioneers  who  have  been  responsible  for  key  ideas;  so  I won’t  scrimp  on 
historical  details.  Furthermore,  whenever  I learn  something  that  I think  is  likely 
to  remain  important  50  years  from  now,  something  that  can  also  be  explained 
elegantly  in  a paragraph  or  two,  I can’t  bear  to  leave  it  out.  Conversely,  difficult 
material  that  requires  a lengthy  proof  is  beyond  the  scope  of  these  books,  unless 
the  subject  matter  is  truly  fundamental. 

OK,  it’s  clear  that  the  field  of  Combinatorial  Algorithms  is  vast,  and  I can’t 
cover  it  all.  What  are  the  most  important  things  that  I’m  leaving  out?  My 
biggest  blind  spot,  I think,  is  geometry,  because  I’ve  always  been  much  better  at 
visualizing  and  manipulating  algebraic  formulas  than  objects  in  space.  Therefore 
I don’t  attempt  to  deal  in  these  books  with  combinatorial  problems  that  are  re- 
lated to  computational  geometry,  such  as  close  packing  of  spheres,  or  clustering  of 
data  points  in  n-dimensional  Euclidean  space,  or  even  the  Steiner  tree  problem  in 
the  plane.  More  significantly,  I tend  to  shy  away  from  polyhedral  combinatorics, 
and  from  approaches  that  are  based  primarily  on  linear  programming,  integer 
programming,  or  semidefinite  programming.  Those  topics  are  treated  well  in 
many  other  books  on  the  subject,  and  they  rely  on  geometrical  intuition.  Purely 
combinatorial  developments  are  easier  for  me  to  understand. 

I also  must  confess  a bias  against  algorithms  that  are  efficient  only  in 
an  asymptotic  sense,  algorithms  whose  superior  performance  doesn’t  begin  to 
“kick  in”  until  the  size  of  the  problem  exceeds  the  size  of  the  universe.  A great 
many  publications  nowadays  are  devoted  to  algorithms  of  that  kind.  I can 
understand  why  the  contemplation  of  ultimate  limits  has  intellectual  appeal  and 
carries  an  academic  cachet;  but  in  The  Art  of  Computer  Programming  I tend 
to  give  short  shrift  to  any  methods  that  I would  never  consider  using  myself  in 
an  actual  program.  (There  are,  of  course,  exceptions  to  this  rule,  especially  with 
respect  to  basic  concepts  in  the  core  of  the  subject.  Some  impractical  methods 
are  simply  too  beautiful  and/or  too  insightful  to  be  excluded;  others  provide 
instructive  examples  of  what  not  to  do.) 

Furthermore,  as  in  earlier  volumes  of  this  series,  I’m  intentionally  concen- 
trating almost  entirely  on  sequential  algorithms,  even  though  computers  are 
increasingly  able  to  carry  out  activities  in  parallel.  I’m  unable  to  judge  what 
ideas  about  parallelism  are  likely  to  be  useful  five  or  ten  years  from  now,  let 
alone  fifty,  so  I happily  leave  such  questions  to  others  who  are  wiser  than  I. 
Sequential  methods,  by  themselves,  already  test  the  limits  of  my  own  ability  to 
discern  what  the  artful  programmers  of  tomorrow  will  want  to  know. 
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The  main  decision  that  I needed  to  make  when  planning  how  to  present  this 
material  was  whether  to  organize  it  by  problems  or  by  techniques.  Chapter  5 
in  Volume  3,  for  example,  was  devoted  to  a single  problem,  the  sorting  of  data 
into  order;  more  than  two  dozen  techniques  were  applied  to  different  aspects 
of  that  problem.  Combinatorial  algorithms,  by  contrast,  involve  many  different 
problems,  which  tend  to  be  attacked  with  a smaller  repertoire  of  techniques. 
I finally  decided  that  a mixed  strategy  would  work  better  than  any  pure  ap- 
proach. Thus,  for  example,  these  books  treat  the  problem  of  finding  shortest 
paths  in  Section  7.3,  and  problems  of  connectivity  in  Section  7.4.1;  but  many 
other  sections  are  devoted  to  basic  techniques,  such  as  the  use  of  Boolean 
algebra  (Section  7.1),  backtracking  (Section  7.2),  matroid  theory  (Section  7.6), 
or  dynamic  programming  (Section  7.7).  The  famous  Traveling  Salesrep  Problem, 
and  other  classic  combinatorial  tasks  related  to  covering,  coloring,  and  packing, 
have  no  sections  of  their  own,  but  they  come  up  several  times  in  different  places 
as  they  are  treated  by  different  methods. 

I’ve  mentioned  great  progress  in  the  art  of  combinatorial  computing,  but  I 
don’t  mean  to  imply  that  all  combinatorial  problems  have  actually  been  tamed. 
When  the  running  time  of  a computer  program  goes  ballistic,  its  programmers 
shouldn’t  expect  to  find  a silver  bullet  for  their  needs  in  this  book.  The  methods 
described  here  will  often  work  a great  deal  faster  than  the  first  approaches  that 
a programmer  tries;  but  let’s  face  it:  Combinatorial  problems  get  huge  very 
quickly.  We  can  even  prove  rigorously  that  a certain  small,  natural  problem  will 
never  have  a feasible  solution  in  the  real  world,  although  it  is  solvable  in  principle 
(see  the  theorem  of  Stockmeyer  and  Meyer  in  Section  7.1.2).  In  other  cases  we 
cannot  prove  as  yet  that  no  decent  algorithm  for  a given  problem  exists,  but 
we  know  that  such  methods  are  unlikely,  because  any  efficient  algorithm  would 
yield  a good  way  to  solve  thousands  of  other  problems  that  have  stumped  the 
world’s  greatest  experts  (see  the  discussion  of  NP-completeness  in  Section  7.9). 

Experience  suggests  that  new  combinatorial  algorithms  will  continue  to  be 
invented,  for  new  combinatorial  problems  and  for  newly  identified  variations  or 
special  cases  of  old  ones;  and  that  people’s  appetite  for  such  algorithms  will  also 
continue  to  grow.  The  art  of  computer  programming  continually  reaches  new 
heights  when  programmers  are  faced  with  challenges  such  as  these.  Yet  today’s 
methods  are  also  likely  to  remain  relevant. 

Most  of  this  book  is  self-contained,  although  there  are  frequent  tie-ins  with 
the  topics  discussed  in  Volumes  1-3.  Low-level  details  of  machine  language 
programming  have  been  covered  extensively  in  those  volumes,  so  the  algorithms 
in  the  present  book  are  usually  specified  only  at  an  abstract  level,  independent  of 
any  machine.  However,  some  aspects  of  combinatorial  programming  are  heavily 
dependent  on  low-level  details  that  didn’t  arise  before;  in  such  cases,  all  examples 
in  this  book  are  based  on  the  MMIX  computer,  which  supersedes  the  MIX  machine 
that  was  defined  in  early  editions  of  Volume  1.  Details  about  MMIX  appear  in  a 
paperback  supplement  to  that  volume  called  The  Art  of  Computer  Programming , 
Volume  1,  Fascicle  1,  containing  Sections  1.3. T,  1.3.2',  etc.;  they’re  also  available 
on  the  Internet,  together  with  downloadable  assemblers  and  simulators. 
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Another  downloadable  resource,  a collection  of  programs  and  data  called  The 
Stanford  GraphBase,  is  cited  extensively  in  the  examples  of  this  book.  Readers 
are  encouraged  to  play  with  it,  in  order  to  learn  about  combinatorial  algorithms 
in  what  I think  will  be  the  most  efficient  and  most  enjoyable  way. 

Incidentally,  while  writing  the  introductory  material  at  the  beginning  of 
Chapter  7,  I was  pleased  to  note  that  it  was  natural  to  mention  some  work  of 
my  Ph.D.  thesis  advisor,  Marshall  Hall,  Jr.  (1910-1990),  as  well  as  some  work 
of  his  thesis  advisor,  Oystein  Ore  (1899-1968),  as  well  as  some  work  of  his  thesis 
advisor,  Thoralf  Skolem  (1887  1963).  Skolem’s  advisor,  Axel  Thue  (1863-1922), 
was  already  present  in  Chapter  6. 

I’m  immensely  grateful  to  the  hundreds  of  readers  who  have  helped  me  to 
ferret  out  numerous  mistakes  that  I made  in  the  early  drafts  of  this  volume,  which 
were  originally  posted  on  the  Internet  and  subsequently  printed  in  paperback 
fascicles.  In  particular,  the  extensive  comments  of  Thorsten  Dahlheimer,  Marc 
van  Leeuwen,  and  Udo  Wermuth  have  been  especially  influential.  But  I fear  that 
other  errors  still  lurk  among  the  details  collected  here,  and  I want  to  correct  them 
as  soon  as  possible.  Therefore  I will  cheerfully  award  $2.56  to  the  first  finder  of 
each  technical,  typographical,  or  historical  error.  The  taocp  webpage  cited  on 
page  iv  contains  a current  listing  of  all  corrections  that  have  been  reported  to  ine. 

Stanford,  California  D-  E.  K. 

October  2010 

In  my  preface  to  the  first  edition, 

I begged  the  reader  not  to  draw  attention  to  errors. 

I now  wish  I had  not  done  so 
and  am  grateful  to  the  few  readers  who  ignored  my  request. 

— STUART  SUTHERLAND,  The  International  Dictionary  of  Psychology  (1996) 

Naturally,  I am  responsible  for  the  remaining  errors — 
although,  in  my  opinion,  my  friends  could  have  caught  a few  more. 

— CHRISTOS  H.  PAPADIMITRIOU,  Computational  Complexity  (1994) 

I like  to  work  in  a variety  of  fields 
in  order  to  spread  my  mistakes  more  thinly. 

— VICTOR  KLEE  (1999) 

A note  on  references.  Several  oft-cited  journals  and  conference  proceedings 
have  special  code  names,  which  appear  in  the  Index  and  Glossary  at  the  close  of 
this  book.  But  the  various  kinds  of  IEEE  Transactions  are  cited  by  including  a 
letter  code  for  the  type  of  transactions,  in  boldface  preceding  the  volume  number. 
For  example,  TEEE  Trans.  C-35’  means  the  IEEE  Transactions  on  Computers, 
volume  35.  The  IEEE  no  longer  uses  these  convenient  letter  codes,  but  the  codes 
aren’t  too  hard  to  decipher:  ‘EC’  once  stood  for  “Electronic  Computers,”  ‘IT' 
for  “Information  Theory,”  ‘SE’  for  “Software  Engineering,”  and  ‘SP‘  for  “Signal 
Processing,”  etc.;  ‘CAD’  meant  “Computer-Aided  Design  of  Integrated  Circuits 
and  Systems.” 

A cross-reference  such  as  ‘exercise  7.10-00’  points  to  a future  exercise  in 
Section  7.10  whose  number  is  not  yet  known. 
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A note  on  notations.  Simple  and  intuitive  conventions  for  the  algebraic  rep- 
resentation of  mathematical  concepts  have  always  been  a boon  to  progress,  espe- 
cially when  most  of  the  world’s  researchers  share  a common  symbolic  language. 
The  current  state  of  affairs  in  combinatorial  mathematics  is  unfortunately  a bit 
of  a mess  in  this  regard,  because  the  same  symbols  are  occasionally  used  with 
completely  different  meanings  by  different  groups  of  people;  some  specialists  who 
work  in  comparatively  narrow  subfields  have  unintentionally  spawned  conflicting 
symbolisms.  Computer  science  — which  interacts  with  large  swaths  of  math- 
ematics needs  to  steer  clear  of  this  danger  by  adopting  internally  consistent 
notations  whenever  possible.  Therefore  I’ve  often  had  to  choose  among  a number 
of  competing  schemes,  knowing  that  it  will  be  impossible  to  please  everyone. 
I have  tried  my  best  to  come  up  with  notations  that  I believe  will  be  best  for  the 
future,  often  after  many  years  of  experimentation  and  discussion  with  colleagues, 
often  flip-flopping  between  alternatives  until  finding  something  that  works  well. 
Usually  it  has  been  possible  to  find  convenient  conventions  that  other  people 
have  not  already  coopted  in  contradictory  ways. 

Appendix  B is  a comprehensive  index  to  all  of  the  principal  notations  that 
are  used  in  the  present  book,  inevitably  including  several  that  are  not  (yet?) 
standard.  If  you  run  across  a formula  that  looks  weird  and/or  incomprehensible, 
chances  are  fairly  good  that  Appendix  B will  direct  you  to  a page  where  my 
intentions  are  clarified.  But  I might  as  well  list  here  a few  instances  that  you 
might  wish  to  watch  for  when  you  read  this  book  for  the  first  time: 

• Hexadecimal  constants  are  preceded  by  a number  sign  or  hash  mark.  For 
example,  #123  means  (123) i6. 

• The  monus  operation  x — y,  sometimes  called  dot-minus  or  saturating 
subtraction,  yields  max(0,x  - y ). 

• The  median  of  three  numbers  {x,y,z}  is  denoted  by  ( xyz ). 

• A set  such  as  {x},  which  consists  of  a single  element,  is  often  denoted  simply 
by  x in  contexts  such  as  X U x or  X \ x. 

• If  n is  a nonnegative  integer,  the  number  of  1-bits  in  n’s  binary  representation 
is  vn.  Furthermore,  if  n > 0,  the  leftmost  and  rightmost  1-bits  of  n are 
respectively  2An  and  2pn . For  example,  i/10  = 2,  A10  = 3,  plO  = 1. 

• The  Cartesian  product  of  graphs  G and  H is  denoted  by  GuH . For  example, 
Cm  n Cn  denotes  an  m x n torus,  because  Cn  denotes  a cycle  of  n vertices. 
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The  exercises  in  this  set  of  books  have  been  designed  for  self-study  as  well 
as  for  classroom  study.  It  is  difficult,  if  not  impossible,  for  anyone  to  learn  a 
subject  purely  by  reading  about  it,  without  applying  the  information  to  specific 
problems  and  thereby  being  encouraged  to  think  about  what  has  been  read. 
Furthermore,  we  all  learn  best  the  things  that  we  have  discovered  for  ourselves. 
Therefore  the  exercises  form  a major  part  of  this  work;  a definite  attempt  has 
been  made  to  keep  them  as  informative  as  possible  and  to  select  problems  that 
are  enjoyable  as  well  as  instructive. 

In  many  books,  easy  exercises  are  found  mixed  randomly  among  extremely 
difficult  ones.  A motley  mixture  is,  however,  often  unfortunate  because  readers 
like  to  know  in  advance  how  long  a problem  ought  to  take  — otherwise  they 
may  just  skip  over  all  the  problems.  A classic  example  of  such  a situation  is 
the  book  Dynamic  Programming  by  Richard  Bellman;  this  is  an  important, 
pioneering  work  in  which  a group  of  problems  is  collected  together  at  the  end 
of  some  chapters  under  the  heading  “Exercises  and  Research  Problems,”  with 
extremely  trivial  questions  appearing  in  the  midst  of  deep,  unsolved  problems. 
It  is  rumored  that  someone  once  asked  Dr.  Bellman  how  to  tell  the  exercises 
apart  from  the  research  problems,  and  he  replied,  “If  you  can  solve  it,  it  is  an 
exercise;  otherwise  it’s  a research  problem.” 

Good  arguments  can  be  made  for  including  both  research  problems  and 
very  easy  exercises  in  a book  of  this  kind;  therefore,  to  save  the  reader  from 
the  possible  dilemma  of  determining  which  are  which,  rating  numbers  have  been 
provided  to  indicate  the  level  of  difficulty.  These  numbers  have  the  following 
general  significance: 

Rating  Interpretation 

00  An  extremely  easy  exercise  that  can  be  answered  immediately  if  the 
material  of  the  text  has  been  understood;  such  an  exercise  can  almost 
always  be  worked  “in  your  head,”  unless  you’re  multitasking. 

10  A simple  problem  that  makes  you  think  over  the  material  just  read,  but 
is  by  no  means  difficult.  You  should  be  able  to  do  this  in  one  minute  at 
most;  pencil  and  paper  may  be  useful  in  obtaining  the  solution. 

20  An  average  problem  that  tests  basic  understanding  of  the  text  mate- 
rial, but  you  may  need  about  fifteen  or  twenty  minutes  to  answer  it 
completely.  Maybe  even  twenty-five. 
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30  A problem  of  moderate  difficulty  and/or  complexity;  this  one  may 
involve  more  than  two  hours’  work  to  solve  satisfactorily,  or  even  more 
if  the  TV  is  on. 

40  Quite  a difficult  or  lengthy  problem  that  would  be  suitable  for  a term 
project  in  classroom  situations.  A student  should  be  able  to  solve  the 
problem  in  a reasonable  amount  of  time,  but  the  solution  is  not  trivial. 

50  A research  problem  that  has  not  yet  been  solved  satisfactorily,  as  far 
as  the  author  knew  at  the  time  of  writing,  although  many  people  have 
tried.  If  you  have  found  an  answer  to  such  a problem,  you  ought  to 
write  it  up  for  publication;  furthermore,  the  author  of  this  book  would 
appreciate  hearing  about  the  solution  as  soon  as  possible  (provided  that 
it  is  correct). 

By  interpolation  in  this  “logarithmic”  scale,  the  significance  of  other  rating 
numbers  becomes  clear.  For  example,  a rating  of  1 7 would  indicate  an  exercise 
that  is  a bit  simpler  than  average.  Problems  with  a rating  of  50  that  are 
subsequently  solved  by  some  reader  may  appear  with  a / 5 rating  in  later  editions 
of  the  book,  and  in  the  errata  posted  on  the  Internet  (see  page  iv). 

The  remainder  of  the  rating  number  divided  by  5 indicates  the  amount  of 
detailed  work  required.  Thus,  an  exercise  rated  24  may  take  longer  to  solve  than 
an  exercise  that  is  rated  25,  but  the  latter  will  require  more  creativity. 

The  author  has  tried  earnestly  to  assign  accurate  rating  numbers,  but  it  is 
difficult  for  the  person  who  makes  up  a problem  to  know  just  how  formidable  it 
will  be  for  someone  else  to  find  a solution;  and  everyone  has  more  aptitude  for 
certain  types  of  problems  than  for  others.  It  is  hoped  that  the  rating  numbers 
represent  a good  guess  at  the  level  of  difficulty,  but  they  should  be  taken  as 
general  guidelines,  not  as  absolute  indicators. 

This  book  has  been  written  for  readers  with  varying  degrees  of  mathematical 
training  and  sophistication;  as  a result,  some  of  the  exercises  are  intended  only  for 
the  use  of  more  mathematically  inclined  readers.  The  rating  is  preceded  by  an  M 
if  the  exercise  involves  mathematical  concepts  or  motivation  to  a greater  extent 
than  necessary  for  someone  who  is  primarily  interested  only  in  programming 
the  algorithms  themselves.  An  exercise  is  marked  with  the  letters  “ HM ” if  its 
solution  necessarily  involves  a knowledge  of  calculus  or  other  higher  mathematics 
not  developed  in  this  book.  An  UHM ” designation  does  not  necessarily  imply 
difficulty. 

Some  exercises  are  preceded  by  an  arrowhead,  this  designates  prob- 
lems that  are  especially  instructive  and  especially  recommended.  Of  course,  no 
reader/student  is  expected  to  work  all  of  the  exercises,  so  those  that  seem  to 
be  the  most  valuable  have  been  singled  out.  (This  distinction  is  not  meant  to 
detract  from  the  other  exercises!)  Each  reader  should  at  least  make  an  attempt 
to  solve  all  of  the  problems  whose  rating  is  10  or  less;  and  the  arrows  may  help 
to  indicate  which  of  the  problems  with  a higher  rating  should  be  given  priority. 

Several  sections  have  more  than  100  exercises.  How  can  you  find  your  way 
among  so  many?  In  general  the  sequence  of  exercises  tends  to  follow  the  sequence 
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of  ideas  in  the  main  text.  Adjacent  exercises  build  on  each  other,  as  in  the 
pioneering  problem  books  of  Polya  and  Szego.  The  final  exercises  of  a section 
often  involve  the  section  as  a whole,  or  introduce  supplementary  topics. 

Solutions  to  most  of  the  exercises  appear  in  the  answer  section.  Please  use 
them  wisely;  do  not  turn  to  the  answer  until  you  have  made  a genuine  effort  to 
solve  the  problem  by  yourself,  or  unless  you  absolutely  do  not  have  time  to  work 
this  particular  problem.  After  getting  your  own  solution  or  giving  the  problem  a 
decent  try,  you  may  find  the  answer  instructive  and  helpful.  The  solution  given 
will  often  be  quite  short,  and  it  will  sketch  the  details  under  the  assumption 
that  you  have  earnestly  tried  to  solve  it  by  your  own  means  first.  Sometimes  the 
solution  gives  less  information  than  was  asked;  often  it  gives  more.  It  is  quite 
possible  that  you  may  have  a better  answer  than  the  one  published  here,  or  you 
may  have  found  an  error  in  the  published  solution;  in  such  a case,  the  author 
will  be  pleased  to  know  the  details.  Later  printings  of  this  book  will  give  the 
improved  solutions  together  with  the  solver’s  name  where  appropriate. 

When  working  an  exercise  you  may  generally  use  the  answers  to  previous 
exercises,  unless  specifically  forbidden  from  doing  so.  The  rating  numbers  have 
been  assigned  with  this  in  mind;  thus  it  is  possible  for  exercise  n + 1 to  have  a 
lower  rating  than  exercise  n,  even  though  it  includes  the  result  of  exercise  n as 
a special  case. 


Summary  of  codes: 

00 

Immediate 

10 

Simple  (one  minute) 

20 

Medium  (quarter  hour) 

► 

Recommended 

30 

Moderately  hard 

M 

Mathematically  oriented 

40 

Term  project 

EM 

Requiring  “higher  math” 

50 

Research  problem 

EXERCISES 

► 1.  [00]  What  does  the  rating  “M15”  mean? 

2.  [10]  Of  what  value  can  the  exercises  in  a textbook  be  to  the  reader? 

3.  [HM45]  Prove  that  every  simply  connected,  closed  3-dimensional  manifold  is  topo- 
logically equivalent  to  a 3-dimensional  sphere. 


Art  derives  a considerable  part  of  its  beneficial  exercise 
from  flying  in  the  face  of  presumptions. 
— HENRY  JAMES,  “The  Art  of  Fiction"  (1884) 


/ am  grateful  to  all  my  friends, 
and  record  here  and  now  my  most  especial  appreciation 
to  those  friends  who,  after  a decent  interval, 
stopped  asking  me,  "How’s  the  book  coming?" 

— PETER  J.  GOMES,  The  Good  Book  (1996) 


/ at  last  deliver  to  the  world  a Work  which  I have  long  promised, 
and  of  which,  I am  afraid,  too  high  expectations  have  been  raised. 
The  delay  of  its  publication  must  be  imputed,  in  a considerable  degree, 
to  the  extraordinary  zeal  which  has  been  shown  by  distinguished  persons 
in  all  quarters  to  supply  me  with  additional  information. 
— JAMES  BOSWELL,  The  Life  of  Samuel  Johnson,  LL.D.  (1791) 


The  author  is  especially  grateful  to  the  Addison-Wesley  Publishing  Company 
for  its  patience  in  waiting  a full  decade  for  this  manuscript 
from  the  date  the  contract  was  signed. 
— FRANK  HARARY,  Graph  Theory  (1969) 


The  average  boy  who  abhors  square  root  or  algebra 
finds  delight  in  working  puzzles  which  involve  similar 
principles,  and  may  be  led  into  a course  of  study 
which  would  develop  the  mathematical  and  inventive  bumps 
in  a way  to  astonish  the  family  phrenologist. 

— SAM  LOYD,  The  World  of  Puzzledom  (1896) 


Bitte  ein  Bit! 
— Slogan  of  Bitburger  Brauerei  (1951) 
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CHAPTER  SEVEN 


COMBINATORIAL  SEARCHING 


You  shall  seeke  all  day  ere  you  finde  them, 
& when  you  have  them,  they  are  not  worth  the  search. 

— BASSANIO,  in  The  Merchant  of  Venice  (Act  I,  Scene  1,  Line  117) 

Amid  the  action  and  reaction  of  so  dense  a swarm  of  humanity, 
every  possible  combination  of  events  may  be  expected  to  take  place, 
and  many  a little  problem  will  be  presented  which  may  be  striking  and  bizarre. 

— SHERLOCK  HOLMES,  in  The  Adventure  of  the  Blue  Carbuncle  (1892) 

The  field  of  combinatorial  algorithms  is  too  vast  to  cover 
in  a single  paper  or  even  in  a single  book. 

— ROBERT  E.  TARJAN  (1976) 


While  Jostling  against  all  manner  of  people 
it  has  been  impressed  upon  my  mind  that  the  successful  ones 
are  those  who  have  a natural  faculty  for  solving  puzzles. 

Life  is  full  of  puzzles,  and  we  are  called  upon 
to  solve  such  as  fate  throws  our  way. 

— SAM  LOYD,  JR.  (1927) 


Combinatorics  is  the  study  of  the  ways  in  which  discrete  objects  can  be 
arranged  into  various  kinds  of  patterns.  For  example,  the  objects  might  be  2 n 
numbers  {1, 1, 2, 2 .,n,n},  and  we  might  want  to  place  them  in  a row  so  that 
exactly  k numbers  occur  between  the  two  appearances  of  each  digit  k.  When 
n — 3 there  is  essentially  only  one  way  to  arrange  such  “Langford  pairs,”  namely 
231213  (and  its  left-right  reversal);  similarly,  there’s  also  a unique  solution  when 
n = 4.  Many  other  types  of  combinatorial  patterns  are  discussed  below. 

Five  basic  types  of  questions  typically  arise  when  combinatorial  problems 
are  studied,  some  more  difficult  than  others. 

i)  Existence:  Are  there  any  arrangements  X that  conform  to  the  pattern.'* 

ii)  Construction:  If  so,  can  such  an  X be  found  quickly? 

iii)  Enumeration:  How  many  different  arrangements  X exist? 

iv)  Generation:  Can  all  arrangements  X\,  X2,  ...  be  visited  systematically? 

v)  Optimization:  What  arrangements  maximize  or  minimize  f{X),  given  an 
objective  function  /? 

Each  of  these  questions  turns  out  to  be  interesting  with  respect  to  Langford  pairs. 
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For  example,  consider  the  question  of  existence.  Trial  and  error  quickly 
reveals  that,  when  n = 5,  we  cannot  place  {1, 1,  2,  2, . . . , 5,  5}  properly  into  ten 
positions.  The  two  Is  must  both  go  into  even-numbered  slots,  or  both  into  odd- 
numbered  slots;  similarly,  the  3s  and  5s  must  choose  between  two  evens  or  two 
odds;  but  the  2s  and  4s  use  one  of  each.  Thus  we  can’t  fill  exactly  five  slots  of 
each  parity.  This  reasoning  also  proves  that  the  problem  has  no  solution  when 
n = 6,  or  in  general  whenever  the  number  of  odd  values  in  {1,2,...,  n } is  odd. 

In  other  words,  Langford  pairings  can  exist  only  when  n = 4m-l  or  n = 4m, 
for  some  integer  m.  Conversely,  when  n does  have  this  form,  Roy  O.  Davies  has 
found  an  elegant  way  to  construct  a suitable  placement  (see  exercise  1). 

How  many  essentially  different  pairings,  Ln,  exist?  Lots,  when  n grows: 

L3  = l;  L4  = 1; 

L7  = 26;  l8  = 150; 

Ln  = 17,792;  L12  = 108,144; 

Lis  = 39,809,640;  L16  = 326,721,800;  ^ 

Llg  = 256,814,891,280;  L20  = 2,636,337,861,200; 

L23  = 3,799,455,942,515,488;  L24  = 46,845,158,056,515,936. 

[The  values  of  L23  and  L24  were  determined  by  M.  Krajecki,  C.  Jaillet,  and  A.  Bui 
in  2004  and  2005;  see  Studia  Informatica  Universalis  4 (2005),  151-190.]  A seat- 
of-the-pants  calculation  suggests  that  Ln  might  be  roughly  of  order  (4n/e3)n+1/2 
when  it  is  nonzero  (see  exercise  5);  and  in  fact  this  prediction  turns  out  to  be 
basically  correct  in  all  known  cases.  But  no  simple  formula  is  apparent. 

The  problem  of  Langford  arrangements  is  a simple  special  case  of  a general 
class  of  combinatorial  challenges  called  exact  cover  problems.  In  Section  7.2.2. 1 
we  shall  study  an  algorithm  called  “dancing  links,”  which  is  a convenient  way  to 
generate  all  solutions  to  such  problems.  When  n = 16,  for  example,  that  method 
needs  to  perform  only  about  3200  memory  accesses  for  each  Langford  pair 
arrangement  that  it  finds.  Thus  the  value  of  L16  can  be  computed  in  a reasonable 
amount  of  time  by  simply  generating  all  of  the  pairings  and  counting  them. 

Notice,  however,  that  L24  is  a huge  number  — roughly  5xl016,  or  about  1500 
MIP-years.  (Recall  that  a “MIP-year”  is  the  number  of  instructions  executed 
per  year  by  a machine  that  carries  out  a million  instructions  per  second  namely 
31,556,952,000,000.)  Therefore  it’s  clear  that  the  exact  value  of  L24  was  deter- 
mined by  some  technique  that  did  not  involve  generating  all  of  the  arrangements. 
Indeed,  there  is  a much,  much  faster  way  to  compute  Ln,  using  polynomial 
algebra.  The  instructive  method  described  in  exercise  6 needs  0(4nn)  operations 
which  may  seem  inefficient;  but  it  beats  the  generate-and-count  method  by  a 
whopping  factor  of  order  0((n/e3)"-1/2)i  and  even  when  n = 16  it  runs  about 
20  times  faster.  On  the  other  hand,  the  exact  value  of  L100  will  probably  never 
be  known,  even  as  computers  become  faster  and  faster. 

We  can  also  consider  Langford  pairings  that  are  optimum  in  various  ways. 

For  example,  it’s  possible  to  arrange  sixteen  pairs  of  weights  {1, 1,  2,  2, 16, 16} 

that  satisfy  Langford’s  condition  and  have  the  additional  property  of  being  “well- 
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balanced,”  in  the  sense  that  they  won’t  tip  a balance  beam  when  they  are  placed 
in  the  appropriate  order: 


Q 


an 


jin 


*11 


AM  Inn 


1 1 I L:J 
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— 

— 

h±-\ 

bdd 

— 

— 

u i-|d_| 



u 

— 

— 

= 

— 

— 

E 

— 

— 

H 

_ 

— 

— 

= 

1 

(2) 


In  other  words,  15.5-16  + 14.5-6+-  • - + 0.5-8  — 0.5-11  + - ■ ■ + 14.5 - 4 + 15.5 • 7;  and 

in  this  particular  example  we  also  have  another  kind  of  balance,  16  + 6-1 h8  = 

11  + 16  + • ■ • + 7,  hence  also  16  • 16  + 15  • 6 + • • • + 1 • 8 = 1 • 11  + • • • + 15  • 4 + 16  • 7. 

Moreover,  the  arrangement  in  (2)  has  minimum  width  among  all  Langford 
pairings  of  order  16:  The  connecting  lines  at  the  bottom  of  the  diagram  show 
that  no  more  than  seven  pairs  are  incomplete  at  any  point,  as  we  read  from  left 
to  right;  and  one  can  show  that  a width  of  six  is  impossible.  (See  exercise  7.) 

What  arrangements  aia^  ■ ■ ■ (132  of  {1, 1, . . . , 16, 16}  are  the  least  balanced, 
in  the  sense  that  Y^k= 1 kak  is  maximized?  The  maximum  possible  value  turns 
out  to  be  5268.  One  such  pairing  — there  are  12,016  of  them  — is 


2 3 4 2 1 3 1 4 16  13  15  5 14  7 9 6 11  5 12  10  8 7 6 13  9 16  15  14  11  8 10  12.  (3) 


A more  interesting  question  is  to  ask  for  the  Langford  pairings  that  are 
smallest  and  largest  in  lexicographic  order.  The  answers  for  n = 24  are 

{abacbdecf gdoersf pgqtuwxv j klonhmirps j qkhlt iunmwvx,  . ^ 

xvwsquntkigrdapaodgiknqs vxwutmrpohl j cf becbhmf e j 1 } 

if  we  use  the  letters  a,  b,  . . . , w,  x instead  of  the  numbers  1,2,...,  23,  24. 

We  shall  discuss  many  techniques  for  combinatorial  optimization  in  later  sec- 
tions of  this  chapter.  Our  goal,  of  course,  will  be  to  solve  such  problems  without 
examining  more  than  a tiny  portion  of  the  space  of  all  possible  arrangements. 

Orthogonal  latin  squares.  Let’s  look  back  for  a moment  at  the  early  days  of 
combinatorics.  A posthumous  edition  of  Jacques  Ozanam  s Recreations  math- 
ematiques  et  physiques  (Paris:  1725)  included  an  amusing  puzzle  in  volume  4, 
page  434:  “Take  all  the  aces,  kings,  queens,  and  jacks  from  an  ordinary  deck  of 
playing  cards  and  arrange  them  in  a square  so  that  each  row  and  each  column 
contains  all  four  values  and  all  four  suits.”  Can  you  do  it?  Ozanam’s  solution, 
shown  in  Fig.  1 on  the  next  page,  does  even  more:  It  exhibits  the  full  panoply 
of  values  and  of  suits  also  on  both  main  diagonals.  (Please  don’t  turn  the  page 
until  you’ve  given  this  problem  a try.) 

By  1779  a similar  puzzle  was  making  the  rounds  of  St.  Petersburg,  and  it 
came  to  the  attention  of  the  great  mathematician  Leonhard  Euler.  “Thirty-six 
officers  of  six  different  ranks,  taken  from  six  different  regiments,  want  to  march 
in  a 6 x 6 formation  so  that  each  row  and  each  column  will  contain  one  officer  of 
each  rank  and  one  of  each  regiment.  How  can  they  do  it?”  Nobody  was  able  to 
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Fig.  1.  Disorder  in  the  court  cards: 
No  agreement  in  any  line  of  four. 
(This  configuration  is  one  of  many 
ways  to  solve  a popular  eighteenth- 
century  problem.) 


find  a satisfactory  marching  order.  So  Euler  decided  to  resolve  the  riddle  — even 
though  he  had  become  nearly  blind  in  1771  and  was  dictating  all  of  his  work 
to  assistants.  He  wrote  a major  paper  on  the  subject  [eventually  published  in 
Verhandelingen  uitgegeven  door  het  Zeeuwsch  Genootschap  der  Wetenschappen 
te  Vlissingen  9 (1782),  85-239],  in  which  he  constructed  suitable  arrangements 
for  the  analogous  task  with  n ranks  and  n regiments  when  n = 1,  3.  4,  5 7 8 
9,  11,  12,  13,  15,  16,  . . . ; only  the  cases  with  n mod  4 = 2 eluded  him. 

There’s  obviously  no  solution  when  n = 2.  But  Euler  was  stumped  when  n = 
6,  after  having  examined  a ‘very  considerable  number”  of  square  arrangements 
that  didn  t work.  He  showed  that  any  actual  solution  would  lead  to  many  others 
that  look  different,  and  he  couldn’t  believe  that  all  such  solutions  had  escaped 
his  attention.  Therefore  he  said,  “I  do  not  hesitate  to  conclude  that  one  cannot 
produce  a complete  square  of  36  cells,  and  that  the  same  impossibility  extends 
to  the  cases  n 10,  n = 14  . . . in  general  to  all  oddly  even  numbers.” 

Euler  named  the  36  officers  aa,  a/3,  ay,  aS,  ae,  a(,  ba,  b/3,  67,  bS,  be , b( , 
ca,  c/3,  cy,  cS,  ce,  c(,  da,  d/3,  dy,  dS,  de,  d(,  ea,  e/3,  ey,  eS,  ee,  e(,  fa,  f/3,  fy, 

/e>  /Ci  based  on  their  regiments  and  ranks.  He  observed  that  any  solution 
would  amount  to  having  two  separate  squares,  one  for  Latin  letters  and  another 
for  Greek.  Each  of  those  squares  is  supposed  to  have  distinct  entries  in  rows  and 
columns;  so  he  began  by  studying  the  possible  configurations  for  {a,  b,  c,  d,  e,  /}, 
which  he  called  Latin  squares.  A Latin  square  can  be  paired  up  with  a Greek 
square  to  form  a “Graeco-Latin  square”  only  if  the  squares  are  orthogonal  to  each 
other,  meaning  that  no  (Latin,  Greek)  pair  of  letters  can  be  found  together  in 
more  than  one  place  when  the  squares  are  superimposed.  For  example,  if  we  let 
a = A,  b — K,  c = Q,  d = J,  a = 4,  (3  = 4,  7 = <>,  and  8 = Z>,  Fig.  1 is  equivalent 
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to  the  Latin,  Greek,  and  Graeco-Latin  squares 


( d 

c 

a 

b 

c\ 

(1 

s 

P 

a\ 

(dj 

aS 

bp 

ca  \ 

b 

a 

d 

P 

a 

7 

5 

, and 

cP 

ba 

a'y 

dS 

a 

d 

c 

b 

•) 

a 

P 

5 

7 

aa 

dp 

cS 

by 

\b 

c 

d 

a) 

\S 

7 

a 

p) 
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cy 
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aP  / 

Of  course  we  can  use  any  n distinct  symbols  in  an  nxn  Latin  square;  all  that 
matters  is  that  no  symbol  occurs  twice  in  any  row  or  twice  in  any  column.  So 
we  might  as  well  use  numeric  values  {0, 1, , n—  1}  for  the  entries.  Furthermore 
we’ll  just  refer  to  “latin  squares”  (with  a lowercase  “1”),  instead  of  categorizing 
a square  as  either  Latin  or  Greek,  because  orthogonality  is  a symmetric  relation. 

Euler’s  assertion  that  two  6x6  latin  squares  cannot  be  orthogonal  was 
verified  by  Thomas  Clausen,  who  reduced  the  problem  to  an  examination  of  17 
fundamentally  different  cases,  according  to  a letter  from  H.  C.  Schumacher  to 
C.  F.  Gauss  dated  10  August  1842.  But  Clausen  did  not  publish  his  analysis. 
The  first  demonstration  to  appear  in  print  was  by  G.  Tarry  [Comptes  rendus, 
Association  frangaise  pour  i’avancement  des  sciences  29,  part  2 (1901),  170-203], 
who  discovered  in  his  own  way  that  6x6  latin  squares  can  be  classified  into  17 
different  families.  (In  Section  7.2.3  we  shall  study  how  to  decompose  a problem 
into  combinatorially  inequivalent  classes  of  arrangements.) 

Euler’s  conjecture  about  the  remaining  cases  n = 10,  n = 14,  . . . was 
"‘proved”  three  times,  by  J.  Petersen  [Annuaire  des  mathematiciens  (Paris:  1902), 
413-427],  by  P.  Wernicke  [Jahresbericht  der  Deutschen  Math.-Vereinigung  19 
(1910),  264-267],  and  by  H.  F.  MacNeish  [Annals  of  Math.  (2)  23  (1922),  221- 
227],  Flaws  in  all  three  arguments  became  known,  however;  and  the  question 
was  still  unsettled  when  computers  became  available  many  years  later.  One  of 
the  very  first  combinatorial  problems  to  be  tackled  by  machine  was  therefore  the 
enigma  of  10  x 10  Grasco-Latin  squares:  Do  they  exist  or  not? 

In  1957,  L.  J.  Paige  and  C.  B.  Tompkins  programmed  the  SWAC  computer  to 
search  for  a counterexample  to  Euler’s  prediction.  They  selected  one  particular 
10  x 10  latin  square  “almost  at  random,”  and  their  program  tried  to  find  another 
square  that  would  be  orthogonal  to  it.  But  the  results  were  discouraging,  and 
they  decided  to  shut  the  machine  off  after  five  hours.  Already  the  program 
had  generated  enough  data  for  them  to  predict  that  at  least  4.8  X 1011  hours  of 
computer  time  would  be  needed  to  finish  the  run! 

Shortly  afterwards,  three  mathematicians  made  a breakthrough  that  put 
latin  squares  onto  page  one  of  major  world  newspapers:  R.  C.  Bose,  S.  S.  Shri- 
khande,  and  E.  T.  Parker  found  a remarkable  series  of  constructions  that  yield 
orthogonal  nxn  squares  for  all  n > 6 [Proc.  Nat.  Acad.  Sci.  45  (1959),  734-737, 
859-862;  Canadian  J.  Math.  12  (1960),  189-203].  Thus,  after  resisting  attacks 
for  180  years,  Euler’s  conjecture  turned  out  to  be  almost  entirely  wrong. 

Their  discovery  was  made  without  computer  help.  But  Parker  worked  for 
UNIVAC,  and  he  soon  brought  programming  skills  into  the  picture  by  solving  the 
problem  of  Paige  and  Tompkins  in  less  than  an  hour,  on  a UNIVAC  1206  Military 
Computer.  [See  Proc.  Symp.  Applied  Math.  10  (1960),  71-83;  15  (1963),  73-81.] 
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Let’s  take  a closer  look  at  what  the  earlier  programmers  did,  and  how 
Parker  dramatically  trumped  their  approach.  Paige  and  Tompkins  began  with 
the  following  10  x 10  square  L and  its  unknown  orthogonal  mate(s)  M: 


/0  12345678  9\ 
1832547690 
2956308471 
3709861524 
4675290813 
5094783162 
6547132908 
7418029356 
8360915247 
\9  28167403  5/ 


and  M 


^Ouuuuuuuuu 

luuuuuuuuu 

2l)L)UUUL1L1UU 

3uuuuuuuuu 

4 u 

5 U 

6 u 

7 u u 

8uuuuuuuuu. 
\9  uuuuuuuu  u / 


uuuuuuuu 

uuuuuuuu 

uuuuuuuu 

uuuuuuuu 


(6) 


We  can  assume  without  loss  of  generality  that  the  rows  of  M begin  with  0,  1, 
. . . , 9,  as  shown.  The  problem  is  to  fill  in  the  remaining  90  blank  entries,  and  the 
original  SWAC  program  proceeded  from  top  to  bottom,  left  to  right.  The  top  left 
u can’t  be  filled  with  0,  since  0 has  already  occurred  in  the  top  row  of  M.  And  it 
can’t  be  1 either,  because  the  pair  (1,1)  already  occurs  at  the  left  of  the  next  row 
in  (L,  M).  We  can,  however,  tentatively  insert  a 2.  The  digit  1 can  be  placed 
next;  and  pretty  soon  we  find  the  lexicographically  smallest  top  row  that  might 
work  for  M,  namely  0214365897.  Similarly,  the  smallest  rows  that  fit  below 
0214365897  are  1023456789  and  2108537946;  and  the  smallest  legitimate  row 
below  them  is  3540619278.  Now,  unfortunately,  the  going  gets  tougher:  There’s 
no  way  to  complete  another  row  without  coming  into  conflict  with  a previous 
choice.  So  we  change  3540619278  to  3540629178  (but  that  doesn’t  work  either), 
then  to  3540698172,  and  so  on  for  several  more  steps,  until  finally  3546109278 
can  be  followed  by  4397028651  before  we  get  stuck  again. 

In  Section  7.2.3,  we  11  study  ways  to  estimate  the  behavior  of  such  searches, 
without  actually  performing  them.  Such  estimates  tell  us  in  this  case  that 
the  Paige  Tompkins  method  essentially  traverses  an  implicit  search  tree  that 
contains  about  2.5  x 1018  nodes.  Most  of  those  nodes  belong  to  only  a few  levels 
of  the  tree;  more  than  half  of  them  deal  with  choices  on  the  right  half  of  the 
sixth  row  of  M,  after  about  50  of  the  90  blanks  have  been  tentatively  filled  in. 
A typical  node  of  the  search  tree  probably  requires  about  75  mems  (memory 
accesses)  for  processing,  to  check  validity.  Therefore  the  total  running  time  on  a 
modern  computer  would  be  roughly  the  time  needed  to  perform  2 x 1020  mems. 

Parker,  on  the  other  hand,  went  back  to  the  method  that  Euler  had  originally 
used  to  search  for  orthogonal  mates  in  1779.  First  he  found  all  of  the  so-called 
transversals  of  L,  namely  all  ways  to  choose  some  of  its  elements  so  that  there’s 
exactly  one  element,  in  each  row,  one  in  each  column,  and  one  of  each  value.  For 
example,  one  transversal  is  0859734216,  in  Euler’s  notation,  meaning  that  we 
choose  the  0 in  column  0,  the  8 in  column  1,  . . . , the  6 in  column  9.  Each  transver- 
sal that  includes  the  k in  L's  leftmost  column  represents  a legitimate  way  to  place 
the  ten  k s into  square  M.  The  task  of  finding  transversals  is,  in  fact,  rather 
easy,  and  the  given  matrix  L turns  out  to  have  exactly  808  of  them;  there  are 
respectively  (79, 96,  76, 87,  70,  84,  83,  75,  95,  63)  transversals  for  k = (0, 1, . . . , 9). 
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Once  the  transversals  are  known,  we’re  left  with  an  exact  cover  problem  of 
10  stages,  which  is  much  simpler  than  the  original  90-stage  problem  in  (6).  All  we 
need  to  do  is  cover  the  square  with  ten  transversals  that  don’t  intersect  - - because 
every  such  set  of  ten  is  equivalent  to  a latin  square  M that  is  orthogonal  to  L. 
The  particular  square  L in  (6)  has,  in  fact,  exactly  one  orthogonal  mate: 
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The  dancing  links  algorithm  finds  it,  and  proves  its  uniqueness,  after  doing  only 
about  1.7  x 108  mems  of  computation,  given  the  808  transversals.  Furthermore, 
the  cost  of  the  transversal-finding  phase,  about  5 million  mems,  is  negligible  by 
comparison.  Thus  the  original  running  time  of  2 X 1020  mems  — which  once  was 
regarded  as  the  inevitable  cost  of  solving  a problem  for  which  there  are  1090  ways 
to  fill  in  the  blanks  — has  been  reduced  by  a further  factor  of  more  than  1012(!). 

We  will  see  later  that  advances  have  also  been  made  in  methods  for  solving 
90-level  problems  like  (6).  Indeed,  (6)  turns  out  to  be  representable  directly 
as  an  exact  cover  problem  (see  exercise  17),  which  the  dancing  links  procedure 
of  Section  7.2.2. 1 solves  after  expending  only  1.3  x 1011  mems.  Even  so,  the 
Euler-Parker  approach  remains  about  a thousand  times  better  than  the  Paige- 
Tompkins  approach.  By  “factoring”  the  problem  into  two  separate  phases,  one 
for  transversal-finding  and  one  for  transversal-combining,  Euler  and  Parker  es- 
sentially reduced  the  computational  cost  from  a product,  T\T2,  to  a sum,  T\  +T2. 

The  moral  of  this  story  is  clear:  Combinatorial  problems  might  confront  us 
with  a huge  universe  of  possibilities,  yet  we  shouldn’t  give  up  too  easily.  A single 
good  idea  can  reduce  the  amount  of  computation  by  many  orders  of  magnitude. 

Puzzles  versus  the  real  world.  Many  of  the  combinatorial  problems  we  shall 
study  in  this  chapter,  like  Langford’s  problem  of  pairs  or  Ozanam’s  problem 
of  the  sixteen  honor  cards,  originated  as  amusing  puzzles  or  “brain  twisters.” 
Some  readers  might  be  put  off  by  this  emphasis  on  recreational  topics,  which 
they  regard  as  a frivolous  waste  of  time.  Shouldn’t  computers  really  be  doing 
useful  work?  And  shouldn’t  textbooks  about  computers  be  primarily  concerned 
with  significant  applications  to  industry  and/or  world  progress? 

Well,  the  author  of  the  textbook  you  are  reading  has  absolutely  no  objections 
to  useful  work  and  human  progress.  But  he  believes  strongly  that  a book  such  as 
this  should  stress  methods  of  problem  solving,  together  with  mathematical  ideas 
and  models  that  help  to  solve  many  different  problems,  rather  than  focusing  on 
the  reasons  why  those  methods  and  models  might  be  useful.  We  shall  learn  many 
beautiful  and  powerful  ways  to  attack  combinatorial  problems,  and  the  elegance 
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of  those  methods  will  be  our  main  motivation  for  studying  them.  Combinatorial 
challenges  pop  up  everywhere,  and  new  ways  to  apply  the  techniques  discussed 
in  this  chapter  arise  every  day.  So  let’s  not  limit  our  horizons  by  attempting  to 
catalog  in  advance  what  the  ideas  are  good  for. 

For  example,  it  turns  out  that  orthogonal  lat.in  squares  are  enormously 
useful,  particularly  in  the  design  of  experiments.  Already  in  1788,  Frangois 
Crette  de  Palluel  used  a 4x4  latin  square  to  study  what  happens  when  sixteen 
sheep  — four  each  from  four  different  breeds  — were  fed  four  different  diets  and 
harvested  at  four  different  times.  [Memoires  d’ Agriculture  (Paris:  Societe  Royale 
d Agriculture,  trimestre  d’ete,  1788),  17-23.]  The  latin  square  allowed  him  to  do 
this  with  16  sheep  instead  of  64;  with  a Graeco-Latin  square  he  could  also  have 
varied  another  parameter  by  trying,  say,  four  different  quantities  of  food  or  four 
different  grazing  paradigms. 

But  if  we  had  focused  our  discussion  on  his  approach  to  animal  husbandry, 
we  might  well  have  gotten  bogged  down  in  details  about  breeding,  about  root 
vegetables  versus  grains  and  the  costs  of  growing  them,  etc.  Readers  who  aren’t 
farmers  might  therefore  have  decided  to  skip  the  whole  topic,  even  though  latin 
square  designs  apply  to  a wide  range  of  studies.  (Think  about  testing  five  kinds 
of  pills,  on  patients  in  five  stages  of  some  disease,  five  age  brackets,  and  five 
weight  groups.)  Moreover,  a concentration  on  experimental  design  could  lead 
readers  to  miss  the  fact  that  latin  squares  also  have  important  applications  to 
discrete  geometry  and  error-correcting  codes  (see  exercises  18-24). 

Even  the  topic  of  Langford  pairing,  which  seems  at  first  to  be  purely  recre- 
ational, turns  out  to  have  practical  importance.  T.  Skolem  used  Langford  se- 
quences to  construct  Steiner  triple  systems,  which  we  have  applied  to  database 
queries  in  Section  6.5  [see  Math.  Scandinavica  6 (1958),  273-280];  and  in  the 
1960s,  E.  J.  Groth  of  Motorola  Corporation  applied  Langford  pairs  to  the  design 
of  circuits  for  multiplication.  Furthermore,  the  algorithms  that  efficiently  find 
Langford  pairs  and  latin  square  transversals,  such  as  the  method  of  dancing  links, 
apply  to  exact  cover  problems  in  general;  and  the  problem  of  exact  covering  has 
great  relevance  to  crucial  problems  such  as  the  equitable  apportionment  of  voter 
precincts  to  electoral  districts,  etc. 

The  applications  are  not  the  most  important  thing,  and  neither  are  the 
puzzles.  Our  primary  goal  is  rather  to  get  basic  concepts  into  our  brains,  like 
the  notions  of  latin  squares  and  exact  covering.  Such  notions  give  us  the  building 
blocks,  vocabulary,  and  insights  that  tomorrow’s  problems  will  need. 

Still,  it’s  foolish  to  discuss  problem  solving  without  actually  solving  any 
problems.  We  need  good  problems  to  stimulate  our  creative  juices,  to  light  up 
our  grey  cells  in  a more  or  less  organized  fashion,  and  to  make  the  basic  concepts 
familiar.  Mind-bending  puzzles  are  often  ideal  for  this  purpose,  because  they  can 
be  presented  in  a few  words,  needing  no  complicated  background  knowledge. 

Vaclav  Havel  once  remarked  that  the  complexities  of  life  are  vast:  “There 
is  too  much  to  know. . . We  have  to  abandon  the  arrogant  belief  that  the  world 
is  merely  a puzzle  to  be  solved,  a machine  with  instructions  for  use  waiting  to 
be  discovered,  a body  of  information  to  be  fed  into  a computer.”  He  called 
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for  an  increased  sense  of  justice  and  responsibility;  for  taste,  courage,  and 
compassion.  His  words  were  filled  with  great  wisdom.  Yet  thank  goodness  we 
do  also  have  puzzles  that  can  be  solved!  Puzzles  deserve  to  be  counted  among 
the  great  pleasures  of  life,  to  be  enjoyed  in  moderation  like  all  other  treats. 

Of  course,  Langford  and  Ozanam  directed  their  puzzles  to  human  beings,  not 
to  computers.  Aren’t  we  missing  the  point  if  we  merely  shuffle  such  questions  off 
to  machines,  to  be  solved  by  brute  force  instead  of  by  rational  thought?  George 
Brewster,  writing  to  Martin  Gardner  in  1963,  expressed  a widely  held  view  as 
follows:  “Feeding  a recreational  puzzle  into  a computer  is  no  more  than  a step 
above  dynamiting  a trout  stream.  Succumbing  to  instant  recreation.” 

Yes,  but  that  view  misses  another  important  point:  Simple  puzzles  often 
have  generalizations  that  go  beyond  human  ability  and  arouse  our  curiosity.  The 
study  of  those  generalizations  often  suggests  instructive  methods  that  apply  to 
numerous  other  problems  and  have  surprising  consequences.  Indeed,  many  of  the 
key  techniques  that  we  shall  study  were  born  when  people  were  trying  to  solve 
various  puzzles.  While  writing  this  chapter,  the  author  couldn’t  help  relishing 
the  fact  that  puzzles  are  now  more  fun  than  ever,  as  computers  get  faster  and 
faster,  because  we  keep  getting  more  powerful  dynamite  to  play  with.  [Further 
comments  appear  in  the  author’s  essay,  “Can  toy  problems  be  useful?” , originally 
written  in  1976;  see  Selected  Papers  on  Computer  Science  (1996),  169—183.] 

Puzzles  do  have  the  danger  that  they  can  be  too  elegant.  Good  puzzles  tend 
to  be  mathematically  clean  and  well-structured,  but  we  also  need  to  learn  how 
to  deal  systematically  with  the  messy,  chaotic,  organic  stuff  that  surrounds  us 
every  day.  Indeed,  some  computational  techniques  are  important  chiefly  because 
they  provide  powerful  ways  to  cope  with  such  complexities.  That  is  why,  for 
example,  the  arcane  rules  of  library-card  alphabetization  were  presented  at  the 
beginning  of  Chapter  5,  and  an  actual  elevator  system  was  discussed  at  length 
to  illustrate  simulation  techniques  in  Section  2.2.5. 

A collection  of  programs  and  data  called  the  Stanford  GraphBase  (SGB)  has 
been  prepared  so  that  experiments  with  combinatorial  algorithms  can  readily  be 
performed  on  a variety  of  real-world  examples.  SGB  includes,  for  example,  data 
about  American  highways,  and  an  input-output  model  of  the  U.S.  economy;  it 
records  the  casts  of  characters  in  Homer’s  Iliad,  Tolstoy’s  Anna  Karenina,  and 
several  other  novels;  it  encapsulates  the  structure  of  Roget’s  Thesaurus  of  1879; 
it  documents  hundreds  of  college  football  scores;  it  specifies  the  gray- value  pixels 
of  Leonardo  da  Vinci’s  Giocond a (Mona  Lisa).  And  perhaps  most  importantly, 
SGB  contains  a collection  of  five-letter  words,  which  we  shall  discuss  next. 

The  five-letter  words  of  English.  Many  of  the  examples  in  this  chapter  will 
be  based  on  the  following  list  of  five-letter  words: 

aargh,  abaca,  abaci,  aback,  abaft,  abase,  abash,  ...,  zooms,  zowie.  (8) 

(There  are  5757  words  altogether  —too  many  to  display  here;  but  those  that  are 
missing  can  readily  be  imagined.)  It’s  a personal  list,  collected  by  the  author 
between  1972  and  1992,  beginning  when  he  realized  that  such  words  would  make 
ideal  data  for  testing  many  kinds  of  combinatorial  algorithms. 
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The  list  has  intentionally  been  restricted  to  words  that  are  truly  part  of  the 
English  language,  in  the  sense  that  the  author  has  encountered  them  in  actual 
use.  Unabridged  dictionaries  contain  thousands  of  entries  that  are  much  more 
esoteric,  like  aalii,  abamp,  ...,  zymin,  and  zyxst;  words  like  that  are  useful 
primarily  to  Scrabble®  players.  But  unfamiliar  words  tend  to  spoil  the  fun 
for  anybody  who  doesn't  know  them.  Therefore,  for  twenty  years,  the  author 
systematically  took  note  of  all  words  that  seemed  right  for  the  expository  goals 
of  The  Art  of  Computer  Programming. 

Finally  it  was  necessary  to  freeze  the  collection,  in  order  to  have  a fixed 
point  for  reproducible  experiments.  The  English  language  will  always  be  evolv- 
ing, but  the  5757  SGB  words  will  therefore  always  stay  the  same  — even  though 
the  author  has  been  tempted  at  times  to  add  a few  words  that  he  didn’t  know  in 
1992,  such  as  chads,  stent,  blogs,  ditzy,  phish,  bling,  and  possibly  tetch. 
No;  noway.  The  time  for  any  changes  to  SGB  has  long  since  ended:  finis. 

The  following  Glossary  is  intended  to  contain  all  well-known  English  words 
■ . . which  may  be  used  in  good  society,  and  which  can  serve  as  Links. 

■ . . There  must  be  a stent  to  the  admission  of  spick  words. 

— LEWIS  CARROLL,  Doublets:  A Word-Puzzle  (1879) 

If  there  is  such  a verb  as  to  tetch,  Mr.  Lillywaite  tetched. 

— ROBERT  BARNARD,  Corpse  in  a Gilded  Cage  (1984) 

Proper  names  like  Knuth  are  not  considered  to  be  legitimate  words.  But 
gauss  and  hardy  are  valid,  because  “gauss”  is  a unit  of  magnetic  induction  and 
hardy  is  hardy.  In  fact,  SGB  words  are  composed  entirely  of  ordinary  lowercase 
letters,  the  list  contains  no  hyphenated  words,  contractions,  or  terms  like  blase 
that  require  an  accent.  Thus  each  word  can  also  be  regarded  as  a vector,  which 
has  five  components  in  the  range  [0 . . 26).  In  the  vector  sense,  the  words  yucca 
and  abuzz  are  furthest  apart:  The  Euclidean  distance  between  them  is 


||(24,  20, 2,  2, 0)  - (0, 1,  20,  25,  25)||2  - ^242  + 192  + 182  + 232  + 252  = ^2415. 

The  entire  Stanford  GraphBase,  including  all  of  its  programs  and  data  sets, 
is  easy  to  download  from  the  author’s  website  (see  page  iv).  And  the  list  of  all 
SGB  words  is  even  easier  to  obtain,  because  it  is  in  the  file  ‘sgb-words . txt’  at 
the  same  place.  That  file  contains  5757  lines  with  one  word  per  line,  beginning 
with  ‘which'  and  ending  with  ‘pupal’.  The  words  appear  in  a default  order, 
corresponding  to  frequency  of  usage;  for  example,  the  words  of  rank  1000,  2000, 
3000,  4000,  and  5000  are  respectively  ditch,  galls,  visas,  faker,  and  pismo. 
The  notation  ‘ WORDS  (n)’  will  be  used  in  this  chapter  to  stand  for  the  n most 
common  words,  according  to  this  ranking. 

Incidentally,  five-letter  words  include  many  plurals  of  four-letter  words , and 
it  should  be  noted  that  no  Victorian-style  censorship  was  done.  Potentially  offen- 
sive vocabulary  has  been  expurgated  from  The  Official  Scrabble®  Players  Dic- 
tionary, but  not  from  the  SGB.  One  way  to  ensure  that  semantically  unsuitable 
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terms  will  not  appear  in  a professional  paper  based  on  the  SGB  wordlist  is  to 
restrict  consideration  to  WORDS(n)  where  n is,  say,  3000. 

Exercises  26-37  below  can  be  used  as  warmups  for  initial  explorations  of  the 
SGB  words,  which  we’ll  see  in  many  different  combinatorial  contexts  throughout 
this  chapter.  For  example,  while  covering  problems  are  still  on  our  minds,  we 
might  as  well  note  that  the  four  words  ‘third  flock  began  jumps’  cover  20  of 
the  first  21  letters  of  the  alphabet.  Five  words  can,  however,  cover  at  most  24 
different  letters,  as  in  {becks,  fjord,  glitz,  nymph,  squaw}  — unless  we  resort  to 
a rare  non-SGB  word  like  waqfs  (Islamic  endowments),  which  can  be  combined 
with  {gyved,  bronx,  chimp,  klutz}  to  cover  25. 

Simple  words  from  W0RDS(400)  suffice  to  make  a word  square: 

class 

light 

agree  . (9) 

sheep 

steps 

We  need  to  go  almost  to  W0RDS(3000),  however,  to  obtain  a word  cube, 


types 

yeast 

pasta 

ester 

start 

yeast 

earth 

armor 

stove 

three 

pasta 

armor 

smoke 

token 

arena 

ester 

stove 

token 

event 

rents 

start 

three 

arena 

rents 

tease 

in  which  every  5x5  “slice”  is  a word  square.  With  a simple  extension  of  the 
basic  dancing  links  algorithm  (see  Section  7. 2. 2. 2),  one  can  show  after  performing 
about  390  billion  mems  of  computation  that  W0RDS(3000)  supports  only  three 
symmetric  word  cubes  such  as  (10);  exercise  36  reveals  the  other  two.  Surpris- 
ingly, 83,576  symmetrical  cubes  can  be  made  from  the  full  set,  W0RDS(5757). 

Graphs  from  words.  It’s  interesting  and  important  to  arrange  objects  into 
rows,  squares,  cubes,  and  other  designs;  but  in  practical  applications  another 
kind  of  combinatorial  structure  is  even  more  interesting  and  important,  namely 
a graph.  Recall  from  Section  2.3.4. 1 that  a graph  is  a set  of  points  called 
vertices,  together  with  a set  of  lines  called  edges,  which  connect  certain  pairs 
of  vertices.  Graphs  are  ubiquitous,  and  many  beautiful  graph  algorithms  have 
been  discovered,  so  graphs  will  naturally  be  the  primary  focus  of  many  sections 
in  this  chapter.  In  fact,  the  Stanford  GraphBase  is  primarily  about  graphs,  as 
its  name  implies;  and  the  SGB  words  were  collected  chiefly  because  they  can  be 
used  to  define  interesting  and  instructive  graphs. 

Lewis  Carroll  blazed  the  trail  by  inventing  a game  that  he  called  Word- 
Links  or  Doublets,  at  the  end  of  1877.  [See  Martin  Gardner,  The  Universe  in 
a Handkerchief  (1996),  Chapter  6.]  Carroll’s  idea,  which  soon  became  quite 
popular,  was  to  transform  one  word  to  another  by  changing  a letter  at  a time: 

tears  — sears  — stars  — stare  — stale  — stile  — smile.  (11) 
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The  shortest  such  transformation  is  the  shortest  path  in  a graph,  where  the 
vertices  of  the  graph  are  English  words  and  the  edges  join  pairs  of  words  that 
have  “Hamming  distance  1”  (meaning  that  they  disagree  in  just  one  place). 

When  restricted  to  SGB  words,  Carroll’s  rule  produces  a graph  of  the 
Stanford  GraphBase  whose  official  name  is  words (5757, 0,  0, 0).  Every  graph 
defined  by  SGB  has  a unique  identifier  called  its  id , and  the  graphs  that  are 
derived  in  Carrollian  fashion  from  SGB  words  are  identified  by  ids  of  the  form 
words  (n,  /,  t,  s ).  Here  n is  the  number  of  vertices;  l is  either  0 or  a list  of  weights, 
used  to  emphasize  various  kinds  of  vocabulary;  t is  a threshold  so  that  low-weight 
words  can  be  disallowed;  and  s is  the  seed  for  any  pseudorandom  numbers  that 
might  be  needed  to  break  ties  between  words  of  equal  weight.  The  full  details 
needn’t  concern  us,  but  a few  examples  will  give  the  general  idea: 

• words  (n,  0,0,0)  is  precisely  the  graph  that  arises  when  Carroll’s  idea  is 
applied  to  WORDS(n),  for  1 < n < 5757. 

• words (1000,  {0,  0, 0, 0, 0,  0,  0, 0, 0},  0,  s ) contains  1000  randomly  chosen  SGB 
words,  usually  different  for  different  values  of  s. 

• words(766,  {0,0, 0,0, 0,0, 0,1,0},  1,0)  contains  all  of  the  five-letter  words 
that  appear  in  the  author’s  books  about  and  METRFONT. 

There  are  only  766  words  in  the  latter  graph,  so  we  can’t  form  very  many  long 
paths  like  (n),  although 

basic  — basis  — bases  — based 

baked  — naked  — named  — names  — games  (12) 
is  one  noteworthy  example. 

Of  course  there  are  many  other  ways  to  define  the  edges  of  a graph  when  the 
vertices  represent  five-letter  words.  We  could,  for  example,  require  the  Euclidean 
distance  to  be  small,  instead  of  the  Hamming  distance.  Or  we  could  declare  two 
words  to  be  adjacent  whenever  they  share  a subword  of  length  four;  that  strategy 
would  substantially  enrich  the  graph,  making  it  possible  for  chaos  to  yield  peace, 
even  when  confined  to  the  766  words  that  are  related  to  T£X: 

chaos  chose  whose  — whole  — holes  — hopes  — copes  — scope 

score  store  stare  — spare  — space  — paces  — peace.  (13) 

(In  this  rule  we  remove  a letter,  then  insert  another,  possibly  in  a different,  place.) 
Or  we  might  choose  a totally  different  strategy,  like  putting  an  edge  between  word 
vectors  <11020304(15  and  bib2b3b4b5  if  and  only  if  their  dot  product  a4bi  + a2b2  + 
a3&3  + a464  + o565  is  a multiple  of  some  parameter  m.  Graph  algorithms  thrive 
on  different  kinds  of  data. 

SGB  words  lead  also  to  an  interesting  family  of  directed  graphs,  if  we  write 
0102030405  -A  M2M465  when  {02,03,04,05}  C {b1,b2,b3,b4,b5}  as  multisets. 
(Remove  the  first  letter,  insert  another,  and  rearrange.)  With  this  rule  we  can, 
for  example,  transform  words  to  graph  via  a shortest  oriented  path  of  length  six: 

words  -A  dross  -A  soars  -A  orcas  -A  crash  -A  sharp  -A  graph.  (14) 
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Theory  is  the  first  term  in  the  Taylor  series  of  practice. 

— THOMAS  M.  COVER  (1992) 

The  number  of  systems  of  terminology  presently  used  in  graph  theory 
is  equal,  to  a close  approximation,  to  the  number  of  graph  theorists. 

— RICHARD  P.  STANLEY  (1986) 

Graph  theory:  The  basics.  A graph  G consists  of  a set  V of  vertices  together 
with  a set  E of  edges,  which  are  pairs  of  distinct  vertices.  We  will  assume  that 

V and  E are  finite  sets  unless  otherwise  specified.  We  write  u — v if  u and  v 
are  vertices  with  {u,u}  £ E,  and  u-f-v  if  u and  v are  vertices  with  {u,v}  £ E. 
Vertices  with  u — v are  called  “neighbors,”  and  they’re  also  said  to  be  “adjacent” 
in  G.  One  consequence  of  this  definition  is  that  we  have  u — v if  and  only  if 
v — u.  Another  consequence  is  that  v—f—v , for  all  v £ V ; that  is,  no  vertex  is 
adjacent  to  itself.  (We  shall,  however,  discuss  multigraphs  below,  in  which  loops 
from  a vertex  to  itself  are  permitted.) 

The  graph  G'  — (V',  E')  is  a subgraph  of  G = (V,  E)  if  V'  C V and  E'  C E. 
It’s  a spanning  subgraph  of  G if,  in  fact,  V'  = V.  And  it’s  an  induced  subgraph 
of  G if  E'  has  as  many  edges  as  possible,  when  V'  is  a given  subset  of  the 
vertices.  In  other  words,  when  V'  C V the  subgraph  of  G = (V,E)  induced  by 

V is  G'  = (V',E'),  where 

E'  = { {u,  u}  | u £ V' , v £ V',  and  {u,  v}  £ E }.  (15) 

This  subgraph  G'  is  denoted  by  G \ V1 , and  often  called  “G  restricted  to  V'.”  In 
the  common  case  where  V'  = V\{u},  we  write  simply  G\v  (“G  minus  vertex  v”) 
as  an  abbreviation  for  G | (V  \ {n}).  The  similar  notation  G \ e is  used  when 
e £ E to  denote  the  subgraph  G'  = (V,  E \ {e}),  obtained  by  removing  an  edge 
instead  of  a vertex.  Notice  that  all  of  the  SGB  graphs  known  as  words  (n,  l,  t,  s), 
described  earlier,  are  induced  subgraphs  of  the  main  graph  words (5757, 0,0,0); 
only  the  vocabulary  changes  in  those  graphs,  not  the  rule  for  adjacency. 

A graph  with  n vertices  and  e edges  is  said  to  have  order  n and  size  e.  The 
simplest  and  most  important  graphs  of  order  n are  the  complete  graph  Kn , the 
path  Pn,  and  the  cycle  Cn.  Suppose  the  vertices  are  V = {1,2, . . . , n}.  Then 

• Kn  has  (2)  = \n(n  - 1)  edges  u — v for  1 < u < v < n;  every  n-vertex 
graph  is  a spanning  subgraph  of  Kn. 

• Pn  has  n — 1 edges  v — (u+l)  for  1 < v < n,  when  n > 1;  it  is  a path 
of  length  n—  1 from  1 to  n. 

• Cn  has  n edges  v — ((n  mod  n)+l)  for  1 < v < n,  when  n > 1;  it  is  a graph 
only  when  n > 3 (but  C 1 and  G2  are  multigraphs). 

We  could  actually  have  defined  Kn,  Pn,  and  Cn  on  the  vertices  {0, 1, . . . ,n— 1}, 
or  on  any  n-element  set  V instead  of  {1, 2, ... , n},  because  two  graphs  that  differ 
only  in  the  names  of  their  vertices  but  not  in  the  structure  of  their  edges  are 
combinatorially  equivalent. 

Formally,  we  say  that  graphs  G = (V,E)  and  G'  = (V',E')  are  isomorphic 
if  there  is  a one-to-one  correspondence  <p  from  V to  V'  such  that  u — v in  G if 
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and  only  if  tp{u)  — ip{v)  in  G' . The  notation  G = G'  is  often  used  to  indicate 
that  G and  G’  are  isomorphic;  but  we  shall  often  be  less  precise,  by  treating 
isomorphic  graphs  as  if  they  were  equal,  and  by  occasionally  writing  G = C 
even  when  the  vertex  sets  of  G and  G’  aren’t  strictly  identical. 

Small  graphs  can  be  defined  by  simply  drawing  a diagram,  in  which  the 
vertices  are  small  circles  and  the  edges  are  lines  between  them.  Figure  2 illus- 
trates several  important  examples,  whose  properties  we  will  be  studying  later. 
The  Petersen  graph  in  Figure  2(e)  is  named  after  Julius  Petersen,  an  early 
graph  theorist  who  used  it  to  disprove  a plausible  conjecture  [LTnterme'diaire 
des  Mathematicians  5 (1898),  225-227];  it  is,  in  fact,  a remarkable  configuration 
that  serves  as  a counterexample  to  many  optimistic  predictions  about  what  might 
be  true  for  graphs  in  general.  The  Chvatal  graph,  Figure  2(f),  was  introduced 
by  Vaclav  Chvatal  in  J.  Combinatorial  Theory  9 (1970),  93-94. 


(a)  (b)  (c)  (d)  (e)  (f) 


Fig.  2.  Six  example  graphs,  which  have  respectively  (5,5,5,8,10,12)  vertices  and 
(4,5,10,12,15,24)  edges. 


The  lines  of  a graph  diagram  are  allowed  to  cross  each  other  at  points  that 
aren’t  vertices.  For  example,  the  center  point  of  Fig.  2(f)  is  not  a vertex  of 
Chvatal’s  graph.  A graph  is  called  planar  if  there’s  a way  to  draw  it  without 
any  crossings.  Clearly  Pn  and  Cn  are  always  planar;  Fig.  2(d)  shows  that  the 
3-cube  is  also  planar.  But  K5  has  too  many  edges  to  be  planar  (see  exercise  46). 

The  degree  of  a vertex  is  the  number  of  neighbors  that  it  has.  If  all  vertices 
have  the  same  degree,  the  graph  is  said  to  be  regular.  In  Fig.  2,  for  example,  P5 
is  irregular  because  it  has  two  vertices  of  degree  1 and  three  of  degree  2.  But 
the  other  five  graphs  are  regular,  of  degrees  (2,4,3,  3,4)  respectively.  A regular 
graph  of  degree  3 is  often  called  “cubic”  or  “trivalent.” 

There  are  many  ways  to  draw  a given  graph,  some  of  which  are  much  more 
perspicuous  than  others.  For  example,  each  of  the  six  diagrams 


is  isomorphic  to  the  3-cube,  Fig.  2(d).  The  layout  of  Chvatal’s  graph  that  appears 
in  Fig.  2(f)  was  discovered  by  Adrian  Bondy  many  years  after  Chvatal’s  paper 
was  published,  thereby  revealing  unexpected  symmetries. 

The  symmetries  of  a graph,  also  known  as  its  automorphisms,  are  the  permu- 
tations of  its  vertices  that  preserve  adjacency.  In  other  words,  the  permutation 
tp  is  an  automorphism  of  G if  we  have  pin)  — p(e)  whenever  u — v in  G.  A 
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well-chosen  drawing  like  Fig.  2(f)  can  reveal  underlying  symmetry,  but  a single 
diagram  isn’t  always  able  to  display  all  the  symmetries  that  exist.  For  example, 
the  3-cube  has  48  automorphisms,  and  the  Petersen  graph  has  120.  We’ll  study 
algorithms  that  deal  with  isomorphisms  and  automorphisms  in  Section  7.2.3. 
Symmetries  can  often  be  exploited  to  avoid  unnecessary  computations,  mak- 
ing an  algorithm  almost  k times  faster  when  it  operates  on  a graph  that  has 
k automorphisms. 

Graphs  that  have  evolved  in  the  real  world  tend  to  be  rather  different  from 
the  mathematically  pristine  graphs  of  Figure  2.  For  example,  here’s  a familiar 
graph  that  has  no  symmetry  whatsoever,  although  it  does  have  the  virtue  of 
being  planar: 


It  represents  the  contiguous  United  States  of  America,  and  we’ll  be  using  it  later 
in  several  examples.  The  49  vertices  of  this  diagram  have  been  labeled  with  two- 
letter  postal  codes  for  convenience,  instead  of  being  reduced  to  empty  circles. 

Paths  and  cycles.  A spanning  path  of  a graph  is  called  a Hamiltonian  path, 
and  a spanning  cycle  is  called  a Hamiltonian  cycle,  because  W.  R.  Hamilton 
invented  a puzzle  in  1856  whose  goal  was  to  find  such  paths  and  cycles  on  the 
edges  of  a dodecahedron.  T.  P.  Kirkman  had  independently  studied  the  problem 
for  polyhedra  in  general,  in  Philosophical  Transactions  146  (1856),  413-418;  148 
(1858),  145-161.  [See  Graph  Theory  1736-1936  by  N.  L.  Biggs,  E.  K.  Lloyd,  and 
R.  J.  Wilson  (1998),  Chapter  2.]  The  task  of  finding  a spanning  path  or  cycle  is, 
however,  much  older  — indeed,  we  can  legitimately  consider  it  to  be  the  oldest 
combinatorial  problem  of  all,  because  paths  and  tours  of  a knight  on  a chessboard 
have  a continuous  history  going  back  to  ninth-century  India  (see  Section  7.3.3). 
A graph  is  called  Hamiltonian  if  it  has  a Hamiltonian  cycle.  (The  Petersen 
graph,  incidentally,  is  the  smallest  3-regular  graph  that  is  neither  planar  nor 
Hamiltonian;  see  C.  de  Polignac,  Bull.  Soc.  Math,  de  France  27  (1899),  142-145.) 

The  girth  of  a graph  is  the  length  of  its  shortest  cycle;  the  girth  is  infinite  if 
the  graph  is  acyclic  (containing  no  cycles).  For  example,  the  six  graphs  of  Fig.  2 
have  girths  (oo,  5, 3,4, 5, 4),  respectively.  It’s  not  difficult  to  prove  that  a graph 
of  minimum  degree  k and  girth  5 must  have  at  least  k2  + 1 vertices.  Further 
analysis  shows  in  fact  that  this  minimum  value  is  achievable  only  if  k = 2 (C5), 
k = 3 (Petersen),  k = 7,  or  perhaps  k = 57.  (See  exercises  63  and  65.) 


16 


COMBINATORIAL  SEARCHING 


7 


The  distance  d(u,v ) between  two  vertices  u and  v is  the  minimum  length 
of  a path  from  u to  v in  the  graph;  it  is  infinite  if  there’s  no  such  path.  Clearly 
d(v,v)  — 0,  and  d(u,v)  = d(v,  u).  We  also  have  the  triangle  inequality 

d(u,v)  + d(v,w)  > d(u,w).  (jg) 

For  if  d(u,  v)  = p and  d(v,  w)  = q and  p < oc  and  q < oo,  there  are  paths 


u~u0  Ul  up  = v and  v = v0  — Vl vq=w,  (19) 

and  we  can  find  the  least  subscript  r such  that  ur  = vs  for  some  s.  Then 

U°  Ul  Ur~  1 Vs Us+1 Vq  (20) 

is  a path  of  length  < p + q from  u to  w. 


The  diameter  of  a graph  is  the  maximum  of  d(u,  v),  over  all  vertices  u and  v. 
le  graph  is  connected  if  its  diameter  is  finite.  The  vertices  of  a graph  can  always 
be  partitioned  into  connected  components , where  two  vertices  u and  v belong  to 
the  same  component  if  and  only  if  d(u,  v)  < oo. 

In  the  graph  words (5757,  0,  0,  0),  for  example,  we  have  d(tears.  smile)  = 6, 
because  (n)  is  a shortest  path  from  tears  to  smile.  Also  d(tears, happy)  = 6 
and  d(smile, happy)  = 10,  and  d(world, court)  = 6.  But  d(world.  happy)  = 
oo;  the  graph  isn’t  connected.  In  fact,  it  contains  671  words  like  aloof,  which 
have  no  neighbors  and  form  connected  components  of  order  1 all  by  themselves. 
Word  pairs  such  as  alpha  — aloha,  droid  — druid,  and  opium  — odium 
account  for  103  further  components  of  order  2.  Some  components  of  order  3, 
like  chain  chair  choir,  are  paths;  others,  like  {getup,  letup,  setup}, 
are  cycles.  A few  more  small  components  are  also  present,  like  the  curious  path 

login  logic  — yogic  — yogis  — yogas  — togas , (21) 

whose  words  have  no  other  neighbors.  But  the  vast  majority  of  all  five-letter 
words  belong  to  a giant  component  of  order  4493.  If  you  can  go  two  steps  away 
from  a given  word,  the  odds  are  better  than  15  to  1 that  your  word  is  connected 
to  everything  in  the  giant  component. 

Similarly,  the  graph  words  (n,  0, 0, 0)  has  a giant  component  of  order  (3825. 
2986,2056, 1198,224)  when  n = (5000,4000,3000,2000, 1000),  respectively.  But 
if  n is  small,  there  aren’t  enough  edges  to  provide  much  connectivity.  For  exam- 
ple, words  (500, 0, 0, 0)  has  327  different  components,  none  of  order  15  or  more. 

The  concept  of  distance  can  be  generalized  to  d(v i,v2, . . . , tife)  for  any  value 
of  k,  meaning  the  minimum  number  of  edges  in  a connected  subgraph  that 
contains  the  vertices  {vi,v2, . . .,vk}.  For  example,  d(blood,  sweat,  tears)  turns 
out  be  15,  because  the  subgraph 

blood  brood  broad  — bread  — tread  — treed  — tweed 

tears  teams  trams  — trims  — tries  — trees  tweet  (22) 

sweat  — sweet 

has  15  edges,  and  there’s  no  suitable  14-edge  subgraph. 
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We  noted  in  Section  2.3.4. 1 that  a connected  graph  with  fewest  edges  is 
called  a free  tree.  A subgraph  that  corresponds  to  the  generalized  distance 
d(y i, . . . ,Vk)  will  always  be  a free  tree.  It  is  misleadingly  called  a Steiner  tree, 
because  Jacob  Steiner  once  mentioned  the  case  k = 3 for  points  {h'1,^2,^3}  in 
the  Euclidean  plane  [Crelle  13  (1835),  362-363].  Franz  Heinen  had  solved  that 
problem  in  Uber  Systeme  von  Kraften  (1834);  Gauss  extended  the  analysis  to 
k — 4 in  a letter  to  Schumacher  (21  March  1836). 

Coloring.  A graph  is  said  to  be  k-partite  or  k-colorable  if  its  vertices  can  be 
partitioned  into  k or  fewer  parts,  with  the  endpoints  of  each  edge  belonging  to 
different  parts  — or  equivalently,  if  there’s  a way  to  paint  its  vertices  with  at  most 
k different  colors,  never  assigning  the  same  color  to  two  adjacent  vertices.  The  fa- 
mous Four  Color  Theorem,  conjectured  by  F.  Guthrie  in  1852  and  finally  proved 
with  massive  computer  aid  by  K.  Appel,  W.  Haken,  and  J.  Koch  [Illinois  J.  Math. 
21  (1977),  429-567],  states  that  every  planar  graph  is  4-colorable.  No  simple 
proof  is  known,  but  special  cases  like  (17)  can  be  colored  at  sight  (see  exercise  45); 
and  0(n2)  steps  suffice  to  4-color  a planar  graph  in  general  [N.  Robertson,  D.  P. 
Sanders,  P.  Seymour,  and  R.  Thomas,  STOC  28  (1996),  571-575]. 

The  case  of  2-colorable  graphs  is  especially  important  in  practice.  A 2- 
partite  graph  is  generally  called  bipartite,  or  simply  a “bigraph”;  every  edge  of 
such  a graph  has  one  endpoint  in  each  part. 


Theorem  B.  A graph  is  bipartite  if  and  only  if  it  contains  no  cycle  of  odd  length. 


Proof.  [See  D.  Konig,  Math.  Annalen  77  (1916),  453-454.]  Every  subgraph  of 
a fc-partite  graph  is  A>partite.  Therefore  the  cycle  Cn  can  be  a subgraph  of  a 
bipartite  graph  only  if  Cn  itself  is  a bigraph,  in  which  case  n must  be  even. 

Conversely,  if  a graph  contains  no  odd  cycles  we  can  color  its  vertices  with 
the  two  colors  {0,1}  by  carrying  out  the  following  procedure:  Begin  with  all 
vertices  uncolored.  If  all  neighbors  of  colored  vertices  are  already  colored,  choose 
an  uncolored  vertex  w , and  color  it  0.  Otherwise  choose  a colored  vertex  u that 
has  an  uncolored  neighbor  v\  assign  to  v the  opposite  color.  Exercise  48  proves 
that  a valid  2-coloring  is  eventually  obtained.  | 


The  complete  bipartite  graph  Kmtn  is  the  largest  bipartite  graph  whose 
vertices  have  two  parts  of  sizes  m and  n.  We  can  define  it  on  the  vertex  set 
{1,  2, . . . , m + n}  by  saying  that  u — v whenever  l<u<m<v<m  + n. 
In  other  words,  Km^n  has  mn  edges,  one  for  each  way  to  choose  one  vertex  in 
the  first  part  and  another  in  the  second  part.  Similarly,  the  complete  k-partite 
graph  Kn 1}...,nk  has  N = ni  + ■ ■ ■ + nk  vertices  partitioned  into  parts  of  sizes 
{ni, . . . , n*,},  and  it  has  edges  between  any  two  vertices  that  don’t  belong  to  the 
same  part.  Here  are  some  examples  when  N = 6: 


AA,5 

Notice  that  K\  n 


is  a free  tree;  it  is  popularly  called  the  star  graph  of  order  n + 1. 
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From  now  on  say  "digraph"  instead  of  "directed  graph." 

It  is  clear  and  short  and  it  will  catch  on. 

— GEORGE  POLYA,  letter  to  Frank  Harary  (c.  1954) 

Directed  graphs.  In  Section  2. 3. 4. 2 we  defined  directed  graphs  (or  digraphs ), 
which  are  very  much  like  graphs  except  that  they  have  arcs  instead  of  edges. 
An  arc  u — > v runs  from  one  vertex  to  another,  while  an  edge  u — v joins 
two  vertices  without  distinguishing  between  them.  Furthermore,  digraphs  are 
allowed  to  have  self-loops  v — > v from  a vertex  to  itself,  and  more  than  one  arc 
u — >v  may  be  present  between  the  same  vertices  u and  v. 

Formally,  a digraph  D = (V,  A)  of  order  n and  size  m is  a set  V of  n vertices 
and  a multiset  A of  m ordered  pairs  (u,  v ),  where  u G V and  v e V.  The  ordered 
pairs  are  called  arcs,  and  we  write  u — >v  when  (u,  v)  e A.  The  digraph  is  called 
simple  if  A is  actually  a set  instead  of  a general  multiset—  namely,  if  there’s  at 
most  one  arc  (u,v)  for  all  u and  v.  Each  arc  (u,v)  has  an  initial  vertex  u and  a 
final  vertex  v,  also  called  its  “tip.”  Each  vertex  has  an  out-degree  d+(v),  the  num- 
ber of  arcs  for  which  v is  the  initial  vertex,  and  an  in-degree  d~(v),  the  number  of 
arcs  for  which  v is  the  tip.  A vertex  with  in-degree  0 is  called  a “source” ; a vertex 
with  out-degree  0 is  called  a “sink.”  Notice  that  J2vevd+(V)  = D Vevd~(v)i 
because  both  sums  are  equal  to  to,  the  total  number  of  arcs. 

Most  of  the  notions  we’ve  defined  for  graphs  carry  over  to  digraphs  in  a nat- 
ural way,  if  we  just  insert  the  word  “directed”  or  “oriented”  (or  the  syllable  “di” ) 
when  it’s  necessary  to  distinguish  between  edges  and  arcs.  For  example,  digraphs 
have  subdigraphs,  which  can  be  spanning  or  induced  or  neither.  An  isomorphism 
between  digraphs  D — (V,  A)  and  D'  = ( V',  A')  is  a one-to-one  correspondence  ip 
from  V to  V'  for  which  the  number  of  arcs  u — > v in  D equals  the  number  of 
arcs  ip(u) — >tp(v)  in  D' , for  all  u,v  £ V. 

Diagrams  for  digraphs  use  arrows  between  the  vertices,  instead  of  unadorned 
lines.  The  simplest  and  most  important  digraphs  of  order  n are  directed  variants 
of  the  graphs  Kn,  Pn,  and  Cn,  namely  the  transitive  tournament  K~,  the  oriented 
path  Pn,  and  the  oriented  cycle  Cn-  They  can  be  schematically  indicated  by  the 
following  diagrams  for  n = 5: 


There’s  also  the  complete  digraph  Jn,  which  is  the  largest  simple  digraph  on  n 
vertices;  it  has  n2  arcs  u — >v,  one  for  each  choice  of  u and  v. 

Figure  3 shows  a more  elaborate  diagram,  for  a digraph  of  order  17  that 
we  might  call  “expressly  oriented”:  It  is  the  directed  graph  described  by  Her- 
cule  Poirot  in  Agatha  Christie’s  novel  Murder  on  the  Orient  Express  (1934). 
Vertices  correspond  to  the  berths  of  the  Stamboul -Calais  coach  in  that  story, 
and  an  arc  u — > v means  that  the  occupant  of  berth  u has  corroborated  the 
alibi  of  the  person  in  berth  v.  This  example  has  six  connected  components, 
namely  {0,1,3,6,8,12,13,14,15,16},  {2},  {4,5},  {7},  {9},  and  {10,11},  because 
connectivity  in  a digraph  is  determined  by  treating  arcs  as  edges. 
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LEGEND 

0:  Pierre  Michel,  the  French  conductor 
1:  Hercule  Poirot,  the  Belgian  detective 


2:  Samuel  Edward  Ratchett,  the  deceased  American 
3:  Caroline  Martha  Hubbard,  the  American  matron 
4:  Edward  Henry  Masterman,  the  British  valet 
5:  Antonio  Foscarelli,  the  Italian  automobile  salesman 
6:  Hector  MacQueen,  the  American  secretary 
7:  Harvey  Harris,  the  Englishman  who  didn’t  show  up 
8:  Hildegarde  Schmidt,  the  German  lady’s  maid 
9:  (vacancy) 

10:  Greta  Ohlsson,  the  Swedish  nurse 
11:  Mary  Hermione  Debenham,  the  English  governess 
12:  Helena  Maria  Andrenyi,  the  beautiful  countess 
13:  Rudolph  Andrenyi,  the  Hungarian  count/diplomat 
14:  Natalia  DragomirofF,  the  Russian  princess  dowager 
15:  Colonel  Arbuthnot,  the  British  officer  from  India 
16:  Cyrus  Bettman  Hardman,  the  American  detective 


Fig.  3.  A digraph  of  order  17  and  size  18,  devised  by  Agatha  Christie. 


Two  arcs  are  consecutive  if  the  tip  of  the  first  is  the  initial  vertex  of  the 
second.  A sequence  of  consecutive  arcs  (ai,  02, . . . , ak)  is  called  a walk  of  length  k ; 
it  can  be  symbolized  by  showing  the  vertices  as  well  as  the  arcs: 

ai.  0.2  at  / v 

vo  — l Vi  — > v2  ■■■  vk-i  — > vk.  (25) 

In  a simple  digraph  it’s  sufficient  merely  to  specify  the  vertices;  for  example, 
1 — > 0 — ► 8 — > 14  — > 8 — > 3 is  a walk  in  Fig.  3.  The  walk  in  (25)  is  an  oriented 
path  when  the  vertices  {vo,wi,  • • • , u^}  are  distinct;  it’s  an  oriented  cycle  when 
they  are  distinct  except  that  vk  = vq. 

In  a digraph,  the  directed  distance  d(u,  v)  is  the  number  of  arcs  in  the  short- 
est oriented  path  from  u to  v,  which  is  also  the  length  of  the  shortest  walk  from 
u to  v.  It  may  differ  from  d(v,u);  but  the  triangle  inequality  (18)  remains  valid. 

Every  graph  can  be  regarded  as  a digraph,  because  an  edge  u — v is 
essentially  equivalent  to  a matched  pair  of  arcs,  u — >v  and  v — >u.  The  digraph 
obtained  in  this  way  retains  all  the  properties  of  the  original  graph;  for  example, 
the  degree  of  each  vertex  in  the  graph  becomes  its  out-degree  in  the  digraph, 
and  also  its  in-degree  in  the  digraph.  Furthermore,  distances  remain  the  same. 

A multigraph  (V,  E)  is  like  a graph  except  that  its  edges  E can  be  any 
multiset  of  pairs  {u,w};  edges  v — v that  loop  from  a vertex  to  itself,  which 
correspond  to  “multipairs”  {u,u},  are  also  permitted.  For  example, 

cXD— ®=dD>  (26) 

is  a multigraph  of  order  3 with  six  edges,  {1,1},  {1,2},  {2,3},  {2,3},  {3,  3},  and 
{3,3}.  The  vertex  degrees  in  this  example  are  d(  1)  = d( 2)  = 3 and  d( 3)  = 6, 
because  each  loop  contributes  2 to  the  degree  of  its  vertex.  An  edge  loop  v — v 
becomes  two  arc  loops  v — >v  when  a rnultigraph  is  regarded  as  a digraph. 

Representation  of  graphs  and  digraphs.  Any  digraph,  and  therefore  any 
graph  or  multigraph,  is  completely  described  by  its  adjacency  matrix  A = (auv), 
which  has  n rows  and  n columns  when  there  are  n vertices.  Each  entry  auv  of 
this  matrix  specifies  the  number  of  arcs  from  u to  v.  For  example,  the  adjacency 
matrices  for  Kj.  P3,  C3,  J3,  and  (26)  are  respectively 
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The  powerful  mathematical  tools  of  matrix  theory  make  it  possible  to  prove 
many  nontrivial  results  about  graphs  by  studying  their  adjacency  matrices; 
exercise  65  provides  a particularly  striking  example  of  what  can  be  done.  One 
of  the  main  reasons  is  that  matrix  multiplication  has  a simple  interpretation  in 
the  context  of  digraphs.  Consider  the  square  of  A,  where  the  element  in  row  u 
and  column  v is 


( A2)uv 


^ ' ®UW@WV1 

w£V 


(28) 


by  definition.  Since  auw  is  the  number  of  arcs  from  u to  w,  we  see  that  auwawv 
is  the  number  of  walks  of  the  form  u y w } v.  Therefore  ( A2)uv  is  the  total 
number  of  walks  of  length  2 from  u to  v.  Similarly,  the  entries  of  Ak  tell  us  the 
total  number  of  walks  of  length  k between  any  ordered  pair  of  vertices,  for  all 
fc  > 0.  For  example,  the  matrix  A in  (27)  satisfies 


(29) 


there  are  12  walks  of  length  3 from  the  vertex  1 of  the  multigraph  (26)  to  vertex  3, 
and  18  such  walks  from  vertex  2 to  itself. 

Reordering  of  the  vertices  changes  an  adjacency  matrix  from  A to  P~AP, 
where  P is  a permutation  matrix  (a  0-1  matrix  with  exactly  one  1 in  each  row 
and  column),  and  P~  = PT  is  the  matrix  for  the  inverse  permutation.  Thus 


( 210  \ 

/ 201  \ 

/012\ 

/021X 

/ 402  \ 

/ 420  \ 

1 no^  / ’ 

042  , 

120  ’ 

240  , 

021  , 

and 

( 201  ) 

V024/ 

V 120/ 

V204/ 

V 102/ 

V2IO/ 

V012/ 

are  all  adjacency  matrices  for  (26),  and  there  are  no  others. 

There  are  more  than  2n("“1)/2/n!  graphs  of  order  n,  when  n > 1,  and 
almost  all  of  them  require  fl(n2)  bits  of  data  in  their  most  economical  encoding. 
Consequently  the  best  way  to  represent  the  vast  majority  of  all  possible  graphs 
inside  a computer,  from  the  standpoint  of  memory  usage,  is  essentially  to  work 
with  their  adjacency  matrices. 

But  the  graphs  that  actually  arise  in  practical  problems  have  quite  different 
characteristics  from  graphs  that  are  chosen  at  random  from  the  set  of  all  possi- 
bilities. A real-life  graph  usually  turns  out  to  be  “sparse,”  having  say  O(nlogn) 
edges  instead  of  fl(n2),  unless  n is  rather  small,  because  fl(n2)  bits  of  data  are 
difficult  to  generate.  For  example,  suppose  the  vertices  correspond  to  people, 
and  the  edges  correspond  to  friendships.  If  we  consider  5 billion  people,  few 
of  them  will  have  more  than  10000  friends.  But  even  if  everybody  had  10000 
friends,  on  average,  the  graph  would  still  have  only  2.5  x 1013  edges,  while  almost 
all  graphs  of  order  5 billion  have  approximately  6.25  x 1018  edges. 

Thus  the  best  way  to  represent  a graph  inside  a machine  usually  turns  out 
to  be  rather  different  than  to  record  n2  values  auv  of  adjacency  matrix  elements. 
Instead,  the  algorithms  of  the  Stanford  GraphBase  were  developed  with  a data 
structure  akin  to  the  linked  representation  of  sparse  matrices  discussed  in  Section 
2.2.6,  though  somewhat  simplified.  That  approach  has  proved  to  be  not  only 
versatile  and  efficient,  but  also  easy  to  use. 
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The  SGB  representation  of  a digraph  is  a combination  of  sequential  and 
linked  allocation,  using  nodes  of  two  basic  types.  Some  nodes  represent  vertices, 
other  nodes  represent  arcs.  (There’s  also  a third  type  of  node,  which  represents 
an  entire  graph,  for  algorithms  that  deal  with  several  graphs  at  once.  But  each 
graph  needs  only  one  graph  node,  so  the  vertex  and  arc  nodes  predominate.) 

Here’s  how  it  works:  Every  SGB  digraph  of  order  n and  size  m is  built 
upon  a sequential  array  of  n vertex  nodes,  making  it  easy  to  access  vertex  k 
for  0 < k < n.  The  m arc  nodes,  by  contrast,  are  linked  together  within  a 
general  memory  pool  that  is  essentially  unstructured.  Each  vertex  node  typically 
occupies  32  bytes,  and  each  arc  node  occupies  20  (and  the  graph  node  occupies 
220);  but  the  node  sizes  can  be  modified  without  difficulty.  A few  fields  of  each 
node  have  a fixed,  definite  meaning  in  all  cases;  the  remaining  fields  can  be  used 
for  different  purposes  in  different  algorithms  or  in  different  phases  of  a single 
algorithm.  The  fixed-purpose  parts  of  a node  are  called  its  “standard  fields,” 
and  the  multipurpose  parts  are  called  its  “utility  fields.” 

Every  vertex  node  has  two  standard  fields  called  NAME  and  ARCS.  If  v is  a 
variable  that  points  to  a vertex  node,  we’ll  call  it  a vertex  variable.  Then  NAME  ( v) 
points  to  a string  of  characters  that  can  be  used  to  identify  the  corresponding 
vertex  in  human-oriented  output;  for  example,  the  49  vertices  of  graph  (17)  have 
names  like  CA,  WA,  OR,  . . . , RI.  The  other  standard  field,  ARCS(v),  is  far  more 
important  in  algorithms:  It  points  to  an  arc  node,  the  first  in  a singly  linked  list 
of  length  d+(v),  with  one  node  for  each  arc  that  emanates  from  vertex  v. 

Every  arc  node  has  two  standard  fields  called  TIP  and  NEXT;  a variable  a that 
points  to  an  arc  node  is  called  an  arc  variable.  TIP  (a)  points  to  the  vertex  node 
that  represents  the  tip  of  arc  a;  NEXT  (a)  points  to  the  arc  node  that  represents 
the  next  arc  whose  initial  vertex  agrees  with  that  of  a. 

A vertex  v with  out-degree  0 is  represented  by  letting  ARCS  (u)  = A (the  null 
pointer).  Otherwise  if,  say,  the  out-degree  is  3,  the  data  structure  contains  three 
arc  nodes  with  ARCS(v)  = cq,  NEXT(di)  = Gq,  NEXT(a2)  = 03,  and  NEXT(aa)  = 
A;  and  the  three  arcs  from  v lead  to  TIP(ai),  TlPCcq),  TIP(a3). 

Suppose,  for  example,  that  we  want  to  compute  the  out-degree  of  vertex  v, 
and  store  it  in  a utility  field  called  ODEG.  It’s  easy: 

Set  a <—  ARCS  GO  and  d <—  0. 

While  a / A,  set  d <—  d + 1 and  a <—  NEXT  (a)  . (31) 

Set  ODEG  GO  <-  d. 

When  a graph  or  a multigraph  is  considered  to  be  a digraph,  as  mentioned 
above,  its  edges  u — v are  each  equivalent  to  two  arcs,  u — ► v and  v — > u.  These 
arcs  are  called  “mates”;  and  they  occupy  two  arc  nodes,  say  a and  a',  where  a 
appears  in  the  list  of  arcs  from  u and  a'  appears  in  the  list  of  arcs  from  v.  Then 
TIP  (a)  = v and  TIP  (a')  = u.  We’ll  also  write 

MATE  (a)  = a'  and  MATE  (a')  = a,  (32) 

in  algorithms  that  want  to  move  rapidly  from  one  list  to  another.  However,  we 
usually  won’t  need  to  store  an  explicit  pointer  from  an  arc  to  its  mate,  or  to  have 
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a utility  field  called  MATE  within  each  arc  node,  because  the  necessary  link  can 
be  deduced  implicitly  when  the  data  structure  has  been  constructed  cleverly. 

The  implicit-mate  trick  works  like  this:  While  creating  each  edge  u — v 
of  an  undirected  graph  or  multigraph,  we  introduce  consecutive  arc  nodes  for 
u^v  and  v — >u.  For  example,  if  there  are  20  bytes  per  arc  node,  we’ll  reserve 
40  consecutive  bytes  for  each  new  pair.  We  can  also  make  sure  that  the  memory 
address  of  the  first  byte  is  a multiple  of  8.  Then  if  the  arc  node  a is  in  memory 
location  a,  its  mate  is  in  location 


f a + 20,  if  a mod  8 = 01 
l a — 20,  if  a mod  8 = 4 / 


a - 20  + (40  &((<*&  4) -1)). 


(33) 


Such  tricks  are  valuable  in  combinatorial  problems,  when  operations  might 
be  performed  a trillion  times,  because  every  way  to  save  3.6  nanoseconds  per 
operation  will  make  such  a computation  finish  an  hour  sooner.  But  (33)  isn’t 
directly  “portable”  from  one  implementation  to  another.  If  the  size  of  an  arc 
node  were  changed  from  20  to  24,  for  example,  we  would  have  to  change  the 
numbers  40,  20,  8,  and  4 in  (33)  to  48,  24,  16,  and  8. 

The  algorithms  in  this  book  will  make  no  assumptions  about  node  sizes. 
Instead,  we’ll  adopt  a convention  of  the  C programming  language  and  its  de- 
scendants, so  that  if  a points  to  an  arc  node,  la  + 1’  denotes  a pointer  to  the  arc 
node  that  follows  it  in  memory.  And  in  general 


L0C(N0DE(a  + k))  = L0C(N0DE(a))  + kc,  (34) 

when  there  are  c bytes  in  each  arc  node.  Similarly,  if  v is  a vertex  variable,  ‘v  + k' 
will  stand  for  the  fcth  vertex  node  following  node  v;  the  actual  memory  location 
of  that  node  will  be  v plus  k times  the  size  of  a vertex  node. 

The  standard  fields  of  a graph  node  g include  M(g0 , the  total  number  of  arcs; 
N(g),  the  total  number  of  vertices;  VERTICES  (g) , a pointer  to  the  first  vertex 
node  in  the  sequential  list  of  all  vertex  nodes;  ID  (3),  the  graph’s  identification, 
which  is  a string  like  words (5757 ,0,0,0);  and  some  other  fields  needed  for  the 
allocation  and  recycling  of  memory  when  the  graph  grows  or  shrinks,  or  for 
exporting  a graph  to  external  formats  that  interface  with  other  users  and  other 
graph-manipulation  systems.  But  we  will  rarely  need  to  refer  to  any  of  these 
graph  node  fields,  nor  will  it  be  necessary  to  give  a complete  description  of  SGB 
format  here,  since  we  shall  describe  almost  all  of  the  graph  algorithms  in  this 
chapter  by  sticking  to  an  English-language  description  at  a fairly  abstract  level 
instead  of  descending  to  the  bit  level  of  machine  programs. 

A simple  graph  algorithm.  To  illustrate  a medium-high-level  algorithm  of 
the  kind  that  will  appear  later,  let’s  convert  the  proof  of  Theorem  B into  a 
step-by-step  procedure  that  paints  the  vertices  of  a given  graph  with  two  colors 
whenever  that  graph  is  bipartite. 

Algorithm  B ( Bipartiteness  testing).  Given  a graph  represented  in  SGB  format, 
this  algorithm  either  finds  a 2-coloring  with  COLOR  (v)  € {0, 1}  in  each  vertex  v, 
or  it  terminates  unsuccessfully  when  no  valid  2-coloring  is  possible.  Here  COLOR 
is  a utility  field  in  each  vertex  node.  Another  vertex  utility  field,  LINK(u),  is  a 
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vertex  pointer  used  to  maintain  a stack  of  all  colored  vertices  whose  neighbors 
have  not  yet  been  examined.  An  auxiliary  vertex  variable  s points  to  the  top  of 
this  stack.  The  algorithm  also  uses  variables  u,  v,  w for  vertices  and  a for  arcs. 
The  vertex  nodes  are  assumed  to  be  vq  + k for  0 < k < n. 

Bl.  [Initialize.]  Set  COLORCuo  + k)  < 1 for  0 < k < n.  (Now  all  vertices  are 

uncolored.)  Then  set  w <—  Vo  + n. 

B2.  [Done?]  (At  this  point  all  vertices  > w have  been  colored,  and  so  have  the 
neighbors  of  all  colored  vertices.)  Terminate  the  algorithm  successfully  if 
w = v o-  Otherwise  set  w <—  w — 1,  the  next  lower  vertex  node. 

B3.  [Color  w if  necessary.]  If  COLOR(w)  > 0,  return  to  B2.  Otherwise  set 
C0L0R(u;)  <-  0,  LINK(w)  <-  A,  and  s <-  w. 

B4.  [Stack  =>  u.]  Set  uCs,  sf-  LINK(s),  a <—  ARCS(u).  (We  will  examine  all 
neighbors  of  the  colored  vertex  u.) 

B5.  [Done  with  ul]  If  a — A,  go  to  B8.  Otherwise  set  v TIP  (a). 

B6.  [Process  u.]  If  COLOR(v)  < 0,  set  COLOR(u)  <-  1 - COLOR(u),  LINK(u)  <-  s, 
and  s v.  Otherwise  if  CDLOR(u)  = COLOR(u),  terminate  unsuccessfully. 
B7.  [Loop  on  a.]  Set  a <—  NEXT  (a)  and  return  to  B5. 

B8.  [Stack  nonempty?]  If  s / A,  return  to  B4.  Otherwise  return  to  B2.  | 

This  algorithm  is  a variant  of  a general  graph  traversal  procedure  called  “depth- 
first  search,”  which  we  will  study  in  detail  in  Section  7.4.1.  Its  running  time  is 
0(m  + n)  when  there  are  m arcs  and  n vertices  (see  exercise  70);  therefore  it 
is  well  adapted  to  the  common  case  of  sparse  graphs.  With  small  changes  we 
can  make  it  output  an  odd-length  cycle  whenever  it  terminates  unsuccessfully, 
thereby  proving  the  impossibility  of  a 2-coloring  (see  exercise  72). 

Examples  of  graphs.  The  Stanford  GraphBase  includes  a library  of  more  than 
three  dozen  generator  routines,  capable  of  producing  a great  variety  of  graphs 
and  digraphs  for  use  in  experiments.  We’ve  already  discussed  words ; now  let’s 
look  at  a few  of  the  others,  in  order  to  get  a feeling  for  some  of  the  possibilities. 

• rw/ef  (1022,  0,  0,  0)  is  a directed  graph  with  1022  vertices  and  5075  arcs.  The 
vertices  represent  the  categories  of  words  or  concepts  that  P.  M.  Roget  and  J.  L. 
Roget  included  in  their  famous  19th-century  Thesaurus  (London:  Longmans, 
Green,  1879).  The  arcs  are  the  cross  references  between  categories,  as  found 
in  that  book.  For  example,  typical  arcs  are  water — ► moisture,  discovery — > 
truth,  preparation — learning,  vulgarity — > ugliness,  wit  — ^amusement. 

• book(" jean", 80, 0, 1,356,0,0,0)  is  a graph  with  80  vertices  and  254  edges. 
The  vertices  represent  the  characters  of  Victor  Hugo’s  Les  Miserables;  the  edges 
connect  characters  who  encounter  each  other  in  that  novel.  Typical  edges  are 
Fantine  — Javert,  Cosette  — Thenardier. 

• bLbook("  jean",  80,  0, 1,  356,  0,  0,  0)  is  a bipartite  graph  with  80+356  vertices 
and  727  edges.  The  vertices  represent  characters  or  chapters  in  Les  Miserables' 
the  edges  connect  characters  with  the  chapters  in  which  they  appear  (for  in- 
stance, Napoleon  — 2.1.8,  Marius  — 4.14.4). 
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• plane-miles (128, 0,0, 0, 1,0,0)  is  a planar  graph  with  129  vertices  and  381 
edges.  The  vertices  represent  128  cities  in  the  United  States  or  Canada  plus 
a special  vertex  INF  for  a “point  at  infinity.”  The  edges  define  the  so-called 
Delaunay  triangulation  of  those  cities,  based  on  latitude  and  longitude  in  a 
plane,  this  means  that  u v if  and  only  if  there’s  a circle  passing  through  u 
and  v that  does  not  enclose  any  other  vertex.  Edges  also  run  between  INF  and 
all  vertices  that  lie  on  the  convex  hull  of  all  city  locations.  Typical  edges  are 
Seattle,  WA  — Vancouver,  BC  — INF;  Toronto,  ON  — Rochester , NY. 

, • plane Jisa( 360,  250, 15, 0, 360, 0,  250, 0, 0, 2295000)  is  a planar  graph  that  has 
3027  vertices  and  5967  edges.  It  is  obtained  by  starting  with  a digitized  image  of 
Leonardo  da  Vinci’s  Mon  a Lisa,  having  360  rows  and  250  columns  of  pixels,  then 
rounding  the  pixel  intensities  to  16  levels  of  gray  from  0 (black)  to  15  (white). 
The  resulting  3027  rookwise  connected  regions  of  constant  brightness  are  then 
considered  to  be  neighbors  when  they  share  a pixel  boundary.  (See  Fig.  4.) 


Fig.  4.  A digital  rendition  of  Mona  Lisa,  with  a closeup  detail  (best  viewed  from  afar). 


• biJisa( 360,  250,  0, 360,  0, 250,  8192,  0)  is  a bipartite  graph  with  360  + 250  = 
610  vertices  and  40923  edges.  It’s  another  takeoff  on  Leonardo’s  famous  painting, 
this  time  linking  rows  and  columns  where  the  brightness  level  is  at  least  1/8.  For 
example,  the  edge  rl02  — cll3  occurs  right  in  the  middle  of  Lisa’s  “smile.” 

• ram<m(31,  23,  3, 1)  is  a graph  with  quite  a different  nature  from  the  SGB 

graphs  in  previous  examples.  Instead  of  being  linked  to  language,  literature 
or  other  outgrowths  of  human  culture,  it’s  a so-called  “Ramanujan  expander 
graph,  based  on  strict  mathematical  principles.  Each  of  its  (233  — 23) /2  = 6072 
vertices  has  degree  32;  hence  it  has  97152  edges.  The  vertices  correspond  to 
equivalence  classes  of  2 x 2 matrices  that  are  nonsingular  modulo  23;  a typical 
edge  is  (2, 7;  1,1)  (4, 6;  1,3).  Ramanujan  graphs  are  important  chiefly 

because  they  have  unusually  high  girth  and  low  diameter  for  their  size  and  degree. 
This  one  has  girth  4 and  diameter  4. 
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• raman( 5,  37, 4,1),  similarly,  is  a regular  graph  of  degree  6 with  50616  vertices 
and  151848  edges.  It  has  girth  10,  diameter  10,  and  happens  also  to  be  bipartite. 

• random-graph (1000,  5000,  0, 0,  0,  0, 0,  0,  0,  s)  is  a graph  with  1000  vertices, 
5000  edges,  and  seed  s.  It  “evolved”  by  starting  with  no  edges,  then  by  repeatedly 
choosing  pseudorandom  vertex  numbers  0 < u,  v < 1000  and  adding  the  edge 
u — v,  unless  u = v or  that  edge  was  already  present.  When  s = 0,  all  vertices 
belong  to  a giant  component  of  order  999,  except  for  the  isolated  vertex  908. 

• random-graph(1000,  5000,  0,  0, 1,  0,  0,  0,  0,  0)  is  a digraph  with  1000  vertices 
and  5000  arcs,  obtained  via  a similar  sort  of  evolution.  (In  fact,  each  of  its  arcs 
happens  to  be  part  also  of  random-graph(1000 , 5000,  0,  0,  0,  0,  0,  0,  0,  0).) 

• subsets(5, 1,  — 10, 0, 0, 0,  #1, 0)  is  a graph  with  ) = 462  vertices,  one  for 
every  five-element  subset  of  {0, 1, . . . , 10}.  Two  vertices  are  adjacent  whenever 
the  corresponding  subsets  are  disjoint;  thus,  the  graph  is  regular  of  degree  6, 
and  it  has  1386  edges.  We  can  consider  it  to  be  a generalization  of  the  Petersen 
graph,  which  has  subsets(2, 1,  —4,  0, 0,  0,  #1 , 0)  as  one  of  its  SGB  names. 

• subsets(5, 1,  — 10, 0, 0, 0,  #10, 0)  has  the  same  462  vertices,  but  now  they  are 
adjacent  if  the  corresponding  subsets  have  four  elements  in  common.  This  graph 
is  regular  of  degree  30,  and  it  has  6930  edges. 

• parts(30, 10,30,0)  is  another  SGB  graph  with  a mathematical  basis.  It  has 
3590  vertices,  one  for  each  partition  of  30  into  at  most  10  parts.  Two  partitions 
are  adjacent  when  one  is  obtained  by  subdividing  a part  of  the  other;  this  rule 
defines  31377  edges.  The  digraph  parts( 30, 10,  30, 1)  is  similar,  but  its  31377  arcs 
point  from  shorter  to  longer  partitions  (for  example,  13+7+7+3 — >7+7+7+6+3). 

• simplex(  10, 10, 10, 10, 10,0,0)  is  a graph  with  286  vertices  and  1320  edges. 
Its  vertices  are  the  integer  solutions  to  *i  +X2  +*3  +X4  = 10  with  Xi  > 0,  namely 
the  “compositions  of  10  into  four  nonnegative  parts” ; they  can  also  be  regarded 
as  barycentric  coordinates  for  points  inside  a tetrahedron.  The  edges,  such  as 
3 , 1 , 4 , 2 — 3 , 0 , 4 , 3 , connect  compositions  that  are  as  close  together  as  possible. 

• board( 8,  8,  0,  0,  5,  0,  0)  and  board( 8,  8,  0,  0,  —2,  0,  0)  are  graphs  on  64  vertices 
whose  168  or  280  edges  correspond  to  the  moves  of  a knight  or  bishop  in  chess. 
And  zillions  of  further  examples  are  obtainable  by  varying  the  parameters  to  the 
SGB  graph  generators.  For  example,  Fig.  5 shows  two  simple  variants  of  board 
and  simplex;  the  somewhat  arcane  rules  of  board  are  explained  in  exercise  75. 


board(d>,  9, 0,  0,  5, 0, 0) 

(Knight  moves  on  a 6 X 9 chessboard) 


simplex{  10, 8,  7, 6, 0, 0, 0) 
(A  truncated  triangular  grid) 


Fig.  5.  Samples  of  SGB  graphs  related  to  board  games. 
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Graph  algebra.  We  can  also  obtain  new  graphs  by  operating  on  the  graphs 
that  we  already  have.  For  example,  if  G = (V,E)  is  any  graph,  its  complement 
G = (V,  E ) is  obtained  by  letting 


u — v in  G 


u -£■  v and  u —f-  v in  G. 


(35) 


Thus,  non-edges  become  edges,  and  vice  versa.  Notice  that  G = G,  and  that  K~n 
has  no  edges.  The  corresponding  adjacency  matrices  A and  A satisfy 


A + A = J-I ; 


(36) 


here  J is  the  matrix  of  all  Is,  and  I is  the  identity  matrix,  so  J and  J — I are 
respectively  the  adjacency  matrices  of  Jn  and  Kn  when  G has  order  n. 

Furthermore,  every  graph  G = (V,  E)  leads  to  a Ime  graph  L(G),  whose 
vertices  are  the  edges  E;  two  edges  are  adjacent  in  L(G)  if  they  have  a common 
vertex.  Thus,  for  example,  the  line  graph  L(Kn)  has  (")  vertices,  and  it  is  regular 
of  degree  2n  - 4 when  n > 2 (see  exercise  82).  A graph  is  called  k- edge- colorable 
when  its  line  graph  is  fc-colorable. 

Given  two  graphs  G = (U,E)  and  H = (V,F),  their  union  G U H is  the 
graph  (U U V,  EuF)  obtained  by  combining  the  vertices  and  edges.  For  example, 
suppose  G and  H are  the  graphs  of  rook  and  bishop  moves  in  chess;  then  GUH 
is  the  graph  of  queen  moves,  and  its  official  SGB  name  is 

gunion (board 8, 0, 0,  -1,  0, 0),  board(8,  8, 0, 0,  -2, 0, 0),  0, 0).  (37) 

In  the  special  case  where  the  vertex  sets  U and  V are  disjoint,  the  union 
GUH  doesn’t  require  the  vertices  to  be  identified  in  any  consistent  way  for  cross- 
correlation; we  get  a diagram  for  G U H by  simply  drawing  a diagram  of  G next 
to  a diagram  of  H.  This  special  case  is  called  the  “juxtaposition”  or  direct  sum 
of  G and  H,  and  we  shall  denote  it  by  G ® H.  For  example,  it’s  easy  to  see  that 


Km®Kn  — Km,n , 

and  that  every  graph  is  the  direct  sum  of  its  connected  components. 
Equation  (38)  is  a special  case  of  the  general  formula 


Kn 


1 Kn 


f<ru  = K, 


nitn2,...,nk  , 


(38) 


(39) 


which  holds  for  complete  A-partite  graphs  whenever  k > 2.  But  (39)  fails  when 
A - 1,  because  of  a scandalous  fact:  The  standard  graph-theoretic  notation 
or  complete  graphs  is  inconsistent!  Indeed,  Km^n  denotes  a complete  2-partite 
graph,  but  Kn  does  not  denote  a complete  1-partite  graph.  Somehow  graph  the- 
orists have  been  able  to  live  with  this  anomaly  for  decades  without  going  berserk. 

Another  important  way  to  combine  disjoint  graphs  G and  H is  to  form  their 
join,  G H , which  consists  of  G © H together  with  all  edges  u — v for  u e U 
and  v e V.  [See  A.  A.  Zykov,  Mat.  Sbornik  24  (1949),  163-188,  §1.3.]  And 
if  G and  H are  disjoint  digraphs , their  directed  join  G — ► H is  similar,  but  it 
supplements  G © H by  adding  only  the  one-way  arcs  u — tv  from  U to  V 
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The  direct  sum  of  two  matrices  A and  B is  obtained  by  placing  B diagonally 
below  and  to  the  right  of  A: 


A®B 


(40) 


where  each  O in  this  example  is  a matrix  of  all  zeros,  with  the  proper  number  of 
rows  and  columns  to  make  everything  line  up  correctly.  Our  notation  G © H for 
the  direct  sum  of  graphs  is  easy  to  remember  because  the  adjacency  matrix  for 
G®H  is  precisely  the  direct  sum  of  the  respective  adjacency  matrices  A and  B for 
G and  H.  Similarly,  the  adjacency  matrices  for  G — H,  G — > H.  and  G< — H are 


A — B = 


>B  = 


B = 


A O 
J B 


, (41) 


respectively,  where  J is  an  all- Is  matrix  as  in  (36).  These  operations  are  asso- 
ciative, and  related  by  complementation: 


A©(£©C)  = (A®B)®C,  A — (B  — C)  — (A  — B)  — C;  (42) 

= A^(B^C)  = (A<—  B)^C-  (43) 

A © B - A — B,  A — B = A®B;  (44) 

A — >B  = ~A< — B,  A< — B = A — >B;  (45) 

(A®  B)  + (A  — B)  = (A — >B)  + (A< — B).  (46) 

Notice  that,  by  combining  (39)  with  (42)  and  (44),  we  have 

= Kni  Kn2  *'*“  (47) 

when  k >2.  Also 

Kn  — K 1 — Ki  — • • • — K\  and  K„  — A'i  — > Ki  — > > Ki , (48) 


with  n copies  of  K 1,  showing  that  Kn  — K 11 1 is  a complete  n-partite  graph. 

Direct  sums  and  joins  are  analogous  to  addition,  because  we  have  Km®Kn  = 
Krn+n  and  Krn  — Kn  = Km+n.  We  can  also  combine  graphs  with  algebraic 
operations  that  are  analogous  to  multiplication.  For  example,  the  Cartesian 
product  operation  forms  a graph  GaH  of  order  mn  from  a graph  G = (U,  E)  of 
order  m and  a graph  H — (V,  F)  of  order  n.  The  vertices  of  GaH  are  ordered 
pairs  (u,  v),  where  u £ U and  v G V;  the  edges  are  ( u , v)  — (u',  v)  when  u — u' 
in  G,  together  with  (u,v)  — (u,v')  when  v — v'  in  H.  In  other  words,  GaH 
is  formed  by  replacing  each  vertex  of  G by  a copy  of  f?,  and  replacing  each  edge 
of  G by  edges  between  corresponding  vertices  of  the  appropriate  copies: 
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As  usual,  the  simplest  special  cases  of  this  general  construction  turn  out  to 
be  especially  important  in  practice.  When  both  G and  H are  paths  or  cycles,  we 
get  “graph-paper  graphs,”  namely  the  m x n grid  PrnaPn,  the  mxn  cylinder 
Pm  □ Cn,  and  the  mxn  torus  Cm  □ Cn,  illustrated  here  for  to  = 3 and  n — 4: 


P3nP4 

(3x4  grid) 


?— < 

p 

— J 

L 

p 

Lx 

[ j 

[ j 

P3  nC4 

(3x4  cylinder) 


f 

f 

-4 

1 

c 

zzj 

J-L 

L4 

C3oC4 

(3x4  torus) 


Four  other  noteworthy  ways  to  define  products  of  graphs  have  also  proved  to 
be  useful.  In  each  case  the  vertices  of  the  product  graph  are  ordered  pairs  ( u , v). 

• The  direct  product  G®H,  also  called  the  “conjunction”  of  G and  H , or  their 
“categorical  product,”  has  (u,  v)  — (u\  v')  when  u — u'  in  G and  v — v'  in  H. 

• The  strong  product  G®H  combines  the  edges  of  GaH  with  those  of  G®H. 


• The  odd  product  GaH  has  (u,v)  — (u',v')  when  we  have  either  u — v! 
in  G or  v — v'  in  H,  but  not  both. 

• The  lexicographic  product  Go  H,  also  called  the  “composition”  of  G and  H , 
has  ( u,v ) (u',v')  when  u — u!  in  G,  and  (u,  v)  — (u,  v')  when  v — v'  in  H. 
All  five  of  these  operations  extend  naturally  to  products  of  k > 2 graphs  Gj  = 
(Vi,Ei),  . . . , Gk  = (Vfe,  Ek),  whose  vertices  are  the  ordered  fc-tuples  (ui, . . . ,vk) 
with  Vj  G Vj  for  1 < j < k.  For  example,  when  k = 3,  the  Cartesian  products 
Gj  □ (G2  □ G3)  and  (Gj  □ G2)  □ G3  are  isomorphic,  if  we  consider  the  compound 
vertices  , (v2,  ^3))  and  ({vi,v2),v3)  to  be  the  same  as  (i'i,  v2,  v3).  Therefore 
we  can  write  this  Cartesian  product  without  parentheses,  as  G:  dG2  dG3.  The 
most  important  example  of  a Cartesian  product  with  k factors  is  the  fc-cube, 


P2aP2D---aP2 ; (51) 

its  2k  vertices  (w1( . . . ,vk)  are  adjacent  when  their  Hamming  distance  is  1. 

In  general,  suppose  v = K,...,nfc)  and  v'  = (v[, . . . ,v'k)  are  fc- tuples  of 
vertices,  where  we  have  vj  — in  Gj  for  exactly  a of  the  subscripts  j,  and 
vj  = vj  for  exactly  b of  the  subscripts.  Then  we  have: 

• v v'  in  Gi  □ • ■ • □ Gfc  if  and  only  if  a = 1 and  b = k - 1 ; 


• v v'  in  Gi  ® • • • (gi  Gk  if  and  only  if  a = k and  b = 0; 

• v 7/  in  Gi  B ■ b Gfc  if  and  only  if  a + b ~ k and  a > 0; 

• v v'  in  G\  A • ■ • a Gfc  if  and  only  if  a is  odd. 


The  lexicographic  product  is  somewhat  different,  because  it  isn’t  commutative; 
in  Gi  o " ' o Gfc  we  have  v — v'  for  v ^ v'  if  and  only  if  v-  — ?/ , where  j is  the 
minimum  subscript  with  Vj  ^ v'y 

Exercises  91-102  explore  some  of  the  basic  properties  of  graph  products. 
See  also  the  book  Product  Graphs  by  Wilfried  Imrich  and  Sandi  Klavzar  (2000), 
which  contains  a comprehensive  introduction  to  the  general  theory,  including 
algorithms  for  factorization  of  a given  graph  into  “prime”  subgraphs. 
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^Graphical  degree  sequences.  A sequence  dpi  2 . . . dn  of  nonnegative  integers 
is  called  graphical  if  there’s  at  least  one  graph  on  vertices  {1,2,...,  n}  such  that 
vertex  k has  degree  dk . We  can  assume  that  d\  > d2  > ■ ■ ■ > dn.  Clearly  d\  < n 
in  any  such  graph;  and  the  sum  m — di  + d2  + ■■■  + dn  of  any  graphical  sequence 
is  always  even,  because  it  is  twice  the  number  of  edges.  Furthermore,  it’s  easy  to 
see  that  the  sequence  3311  is  not  graphical;  therefore  graphical  sequences  must 
also  satisfy  additional  conditions.  What  are  they? 

A simple  way  to  decide  if  a given  sequence  d\d2  ■ ■ - dn  is  graphical,  and  to 
construct  such  a graph  if  one  exists,  was  discovered  by  V.  Havel  [Casopis  pro 
Pestovam  Matematiky  80  (1955),  477-479] . We  begin  with  an  empty  tableau, 
having  d cells  in  row  k ; these  cells  represent  “slots”  into  which  we’ll  place  the 
neighbors  of  vertex  k in  the  constructed  graph.  Let  Cj  be  the  number  of  cells  in 
column  j;  thus  c±  > C2  > • • • , and  when  1 < k < n we  have  Cj  > k if  and  only  if 
dk  > j ■ For  example,  suppose  n — 8 and  d\ . . . dg  = 55544322;  then 


is  the  initial  tableau,  and  we  have  c\  . . . C5  = 88653.  Havel’s  idea  is  to  pair  up 
vertex  n with  dn  of  the  highest-degree  vertices.  In  this  case,  for  example,  we 
create  the  two  edges  8 — 3 and  8 — 2,  and  the  tableau  takes  the  following  form: 


(53) 


(We  don’t  want  8 — 1,  because  the  empty  slots  should  continue  to  form  a tableau 
shape;  the  cells  of  each  column  must  be  filled  from  the  bottom  up.)  Next  we  set 
n <—  7 and  create  two  further  edges,  7 — 1 and  7 — 5.  And  then  come  three 
more,  6 — 4,  6 — 3,  6 — 2,  making  the  tableau  almost  half  full: 


1 

7 

2 

6 
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3 

6 

8 

4 
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5 
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6 
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3 

4 

7 

5 

1 

8 

2 

3 

(54) 
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We’ve  reduced  the  problem  to  finding  a graph  with  degree  sequence  dr . . . d5  = 
43333;  at  this  point  we  also  have  cx . . . c4  = 5551.  The  reader  is  encouraged  to 
fill  in  the  remaining  blanks,  before  looking  at  the  answer  in  exercise  103. 

Algorithm  H ( Graph  generator  for  specified  degrees).  Given  dx  > ■ ■ ■ > dn  > 
dn+ 1 = °>  this  algorithm  creates  edges  between  the  vertices  {1, . . . , n}  in  such 
a way  that  exactly  dk  edges  touch  vertex  k,  for  1 < k < n,  unless  the  sequence 
di  . . ,dn  isn’t  graphical.  An  array  cx  . . . cdl  is  used  for  auxiliary  storage. 

HI.  [Set  the  c s.]  Start  with  k <—  dx  and  j «-  0.  Then  while  k > 0 do  the  follow- 
ing operations:  Set  j j + 1;  while  k > dj+1,  set  ck  j and  k «-  k - 1 
Terminate  successfully  if  j = 0 (all  d's  are  zero). 

H2.  [Find  n.]  Set  n Cl.  Terminate  successfully  if  n = 0;  terminate  unsuccess- 
fully if  d1  > n > 0. 


H3.  [Begin  loop  on  j.\  Set  i <-  1,  t <-  dx,  r <-  ct,  and  j <-  dn. 

H4.  [Generate  a new  edge.]  Set  Cj  t-  Cj  - 1 and  m <-  ct.  Create  the  edge 
n — m,  and  set  dm  <-  dm  - 1,  ct  <-  m - 1,  j «-  j - 1.  If  j = 0,  return 
to  step  H2.  Otherwise,  if  m = *,  set  i <-  r + 1,  t <-  dt,  and  r ^ ct  (see 
exercise  104);  repeat  step  H4.  | 


When  Algorithm  H succeeds,  it  certainly  has  constructed  a graph  with  the 

desired  degrees.  But  when  it  fails,  how  can  we  be  sure  that  its  mission  was 

impossible?  The  key  fact  is  based  on  an  important  concept  called  “majorization” : 
If  d1...dn  and  d[  . . . d’n  are  two  partitions  of  the  same  integer  (that  is,  if  d1  > 
> dn  and  dx  > ■ ■ > dn  and  dx  + ■ • • + dn  = d[  + ■ • • + d!n ) , we  say  that 

du  > d\  + ■ • ■ + d'k  for  1 < k < n. 


dl...dn  majorizes  d[  . . . d!n  if  dx  + ■ • • + dk  > d[  + ■ ■ ■ -r 


Lemma  M.  If  d1...dn  is  graphical  and  dx 
d'i  ■ ■ ■ d!n  is  also  graphical. 


■ dn  majorizes  d\  . . . d'n,  then 


Proof.  It  is  sufficient  to  prove  the  claim  when  d1...dn  and  d[  . . . d'  differ  in 
only  two  places, 


dk  ~ dk  — [k  — i]  + [k  = j]  where  i < j,  (55) 

because  any  sequence  majorized  by  d1 . . . dn  can  be  obtained  by  repeatedly 
performing  mini-majorizations  such  as  this.  (Exercise  7.2.1.4-55  discusses  ma- 
jorization in  detail.) 

Condition  (55)  implies  that  dt  > d[  > d'l+1  > d!-  > dr  So  any  graph 
with  degree  sequence  d1...dn  contains  a vertex  v such  that  v — i and  v-f-j. 
Deleting  the  edge  v * and  adding  the  edge  v — j yields  a graph  with  degree 
sequence  d[...d'n,  as  desired.  | 

Corollary  H.  Algorithm  H succeeds  whenever  dj  ...  dn  is  graphical. 

Proof.  We  may  assume  that  n > 1.  Suppose  G is  any  graph  on  {1, . . . , n}  with 
degree  sequence  d\  . . ,dn,  and  let  G'  be  the  subgraph  induced  by  {1, . . . , n — 1}; 
m other  words,  obtain  G'  by  removing  vertex  n and  the  dn  edges  that  it  touches! 
The  degree  sequence  d[  . . . d'n_1  of  G"  is  obtained  from  dx  . . . dn_x  by  reducing 
some  dn  of  the  entries  by  1 and  sorting  them  into  nonincreasing  order.  By 
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definition,  d[  . . . d!n_l  is  graphical.  The  new  degree  sequence  d”  . . . d'^l_1  produced 
by  the  strategy  of  steps  H3  and  H4  is  designed  to  be  majorized  by  every  such 
d[  . . . d'n_l,  because  it  reduces  the  largest  possible  dn  entries  by  1.  Thus  the  new 
d " . . . d"l_1  is  graphical.  Algorithm  H,  which  sets  d\ . . . dn_i  -f-  d'{  . . . d"_l5  will 
therefore  succeed  by  induction  on  n.  | 

The  running  time  of  Algorithm  H is  roughly  proportional  to  the  number 
of  edges  generated,  which  can  be  of  order  n2.  Exercise  105  presents  a faster 
method,  which  decides  in  O(n)  steps  whether  or  not  a given  sequence  di . . . dn 
is  graphical  (without  constructing  any  graph). 

Beyond  graphs.  When  the  vertices  and/or  arcs  of  a graph  or  digraph  are 
decorated  with  additional  data,  we  call  it  a network.  For  example,  every  vertex  of 
words  (5757,  0,  0,  0)  has  an  associated  rank,  which  corresponds  to  the  popularity 
of  the  corresponding  five-letter  word.  Every  vertex  of  planeJisa (360, 250, 15, 
0,360,0,250,0,0,2295000)  has  an  associated  pixel  density,  between  0 and  15. 
Every  arc  of  board  ( 8, 8, 0, 0,  — 2, 0, 0)  has  an  associated  length,  which  reflects 
the  distance  of  a piece’s  motion  on  the  board:  A bishop’s  move  from  corner  to 
corner  has  length  7.  The  Stanford  GraphBase  includes  several  further  generators 
that  were  not  mentioned  above,  because  they  are  primarily  used  to  generate 
interesting  networks,  rather  than  to  generate  graphs  with  interesting  structure: 

• 7mfes(128, 0, 0, 0, 0, 127, 0)  is  a network  with  128  vertices,  corresponding  to 
the  same  North  American  cities  as  the  graph  plane-miles  described  earlier.  But 
miles,  unlike  plane-miles,  is  a complete  graph  with  (128)  edges.  Every  edge  has 
an  integer  length,  which  represents  the  distance  that  a car  or  truck  would  have 
needed  to  travel  in  1949  when  going  from  one  given  city  to  another.  For  example, 
‘Vancouver,  BC’  is  3496  miles  from  ‘West  Palm  Beach,  FL’  in  the  miles  network. 

• econ(81, 0, 0,  0)  is  a network  with  81  vertices  and  4902  arcs.  Its  vertices 
represent  sectors  of  the  United  States  economy,  and  its  arcs  represent  the  flow  of 
money  from  one  sector  to  another  during  the  year  1985,  measured  in  millions  of 
dollars.  For  example,  the  flow  value  from  Apparel  to  Household  furniture  is  44, 
meaning  that  the  furniture  industry  paid  $44,000,000  to  the  apparel  industry  in 
that  year.  The  sum  of  flows  coming  into  each  vertex  is  equal  to  the  sum  of  flows 
going  out.  An  arc  appears  only  when  the  flow  is  nonzero.  A special  vertex  called 
Users  receives  the  flows  that  represent  total  demand  for  a product;  a few  of  these 
end-user  flows  are  negative,  because  of  the  way  imported  goods  are  treated  by 
government  economists. 

• games(120, 0, 0, 0, 0, 0, 128,0)  is  a network  with  120  vertices  and  1276  arcs. 
Its  vertices  represent  football  teams  at  American  colleges  and  universities.  Arcs 
run  between  teams  that  played  each  other  during  the  exciting  1990  season, 
and  they  are  labeled  with  the  number  of  points  scored.  For  example,  the  arc 
Stanford — >■  California  has  value  27,  and  the  arc  California — !•  Stanford 
has  value  25,  because  the  Stanford  Cardinal  defeated  the  U.  C.  Berkeley  Golden 
Bears  by  a score  of  27-  25  on  17  November  1990. 

• risc(16)  is  a network  of  an  entirely  different  kind.  It  has  3240  vertices  and 
7878  arcs,  which  define  a directed  acyclic  graph  or  “dag” — namely,  a digraph 


32 


COMBINATORIAL  SEARCHING 


7 


that  contains  no  oriented  cycles.  The  vertices  represent  gates  that  have  Boolean 
values;  an  arc  such  as  Z45  — » RO : 7~  means  that  the  value  of  gate  Z45  is  an 
input  to  gate  R0:7~.  Each  gate  has  a type  code  (AND,  OR,  XOR,  NOT,  latch, 
or  external  input);  each  arc  has  a length,  denoting  an  amount  of  delay.  The 
network  contains  the  complete  logic  for  a miniature  RISC  chip  that  is  able  to 
obey  simple  commands  governing  sixteen  registers,  each  16  bits  wide. 

Complete  details  about  all  the  SGB  generators  can  be  found  in  the  author’s 
book  The  Stanford  GraphBase  (New  York:  ACM  Press,  1994),  together  with 
dozens  of  short  example  programs  that  explain  how  to  manipulate  the  graphs  and 
networks  that  the  generators  produce.  For  example,  a program  called  LADDERS 
shows  how  to  find  a shortest  path  between  one  five-letter  word  and  another.  A 
program  called  TAKE_RISC  demonstrates  how  to  put  a nanocomputer  through 
its  paces  by  simulating  the  actions  of  a network  built  from  the  gates  of  nsc(16). 

Hypergraphs.  Graphs  and  networks  can  be  utterly  fascinating,  but  they  aren’t 
the  end  of  the  story  by  any  means.  Lots  of  important  combinatorial  algorithms 
are  designed  to  work  with  hypergraphs,  which  are  more  general  than  graphs 
because  their  edges  are  allowed  to  be  arbitrary  subsets  of  the  vertices. 

For  example,  we  might  have  seven  vertices,  identified  by  nonzero  binary 
strings  v = aia2a3,  together  with  seven  edges,  identified  by  bracketed  nonzero 
binary  strings  e = [616263],  with  v £ e if  and  only  if  (0161+0262+0363)  mod  2 = 0. 
Each  of  these  edges  contains  exactly  three  vertices: 

[001]  = {010,100,110};  [010]  = {001,100,101};  [011]  = {Oil,  100,  111}; 

[100]  = {001,010,011};  [101]  = {010,101,111}; 

[110]  = {001,110,  111};  [111]  = {011,101,110}.  (56) 

And  by  symmetry,  each  vertex  belongs  to  exactly  three  edges.  (Edges  that 
contain  three  or  more  vertices  are  sometimes  called  “hyperedges,”  to  distinguish 
them  from  the  edges  of  an  ordinary  graph.  But  it’s  OK  to  call  them  just  “edges.” ) 

A hypergraph  is  said  to  be  r -uniform  if  every  edge  contains  exactly  r vertices. 
Thus  (56)  is  a 3-uniform  hypergraph,  and  a 2-uniform  hypergraph  is  an  ordinary 
graph.  The  complete  r-uniform  hypergraph  K ir)  has  n vertices  and  (")  edges. 

Most  of  the  basic  concepts  of  graph  theory  can  be  extended  to  hypergraphs 
in  a natural  way.  For  example,  if  H = ( V , E)  is  a hypergraph  and  if  V C V,  the 
subhypergraph  H \ U induced  by  U has  the  edges  {e  \ e e E and  e C U}.  The 
complement  H of  an  r-uniform  hypergraph  has  the  edges  of  that  aren’t 
edges  of  H.  A /.'-coloring  of  a hypergraph  is  an  assignment  of  colors  to  the 
vertices  so  that  no  edge  is  monochromatic.  And  so  on. 

Hypergraphs  go  by  many  other  names,  because  the  same  properties  can  be 
formulated  in  many  different  ways.  For  example,  every  hypergraph  H = (V,  E) 
is  essentially  a family  of  sets,  because  each  edge  is  a subset  of  V.  A 3-uniform 
hypergraph  is  also  called  a triple  system.  A hypergraph  is  also  equivalent  to 
a matrix  B of  0s  and  Is,  with  one  row  for  each  vertex  v and  one  column  for 
each  edge  e;  row  v and  column  e of  this  matrix  contains  the  value  bve  = [v  6 e]. 
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Matrix  B is  called  the  incidence  matrix  of  H,  and  we  say  that  “v  is  incident 
with  e”  when  v G e.  Furthermore,  a hypergraph  is  equivalent  to  a bipartite 
graph,  with  vertex  set  V Li  E and  with  the  edge  v — e whenever  v is  incident 
with  e.  The  hypergraph  is  said  to  be  connected  if  and  only  if  the  corresponding 
bipartite  graph  is  connected.  A cycle  of  length  k in  a hypergraph  is  defined  to 
be  a cycle  of  length  2k  in  the  corresponding  bipartite  graph. 

For  example,  the  hypergraph  (56)  can  be  defined  by  an  equivalent  incidence 
matrix  or  an  equivalent  bipartite  graph  as  follows: 


001 

010 

011 

100 

101 

110 

111 


[001]  [010]  [011]  [100]  [101]  [110]  [111] 


/° 

1 

0 

1 

0 

1 

Vo 


0 

1 
0 

0 

1 

0 

1 


1 

0 

1 
1 

0 / 


It  contains  28  cycles  of  length  3,  such  as 


[010]  001 


[101]  — 101  — [010]  — 001  — [100]  — 010  — [101]. 


(57) 


(58) 


The  dual  HT  of  a hypergraph  H is  obtained  by  interchanging  the  roles 
of  vertices  and  edges,  but  retaining  the  incidence  relation.  In  other  words,  it 
corresponds  to  transposing  the  incidence  matrix.  Notice,  for  example,  that  the 
dual  of  an  r-regular  graph  is  an  r-uniform  hypergraph. 

Incidence  matrices  and  bipartite  graphs  might  correspond  to  hypergraphs  in 
which  some  edges  occur  more  than  once,  because  distinct  columns  of  the  matrix 
might  be  equal.  When  a hypergraph  H = (V,  E)  does  not  have  any  repeated 
edges,  it  corresponds  also  to  yet  another  combinatorial  object,  namely  a Boolean 
function.  For  if,  say,  the  vertex  set  V is  {1,2,...,  n},  the  function 

h(x i,x2,...,xn)  = [{j  | xj  = 1}  6 E] 
characterizes  the  edges  of  H . For  example,  the  Boolean  formula 

(£1  ® X2  ® £3)  A (x2  ® X\  ® x6)  A (x3  ® x4  ® x7) 

A (£3  © £5  ® x6)  A (xi  V x2  V £4) 


(59) 

(60) 


is  another  way  to  describe  the  hypergraph  of  (56)  and  (57). 

The  fact  that  combinatorial  objects  can  be  viewed  in  so  many  ways  can 
be  mind-boggling.  But  it’s  also  extremely  helpful,  because  it  suggests  different 
ways  to  solve  equivalent  problems.  When  we  look  at  a problem  from  different 
perspectives,  our  brains  naturally  think  of  different  ways  to  attack  it.  Sometimes 
we  get  the  best  insights  by  thinking  about  how  to  manipulate  rows  and  columns 
in  a matrix.  Sometimes  we  make  progress  by  imagining  vertices  and  paths,  or 
by  visualizing  clusters  of  points  in  space.  Sometimes  Boolean  algebra  is  just  the 
thing.  If  we’re  stuck  in  one  domain,  another  might  come  to  our  rescue. 
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Covering  and  independence.  If  H = ( V , E)  is  a graph  or  hypergraph,  a set 
U of  vertices  is  said  to  cover  H if  every  edge  contains  at  least  one  member  of  U. 
A set  W of  vertices  is  said  to  be  independent  (or  “stable”)  in  H if  no  edge  is 
completely  contained  in  W . 

From  the  standpoint  of  the  incidence  matrix,  a covering  is  a set  of  rows 
whose  sum  is  nonzero  in  every  column.  And  in  the  special  case  that  H is  a 
graph,  every  column  of  the  matrix  contains  just  two  Is;  hence  an  independent 
set  in  a graph  corresponds  to  a set  of  rows  that  are  mutually  orthogonal  — that 
is,  a set  for  which  the  dot  product  of  any  two  different  rows  is  zero. 

These  concepts  are  opposite  sides  of  the  same  coin.  If  U covers  H,  then 
W = V \ U is  independent  in  H;  conversely,  if  W is  independent  in  H,  then 
U = V \ W covers  H . Both  statements  are  equivalent  to  saying  that  the  induced 
hypergraph  H \ W has  no  edges. 

This  dual  relationship  between  covering  and  independence,  which  was  per- 
haps first  noted  by  Claude  Berge  [Proc.  National  Acad.  Sci.  43  (1957),  842-844], 
is  somewhat  paradoxical.  Although  it’s  logically  obvious  and  easy  to  verify,  it’s 
also  intuitively  surprising.  When  we  look  at  a graph  and  try  to  find  a large 
independent  set,  we  tend  to  have  rather  different  thoughts  from  when  we  look  at 
the  same  graph  and  try  to  find  a small  vertex  cover;  yet  both  goals  are  the  same. 

A covering  set  U is  minimal  if  U \ u fails  to  be  a cover  for  all  u € U. 
Similarly,  an  independent  set  W is  maximal  if  W U w fails  to  be  independent  for 
all  w £ W.  Here,  for  example,  is  a minimal  cover  of  the  49-vertex  graph  of  the 
contiguous  United  States,  (17),  and  the  corresponding  maximal  independent  set: 


Minimal  vertex  cover,  Maximal  independent  set, 

with  38  vertices  with  1 1 vertices 


A covering  is  called  minimum  if  it  has  the  smallest  possible  size,  and  an 
independent  set  is  called  maximum  if  it  has  the  largest  possible  size.  For  example, 
with  graph  (17)  we  can  do  much  better  than  (61): 


Minimum  vertex  cover, 
with  30  vertices 


Maximum  independent  set, 
with  19  vertices 


(62) 


Notice  the  subtle  distinction  between  “minimal”  and  “minimum”  here:  In  gen- 
eral (but  in  contrast  to  most  dictionaries  of  English),  people  who  work  with 
combinatorial  algorithms  use  ‘-al’  words  like  “minimal”  or  “optimal”  to  refer 
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to  combinatorial  configurations  that  are  locally  best,  in  the  sense  that  small 
changes  don’t  improve  them.  The  corresponding  ‘-um’  words,  “minimum”  or 
“optimum,”  are  reserved  for  configurations  that  are  globally  best,  considered 
over  all  possibilities.  It’s  easy  to  find  solutions  to  any  optimization  problem 
that  are  merely  optimal,  in  the  weak  local  sense,  by  climbing  repeatedly  until 
reaching  the  top  of  a hill.  But  it’s  usually  much  harder  to  find  solutions  that 
are  truly  optimum.  For  example,  we’ll  see  in  Section  7.9  that  the  problem  of 
finding  a maximum  independent  set  in  a given  graph  belongs  to  a class  of  difficult 
problems  that  are  called  NP-complete. 

Even  when  a problem  is  NP-complete,  we  needn’t  despair.  We’ll  discuss 
techniques  for  finding  minimum  covers  in  several  parts  of  this  chapter,  and  those 
methods  work  fine  on  smallish  problems;  the  optimum  solution  in  (62)  was  found 
in  less  than  a second,  after  examining  only  a tiny  fraction  of  the  249  possibilities. 
Furthermore,  special  cases  of  NP-complete  problems  often  turn  out  to  be  simpler 
than  the  general  case.  In  Section  7.5.1  we’ll  see  that  a minimum  vertex  cover  can 
be  discovered  quickly  in  any  bipartite  graph,  or  in  any  hypergraph  that  is  the  dual 
of  a graph.  And  in  Section  7.5.5  we’ll  study  efficient  ways  to  discover  a maximum 
matching , which  is  a maximum  independent  set  in  the  line  graph  of  a given  graph. 

The  problem  of  maximizing  the  size  of  an  independent  set  occurs  sufficiently 
often  that  it  has  acquired  a special  notation:  If  H is  any  hypergraph,  the  number 

a(H)  — max{|IF|  | W is  an  independent  set  of  vertices  in  id}  (63) 

is  called  the  independence  number  (or  the  stability  number)  of  H.  Similarly, 

x(H)  — min{  k | H is  fc-colorable } (64) 

is  called  the  chromatic  number  of  H . Notice  that  x(H)  is  the  size  of  a mini- 
mum covering  of  H by  independent  sets,  because  the  vertices  that  receive  any 
particular  color  must  be  independent  according  to  our  definitions. 

These  definitions  of  a(H)  and  x(Id)  apply  in  particular  to  the  case  when 
H is  an  ordinary  graph,  but  of  course  we  usually  write  a(G)  and  x(G)  in  such 
situations.  Graphs  have  another  important  number  called  their  clique  number , 

u>(G)  — maxll^l  | X is  a clique  in  G},  (65) 

where  a “clique”  is  a set  of  mutually  adjacent  vertices.  Clearly 

w(G)  = a(G),  (66) 

because  a clique  in  G is  an  independent  set  in  the  complementary  graph.  Sim- 
ilarly we  can  see  that  x(G)  is  the  minimum  size  of  a “clique  cover,”  which  is  a 
set  of  cliques  that  exactly  covers  all  of  the  vertices. 

Several  instances  of  “exact  cover  problems”  were  mentioned  earlier  in  this 
section,  without  an  explanation  of  exactly  what  such  a problem  really  signifies. 
Finally  we’re  ready  for  the  definition:  Given  the  incidence  matrix  of  a hyper- 
graph H,  an  exact  cover  of  Id  is  a set  of  rows  whose  sum  is  (11  ...  1).  In  other 
words,  an  exact  cover  is  a set  of  vertices  that  touches  each  hyperedge  exactly 
once;  an  ordinary  cover  is  only  required  to  touch  each  hyperedge  at  least  once. 
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EXERCISES 

1.  [25]  Suppose  n = 4 to  — 1.  Construct  arrangements  of  Langford  pairs  for  the 
numbers  {1, 1, . . . , w,  n},  with  the  property  that  we  also  obtain  a solution  for  n = 4m 
by  changing  the  first  ‘2m— V to  ‘4m’  and  appending  ‘2m— 1 4m’  at  the  right.  Hint: 
Put  the  m - 1 even  numbers  4m-4,  4m-6,  . . . , 2m  at  the  left. 

2.  [20]  For  which  n can  {0, 0, 1, 1, ... , n— 1,  n— 1}  be  arranged  as  Langford  pairs? 

3.  [22]  Suppose  we  arrange  the  numbers  {0,0, 1, 1, . . . ,n-l,n-l}  in  a circle , instead 
of  a straight  line,  with  distance  k between  the  two  k’s.  Do  we  get  solutions  that  are 
essentially  distinct  from  those  of  exercise  2? 

4.  [ M20 ] (T.  Skolem,  1957.)  Show  that  the  Fibonacci  string  Soo  — babbababbabba. . . . 
of  exercise  1.2.8-36  leads  directly  to  an  infinite  sequence  0012132453674  . . . of  Langford 
pairs  for  the  set  of  all  nonnegative  integers,  if  we  simply  replace  the  a’s  and  b’s 
independently  by  0,  1,  2,  etc.,  from  left  to  right. 

► 5.  [HM22]  If  a permutation  of  {1,1, 2,2,..., n,n}  is  chosen  at  random,  what  is  the 
probability  that  the  two  k’s  are  exactly  k positions  apart,  given  k?  Use  this  formula 
to  guess  the  size  of  the  Langford  numbers  Ln  in  (i). 

► 6.  [M28]  (M.  Godfrey,  2002.)  Let /(aq, . . .,x2n)  = Y\l=l{xkxn+k  XjXj+k+i)  ■ 

a)  Prove  that  Ex1,...,3:2r,e{-i.+1}  /(* l,  • • • ,*2„)  = 2 2n+1Ln. 

b)  Explain  how  to  evaluate  this  sum  in  0(4"n)  steps.  How  many  bits  of  precision 
are  needed  for  the  arithmetic? 

c)  Gain  a factor  of  eight  by  exploiting  the  identities 

f(X  1,  ■ • • ’X2n)  = f(-X l,  . . . , -a ;2n)  = f(x 2„,  . . . ,xi)  = /(xi,  -X2,  . . . ,X2n-l,  ~X2n)- 

7.  [M2 2]  Prove  that  every  Langford  pairing  of  {1, 1, . . . , 16, 16}  must  have  seven 
uncompleted  pairs  at  some  point,  when  read  from  left  to  right. 

8.  [23]  The  simplest  Langford  sequence  is  not  only  well-balanced;  it’s  planar,  in  the 
sense  that  its  pairs  can  be  connected  up  without  crossing  lines  as  in  (2): 

1  1 

2 3 12  13. 


Find  all  of  the  planar  Langford  pairings  for  which  n < 8. 

9.  [24]  [Langford  triples.)  In  how  many  ways  can  {1, 1, 1,  2,  2,  2, ...,  9,  9, 9}  be  ar- 
ranged in  a row  so  that  consecutive  k’s  are  k apart,  for  1 < k < 9? 

10.  [M20]  Explain  how  to  construct  a magic  square  directly  from  Fig.  1.  (Convert 
each  card  into  a number  between  1 and  16,  in  such  a way  that  the  rows,  columns,  and 
main  diagonals  all  sum  to  34.) 

11.  [20]  Extend  (5)  to  a “Hebraic-Graaco-Latin”  square  by  appending  one  of  the 
letters  {N,3,J,  1}  to  the  two-letter  string  in  each  compartment.  No  letter  pair  (Latin, 
Greek),  (Latin,  Hebrew),  or  (Greek,  Hebrew)  should  appear  in  more  than  one  place. 

► 12.  [M21]  (L.  Euler.)  Let  Ll}  = (i+j)  mod  n for  0 < i,  j < n be  the  addition  table  for 
integers  mod  n.  Prove  that  a latin  square  orthogonal  to  L exists  if  and  only  if  n is  odd. 

13.  [M25]  A 10  x 10  square  can  be  divided  into  four  quarters  of  size  5x5.  A 10  X 10 
latin  square  formed  from  the  digits  {0,1,...,  9}  has  k “intruders”  if  its  upper  left 
quarter  has  exactly  k elements  > 5.  (See  exercise  14(e)  for  an  example  with  k = 3.) 
Prove  that  the  square  has  no  orthogonal  mate  unless  there  are  at  least  three  intruders. 
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14.  [29]  Find  all  orthogonal  mates  of  the  following  latin  squares: 


(a) 

(b) 

(c) 

(d) 

(e) 

3145926870 

2718459036 

0572164938 

1680397425 

7823456019 

2819763504 

0287135649 

6051298473 

8346512097 

8234067195 

9452307168 

7524093168 

4867039215 

9805761342 

2340178956 

6208451793 

1435962780 

1439807652 

2754689130 

3401289567 

8364095217. 

6390718425. 

8324756091 . 

0538976214. 

4012395678 

5981274036  ’ 

4069271853 ’ 

7203941586 ’ 

4963820571 ’ 

5678912340 

4627530981 

3102684597 

5610473829 

7192034658 

6789523401 

0576148329 

9871546302 

9148625307 

6219405783 

0195634782 

1730689452 

8956307214 

2795380164 

3471258906 

1956740823 

7093812645 

5643820971 

3986512740 

5027143869 

9567801234 

15.  [50]  Find  three  10  x 10  latin  squares  that  are  mutually  orthogonal  to  each  other. 

16.  [48]  (H.  J.  Ryser,  1967.)  A latin  square  is  said  to  be  of  “order  n”  if  it  has  n rows, 
n columns,  and  n symbols.  Does  every  latin  square  of  odd  order  have  a transversal? 

17.  [25]  Let  L be  a latin  square  with  elements  Lij  for  0 < i,j  < n.  Show  that  the 
problems  of  (a)  finding  all  the  transversals  of  L,  and  (b)  finding  all  the  orthogonal 
mates  of  L,  are  special  cases  of  the  general  exact  cover  problem. 

18.  [M26]  The  string  x\x2  ...xn  is  called  “n-ary”  if  each  element  Xj  belongs  to  the 
set  {0, 1, ... , n—  1}  of  n-ary  digits.  Two  strings  xix2  ■ ■ ■ xn  and  yyy2  ■ ■ .yN  are  said  to 
be  orthogonal  if  the  N pairs  ( Xj,yj ) are  distinct  for  1 < j < N.  (Consequently,  two 
n-ary  strings  cannot  be  orthogonal  if  their  length  N exceeds  n2.)  An  n-ary  matrix 
with  m rows  and  n2  columns  whose  rows  are  orthogonal  to  each  other  is  called  an 
orthogonal  array  of  order  n and  depth  to. 

Find  a correspondence  between  orthogonal  arrays  of  depth  to  and  lists  of  to  — 2 
mutually  orthogonal  latin  squares.  What  orthogonal  array  corresponds  to  exercise  11? 

► 19.  [M25]  Continuing  exercise  18,  prove  that  an  orthogonal  array  of  order  n > 1 and 
depth  to  is  possible  only  if  to  < n + 1.  Show  that  this  upper  limit  is  achievable  when 
n is  a prime  number  p.  Write  out  an  example  when  p = 5. 

20.  [HM20]  Show  that  if  each  element  k in  an  orthogonal  array  is  replaced  by  e27rfc,/,n, 
the  rows  become  orthogonal  vectors  in  the  usual  sense  (their  dot  product  is  zero). 

► 21.  [M21]  A geometric  net  is  a system  of  points  and  lines  that  obeys  three  axioms: 

i)  Each  line  is  a set  of  points. 

ii)  Distinct  lines  have  at  most  one  point  in  common. 

iii)  If  p is  a point  and  L is  a line  with  p ^ L,  then  there  is  exactly  one  line  M such 
that  p € M and  L PI  M — 0. 

If  L n M = 0 we  say  that  L is  parallel  to  A I,  and  write  L ||  M. 

a)  Prove  that  the  lines  of  a geometric  net  can  be  partitioned  into  equivalence  classes, 
with  two  lines  in  the  same  class  if  and  only  if  they  are  equal  or  parallel. 

b)  Show  that  if  there  are  at  least  two  classes  of  parallel  lines,  every  line  contains  the 
same  number  of  points  as  the  other  lines  in  its  class. 

c)  Furthermore,  if  there  are  at  least  three  classes,  there  are  numbers  to  and  n such 
that  all  points  belong  to  exactly  to  lines  and  all  lines  contain  exactly  n points. 

► 22.  [M22]  Show  that  every  orthogonal  array  can  be  regarded  as  a geometric  net.  Is 
the  converse  also  true? 

23.  [M23]  ( Error- correcting  codes.)  The  “Hamming  distance”  d(x,y)  between  two 
strings  x = x\  . . .xn  and  y = yi  . . . yN  is  the  number  of  positions  j where  Xj  yf  yp  A 
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b-ary  code  with  n information  digits  and  r check  digits  is  a set  C(b , n,  r)  of  bn  strings 
x = Xl  ' ’ ’ Xn+r’  where  0 < xj  < b for  1 < j < n + r.  When  a codeword  x is  transmitted 
and  the  message  y is  received,  d(x,  y)  is  the  number  of  transmission  errors.  The  code 
is  called  t-error  correcting  if  we  can  reconstruct  the  value  of  z whenever  a message  y 
is  received  with  d(x,  y)  < t.  The  distance  of  the  code  is  the  minimum  value  of  d(x,  x'), 
taken  over  all  pairs  of  codewords  x / x' . 

a)  Prove  that  a code  is  t-error  correcting  if  and  only  if  its  distance  exceeds  2 1. 

b)  Prove  that  a single-error  correcting  b- ary  code  with  2 information  digits  and  2 check 
digits  is  equivalent  to  a pair  of  orthogonal  latin  squares  of  order  b. 

c)  Furthermore,  a code  C(b,  2,  r)  with  distance  r+1  is  equivalent  to  a set  of  r mutually 
orthogonal  latin  squares  of  order  b. 

► 24.  [MSO]  A geometric  net  with  N points  and  R lines  leads  naturally  to  the  binary 
code  C( 2,  N,  R)  with  codewords  Xl  . . . xNxN+1  . . . xN+R  defined  by  the  parity  bits 

XN+k  = fk{x  1, . . . ,xN)  = I point  j lies  on  line  k})  mod  2. 

a)  If  the  net  has  m classes  of  parallel  lines,  prove  that  this  code  has  distance  m + 1. 

b)  Find  an  efficient  way  to  correct  up  to  t errors  with  this  code,  assuming  that  m = 2i. 
Illustrate  the  decoding  process  in  the  case  N = 25,  R = 30,  t = 3. 

25.  [27]  Find  a latin  square  whose  rows  and  columns  are  five-letter  words.  (For  this 
exercise  you’ll  need  to  dig  out  the  big  dictionaries.) 

► 26.  [25]  Compose  a meaningful  English  sentence  that  contains  only  five-letter  words. 

27.  [20]  How  many  SGB  words  contain  exactly  k distinct  letters,  for  1 < k < 5? 

28.  [20]  Are  there  any  pairs  of  SGB  word  vectors  that  differ  by  ±1  in  each  component? 

29.  [20]  Find  all  SGB  words  that  are  palindromes  (equal  to  their  reflection),  or  mirror 
pairs  (like  regal  lager). 

► 30.  [20]  The  letters  of  first  are  in  alphabetic  order  from  left  to  right.  What  is  the 
lexicographically  first  such  five-letter  word?  What  is  the  last? 

31.  [21]  (C.  McManus.)  Find  all  sets  of  three  SGB  words  that  are  in  arithmetic 
progression  but  have  no  common  letters  in  any  fixed  position.  (One  such  example  is 
{power,  slugs,  visit}.) 

32.  [23]  Does  the  English  language  contain  any  10-letter  words  a0a i . . . a9  for  which 
both  an C2U4 Ocas  and  1/10311511709  are  SGB  words? 

33.  [20]  (Scot  Morris.)  Complete  the  following  list  of  26  interesting  SGB  words: 

about,  bacon,  faced,  under,  chief,  ...,  pizza. 

► 34.  [21]  For  each  SGB  word  that  doesn’t  include  the  letter  y,  obtain  a 5-bit  binary 
number  by  changing  the  vowels  {a,  e,  i,  o,  u}  to  1 and  the  other  letters  to  0.  What  are 
the  most  common  words  for  each  of  the  32  binary  outcomes? 

► 35.  [26]  Sixteen  well-chosen  elements  of  WORDS(IOOO)  lead  to  the  branching  pattern 
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which  is  a complete  binary  trie  of  words  that  begin  with  the  letter  s . But  there’s  no  such 
pattern  of  words  beginning  with  a,  even  if  we  consider  the  full  collection  W0RDS(5757). 

What  letters  of  the  alphabet  can  be  used  as  the  starting  letter  of  sixteen  words 
that  form  a complete  binary  trie  within  WORDS  (n),  given  n? 

36.  [ M17 ] Explain  the  symmetries  that  appear  in  the  word  cube  (10).  Also  show  that 
two  more  such  cubes  can  be  obtained  by  changing  only  the  two  words  {stove, event}. 

37.  [20]  Which  vertices  of  the  graph  words  (5757, 0, 0, 0)  have  maximum  degree? 

38.  [22]  Using  the  digraph  rule  in  (14),  change  tears  to  smile  in  just  three  steps, 
without  computer  assistance. 

39.  [MOO]  Is  G \ e an  induced  subgraph  of  G?  Is  it  a spanning  subgraph? 

40.  [M15]  How  many  (a)  spanning  (b)  induced  subgraphs  does  a graph  G = (V,E) 
have,  when  \V\  = n and  |£}  = e? 

41.  [M10]  For  which  integers  n do  we  have  (a)  Kn  = Pn?  (b)  Kn  = Gn? 

42.  [15]  (D.  H.  Lehrner.)  Let  G be  a graph  with  13  vertices,  in  which  every  vertex 
has  degree  5.  Make  a nontrivial  statement  about  G. 

43.  [23]  Are  any  of  the  following  graphs  the  same  as  the  Petersen  graph? 


44.  [M23]  How  many  symmetries  does  Chvatal’s  graph  have?  (See  Fig.  2(f).) 

45.  [20]  Find  an  easy  way  to  4-color  the  planar  graph  (17).  Would  3 colors  suffice? 

46.  [M25]  Let  G be  a graph  with  n > 3 vertices,  defined  by  a planar  diagram  that 
is  “maximal,”  in  the  sense  that  no  additional  lines  can  be  drawn  between  nonadjacent 
vertices  without  crossing  an  existing  edge. 

a)  Prove  that  the  diagram  partitions  the  plane  into  regions  that  each  have  exactly 
three  vertices  on  their  boundary.  (One  of  these  regions  is  the  set  of  all  points  that 
lie  outside  the  diagram.) 

b)  Therefore  G has  exactly  3n  — 6 edges. 

47.  [M22]  Prove  that  the  complete  bigraph  K3, 3 isn’t  planar. 

48.  [M25]  Complete  the  proof  of  Theorem  B by  showing  that  the  stated  procedure 
never  gives  the  same  color  to  two  adjacent  vertices. 

49.  [1#]  Draw  diagrams  of  all  the  cubic  graphs  with  at  most  6 vertices. 

50.  [M24]  Find  all  bipartite  graphs  that  can  be  3-colored  in  exactly  24  ways. 

► 51.  [M22]  Given  a geometric  net  as  described  in  exercise  21,  construct  the  bipartite 
graph  whose  vertices  are  the  points  p and  the  lines  L of  the  net,  with  p — L if  and 
only  if  p € L.  What  is  the  girth  of  this  graph? 

52.  [Ml 6]  Find  a simple  inequality  that  relates  the  diameter  of  a graph  to  its  girth. 
(How  small  can  the  diameter  be,  if  the  girth  is  large?) 

53.  [15]  Which  of  the  words  world  and  happy  belongs  to  the  giant  component  of  the 
graph  words  (5757, 0, 0, 0)? 
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► 54.  [21]  The  49  postal  codes  in  graph  (17)  are  AL,  AR,  AZ,  CA,  CO,  CT,  DC,  DE.  FL,  GA. 
IA,  ID,  IL,  IN,  KS,  KY,  LA,  MA,  MD,  ME,  MI,  MN,  MO,  MS,  MT,  NC,  ND,  NE,  NH,  NJ  NM  NV 
NY,  OH,  OK,  OR,  PA,  RI,  SC,  SD,  TN,  TX,  UT,  VA,  VT,  WA,  WI,  WV,  WY,  in  alphabetical  order! 

a)  Suppose  we  consider  two  states  to  be  adjacent  if  their  postal  codes  agree  in  one 

place  (namely  AL  AR  — OR  — OH,  etc.).  What  are  the  components  of  this  graph? 

b)  Now  form  a directed  graph  with  XY  — > YZ  (for  example,  AL  — > LA  — > AR.  etc.). 
What  are  the  strongly  connected  components  of  this  digraph?  (See  Section  2. 3. 4. 2.) 

c)  The  United  States  has  additional  postal  codes  AA,  AE,  AK,  AP,  AS,  FM,  GU,  HI.  MH. 
MP,  PW,  PR,  VI,  besides  those  in  (17).  Reconsider  question  (b),  using  all  62  codes. 

55.  [ M20 ] How  many  edges  are  in  the  complete  fc-partite  graph  Kni  ^ ? 

► 56.  [. M10 ] True  or  false:  A multigraph  is  a graph  if  and  only  if  the  corresponding 
digraph  is  simple. 

57.  [M10]  True  or  false:  Vertices  u and  v are  in  the  same  connected  component  of  a 
directed  graph  if  and  only  if  either  d(u,  v)  < 00  or  d{v,  u ) < 00. 

58.  [Ml  7]  Describe  all  (a)  graphs  (b)  multigraphs  that  are  regular  of  degree  2. 

► 59.  [M23]  A tournament  of  order  n is  a digraph  on  n vertices  that  has  exactly  (") 
arcs,  either  u — >v  or  v — >u  for  every  pair  of  distinct  vertices  {u,  u}. 

a)  Prove  that  every  tournament  contains  an  oriented  spanning  path  n t >vn. 

b)  Consider  the  tournament  on  vertices  {0,1,  2,  3, 4}  for  which  u— tu  if  and  only  "if 
(u  -v)  mod  5 > 3.  How  many  oriented  spanning  paths  does  it  have? 

c)  Is  Kn  the  only  tournament  of  order  n that  has  a unique  oriented  spanning  path? 

► 60.  [M22]  Let  u be  a vertex  of  greatest  out-degree  in  a tournament,  and  let  v be  any 
other  vertex.  Prove  that  d(u,  v)  < 2. 

61.  [M16]  Construct  a digraph  that  has  k walks  of  length  k from  vertex  1 to  vertex  2. 

62.  [M21]  A permutation  digraph  is  a directed  graph  in  which  every  vertex  has  out- 
degree  1 and  m-degree  1;  therefore  its  components  are  oriented  cycles.  If  it  has 
n vertices  and  k components,  we  call  it  even  if  n - k is  even,  odd  if  n-  k is  odd. 

a)  Let  G be  a directed  graph  with  adjacency  matrix  A.  Prove  that  the  number  of 
spanning  permutation  digraphs  of  G is  per  A,  the  permanent  of  A. 

b)  Interpret  the  determinant,  det  A,  in  terms  of  spanning  permutation  digraphs. 

63.  [M23]  Let  G be  a graph  of  girth  g in  which  every  vertex  has  at  least  d neighbors. 
Prove  that  G has  at  least  N vertices,  where 

N=  f 1 + £o<fc<td(d-  1)\  if  3 = 21+1; 

l 1 + {d  - 1)*  + Eo<jt<i  d(d  ~ l)fc,  if  g = 2t  + 2. 

► 64.  [M21]  Continuing  exercise  63,  show  that  there’s  a unique  graph  of  girth  4,  mini- 
mum degree  d,  and  order  2d,  for  each  d > 2. 

► 65.  [ HM31 ] Suppose  graph  G has  girth  5,  minimum  degree  d,  and  N = d2+ 1 vertices. 

a)  Prove  that  the  adjacency  matrix  A of  G satisfies  the  equation  A2  + A = [d—l)I+J. 

b)  Since  A is  a symmetric  matrix,  it  has  N orthogonal  eigenvectors  Xj,  with  corre- 
sponding eigenvalues  A,  such  that  Axj  = \]Xj  for  1 < j < N.  Prove  that  each 
\j  is  either  rf  or  (-  1_±  y/4d  — 3)/2. 

c)  Show  that  if  sj 4 d - 3 is  irrational,  then  d = 2.  Hint:  Aid h \N  = trace(A)  = 0 

d)  And  if  \/4 d - 3 is  rational,  d e {3,  7,57}. 

66.  [M30]  Continuing  exercise  65,  construct  such  a graph  when  d = 7. 
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67.  [MJ8]  Is  there  a regular  graph  of  degree  57,  order  3250,  and  girth  5? 

68.  [MZO]  How  many  different  adjacency  matrices  does  a graph  G on  n vertices  have? 

► 69.  [20]  Extending  (31),  explain  how  to  calculate  both  out-degree  ODEG(u)  and  in- 
degree IDEG(u)  for  all  vertices  v in  a graph  that  has  been  represented  in  SGB  format. 

► 70.  [M20]  How  often  is  each  step  of  Algorithm  B performed,  when  that  algorithm 
successfully  2-colors  a graph  with  m arcs  and  n vertices? 

71.  [26]  Implement  Algorithm  B for  the  MMIX  computer,  using  the  MMIXAL  assembly 
language.  Assume  that,  when  your  program  begins,  register  vO  points  to  the  first  vertex 
node  and  register  n contains  the  number  of  vertices. 

► 72.  [M22]  When  COLOR(u)  is  set  in  step  B6,  call  u the  parent  of  v,  but  when  COLOR(ui) 
is  set  in  step  B3,  say  that  w has  no  parent.  Define  the  (inclusive)  ancestors  of  vertex  v, 
recursively,  to  be  v together  with  the  ancestors  of  v’s  parent  (if  any). 

a)  Prove  that  if  v is  below  u in  the  stack  during  Algorithm  B,  the  parent  of  v is  an 
ancestor  of  u. 

b)  Furthermore,  if  COLOR(u)  = COLOR(u)  in  step  B6,  v is  currently  in  the  stack. 

c)  Use  these  facts  to  extend  Algorithm  B so  that,  if  the  given  graph  is  not  bipartite, 
the  names  of  vertices  in  a cycle  of  odd  length  are  output. 

73.  [15]  What’s  another  name  for  random-graph(W,  45, 0, 0, 0, 0, 0, 0, 0, 0)? 

74.  [21]  What  vertex  of  roget(\022, 0, 0, 0)  has  the  largest  out-degree? 

75.  [22]  The  SGB  graph  generator  board(n\,n2,n3,n4,p,w,o)  creates  a graph  whose 
vertices  are  the  t-dimensional  integer  vectors  (xi, . . . ,xt)  for  0 < Xi  < bi,  determined 
by  the  first  four  parameters  (ni,  n2,  ns,  714)  as  follows:  Set  ns  t—  0 and  let  j > 0 be  min- 
imum such  that  nj  + 1 < 0.  If  j = 0,  set  fci  «—  1)2  «—  8 and  t 2;  this  is  the  default  8x8 
board.  Otherwise  if  nj+i  = 0,  set  bi  e—  rii  for  1 < i < j and  t t—  j.  Finally,  if  nj+i  < 0, 
set  t t—  |rij+i|,  and  set  bi  to  the  ith  element  of  the  periodic  sequence  (m, , . . ,nj , 
ni, . . . , nj,  ni, . . . ).  (For  example,  the  specification  (m,  n2,  ri3,  714)  = (2,3,5, —7)  is 
about  as  tricky  as  you  can  get;  it  produces  a 7-dimensional  board  with  (bi, . . . ,67)  = 
(2, 3,  5,  2, 3, 5, 2),  hence  a graph  with  2-3-5-2-3-5-2  = 1800  vertices.) 

The  remaining  parameters  (p,w,o),  for  “piece,  wrap,  and  orientation,”  determine 
the  arcs  of  the  graph.  Suppose  first  that  w = o = 0.  If  p > 0,  we  have  (xi, . . . , xt)  — > 
(j/i,...,  yt)  if  and  only  if  yi  = Xi  + Si  for  1 < i < t,  where  (<5X, . . . , 5t)  is  an  integer 
solution  to  the  equation  5?  + • ■ ■ + <5*  = |p|.  And  if  p < 0,  we  allow  also  yt  = Xi  + k5i 
for  k > 1,  corresponding  to  k moves  in  the  same  direction. 

If  w -£■  0,  let  w = ( wt  ■ ■ • tni)2  in  binary  notation.  Then  we  allow  “wraparound,” 
Vi  = ( Xi  + 5i)  mod  bi  or  yi  = (xt  + k8i)  mod  bi,  in  each  coordinate  i for  which  Wi  = 1. 

If  o / 0,  the  graph  is  directed;  offsets  (<5i, . . . , St)  produce  arcs  only  when  they  are 
lexicographically  greater  than  (0, ...  ,0).  But  if  o = 0,  the  graph  is  undirected. 

Find  settings  of  (ni,n2,n3,n4,p,w,o)  for  which  board  will  produce  the  following 
fundamental  graphs:  (a)  the  complete  graph  K„;  (b)  the  path  Pn ; (c)  the  cycle  C„; 
(d)  the  transitive  tournament  Kn\  (e)  the  oriented  path  Pn',  (f)  the  oriented  cycle  Cn\ 
(g)  the  mxn  grid  PmnPn;  (h)  the  mxn  cylinder  PmnCn',  (i)  the  mxn  torus  CmnCn\ 
(j)  the  mxn  rook  graph  Km  nKn\  (k)  the  mxn  directed  torus  Cm  □CjT;  (1)  the  null 
graph  A'n;  (m)  the  n-cube  P2  □ • • • C1P2  with  2n  vertices. 

76.  [20]  Can  board(n\,n2,n3,n4,p,w,o)  produce  loops,  or  parallel  (repeated)  edges? 

77.  [M20]  If  graph  G has  diameter  > 3,  prove  that  G has  diameter  < 3. 
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78.  [M27]  Let  G = (V,E)  be  a graph  with  |V|  = n and  G = G.  (In  other  words,  G 
is  self-complementary.  There’s  a permutation  p of  V such  that  u — v if  and  only  if 
<p(u) -f- <p(v)  and  u / v.  We  can  imagine  that  the  edges  of  Kn  have  been  painted  black 
or  white;  the  white  edges  define  a graph  that’s  isomorphic  to  the  graph  of  black  edges.) 

a)  Prove  that  n mod  4 = 0 or  1.  Draw  diagrams  for  all  such  graphs  with  n < 8. 

b)  Prove  that  if  n mod  4 = 0,  every  cycle  of  the  permutation  has  a length  that  is  a 
multiple  of  4. 

c)  Conversely,  every  permutation  p with  such  cycles  arises  in  some  such  graph  G. 

d)  Extend  these  results  to  the  case  n mod  4 = 1. 


► 79.  [M22]  Given  k > 0,  construct  a graph  on  the  vertices  {0, 1, ... , 4 k}  that  is  both 
regular  and  self-complementary. 


► 80.  [M22]  A self-complementary  graph  must  have  diameter  2 or  3,  by  exercise  77. 
Given  k > 2,  construct  self-complementary  graphs  of  both  possible  diameters,  when 
(a)  V = {1,  2, . . . , 4fc};  (b)  V = {0, 1, 2, . . . , 4k}. 


81.  [20]  The  complement  of  a simple  digraph  without  loops  is  defined  by  extending 
(35)  and  (36),  so  that  we  have  u ->•  v in  D if  and  only  if  u ± v and  u -fr  v in  D.  What 
are  the  self-complementary  digraphs  of  order  3? 

82.  [M21]  Are  the  following  statements  about  line  graphs  true  or  false? 

a)  If  G is  contained  in  G' , then  L(G)  is  an  induced  subgraph  of  L(G'). 

b)  If  G is  a regular  graph,  so  is  L(G). 

c)  L(Km.,n ) is  regular,  for  all  m,n>  0. 

d)  L(Km:n ,r)  is  regular,  for  all  m,n,r  > 0. 

e)  L[Km,n)  ~ Km  □ ATn. 

f)  A(AT4)  Si  tf2,2,2. 

g)  L(Pn+ 1)  £*  Pn. 

h)  The  graphs  G and  L(G ) both  have  the  same  number  of  components. 

83.  [16]  Draw  the  graph  L(  7f5). 

► 84.  [ M21 ] Is  L(K3,3)  self-complementary? 

85.  [M22]  (O.  Ore,  1962.)  For  which  graphs  G do  we  have  G = L(G)1 

86.  [M20]  (R.  J.  Wilson.)  Find  a graph  G of  order  6 for  which  G = L(G). 

87.  [20]  Is  the  Petersen  graph  (a)  3-colorable?  (b)  3-edge-colorable? 

88.  [M20]  The  graph  Wn  = Tfi  — Cn-i  is  called  the  wheel  of  order  n, 
when  n > 4.  How  many  cycles  does  it  contain  as  subgraphs? 

89.  [M20]  Prove  the  associative  laws,  (42)  and  (43).  Wfi 


► 90.  [M24]  A graph  is  called  a cograph  if  it  can  be  constructed  algebraically  from 
1-element  graphs  by  means  of  complementation  and/or  direct  sum  operations.  For 
example,  there  are  four  nonisomorphic  graphs  of  order  3,  and  they  all  are  cographs: 
K3  = K±  ® A/  © Kx  and  its  complement,  K3;  #1,2  = A',  ® K2  and  its  complement, 

where  K2  = A'i  © AL . 

Exhaustive  enumeration  shows  that  there  are  1 1 nonisomorphic  graphs  of  order  4. 
Give  algebraic  formulas  to  prove  that  10  of  them  are  cographs.  Which  one  isn’t? 

► 91.  [20]  Draw  diagrams  for  the_4- vertex  graphs_(a)  AT2dA2;  (b)  A'2®AT2;  (c)  K2mK2; 
(d)  K2  a K2-,  (e)  K2  o K2\  (f)  K2  o AT2;  (g)  K2  o K2. 


92.  [21]  The  five  types  of  graph  products  defined  in  the  text  work  fine  for  simple 
digraphs  as  well  as  for  ordinary  graphs.  Draw  diagrams  for  the  4-vertex  digraphs 
(a)  K2dK2;  (b)  K2®K2-  (c)  K2®K2-  (d)  K2  a K2  : (e)  K2oK2. 
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93.  [15]  Which  of  the  five  graph  products  takes  Km  and  Kn  into  7Cmn? 

94.  [10]  Are  the  SGB  words  graphs  induced  subgraphs  of  P26  0^26  □ I'm  0P2&  C1P26? 

95.  [M20]  If  vertex  u of  G has  degree  du  and  vertex  v of  77  has  degree  dv,  what  is 

the  degree  of  vertex  (u,v)  in  (a)  GnH?  (b)  G®77?  (c)  GbH?  (d)  GaH?  (e)  Go  HI 

► 96.  [M22]  Let  A be  an  m x m'  matrix  with  auu/  in  row  u and  column  u;  let  B be 

an  n X n matrix  with  bvv>  in  row  v and  column  v' . The  direct  product  A ® B is  an 
mn  x m'n  matrix  with  auu'bvv'  in  row  (u,v)  and  column  ( u ,v ').  Thus  A <g>  B is  the 

adjacency  matrix  of  G <8>  H,  if  A and  B are  the  adjacency  matrices  of  G and  77. 

Find  analogous  formulas  for  the  adjacency  matrices  of  (a)  GnH;  (b)  G b 77; 
(c)  GaF;  (d)  GoH. 

97.  [M25]  Find  as  many  interesting  algebraic  relations  between  graph  sums  and  prod- 
ucts as  you  can.  (For  example,  the  distributive  law  (A®B)  <g>  C = (A®  C)  © [B®  C)  for 
direct  sums  and  products  of  matrices  implies  that  ( G@G')®H  = (G  ® H)  © (G'  ® H). 
We  also  have  Km  nH  = H © • • • © H , with  m copies  of  H,  etc.) 

98.  [M20]  If  the  graph  G has  k components  and  the  graph  H has  l components,  how 
many  components  are  in  the  graphs  GnH  and  G H 77? 

99.  [M20]  Let  da(u,u)  be  the  distance  from  vertex  u to  vertex  u'  in  graph  G. 
Prove  that  dGnH((u,v),  (u1  ,v'))  = da(u,u)  + d[{{v,v'),  and  find  a similar  formula 
for  dasH{{u,  v),  («',«')). 

100.  [M21]  For  which  connected  graphs  is  G ® 77  connected? 

► 101.  [M25]  Find  all  connected  graphs  G and  77  such  that  Go  77  = G ® 77. 

102.  [ M20 ] What’s  a simple  algebraic  formula  for  the  graph  of  king  moves  (which 
take  one  step  horizontally,  vertically,  or  diagonally)  on  an  m x n board? 

103.  [20]  Complete  tableau  (54).  Also  apply  Algorithm  H to  the  sequence  866444444. 

104.  [7S]  Explain  the  manipulation  of  variables  i,  t,  and  r in  steps  H3  and  H4. 

105.  [M38]  Suppose  d\  > • • • > dn  > 0,  and  let  ci  > • • • > c ^ be  its  conjugate  as  in 
Algorithm  H.  Prove  that  d\  . . ,dn  is  graphical  if  and  only  if  d\  + • ■ • + dn  is  even  and 
di  + ■ • • + dk  < ci  + • ■ • + Ck  — k for  1 < k < s,  where  s is  maximal  such  that  ds  > s. 

106.  [20]  True  or  false:  If  di  = • ■ ■ = dn  = d < n and  nd  is  even,  Algorithm  FI 
constructs  a connected  graph. 

107.  [M21]  Prove  that  the  degree  sequence  d\...dn  of  a self-complementary  graph 
satisfies  dj  + dn+i~j  = n — 1 and  ebj-i  = £^24  for  1 < j < n/2. 

► 108.  [M23]  Design  an  algorithm  analogous  to  Algorithm  H that  constructs  a simple 
directed  graph  on  vertices  {1, . . . , n},  having  specified  values  d ^ and  d £ for  the  in-degree 
and  out-degree  of  each  vertex  k , whenever  at  least  one  such  graph  exists. 

109.  [M20]  Design  an  algorithm  analogous  to  Algorithm  H that  constructs  a bipartite 
graph  on  vertices  {1 , . . . ,m  + n},  having  specified  degrees  dk  for  each  vertex  k when 
possible;  all  edges  j — k should  have  j < m and  k > m. 

110.  [M22]  Without  using  Algorithm  H,  show  by  a direct  construction  that  the  se- 
quence d\  ...  dn  is  graphical  when  n > d\  > ■ ■ ■ > dn  > d\  — 1 and  d\  + • • • + dn  is  even. 

► 111.  [25]  Let  G be  a graph  on  vertices  V = {1, . . . ,n},  with  dk  the  degree  of  k and 
max(di, . . . , dn)  = d.  Prove  that  there’s  an  integer  N with  n < N < 2n  and  a graph  77 
on  vertices  (1, . . . , Af},  such  that  77  is  regular  of  degree  d and  77  | V — G.  Explain  how 
to  construct  such  a regular  graph  with  N as  small  as  possible. 
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► 112.  [20]  Does  the  network  miles (128, 0, 0, 0, 0, 127, 0)  have  three  equidistant  cities? 
If  not,  what  three  cities  come  closest  to  an  equilateral  triangle? 

113.  [05]  When  H is  a hypergraph  with  m edges  and  n vertices,  how  many  rows  and 
columns  does  its  incidence  matrix  have? 


114.  [M20]  Suppose  the  multigraph  (26)  is  regarded  as  a hypergraph.  What  is  the 
corresponding  incidence  matrix?  What  is  the  corresponding  bipartite  graph? 


► 115.  [M20]  When  B is  the  incidence  matrix  of  a graph  G,  explain  the  significance  of 
the  symmetric  matrices  BTB  and  BBT. 

116.  [Ml 7]  Describe  the  edges  of  the  complete  bipartite  r-uniform  hypergraph  K{„\]n . 

117.  [M22]  How  many  nonisomorphic  1-uniform  hypergraphs  have  m edges  and  n ver- 
tices? (Edges  may  be  repeated.)  List  them  all  when  m = 4 and  n = 3. 

118.  [M20]  A “hyperforest”  is  a hypergraph  that  contains  no  cycles.  If  a hyperforest 
has  171  edges,  n vertices,  and  p components,  what’s  the  sum  of  the  degrees  of  its  vertices? 

119.  [Ml 8]  What  hypergraph  corresponds  to  (60)  without  the  final  term  (xi  Vi2Vz4)? 

120.  [M20]  Define  directed  hypergraphs , by  generalizing  the  concept  of  directed  graphs. 

121.  [Ml 9]  Given  a hypergraph  H = (' V , E),  let  1(H)  = (V,  F),  where  F is  the  family 
of  all  maximal  independent  sets  of  H.  Express  X(H)  in  terms  of  |V|,  |F|,  and  a(I(H)T). 

► 122.  [M24]  Find  a maximum  independent  set  and  a minimum  coloring  of  the  following 

triple  systems:  (a)  the  hypergraph  (56);  (b)  the  dual  of  the  Petersen  graph. 

123.  [1 7]  Show  that  the  optimum  colorings  of  K„nKn  are  equivalent  to  the  solutions 
of  a famous  combinatorial  problem. 

124.  [M22]  What  is  the  chromatic  number  of  the  Chvatal  graph,  Fig.  2(f)? 

125.  [M48]  For  what  values  of  g is  there  a 4-regular,  4-chromatic  graph  of  girth  g ? 

► 126.  [M22]  Find  optimum  colorings  of  the  “kingwise  torus,”  Cm®Cn,  when  m,n>  3. 
127.  [M22]  Prove  that  (a)  X(G)  + X(G)  < n + 1 and  (b)  X(G)X(G)  > n when  G is  a 


graph  of  order  n,  and  find  graphs  for  which  equality  holds. 

128.  [Ml 8]  Express  X(GnH)  in  terms  of  X(G)  and  X(H),  when  G and  H are  graphs. 

129.  [23]  Describe  the  maximal  cliques  of  the  8x8  queen  graph  (37). 

130.  [M20]  How  many  maximal  cliques  are  in  a complete  fc-partite  graph? 

131.  [M30]  Let  N(n)  be  the  largest  number  of  maximal  cliques  that  an  n- vertex  graph 
can  have.  Prove  that  3 1”/3!  < JV(n)  < 3 T”/3! . 

► 132.  [M20]  We  call  G tightly  colorable  if  X(G)  = w(G).  Prove  that  X(G  a H)  = 
x(G)x(H)  whenever  G and  H are  tightly  colorable. 


133.  [21]  The  “musical  graph”  illustrated  here  pro- 
vides a nice  way  to  review  numerous  definitions 
that  were  given  in  this  section,  because  its  proper- 
ties are  easily  analyzed.  Determine  its  (a)  order; 

(b)  size;  (c)  girth;  (d)  diameter;  (e)  independ- 
ence number,  a(G);  (f)  chromatic  number,  x(G); 

(g)  edge-chromatic  number,  X(L(G));  (h)  clique 
number,  u;(G);  (i)  algebraic  formula  as  a product 
of  well-known  smaller  graphs.  What  is  the  size 
of  (j)  a minimum  vertex  cover?  (k)  a maximum 
matching?  Is  G (1)  regular?  (m)  planar?  (n)  con- 
nected? (o)  directed?  (p)  a free  tree?  (q)  Hamiltonian? 
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134.  [M22]  How  many  automorphisms  does  the  musical  graph  have? 

► 135.  [HM26]  Suppose  a composer  takes  a random  walk  in  the  musical  graph,  starting 
at  vertex  C and  then  making  five  equally  likely  choices  at  each  step.  Show  that  after 
an  even  number  of  steps,  the  walk  is  more  likely  to  end  at  vertex  C than  at  any  other 
vertex.  What  is  the  exact  probability  of  going  from  C to  C in  a 12-step  walk? 

136.  [HM23]  A Cayley  digraph  is  a directed  graph  whose  vertices  V are  the  elements 
of  a group  and  whose  arcs  are  v — > votj  for  1 < j < d and  all  vertices  v,  where 
(qi,  . . . , ad)  are  fixed  elements  of  the  group.  A Cayley  graph  is  a Cayley  digraph  that 
is  also  a graph.  Is  the  Petersen  graph  a Cayley  graph? 
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► 137.  [M25\  ( Generalized  toruses.)  An  m x n torus  can  be  regarded  as  a tiling  of  the 
plane.  For  example,  we  can  imagine  that  infinitely  many  copies  of  the  3x4  torus 
in  (50)  have  been  placed  together  gridwise,  as  indicated  in  the  left-hand  illustration 
above;  from  each  vertex  we  can  move  north,  south,  east,  or  west  to  another  vertex  of  the 
torus.  The  vertices  have  been  numbered  here  so  that  a northward  move  from  v goes  to 
(u  + 4)  mod  12,  and  an  eastward  move  to  (t>  + 3)  mod  12,  etc.  The  right-hand  illustration 
shows  the  same  torus,  but  with  a differently  shaped  tile;  any  way  to  choose  twelve  cells 
numbered  {0, 1, . . . , 11}  will  tile  the  plane,  with  exactly  the  same  underlying  graph. 

Shifted  copies  of  a single  shape  will  also  tile  the  plane  if  they  form  a generalized 
torus,  in  which  cell  (x,  y)  corresponds  to  the  same  vertex  as  cells  (x  + a,  y + b)  and 
(x  + c,y  + d),  where  (a,  b)  and  ( c,d ) are  integer  vectors  and  n = ad  — be  > 0.  The 
generalized  torus  will  then  have  n points.  These  vectors  (a,  b)  and  (c,  d)  are  (4, 0)  and 
(0, 3)  in  the  3x4  example  above;  and  when  they  are  respectively  (5,  2)  and  (1,3)  we  get 
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Here  n = 13,  and  a northward  move  from  v goes  to  (v  + 4)  mod  13;  an  eastward  move 
goes  to  (v  + 1)  mod  13. 

Prove  that  if  gcd(o,  b,  c,  d)  = 1,  the  vertices  of  such  a generalized  torus  can  always 
be  assigned  integer  labels  {0, 1, . . . , n— 1}  in  such  a way  that  the  neighbors  of  v are 
(v  ± p)  mod  n and  ( v ± q)  mod  n,  for  some  integers  p and  q. 
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138.  [HM27]  Continuing  exercise  137,  what  is  a good  way  to  label  fc-dimensional 
vertices  x = (xi,...,xk),  when  integer  vectors  aj  are  given  such  that  each  vector  x 
is  equivalent  to  x + a3  for  1 < j < kl  Illustrate  your  method  in  the  case  k = 3, 
«i  = (3,1,1),  Q2  = (1,3,1),  a3  = (1,1,3). 

► 139.  [M22]  Let  H be  a fixed  graph  of  order  h,  and  let  #(H:G)  be  the  number  of  times 
that  H occurs  as  an  induced  subgraph  of  a given  graph  G.  If  G is  chosen  at  random 
from  the  set  of  all  2n(n~1)/ 2 graphs  on  the  vertices  V = {1,2, ...  ,nj,  what  is  the  average 
value  of  #(H:G)  when  H is  (a)  Kh-  (b)  Ph,  for  h > 1;  (c)  Ch,  for  h > 2;  (d)  arbitrary? 

140.  [M30]  A graph  G is  called  proportional  if  its  induced  subgraph  counts  #(A'3:G), 
#{K3:G),  and  #(P3:G)  each  agree  with  the  expected  values  derived  in  exercise  139. 

a)  Show  that  the  wheel  graph  Wg  of  exercise  88  is  proportional  in  this  sense. 

b)  Prove  that  G is  proportional  if  and  only  if  jj=(K3:G)  = ^(3)  and  the  degree 
sequence  d\  . . ,dn  of  its  vertices  satisfies  the  identities 

d1  + ---+dn=  (”),  dl  + ---  + d2n  = ^(^y  (*) 

141.  [26]  The  conditions  of  exercise  140(b)  can  hold  only  if  n mod  16  € {0,1,8}. 
Write  a program  to  find  all  of  the  proportional  graphs  that  have  n = 8 vertices. 

142.  [M30]  (S.  Janson  and  J.  Kratochvil,  1991.)  Prove  that  no  graph  G on  4 or  more 
vertices  can  be  “extraproportional,”  in  the  sense  that  its  subgraph  counts  #{H:G)  agree 
with  the  expected  values  in  exercise  139  for  each  of  the  eleven  nonisomorphic  graphs  H 
of  order  4.  Hint:  (n  - 3)#(AT3:G)  = 4 #(AT4:G)  + 2#(A'M,2:G)  + #(AT1©7f3:G). 

► 143.  [M25]  Let  A be  any  matrix  with  m > 1 distinct  rows,  and  n>m  columns.  Prove 
that  at  least  one  column  of  A can  be  deleted,  without  making  any  two  rows  equal. 

► 144.  [21]  Let  X be  an  m x n matrix  whose  entries  xi3  are  either  0,  1,  or  *.  A 
“completion”  of  X is  a matrix  A*  in  which  every  * has  been  replaced  by  either  0 or  1. 
Show  that  the  problem  of  finding  a completion  with  fewest  distinct  rows  is  equivalent 
to  the  problem  of  finding  the  chromatic  number  of  a graph. 

► 145.  [25]  (R.  S.  Boyer  and  J.  S.  Moore,  1980.)  Suppose  the  array  a\  . . .a„  contains  a 
majority  element,  namely  a value  that  occurs  more  than  n/2  times.  Design  an  algorithm 
that  finds  it  after  making  fewer  than  n comparisons.  Hint:  If  n > 3 and  a„_i  / an, 
the  majority  element  of  a3  . . . an  is  also  the  majority  element  of  a3  . . . a„-2. 
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Yet  now  and  then  your  men  of  wit 
Will  condescend  to  take  a bit. 
— JONATHAN  SWIFT,  Cadenus  and  Vanessa  (1713) 

If  the  base  2 is  used  the  resulting  units  may  be  called  binary  digits, 
or  more  briefly  bits,  a word  suggested  by  J.  W.  Tukey. 

— CLAUDE  E.  SHANNON,  in  Bell  System  Technical  Journal  (1948) 

bit  (bit),  n . . . [A]  boring  tool  . . . 

— Random  House  Dictionary  of  the  English  Language  (1987) 

7.1.  ZEROS  AND  ONES 

Combinatorial  algorithms  often  require  special  attention  to  efficiency,  and 
the  proper  representation  of  data  is  an  important  way  to  gain  the  necessary 
speed.  It  is  therefore  wise  to  beef  up  our  knowledge  of  elementary  representation 
techniques  before  we  set  out  to  study  combinatorial  algorithms  in  detail. 

Most  of  today’s  computers  are  based  on  the  binary  number  system,  instead 
of  working  directly  with  the  decimal  numbers  that  human  beings  prefer,  because 
machines  are  especially  good  at  dealing  with  the  two-state  on-off  quantities  that 
we  usually  denote  by  the  digits  0 and  1.  But  in  Chapters  1 to  6 we  haven’t  made 
much  use  of  the  fact  that  binary  computers  can  do  several  things  quickly  that 
decimal  computers  cannot.  A binary  machine  can  usually  perform  “logical”  or 
"bitwise”  operations  just  as  easily  as  it  can  add  or  subtract;  yet  we  have  seldom 
capitalized  on  that  capability.  We’ve  seen  that  binary  and  decimal  computers  are 
not  significantly  different,  for  many  purposes,  but  in  a sense  we’ve  been  asking 
a binary  computer  to  operate  with  one  hand  tied  behind  its  back. 

The  amazing  ability  of  Os  and  Is  to  encode  information  as  well  as  to  encode 
the  logical  relations  between  items,  and  even  to  encode  algorithms  for  processing 
information,  makes  the  study  of  binary  digits  especially  rich.  Indeed,  we  not  only 
use  bitwise  operations  to  enhance  combinatorial  algorithms,  we  also  find  that  the 
properties  of  binary  logic  lead  naturally  to  new  combinatorial  problems  that  are 
of  great  interest  in  their  own  right. 

Computer  scientists  have  gradually  become  better  and  better  at  taming  the 
wild  Os  and  Is  of  the  universe  and  making  them  do  useful  tricks.  But  as  bit 
players  on  the  world’s  stage,  we’d  better  have  a thorough  understanding  of  the 
low-level  properties  of  binary  quantities  before  we  launch  into  a study  of  higher- 
level  concepts  and  techniques.  Therefore  we  shall  start  by  investigating  basic 
ways  to  combine  individual  bits  and  sequences  of  bits. 

7.1.1.  Boolean  Basics 

There  are  16  possible  functions  f(x,y)  that  transform  two  given  bits  x and  y 
into  a third  bit  z = /(x,  y),  since  there  are  two  choices  for  each  of  /( 0, 0),  /( 0, 1), 
/(1,0),  and  /( 1, 1).  Table  1 indicates  the  names  and  notations  that  have  tradi- 
tionally been  associated  with  these  functions  in  studies  of  formal  logic,  assuming 
that  1 corresponds  to  “true”  and  0 to  “false.”  The  sequence  of  four  values 
/(0,0)/(0, 1)/(1,  0)/(l,  1)  is  customarily  called  the  truth  table  of  the  function  /. 
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Let  us  conceive,  then,  of  an  Algebra 
in  which  the  symbols  x,  y,  z,  Szc.  admit  indifferently  of 
the  values  0 and  1,  and  of  these  values  alone. 
— GEORGE  BOOLE,  An  Investigation  of  the  Laws  of  Thought  (1854) 

‘Contrariwise,’  continued  Tweedledee,  ‘if  it  was  so,  it  might  be; 

and  if  it  were  so,  it  would  be; 
but  as  it  isn’t,  it  ain’t.  That's  logic.' 
— LEWIS  CARROLL,  Through  the  Looking  Glass  (1871) 

Such  functions  are  often  called  “Boolean  operations”  in  honor  of  George 
Boole,  who  first  discovered  that  algebraic  operations  on  Os  and  Is  could  be  used 
to  construct  a calculus  for  logical  reasoning  [The  Mathematical  Analysis  of  Logic 
(Cambridge:  1847);  An  Investigation  of  the  Laws  of  Thought  (London:  1854)]. 
But  Boole  never  actually  dealt  with  the  “logical  or”  operation  V;  he  confined 
himself  strictly  to  ordinary  arithmetic  operations  on  Os  and  Is.  Thus  he  would 
write  x + y to  stand  for  disjunction,  but  he  took  pains  never  to  use  this  notation 
unless  x and  y were  mutually  exclusive  (not  both  1).  If  necessary,  he  wrote 
x + (1  ~x)y  to  ensure  that  the  result  of  a disjunction  would  never  be  equal  to  2. 

When  rendering  the  + operation  in  English,  Boole  sometimes  called  it  “and,” 
sometimes  “or.”  This  practice  may  seem  strange  to  modern  mathematicians  until 
we  realize  that  his  usage  was  in  fact  normal  English;  we  say,  for  example,  that 
“boys  and  girls  are  children,”  but  “children  are  boys  or  girls.” 

Boole’s  calculus  was  extended  to  include  the  unconventional  rule  x + x - x 
by  W.  Stanley  Jevons  [Pure  Logic  (London:  Edward  Stanford,  1864),  §69],  who 
pointed  out  that  (x  + y)z  was  equal  to  xz  + yz  using  his  new  + operation.  But 
Jevons  did  not  know  the  other  distributive  law  xy+z  = (x+z)(y+z).  Presumably 
he  missed  this  because  of  the  notation  he  was  using,  since  the  second  distributive 
law  has  no  familiar  counterpart  in  arithmetic;  the  more  symmetrical  notations 
x A y,  x V y in  Table  1 make  it  easier  for  us  to  remember  both  distributive  laws 

(xVy)Az  = (x  A z)  V (y  A z);  (i) 

(xA  y)Vz  — (x  V z)  A (y  V z).  (2) 

The  second  law  (2)  was  introduced  by  C.  S.  Peirce,  who  had  discovered  indepen- 
dently how  to  extend  Boole’s  calculus  [Proc.  Amer.  Acad.  Arts  and  Sciences  7 
(1867),  250-261].  Incidentally,  when  Peirce  discussed  these  early  developments 
several  years  later  [Amer.  J.  Math.  3 (1880),  32],  he  referred  to  “the  Boolian 
algebra,  with  Jevons’s  addition”;  his  now-unfamiliar  spelling  of  “Boolean”  was 
in  use  for  many  years,  appearing  in  the  Funk  and  Wagnalls  unabridged  dictionary 
as  late  as  1963. 

The  notion  of  truth- value  combination  is  actually  much  older  than  Boolean 
algebra.  Indeed,  propositional  logic  had  been  developed  by  Greek  philosophers 
already  in  the  fourth  century  B.C.  There  was  considerable  debate  in  those  days 
about  how  to  assign  an  appropriate  true-or-false  value  to  the  proposition  “if  x 
then  y ” when  x and  y are  propositions;  Philo  of  Megara,  about  300  B.C.,  defined 
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Truth 

table 

New  and  old  notation (s)  Operator 
v ’ symbol  o 

Name(s) 

oooo 

0 

1 

Contradiction;  falsehood;  antilogy;  constant  0 

0001 

xy,  xAy,  x Si  y 

A 

Conjunction;  and 

0010 

xAy,  xj)y,  [x>y],  x-y 

3 

Nonimplication;  difference;  but  not 

0011 

X 

L 

Left  projection;  first  dictator 

0100 

xAy,  x<£y,  [x<y],  y-x 

C 

Converse  nonimplication;  not  . . . but 

0101 

y 

R 

Right  projection;  second  dictator 

0110 

x®y,  x^y,  x~y 

© 

Exclusive  disjunction;  nonequivalence;  “xor” 

0111 

x V y,  x | y 

V 

(Inclusive)  disjunction;  or;  and/or 

1000 

x Ay,  xWy,  xVy,  xly 

V 

Nondisjunction;  joint  denial;  neither  . . . nor 

1001 

x = y,  x-^y,  x<&y 

= 

Equivalence;  if  and  only  if;  “iff” 

1010 

y,  -u/,  \y,  ~y 

R 

Right  complementation 

1011 

xVy,  xCy,  x<=y,  [x  > y],  xy 

C 

Converse  implication;  if 

1100 

x,  -ix,  !x,  ~x 

L 

Left  complementation 

1101 

x V y,  xDy,  x =>■  y,  [x<y],  yx 

3 

Implication;  only  if;  if  . . . then 

1110 

x V y,  x Ay,  x Ay,  x\y 

A 

Nonconjunction;  not  both  . . . and;  “nand” 

ini 

1 

T 

Affirmation;  validity;  tautology;  constant  1 

it 

by  the  truth  table  shown  in 

Table 

■ 1,  which  states  in  particular  that  the 

implication  is  true  when  both  x and  y are  false.  Much  of  this  early  work  has  been 
lost,  but  there  are  passages  in  the  works  of  Galen  (2nd  century  A.D.)  that  refer 
to  both  inclusive  and  exclusive  disjunction  of  propositions.  [See  I.  M.  Bochenski, 
Formale  Logik  (1956),  English  translation  by  Ivo  Thomas  (1961),  for  an  excellent 
survey  of  the  development  of  logic  from  ancient  times  up  to  the  20th  century.] 

A function  of  two  variables  is  often  written  xoy  instead  of  f(x , y),  using  some 
appropriate  operator  symbol  o.  Table  1 shows  the  sixteen  operator  symbols  that 
we  shall  adopt  for  Boolean  functions  of  two  variables;  for  example,  _L  symbolizes 
the  function  whose  truth  table  is  0000,  A is  the  symbol  for  0001,  D is  the  symbol 
for  0010,  and  so  on.  We  have  x _L  y = 0,  x A y = xy,  xD  y — x — y,  x \_y  = x, 
...,xAy  = xVy,xTy=l. 

Of  course  the  operations  in  Table  1 aren’t  all  of  equal  importance.  For 
example,  the  first  and  last  cases  are  trivial,  since  they  have  a constant  value 
independent  of  x and  y.  Four  of  them  are  functions  of  x alone  or  y alone.  We 
write  x for  1 — x,  the  complement  of  x. 

The  four  operations  whose  truth  table  contains  just  a single  1 are  easily 
expressed  in  terms  of  the  AND  operator  A,  namely  x A y,  x Ay,  x Ay,  x Ay. 
Those  with  three  Is  are  easily  written  in  terms  of  the  OR  operator  V,  namely 
x\/y,x\/y,x\Jy,x\Jy.  The  basic  functions  x Ay  and  x V y have  proved  to  be 
more  useful  in  practice  than  their  complemented  or  half-complemented  cousins, 
although  the  NOR  and  NAND  operations  x V y — x A y and  x A y — x V y are  also 
of  interest  because  they  are  easily  implemented  in  transistor  circuits. 
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In  1913,  H.  M.  Sheffer  showed  that  all  16  of  the  functions  can  be  expressed 
in  terms  of  just  one,  starting  with  either  V or  A as  the  given  operation  (see 
exercise  4).  Actually  C.  S.  Peirce  had  made  the  same  discovery  about  1880,  but 
his  work  on  the  subject  remained  unpublished  until  after  his  death  [Collected 
Papers  of  Charles  Sanders  Peirce  4 (1933),  §§12-20,  264],  Table  1 indicates  that 
NAND  and  NOR  have  occasionally  been  written  x \ y and  x \.y\  sometimes  they 
have  been  called  “Sheffer’s  stroke”  and  the  “Peirce  arrow.”  Nowadays  it  is  best 
not  to  use  Sheffer’s  vertical  line  for  NAND,  because  x | y denotes  bitwise  x V y in 
programming  languages  like  C. 

So  far  we  have  discussed  all  but  two  of  the  functions  in  Table  1.  The 
remaining  two  are  x = y and  x ® y,  “equivalence”  and  “exclusive-or,”  which 
are  related  by  the  identities 


x — V — x ® y = x ® y = 1 © x © «/;  (3) 

x®y  = x = y = x = y = 0 = x = y.  (4) 

Both  operations  are  associative  (see  exercise  6).  In  propositional  logic,  the  notion 

of  equivalence  is  more  important  than  the  notion  of  exclusive-or,  which  means 

inequivalence;  but  when  we  consider  bitwise  operations  on  full  computer  words, 
we  shall  see  in  Section  7.1.3  that  the  situation  is  reversed:  Exclusive-or  turns 
out  to  be  more  useful  than  equivalence,  in  typical  programs.  The  chief  reason 
why  x © y has  significant  applications,  even  in  the  one-bit  case,  is  the  fact  that 

x®y  — (x  + y)  mod  2.  (5) 

Therefore  x®y  and  xAy  denote  addition  and  multiplication  in  the  field  of  two  el- 
ements (see  Section  4.6),  and  x@y  naturally  inherits  many  “clean”  mathematical 
properties. 

Basic  identities.  Now  let’s  take  a look  at  interactions  between  the  fundamental 
operators  A,  V,  ®,  and  , since  the  other  operations  are  easily  expressed  in  terms 
of  these  four.  Each  of  A,  V,  © is  associative  and  commutative.  Besides  the 
distributive  laws  (1)  and  (2),  we  also  have 

(x®y)Az  = [x  A 2)®  (y  A z),  (6) 

as  well  as  the  absorption  laws 

(x  A y)  V x = (xVy)Ax  = x.  (7) 

One  of  the  simplest,  yet  most  useful,  identities  is 


a;©*  = 0,  (8) 

since  it  implies  among  other  things  that 

[x®y)®x  = y,  (x®y)®y  = x,  (9) 

when  we  use  the  obvious  fact  that  x © 0 = x.  In  other  words,  given  x © y and 
either  x or  y,  it  is  easy  to  determine  the  other.  And  let  us  not  overlook  the 
simple  complementation  law 

x = x 0 1.  (10) 
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Another  important  pair  of  identities  is  known  as  De  Morgan’s  laws  in  honor 
of  Augustus  De  Morgan,  who  stated  that  “The  contrary  of  an  aggregate  is  the 
compound  of  the  contraries  of  the  aggregants;  the  contrary  of  a compound  is 
the  aggregate  of  the  contraries  of  the  components.  Thus  ( A,B ) and  AB  have 
ab  and  (a,b)  for  contraries.”  [Ihans.  Cambridge  Philos.  Soc.  10  (1858),  208.]  In 
more  modern  notation,  these  are  the  rules  we  have  implicitly  derived  via  truth 
tables  in  connection  with  the  operations  NAND  and  NOR  in  Table  1,  namely 

x A y = xVy;  (n) 

xVy  = x Ay.  (12) 

Incidentally,  W.  S.  Jevons  knew  (12)  but  not  (11);  he  consistently  wrote  AB  + 
BA  + AB  instead  of  A + B for  the  complement  of  AB.  Yet  De  Morgan  was 
not  the  first  Englishman  who  enunciated  the  laws  above.  Both  (11)  and  (12) 
can  be  found  in  the  early  14th  century  writings  of  two  scholastic  philosophers, 
William  of  Ockham  [Sunima  Logics;  2 (1323)]  and  Walter  Burley  [De  Puritate 
Artis  Logics  (c.  1330)]. 

De  Morgan’s  laws  and  a few  other  identities  can  be  used  to  express  A,  V, 
and  © in  terms  of  each  other: 

xAy  = xVy  = x®y®(xVy);  (13) 

xVy  = xAy  = x ® y ® (x  A y);  (14) 

= (xVy)  Ax  Ay  = (x  A y)  V (x  A y).  (15) 

According  to  exercise  7.1.2-77,  all  computations  of  xi  © x2  © • • • © xn  that  use 
only  the  operations  A,  V,  and  “ must  be  at  least  4(n  - 1)  steps  long;  thus,  the 
other  three  operations  are  not  an  especially  good  substitute  for  ©. 

Functions  of  n variables.  A Boolean  function  f(x,  y.  z)  of  three  Boolean  vari- 
ables x,  y , 2 can  be  defined  by  its  8-bit  truth  table  /( 0, 0, 0)/(0, 0, 1) . . . /( 1, 1, 1); 
and  in  general,  every  n-ary  Boolean  function  f(x  1, . . . ,xn)  corresponds  to  a 2n- 
bit  truth  table  that  lists  the  successive  values  of  /( 0, . . . , 0, 0),  /( 0,  ...,0  1) 
/(0 1,0),  ...,  /(1 1,1). 

We  needn’t  devise  special  names  and  notations  for  all  these  functions,  since 
they  can  all  be  expressed  in  terms  of  the  binary  functions  that  we’ve  already 
learned.  For  example,  as  observed  by  I.  I.  Zhegalkin  [Matematicheskii  Sbornik 
35  (1928),  311-369],  we  can  always  write 

f (x  1 , . . . , xn)  y(x  1 , . . . , xn—i ) © h(x\ , . . . , xn—i ) A xn 

when  n > 0,  for  appropriate  functions  g and  h,  by  letting 

g(x  il,..,a;n_1)  = /(xj,...,  £„_!,()); 

h{x  1, ... , xn_i ) = f(x  1, . . . , xn_l5  0)  © f(xi , . . . , xn—i,  1). 

(The  operation  A conventionally  takes  precedence  over  ©,  so  we  need  not  use 
parentheses  to  enclose  the  subformula  ‘h(x  1, . . . ,xn_x)  A xn ’ on  the  right-hand 
side  of  (16).)  Repeating  this  process  recursively  on  g and  h until  we’re  down  to 


(16) 

(17) 
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0-ary  functions  leaves  us  with  an  expression  that  involves  only  the  operators  ®, 
A,  and  a sequence  of  2"  constants.  Furthermore,  those  constants  can  usually  be 
simplified  away,  because  we  have 

a;  A 0 = 0 and  xAUi®0  = i.  (18) 

After  applying  the  associative  and  distributive  laws,  we  end  up  needing  the 
constant  0 only  if  f(x1,...,xn)  is  identically  zero,  and  the  constant  1 only  if 

/(0 0)  = 1. 

We  might  have,  for  instance, 

f(x,y,z)  = ((l®0Aa;)®  (0®  lAx)A  y)  ffi  ((0  ® 1 Ax)  ® (1  ® 1 Aar)  Ay)  Az 
= (l®a;Aj/)  ® (x  ® y ® xAy)Az 
= 1 ® xAy  ® xAz  ® yAz  ® xAyAz. 

And  by  rule  (5),  we  see  that  we’re  simply  left  with  the  polynomial 

f(x,V,z)  = (1  + xy  + xz  + yz  + xyz)  mod  2,  (ig) 

because  xAy  = xy.  Notice  that  this  polynomial  is  linear  (of  degree  < 1)  in  each  of 
its  variables.  In  general,  a similar  calculation  will  show  that  any  Boolean  function 
f(x  1, . . . , x„)  has  a unique  representation  such  as  this,  called  its  multilinear  rep- 
resentation or  exclusive  normal  form,  which  is  a sum  (modulo  2)  of  zero  or  more 
of  the  2n  possible  terms  1,  aq,  x2,  aqaq,  x3,  aqaq,  x2x3,  xxx2x3,  . . . , aqaq  . . . xn. 

George  Boole  decomposed  Boolean  functions  in  a different  way,  which  is 
often  simpler  for  the  kinds  of  functions  that  arise  in  practice.  Instead  of  (16),  he 
essentially  wrote 

f(x  i,...,xn)  = (g(xi, . . . , xn-i)  A V (/i(xi,...,a:n_1)  Arn)  (20) 
and  called  it  the  “law  of  development,”  where  we  now  have  simply 

g(x\i . . . ,xn—\)  — /(aq , . . . , xn_ i,0), 

/i(aq, . . . , xn_i)  = f(xi, . . . , xn_j,  1),  ^ ^ 

instead  of  (17).  Repeatedly  iterating  Boole’s  procedure,  using  the  distributive 
law  (1),  and  eliminating  constants,  leaves  us  with  a formula  that  is  a disjunc- 
tion of  zero  or  more  minterms,  where  each  minterm  is  a conjunction  such  as 
xtAx2Ax3Ax4A x5  in  which  every  variable  or  its  complement  is  present.  Notice 
that  a minterm  is  a Boolean  function  that  is  true  at  exactly  one  point. 

For  example,  let’s  consider  the  more-or-less  random  function  f(w,x,y,z) 
whose  truth  table  is 

1100100100001111.  (22) 

When  this  function  is  expanded  by  repeatedly  applying  Boole’s  law  (20),  we  get 
a disjunction  of  eight  minterms,  one  for  each  of  the  Is  in  the  truth  table: 

f(w,x,  y,  z)  = (wAxAyAz)  V (wAxAyAz)  V (wAxAyAz)  V (wAxAyAz) 

V (wAxAyAz)  V (wAxAyAz)  V (wAxAyAz)  V (wAxAyAz).  (23) 
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In  general,  a disjunction  of  minterms  is  called  a full  disjunctive  normal 
form.  Every  Boolean  function  can  be  expressed  in  this  way,  and  the  result 
is  unique  — except,  of  course,  for  the  order  of  the  minterms.  Nitpick:  A special 
case  arises  when  f(x  1, . . . , xn)  is  identically  zero.  We  consider  ‘O’  to  be  an  empty 
disjunction,  with  no  terms,  and  we  also  consider  ‘1’  to  be  an  empty  conjunction, 
for  the  same  reasons  as  we  defined  Yl°k=i  “fe  = 0 and  n!t=i  ak  — 1 in  Section  1.2.3. 

C.  S.  Peirce  observed,  in  Amer.  J.  Math.  3 (1880),  37-39,  that  every  Boolean 
function  also  has  a full  conjunctive  normal  form,  which  is  a conjunction  of  “min- 
clauses”  like  xi  V Xi  V x%  V X4  V X5.  A minclause  is  0 at  only  one  point;  so  each 
clause  in  such  a conjunction  accounts  for  a place  where  the  truth  table  has  a 0. 
For  example,  the  full  conjunctive  normal  form  of  our  function  in  (22)  and  (23)  is 

f(w,x,y,z ) = (wVxVyVz)  A (wVxVyVz)  A (wVxVyVz)  A (wVxVyVz) 

A (wVxVyVz)  A (wVa:VyVz)  A (wVxVyW z)  A (wVxVyVz).  (24) 

Not  surprisingly,  however,  we  often  want  to  work  with  disjunctions  and  con- 
junctions that  don't  necessarily  involve  full  minterms  or  minclauses.  Therefore, 
following  nomenclature  introduced  by  Paul  Bernays  in  his  Habilitationsschrift 
(1918),  we  speak  in  general  of  a disjunctive  normal  form  or  “DNF”  as  any 
disjunction  of  conjunctions, 

m Sj 

V A utk  — (un  A • • • A uiSl)  V • • • V (uml  A • • • A umSrn),  (25) 

j=l  fc=l 

where  each  Ujk  is  a literal,  namely  a variable  xt  or  its  complement.  Similarly, 
any  conjunction  of  disjunctions  of  literals, 

m 

A V uik  = (Ml1  v • ' • v ui«i)  A • ■ • A (um  1 V • • • V umSm ),  (26) 

j= 1 fc=i 

is  called  a conjunctive  normal  form,  or  “CNF”  for  short. 

A great  many  electrical  circuits  embedded  inside  today’s  computer  chips  are 
composed  of  “programmable  logic  arrays”  (PL As),  which  are  ORs  of  ANDs  of 
possibly  complemented  input  signals.  In  other  words,  a PLA  basically  computes 
one  or  more  disjunctive  normal  forms.  Such  building  blocks  are  fast,  versatile, 
and  relatively  inexpensive;  and  indeed,  DNFs  have  played  a prominent  role  in 
electrical  engineering  ever  since  the  1950s,  when  switching  circuits  were  imple- 
mented with  comparatively  old-fashioned  devices  like  relays  or  vacuum  tubes. 
Therefore  people  have  long  been  interested  in  finding  the  simplest  DNFs  for 
classes  of  Boolean  functions,  and  we  can  expect  that  an  understanding  of  disjunc- 
tive normal  forms  will  continue  to  be  important  as  technology  continues  to  evolve. 

The  terms  of  a DNF  are  often  called  implicants,  because  the  truth  of  any 
term  in  a disjunction  implies  the  truth  of  the  whole  formula.  In  a formula  like 

f(x,y,z)  — (x  A y A z)  V (y  A z)  V (x  A y A z), 

for  example,  we  know  that  / is  true  when  xAyAz  is  true,  namely  when  (x,  y,  z)  = 
(1,  0, 1).  But  notice  that  in  this  example  the  shorter  term  x Az  also  turns  out  to 
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be  an  implicant  of  /,  even  though  not  written  explicitly,  because  the  additional 
term  y Az  makes  the  function  true  whenever  x = z = 1,  regardless  of  the  value 
of  y.  Similarly,  x Ay  is  an  implicant  of  this  particular  function.  So  we  might  as 
well  work  with  the  simpler  formula 

f(x,y,z)  = (iAz)V(i/A2)V(iA!/).  (27) 

At  this  point  no  more  deletions  are  possible  within  the  implicants,  because 
neither  x nor  y nor  z nor  x is  a strong  enough  condition  to  imply  the  truth  of  /. 

An  implicant  that  can’t  be  factored  further  by  removing  any  of  its  literals 
without  making  it  too  weak  is  called  a prime  implicant,  following  the  terminology 
of  W.  V.  Quine  in  AMM  59  (1952),  521-531. 

These  basic  concepts  can  perhaps  be  understood  most  easily  if  we  simplify 
the  notation  and  adopt  a more  geometric  viewpoint.  We  can  write  simply  ‘/(x)’ 
instead  of  f(x1, . . . , xn),  and  regard  a:  as  a vector,  or  as  a binary  string  x1  . . . xn 
of  length  n.  For  example,  the  strings  wxyz  where  the  function  of  (22)  is  true  are 

{0000,  0001,  0100,  0111,  1100,  1101,  1110,  1111},  (28) 

and  we  can  think  of  them  as  eight  points  in  the  4-dimensional  hypercube  2 x 
2x2x2.  The  eight  points  in  (28)  correspond  to  the  minterm  implicants  that 
are  explicitly  present  in  the  full  disjunctive  normal  form  (23);  but  none  of  those 
implicants  is  actually  prime.  For  example,  the  first  two  points  of  (28)  make  the 
subcube  000*,  and  the  last  four  points  constitute  the  subcube  11**,  if  we  use 
asterisks  to  denote  wild  cards  as  we  did  when  discussing  database  queries  in 
Section  6.5;  therefore  w A x A y is  an  implicant  of  /,  and  so  is  w A x.  Similarly, 
we  can  see  that  the  subcube  0*00  accounts  for  two  of  the  eight  points  in  (28), 
making  w A y A z an  implicant. 

In  general,  each  prime  implicant  corresponds  in  this  way  to  a maximal 
subcube  that  stays  within  the  set  of  points  that  make  / true.  (The  subcube 
is  maximal  in  the  sense  that  it  isn’t  contained  in  any  larger  subcube  with  the 
same  property;  we  can’t  replace  any  of  its  explicit  bits  by  an  asterisk.  A maximal 
subcube  has  a maximal  number  of  asterisks,  hence  a minimal  number  of  con- 
strained coordinates,  hence  a minimal  number  of  variables  in  the  corresponding 
implicant.)  The  maximal  subcubes  of  the  eight  points  in  (28)  are 

000*,  0*00,  *100,  *111,  11**;  (29) 

so  the  prime  implicants  of  the  function  f(w,x,y,z)  in  (23)  are 

(w  A x A y)  V (w  A y A z)  V (x  A y A z)  V (x  A y A z)  V (w  A x).  (30) 

The  disjunctive  prime  form  of  a Boolean  function  is  the  disjunction  of  all  its 
prime  implicants.  Exercise  30  contains  an  algorithm  to  find  all  the  prime  impli- 
cants of  a given  function,  based  on  a list  of  the  points  where  the  function  is  true. 

We  can  define  a prime  clause  in  an  exactly  similar  way:  It  is  a disjunctive 
clause  that  is  implied  by  /,  having  no  subclause  with  the  same  property.  And 
the  conjunctive  prime  form  of  / is  the  conjunction  of  all  its  prime  clauses.  (An 
example  appears  in  exercise  19.) 
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Iii  many  simple  cases,  the  disjunctive  prime  form  is  the  shortest  possible 
disjunctive  normal  form  that  a function  can  have.  But  we  can  often  do  better, 
because  we  might  be  able  to  cover  all  the  necessary  points  with  only  a few  of 
the  maximal  subcubes.  For  example,  the  prime  implicant  ( y A z)  is  unnecessary 
in  (27).  And  in  expression  (30)  we  don’t  need  both  (w  Ay  Az)  and  (x  A y A z); 
either  one  is  sufficient,  in  the  presence  of  the  other  terms. 

Unfortunately,  we  will  see  in  Section  7.9  that  the  task  of  finding  a shortest 
disjunctive  normal  form  is  NP-hard,  thus  quite  difficult  in  general.  But  many 
useful  shortcuts  have  been  developed  for  sufficiently  small  problems,  and  they 
are  well  explained  in  the  book  Introduction  to  the  Theory  of  Switching  Circuits 
by  E.  J.  McCluskey  (New  York:  McGraw-Hill,  1965).  For  later  developments, 
see  Petr  Fiser  and  Jan  Hlavicka,  Computing  and  Informatics  22  (2003),  19-51. 

There’s  an  important  special  case  for  which  the  shortest  DNF  is,  however, 
easily  characterized.  A Boolean  function  is  said  to  be  monotone  or  positive  if 
its  value  does  not  change  from  1 to  0 when  any  of  its  variables  changes  from  0 
to  1.  In  other  words,  / is  monotone  if  and  only  if  f(x)  < f(y)  whenever  x C y, 
where  the  bit  string  x = X\  . . . xn  is  regarded  as  contained  in  or  equal  to  the  bit 
string  y = yi  ■■■  yn  if  and  only  if  Xj  < yj  for  all  j.  An  equivalent  condition  (see 
exercise  21)  is  that  the  function  / either  is  constant  or  can  be  expressed  entirely 
in  terms  of  A and  V,  without  complementation. 

Theorem  Q.  The  shortest  disjunctive  normal  form  of  a monotone  Boolean 
function  is  its  disjunctive  prime  form. 

Proof.  [W.  V.  Quine,  Boletfn  de  la  Sociedad  Matematica  Mexicana  10  (1953), 
64-70.]  Let  f(x i,...,xn)  be  monotone,  and  let  u\  A • • • A us  be  one  of  its  prime 
implicants.  We  cannot  have,  say,  ti\  — ah,  because  in  that  case  the  shorter  term 
U2  A • • • A us  would  also  be  an  implicant,  by  monotonicity.  Therefore  no  prime 
implicant  has  a complemented  literal. 

Now  if  we  set  iq  «—•••«—  us  <—  1 and  all  other  variables  to  0,  the  value  of  / 
will  be  1,  but  all  of  /’ s other  prime  implicants  will  vanish.  Thus  u±  A ■■■  A u3 
must  be  in  every  shortest  DNF,  because  every  implicant  of  a shortest  DNF  is 
clearly  prime.  | 

Corollary  Q.  A disjunctive  normal  form  is  the  disjunctive  prime  form  of  a 
monotone  Boolean  function  if  and  only  if  it  has  no  complemented  literals  and 
none  of  its  implicants  is  contained  in  another.  | 

Satisfiability.  A Boolean  function  is  said  to  be  satisfiable  if  it  is  not  identically 
zero  — that  is,  if  it  has  at  least  one  implicant.  The  most  famous  unsolved  problem 
in  all  of  computer  science  is  to  find  an  efficient  way  to  decide  whether  a given 
Boolean  function  is  satisfiable  or  unsatisfiable.  More  precisely,  we  ask:  Is  there  an 
algorithm  that  inputs  a Boolean  formula  of  length  N and  tests  it  for  satisfiability, 
always  giving  the  correct  answer  after  performing  at  most  N°^  steps? 

When  you  hear  about  this  problem  for  the  first  time,  you  might  be  tempted 
to  ask  a question  of  your  own  in  return:  “What?  Are  you  serious  that  computer 
scientists  still  haven’t  figured  out  how  to  do  such  a simple  thing?” 
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Well,  if  you  think  satisfiability  testing  is  trivial,  please  tell  us  your  method. 
We  agree  that  the  problem  isn’t  always  difficult;  if,  for  example,  the  given  formula 
involves  only  30  Boolean  variables,  a brute-force  trial  of  230  cases  — that’s  about 
a billion  will  indeed  settle  the  matter.  But  an  enormous  number  of  practical 
problems  that  still  await  solution  can  be  formulated  as  Boolean  functions  with, 
say,  100  variables,  because  mathematical  logic  is  a very  powerful  way  to  express 
concepts.  And  the  solutions  to  those  problems  correspond  to  the  vectors  x — 
x\  . . .2:100  for  which  f(x ) = 1.  So  a truly  efficient  solution  to  the  satisfiability 
problem  would  be  a wonderful  achievement. 

There  is  at  least  one  sense  in  which  satisfiability  testing  is  a no-brainer:  If 
the  function  f(x i,...,xn)  has  been  chosen  at  random,  so  that  all  2"-bit  truth 
tables  are  equally  likely,  then  / is  almost  surely  satisfiable,  and  we  can  find  an  x 
with  /( x)  = l after  making  fewer  than  2 trials  (on  the  average).  It’s  like  flipping 
a com  until  it  comes  up  heads;  we  rarely  need  to  wait  long.  But  the  catch,  of 
course,  is  that  practical  problems  do  not  have  random  truth  tables. 

Okay,  let  s grant  that  satisfiability  testing  does  seem  to  be  tough,  in  general. 
In  fact,  satisfiability  turns  out  to  be  difficult  even  when  we  try  to  simplify  it  by 
requiring  that  the  Boolean  function  be  presented  as  a “formula  in  3CNF”  — 
namely  as  a conjunctive  normal  form  that  has  only  three  literals  in  each  clause: 

f(x  1>  ■ • • > xn)  ~ (^1  v V 1>i)  A (t2  v U2  V V2)  A • • • A ( tm  V um  V vm ).  (31) 

Here  each  tj , uj,  and  vj  is  Xk  or  x^,  for  some  k.  The  problem  of  deciding 
satisfiability  for  formulas  in  3CNF  is  called  “3SAT,”  and  exercise  39  explains 
why  it  is  not  really  easier  than  satisfiability  in  general. 

We  will  be  seeing  many  examples  of  hard-to-crack  3SAT  problems,  especially 
m Section  7.9,  where  satisfiability  testing  will  be  discussed  in  great  detail.  The 
situation  is  a little  peculiar,  however,  because  a formula  needs  to  be  fairly  long 
before  we  need  to  think  twice  about  its  satisfiability.  For  example,  the  shortest 
unsatisfiable  formula  in  3CNF  is  (x  V x V x)  A (x  V x V x);  but  it  is  obviously 
no  challenge  to  the  intellect.  We  don’t  get  into  rough  waters  unless  the  three 
literals  tj , Uj , vj  of  a clause  correspond  to  three  different  variables.  And  in 
that  case,  each  clause  rules  out  exactly  1/8  of  the  possibilities,  because  seven 
different  settings  of  (tj,Uj,Vj)  will  make  it  true.  Consequently  every  such  3CNF 
with  at  most  seven  clauses  is  automatically  satisfiable,  and  a random  setting  of 
its  variables  will  succeed  with  probability  > 1 — 7/8  = 1/8. 

The  shortest  interesting  formula  in  3CNF  therefore  has  at  least  eight  clauses. 
And  in  fact,  an  interesting  8-clause  formula  does  exist,  based  on  the  associative 
block  design  by  R.  L.  Rivest  that  we  considered  in  6.5-(i3): 

(x2  Vx3Vx4)  A (aq Vx3 Vx4)  A (xxVx2Vx4)  A (xxVx2Vx3) 

A (x2Vx3  Vx4)  A (xx  Vx3Vx4)  A (xxVx2Vx4)  A (xxVx2Vx3).  (32) 

Any  seven  of  these  eight  clauses  are  satisfiable,  in  exactly  two  ways,  and  they 
force  the  values  of  three  variables;  for  example,  the  first  seven  imply  that  we  have 
xix2x3  = 001.  But  the  complete  set  of  eight  cannot  be  satisfied  simultaneously. 
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Simple  special  cases.  Two  important  classes  of  Boolean  formulas  have  been 
identified  for  which  the  satisfiability  problem  does  turn  out  to  be  pretty  easy. 
These  special  cases  arise  when  the  conjunctive  normal  form  being  tested  consists 
entirely  of  “Horn  clauses”  or  entirely  of  “Krom  clauses.”  A Horn  clause  is  an 
OR  of  literals  in  which  all  or  nearly  all  of  the  literals  are  complemented  — at 
most  one  of  its  literals  is  a pure,  unbarred  variable.  A Krom  clause  is  an  OR  of 
exactly  two  literals.  Thus,  for  example, 

iVy,  wVj/Vz,  MViVwViVi/Vz,  and  x 

are  examples  of  Horn  clauses;  and 

1V1,  iVi,  x V y,  x V y,  x V y,  and  x V y 

are  examples  of  Krom  clauses,  only  the  last  of  which  is  not  also  a Horn  clause. 
(The  first  example  qualifies  because  x\l  x = x.)  Notice  that  a Horn  clause 
is  allowed  to  contain  any  number  of  literals,  but  when  we  restrict  ourselves  to 
Krom  clauses  we  are  essentially  considering  the  2SAT  problem.  In  both  cases 
we  will  see  that  satisfiability  can  be  decided  in  linear  time  — that  is,  by  carrying 
out  only  O(N)  simple  steps,  when  given  a formula  of  length  N. 

Let’s  consider  Horn  clauses  first.  Why  are  they  so  easy  to  handle?  The 
main  reason  is  that  a clause  like  MViVtiViV  can  be  recast  in  the  form 
->(u  /\vAwAx/\y)\/z,  which  is  the  same  as 

uAvAwAxAy  =>■  2. 

In  other  words,  if  u,  v,  w,  x,  and  y are  all  true,  then  z must  also  be  true.  For 
this  reason,  parameterized  Horn  clauses  were  chosen  to  be  the  basic  underlying 
mechanism  of  the  programming  language  called  Prolog.  Furthermore  there  is 
an  easy  way  to  characterize  exactly  which  Boolean  functions  can  be  represented 
entirely  with  Horn  clauses: 

Theorem  H.  The  Boolean  function  f (xi , . . . , xn)  is  expressible  as  a conjunction 
of  Horn  clauses  if  and  only  if 

f(xi, . . . ,xn)  = / (2/1 , . . . ,yn)  = 1 implies  f(xi  A 2/1, . . . , xn  A yn)  = 1 (33) 
for  all  Boolean  values  Xj  and  yj. 

Proof.  [Alfred  Horn,  J.  Symbolic  Logic  16  (1951),  14-21,  Lemma  7.]  If  we  have 
Xq  V Xi  V • ■ • V Xk  — 1 and  2/0  V j/i  V • • • V yk  = 1,  then 

(xo  A yo)  V xi  A 2/1  V • ■ • V xk  A 2/fc 

= (x0  V X!  V 2/1  V • • ■ V xk  V yk)  A (y0  V Si  V yx  V ■ • • V xk  V yk) 

> (x0  V X]_  V • ■ • V xk)  A (2/0  V 2/1  V • • • V 2/fc)  = 1; 

and  a similar  (but  simpler)  calculation  applies  when  the  unbarred  literals  Xo 
and  2/0  are  not  present.  Therefore  every  conjunction  of  Horn  clauses  satisfies  (33). 

Conversely,  condition  (33)  implies  that  every  prime  clause  of  / is  a Horn 
clause  (see  exercise  44).  | 
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Let’s  say  that  a Horn  function  is  a Boolean  function  that  satisfies  con- 
ition  (33),  and  let’s  also  call  it  definite  if  it  satisfies  the  further  condition 
/(l,  • ■ • , 1)  = 1-  It’s  easy  to  see  that  a conjunction  of  Horn  clauses  is  definite  if 
and  only  if  each  clause  has  exactly  one  unbarred  literal,  because  only  an  entirely 
negative  clause  like  xVy  will  fail  if  all  variables  are  true.  Definite  Horn  functions 
are  slightly  simpler  to  work  with  than  Horn  functions  in  general,  because  they 
are  obviously  always  satisfiable.  Thus,  by  Theorem  H,  they  have  a unique  least 
vector  x such  that  f(x)  = 1,  namely  the  bitwise  AND  of  all  vectors  that  satisfy 
all  clauses.  The  core  of  a definite  Horn  function  is  the  set  of  all  variables  Xj  that 
are  true  in  this  minimum  vector  z.  Notice  that  the  variables  in  the  core  must 
be  true  whenever  / is  true,  so  we  can  essentially  factor  them  out. 

Definite  Horn  functions  arise  in  many  ways,  for  example  in  the  analysis 
o games  (see  exercises  51  and  52).  Another  nice  example  comes  from  compiler 
technology.  Consider  the  following  typical  (but  simplified)  grammar  for  algebraic 
expressions  in  a programming  language: 

( expression ) ->  ( term ) | ( expression ) + ( term ) | ( expression ) - ( term ) 
term  ) -»  ( factor ) | - ( factor ) | ( term  ) * ( factor ) | ( term  ) / ( factor ) 

( factor ) — > ( variable ) | ( constant ) | ( ( expression ) ) 

( variable ) ->  ( letter ) | ( variable ) ( letter ) | ( variable )( digit ) (o  D 

( letter ) ->•  a | b | c 

( constant ) — » ( digit ) | ( constant ) ( digit ) 

( digit ) ->•  0 | 1 


For  example,  the  string  a/ (-b0-10)+cc*cc  meets  the  syntax  for  (expression) 
and  uses  each  of  the  grammatical  rules  at  least  once. 

Suppose  we  want  to  know  what  pairs  of  characters  can  appear  next  to  each 
other  in  such  expressions.  Definite  Horn  clauses  provide  the  answer,  because 
we  can  set  the  problem  up  as  follows:  Let  the  quantities  Xx,  xX,  and  xy  denote 
Boolean  “propositions,”  where  X is  one  of  the  symbols  {E,  T,  F,  V,  L,  C.  D}  standing 
respectively  for  ( expression ),  ( term ),...,( digit ),  and  where  x and  y are  sym- 
o s m the  set  a,  b,c,  0,1}.  The  proposition  Xx  means,  “X  can 

end  with  x ; similarly,  xX  means,  “X  can  start  with  x”;  and  xy  means,  “The 
character  x can  be  followed  immediately  by  y in  an  expression.”  (There  are 
7x  11  + 11  x 7+11  x 11  = 275  propositions  altogether.)  Then  we  can  write 


xT 

=k 

xE 

-T 

Tx 

Ex 

xF 

-x 

Ex 

x+ 

Tx 

X* 

xT 

+x 

xF 

=> 

*x 

Ex 

x- 

Tx 

x/ 

xT 

=t- 

-x 

xF 

/x 

xF 

xT 

xV 

xF 

Fx 

Tx 

Vx 

Fx 

xC  =>  xF 

Vx  A yL  =t-  xy 

Cx  Fx 

Vx  A yD  =>  xy 

=>  (F 

Dx  =>  Vx 

xE  =>■  (x 

=>  aL 

Ex  =£-  x) 

=f>  La 

=>  F) 

=>  bL 

xL  =>  xV 

=>  Lb 

Lx  Vx 

=>  cL 

=>  Lc 
xD  =>  xC 
Dx  =>■  Cx 
Cx  A yD  xy 
=>  0D 
=>■  DO 
=>  ID 
^ D1 


(35) 


where  x and  y run  through  the  eleven  terminal  symbols  {+,...,  l}.  This  sche- 
matic specification  gives  us  a total  of  24  x 11  + 3 x 11  x 11  + 13  x 1 = 640  definite 
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Horn  clauses,  which  we  could  write  out  formally  as 

(+TV+E)  A pf  V -E)  A • • • A (V+  V OL  V +0)  A • • • A (Dl) 

if  we  prefer  the  cryptic  notation  of  Boolean  algebra  to  the  =>  convention  of  (35). 

Why  did  we  do  this?  Because  the  core  of  all  these  clauses  is  the  set  of  all 
propositions  that  are  true  in  this  particular  grammar.  For  example,  one  can 
verify  that  -E  is  true,  hence  the  symbols  (-  can  occur  next  to  each  other  within 
an  expression;  but  the  symbol  pairs  ++  and  *-  cannot  (see  exercise  46). 

Furthermore,  we  can  find  the  core  of  any  given  set  of  definite  Horn  clauses 
without  great  difficulty.  We  just  start  out  with  the  propositions  that  appear 
alone,  on  the  right-hand  side  of  when  the  left-hand  side  is  empty;  thirteen 
clauses  of  that  kind  appear  in  (35).  And  once  we  assert  the  truth  of  those 
propositions,  we  might  find  one  or  more  clauses  whose  left-hand  sides  are  now 
known  to  be  true.  Hence  their  right-hand  sides  also  belong  to  the  core,  and 
we  can  keep  going  in  the  same  way.  The  whole  procedure  is  pretty  much  like 
letting  water  run  downhill  until  it  has  found  its  proper  level.  In  fact,  when 
we  choose  appropriate  data  structures,  this  downhill  process  goes  quite  fast, 
requiring  only  0(N+n)  steps,  when  N denotes  the  total  length  of  the  clauses  and 
n is  the  number  of  propositional  variables.  (We  assume  here  that  all  clauses  have 
been  expanded  out,  not  abbreviated  in  terms  of  parameters  like  x and  y above. 
More  sophisticated  techniques  of  theorem  proving  are  available  to  deal  with 
parameterized  clauses,  but  they  are  beyond  the  scope  of  our  present  discussion.) 

Algorithm  C ( Core  computation  for  definite  Horn  clauses).  Given  a set  P of 
propositional  variables  and  a set  C of  clauses,  each  having  the  form 

u\  A • • • A uk  =>■  v where  k > 0 and  {iq, ...,  Uk,v}  C P,  (36) 

this  algorithm  finds  the  set  Q C P of  all  propositional  variables  that  are  neces- 
sarily true  whenever  all  of  the  clauses  are  true. 

We  use  the  following  data  structures  for  clauses  c,  propositions  p,  and 
hypotheses  h,  where  a “hypothesis”  is  the  appearance  of  a proposition  on  the 
left-hand  side  of  a clause: 

CONCLUSION (c)  is  the  proposition  on  the  right  of  clause  c; 

COUNT  (c)  is  the  number  of  hypotheses  of  c not  yet  asserted; 

TRUTH (p)  is  1 if  p is  known  to  be  true,  otherwise  0; 

LAST(p)  is  the  last  hypothesis  in  which  p appears; 

CLAUSE  (/?,)  is  the  clause  for  which  h appears  on  the  left; 

PREV (ft)  is  the  previous  hypothesis  containing  the  proposition  of  h. 

We  also  maintain  a stack  So,  Si,  . . . , Ss- 1 of  all  propositions  that  are  known  to 
be  true  but  not  yet  asserted. 

Cl.  [Initialize.]  Set  LAST(p)  A and  TRUTH (p)  A-  0 for  each  proposition  p. 
Also  set  s A-  0,  so  that  the  stack  is  empty.  Then  for  each  clause  c,  having 
the  form  (36),  set  CONCLUSION(c)  A-  v and  COUNT(c)  A-  k.  If  k = 0 and 
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TRUTH  GO  = 0,  set  TRUTH (u)  <-  1,  5S  <-  v,  and  * <-  « + 1.  Otherwise,  for 
1 < j < k,  create  a hypothesis  record  h and  set  CLAUSED?)  c PREV(h)  <- 
LAST(wj) , LAST(nj)  y-  h. 

C2.  [Prepare  to  assert  p.]  Terminate  the  algorithm  if  s = 0;  the  desired  core 
now  consists  of  all  propositions  whose  TRUTH  has  been  set  to  1.  Otherwise 
set  s <r-  s - 1,  p Ss,  and  h <-  LAST(p). 

C3.  [Done  with  hypotheses?]  If  h = A,  return  to  C2. 

C4.  [Validate  h .]  Set  c «—  CLAUSE(h)  and  COUNT(c)  COUNT(c)  - 1.  If  the  new 
value  of  COUNT  (c)  is  still  nonzero,  go  to  step  C6. 

C5.  [Deduce  CONCLUSION  (c)  .]  Set  p <r-  CONCLUSION  (c)  . If  TRUTH  (»)  = 0 set 
TRUTH(p)  <-l,  Ss^p,  si-s  + l. 

C6.  [Loop  on  h.]  Set  h t—  PREV(h)  and  return  to  C3.  | 

Notice  how  smoothly  the  data  structures  work  together,  avoiding  any  need  to 
search  for  a place  to  make  progress  in  the  calculation.  Algorithm  C is  similar 
m many  respects  to  Algorithm  2.2.3T  (topological  sorting),  which  was  the  first 
example  of  multilinked  data  structures  that  we  discussed  long  ago  in  Chapter  2; 
in  fact,  we  can  regard  Algorithm  2.2.3T  as  the  special  case  of  Algorithm  C in 
which  every  proposition  appears  on  the  right-hand  side  of  exactly  one  clause. 
(See  exercise  47.) 

Exercise  48  shows  that  a slight  modification  of  Algorithm  C solves  the 
satisfiability  problem  for  Horn  clauses  in  general.  Further  discussion  can  be 
found  m papers  by  W.  F.  Dowling  and  J.  H.  Gallier,  J.  Logic  Programming  1 
(1984),  267-284;  M.  G.  Scutella,  J.  Logic  Programming  8 (1990),  265-273. 

We  turn  now  to  Krom  functions  and  the  2SAT  problem.  Again  there’s  a 
linear-time  algorithm;  but  again,  we  can  probably  appreciate  it  best  if  we  look 
first  at  a simplified-but-practical  application.  Let’s  suppose  that  seven  comedians 
have  each  agreed  to  do  one-night  standup  gigs  at  two  of  five  hotels  during  a three- 
day  festival,  but  each  of  them  is  available  for  only  two  of  those  days  because  of 
other  commitments: 

Tomlin  should  do  Aladdin  and  Caesars  on  days  1 and  2; 

Unwin  should  do  Bellagio  and  Excalibur  on  days  1 and  2; 

Vegas  should  do  Desert  and  Excalibur  on  days  2 and  3; 

Williams  should  do  Aladdin  and  Desert  on  days  1 and  3:  (37) 

Xie  should  do  Caesars  and  Excalibur  on  days  1 and  3; 

Yankovic  should  do  Bellagio  and  Desert  on  days  2 and  3; 

Zany  should  do  Bellagio  and  Caesars  on  days  1 and  2. 

Is  it  possible  to  schedule  them  all  without  conflict? 

To  solve  this  problem,  we  can  introduce  seven  Boolean  variables  {t , u,  v,  w, 
xiV,z},  where  t (for  example)  means  that  Tomlin  does  Aladdin  on  day  1 and 
Caesars  on  day  2 while  t means  that  the  days  booked  for  those  hotels  occur  in  the 
opposite  order.  Then  we  can  set  up  constraints  to  ensure  that  no  two  comedians 
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are  booked  in  the  same  hotel  on  the  same  day: 


-<(t  A w)  [Al] 
-i(w  A z)  [Bl] 
~'(uAy)  [B2] 
-i  (u  A z)  [B2] 


“’( V_Az ) [B2] 
i(tAi)  [Cl] 
-i(tAz)  [Cl] 
-i(x  A z)  [Cl] 


-i(t  A z)  [C2] 
>Ay)  [D2] 
-i(v  A w)  [D3] 
~'{vA  y)  [D3] 


-|(w  A y)  [D3] 
-i(u  A x ) [El] 
-<(u  A v)  [E2] 
->(v  A x ) [E3] 


Each  of  these  constraints  is,  of  course,  a Krom  clause;  we  must  satisfy 


(38) 


(t\/w)  A (uVz)  A (uVy)  A (uVz)  A (yVz)  A (tVx)  A (fVz)  A (xVz) 

A (tVf)  A (vVy)  A (vVw)  A (vVy)  A (wVy)  A («Vi)  A (uVv)  A (vVx).  (39) 


Furthermore,  Krom  clauses  (like  Horn  clauses)  can  be  written  as  implications: 


t=>w,  u^z,  u^y,  u=>z,  y=>z,  t=$>x,  t=>z,  x=5>z, 

v=5>w,  v=>y,  w^>y,  u=>x,  u=>v,  v=>x.  (40) 

And  every  such  implication  also  has  an  alternative,  “contrapositive”  form: 


w=>t , z=>u,  y=>u,  z =r*  Uj  z=>y,  x=>t,  z=>t,  z=>x, 

z=>t,  y^>v,  y=>w,  x=f>u,  v =>  u,  x=>v.  (41) 

But  oops  — alas—  there  is  a vicious  cycle, 


u=^z=>y^v^>u^z=>t=^x=>u.  (42) 

[Bl]  [B2]  [D2]  [E2]  [B2]  [C2]  [Cl]  [El] 


This  cycle  tells  that  u and  u must  both  have  the  same  value;  so  there  is  no  way 
to  accommodate  all  of  the  conditions  in  (37).  The  festival  organizers  will  have  to 
renegotiate  their  agreement  with  at  least  one  of  the  six  comedians  {t,  u,  v,  x,  y,  z}, 
if  a viable  schedule  is  to  be  achieved.  (See  exercise  53.) 


Fig.  6.  The  digraph  corresponding 
to  all  implications  of  (40)  and  (41) 
that  do  not  involve  either  v or  v. 
Assigning  appropriate  values  to  the 
literals  in  each  strong  component 
will  solve  a binary  scheduling  prob- 
lem that  is  an  instance  of  2SAT. 


The  organizers  might,  for  instance,  try  to  leave  v out  of  the  picture  tem- 
porarily. Then  five  of  the  sixteen  constraints  in  (38)  would  go  away  and  only  22 
of  the  implications  from  (40)  and  (41)  would  remain,  leaving  the  directed  graph 
illustrated  in  Fig.  6.  This  digraph  does  contain  cycles,  like  2 =>■  u =>  x =>  z and 
t =>  z =>  t;  but  no  cycle  contains  both  a variable  and  its  complement.  Indeed, 
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we  can  see  from  Fig.  6 that  the  values  tuwxyz  = 110000  do  satisfy  every  clause 
°f  (39)  that  doesn’t  involve  v or  v.  These  values  give  us  a schedule  that  satisfies 
six  of  the  seven  original  stipulations  in  (37),  starting  with  (Tomlin,  Unwin,  Zany, 
'W  illiams,  Xie)  at  the  (Aladdin,  Bellagio,  Caesars,  Desert,  Excalibur)  on  day  1. 

In  general,  given  any  2SAT  problem  with  m Krom  clauses  that  involve 
n Boolean  variables,  we  can  form  a directed  graph  in  the  same  way.  There 
are  2 n veitices  {aR,  Xi, . . . , xn,  xn},  one  for  each  possible  literal;  and  there  are 
2m  arcs  of  the  form  u -A  v and  v ->•  u,  two  for  each  clause  uVv.  Two  literals 
u and  v belong  to  the  same  strong  component  of  this  digraph  if  and  only  if 
there  are  oriented  paths  from  u to  v and  from  v to  u.  For  example,  the  six 
strong  components  of  the  digraph  in  Fig.  6 are  indicated  by  dotted  contours. 
All  literals  in  a strong  component  must  have  the  same  Boolean  value,  in  any 
solution  to  the  corresponding  2SAT  problem. 

Theorem  K.  A conjunctive  normal  form  with  two  literals  per  clause  is  satishable 
if  and  only  if  no  strong  component  of  the  associated  digraph  contains  both  a 
variable  and  its  complement. 

Proof.  [Melven  Krom,  Zeitschrift  fur  mathematische  Logik  und  Grundlagen  der 
Mathematik  13  (1967),  15—20,  Corollary  2.2.]  If  there  are  paths  from  x to  x and 
from  x to  x,  the  formula  is  certainly  unsatisfiable. 

Conversely,  assume  that  no  such  paths  exist.  Any  digraph  has  at  least 
one  strong  component  S that  is  a “source,”  having  no  incoming  arcs  from 
vertices  in  any  other  strong  component.  Moreover,  our  digraph  always  has  an 
attractive  antisymmetry,  illustrated  in  Fig.  6:  We  have  u ->  v if  and  only  if 
v -y  u.  Therefore  the  complements  of  the  literals  in  S form  another  strong 
component  S ^ S that  is  a “sink,”  having  no  outgoing  arcs  to  other  strong 
components.  _Hence  we  can  assign  the  value  0 to  all  literals  in  S and  1 to 
all  literals  in  S,  then  remove  them  from  the  digraph  and  proceed  in  the  same 
way  until  all  literals  have  received  a value.  The  resulting  values  satisfy  u < v 
whenever  u ->  v in  the  digraph;  hence  they  satisfy  u V v whenever  u V v is  a 
clause  of  the  formula.  | 

Theorem  K leads  immediately  to  an  efficient  solution  of  the  2SAT  problem, 
thanks  to  an  algorithm  by  R.  E.  Tarjan  that  finds  strong  components  in  linear 
time.  [See  SICOMP  1 (1972),  146-160;  D.  E.  Knuth,  The  Stanford  GraphBase 
(1994),  512-519.]  We  shall  study  Tarjan’s  algorithm  in  detail  in  Section  7.4.1. 
Exercise  54  shows  that  the  condition  of  Theorem  K is  readily  checked  whenever 
the  algorithm  detects  a new  strong  component.  Furthermore,  the  algorithm 
detects  sinks  first;  thus,  as  a simple  byproduct  of  Tarjan’s  procedure,  we  can 
assign  values  that  establish  satisfiability  by  choosing  the  value  1 for  each  literal 
in  a strong  component  that  occurs  before  its  complement. 

Medians.  We’ve  been  focusing  on  Boolean  binary  operations  like  xVy  or  x®y. 
But  there’s  also  a significant  ternary  operation  (xyz),  called  the  median  of  x,  y, 
and  z: 


(xyz)  = (xAy)  V (yAz)  V (xAz)  = (xVy)  A (yVz)  A (xVz).  (43) 
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In  fact,  ( xyz ) is  probably  the  most  important  ternary  operation  in  the  entire 
universe,  because  it  has  amazing  properties  that  are  continually  being  discovered 
and  rediscovered. 

In  the  first  place,  we  can  see  easily  that  this  formula  for  {xyz)  describes  the 
majority  value  of  any  three  Boolean  quantities  x,  y,  and  2:  (000)  = (001)  = 0 
and  (011)  = (111)  = 1.  We  call  {xyz)  the  “median”  instead  of  the  “majority” 
because,  if  x,  y,  and  2 are  arbitrary  real  numbers,  and  if  the  operations  A and  V 
denote  min  and  max  in  (43),  then 

{xyz)  = y when  x < y < z.  (44) 

Secondly,  the  basic  binary  operations  A and  V are  special  cases  of  medians: 

x/\y  = {xOy);  x V y = (xly).  (45) 

Thus  any  monotone  Boolean  function  can  be  expressed  entirely  in  terms  of  the 
ternary  median  operator  and  the  constants  0 and  1.  In  fact,  if  we  lived  in  a 
median-only  world,  we  could  let  A stand  for  falsehood  and  V for  truth;  then 
x Ay  = {xA  y)  and  x V y = {xVy)  would  be  perfectly  natural  expressions,  and  we 
could  even  use  Polish  notation  like  (A xy)  and  (Va :y)  if  we  wanted  to!  The  same 
idea  applies  to  extended  real  numbers  under  the  min-max  interpretation  of  A 
and  V,  if  we  take  medians  with  respect  to  the  constants  A = —00  and  V = +00. 

A Boolean  function  f(x i,X2, . . . ,xn)  is  called  self-dual  when  it  satisfies 

f{xu  X2,  • • • , xn)  — f{xi , X2 , ■ ■ • , xn) . (46) 

We’ve  noted  that  a Boolean  function  is  monotone  if  and  only  if  it  can  be  expressed 
in  terms  of  A and  V;  by  De  Morgan’s  laws  (11)  and  (12),  a monotone  formula  is 
self-dual  if  and  only  if  the  symbols  A and  V can  be  interchanged  without  changing 
the  formula’s  value.  Thus  the  median  operation  defined  in  (43)  is  both  monotone 
and  self-dual.  In  fact,  it  is  the  simplest  nontrivial  function  of  that  kind,  since 
none  of  the  binary  operations  in  Table  1 are  both  monotone  and  self-dual  except 
the  projections  L and  R. 

Furthermore,  any  expression  that  has  been  formed  entirely  with  the  median 
operator,  without  using  constants,  is  both  monotone  and  self-dual.  For  example, 
the  function  ( w{xyz){w{uvw)x ))  is  self-dual  because 

(■ w{xyz){w{uvw)x ))  = {w  {xyz)  {w{uvw)x)) 

— {w{xyz){w{uvw)x))  = {w{xyz){w{uvw)x)). 

Emil  Post,  while  working  on  his  Ph.D.  thesis  (Columbia  University,  1920),  proved 
that  the  converse  statement  is  also  true: 

Theorem  P.  Every  monotone,  self-dual  Boolean  function  f(x  . . . , xn)  can  be 
expressed  entirely  in  terms  of  the  median  operation  {xyz) . 
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Proof.  [Annals  of  Mathematics  Studies  5 (1941),  74-75.]  Observe  first  that 
{xiy{x2y . . .y{xa-iyxs) . . .)) 

= ((2fi  v x2  V • • ■ V a;s_i  V xs)  A y)  V (xi  A x2  A • ■ • A xs~i  A xs);  (47) 

this  formula  for  repeated  medianing  is  easily  proved  by  induction  on  s. 

Now  suppose  f(xi, . . . ,xn)  is  monotone,  self-dual,  and  has  the  disjunctive 
prime  form 

f (A  1 ’ * * * > xn)  f 1 V • • • V , tj  — 1 Xji  A * • • A XjSj  , 

where  no  prime  implicant  tj  is  contained  in  another  (Corollary  Q).  Any  two  prime 
implicants  must  have  at  least  one  variable  in  common.  For  if  we  had,  say,  t\  = 
x Ay  and  t2  = u A v A w,  the  value  of  / would  be  1 when  x = y = 1 and  u — v = 
w — 0,  as  well  as  when  x = y — 0 and  u = v — w = 1,  contradicting  self-duality. 
Therefore  if  any  tj  consists  of  a single  variable  x,  it  must  be  the  only  prime 
implicant  — in  which  case  / is  the  trivial  function  f(x  1, . . . ,x„)  — x = ( xxx ). 
Define  the  functions  go,  gi,  . . . , gm  by  composing  medians  as  follows: 

go(x\, . . . , xn ) x\ , 

( aQ\ 

9j(,x  1 > • • • > xn)  h(xj  1 , . . . , XjSj ; gj—i  (xj , . . . , xn)),  for  1 < j < to; 

here  h(xi, . . . , xs\ y)  denotes  the  function  on  the  top  line  of  (47).  By  induction 
on  j , we  can  prove  from  (47)  and  (48)  that  gj(x  1, . . . , xn)  = 1 whenever  we  have 
ti  V • • • V tj  — 1,  because  (xji  V • • • V xjSj)  A tk  = tk  when  k < j. 

Finally,  f(x  1, . . . , xn)  must  equal  gm(x  1, . . . , xn ),  because  both  functions  are 
monotone  and  self-dual,  and  we  have  shown  that  f(x\, . . . ,xn)  < gm(x  1, . . . ,xn) 
for  all  combinations  of  Os  and  Is.  This  inequality  suffices  to  prove  equality, 
because  a self-dual  function  equals  1 in  exactly  half  of  the  2n  possible  cases.  | 

One  consequence  of  Theorem  P is  that  we  can  express  the  median  of  five 
elements  via  medians  of  three,  because  the  median  of  any  odd  number  of  Boolean 
variables  is  obviously  a monotone  and  self-dual  Boolean  function.  Let’s  write 
(*!■••  x2k—i)  for  such  a median.  Then  the  disjunctive  prime  form  of  ( vwxyz ) is 

(vAwAx)  V ( vAwAy ) V (vAwAz)  V (vAxAy)  V (vAxAz) 

V ( vAyAz ) V ( wAxAy ) V (wAxAz)  V [wAyAz]  V ( xAyAz ); 

so  the  construction  in  the  proof  of  Theorem  P expresses  {vwxyz)  as  a huge 
formula  gw{v,w,x,y,z)  involving  2,046  median-of-3  operations.  Of  course  this 
expression  isn’t  the  shortest  possible  one;  we  actually  have 

(vwxyz)  = (v(xyz)(wx(wyz))).  (49) 

[See  H.  S.  Miiller  and  R.  O.  Winder,  IRE  Transactions  EC-11  (1962),  89-90.] 

*Median  algebras  and  median  graphs.  We  noted  earlier  that  the  ternary 
operation  (xyz)  is  useful  when  x,  y,  and  2 belong  to  any  ordered  set  like  the  real 
numbers,  when  A and  V are  regarded  as  the  operators  min  and  max.  In  fact, 
the  operation  (xyz)  also  plays  a useful  role  in  far  more  general  circumstances. 
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A median  algebra  is  any  set  M on  which  a ternary  operation  (xyz)  is  defined  that 
takes  elements  of  M into  elements  of  M and  obeys  the  following  three  axioms: 

(. xxy ) — x (majority  law);  (50) 

{xyz)  = { xzy ) — { yxz ) = (yzx)  = { zxy ) = { zyx ) (commutative  law);  (51) 
( xw(ywz ))  = {(xwy)wz)  (associative  law).  (52) 

In  the  Boolean  case,  for  example,  the  associative  law  (52)  holds  for  u)  = 0 and 
w = 1 because  A and  V are  associative.  Exercises  75  and  76  prove  that  these  three 
axioms  imply  also  a distributive  law  for  medians,  which  has  both  a short  form 

(( xyz)uv ) = ( x{yuv)(zuv ))  (53) 

and  a more  symmetrical  long  form 

(( xyz)uv ) = {(xuv)  (yuv)  (zuv)') . (54) 

No  simple  proof  of  this  fact  is  known,  but  we  can  at  least  verify  the  special  case 
of  (53)  and  (54)  when  y = u and  z — v:  We  have 

(( xyz)yz ) = {xyz)  (55) 

because  both  sides  equal  ( xy{zyz )).  In  fact,  the  associative  law  (52)  is  just  the 
special  case  y — u of  (53).  And  with  (55)  and  (52)  we  can  also  verify  the  case 
x — u:  {{uyz)uv)  = ( vu{yuz ))  = (( vuy)uz ) = {{ yuv)uz ) = {{{yuv)uv)uz)  = 
{{ yuv)u{vuz ))  = {u{yuv){zuv)). 

An  ideal  in  a median  algebra  M is  a set  C C M for  which  we  have 

{xyz)  £ C whenever  a : £ C,  y £ C,  and  z £ M.  (56) 

If  u and  v are  any  elements  of  M,  the  interval  [u . . v\  is  defined  as  follows: 

[w  . . u]  = { {xuv)  | X£M}.  (57) 

We  say  that  “x  is  between  u and  v"  if  and  only  if  x £ [w  . . u].  According  to  these 
definitions,  u and  v themselves  always  belong  to  the  interval  [u . . v] . 

Lemma  M.  Every  interval  [u  . .v]  is  an  ideal,  and  x £ [u  . ,v\  4=4-  x = {uxv) . 

Proof.  Let  {xuv)  and  {yuv)  be  arbitrary  elements  of  [u  . . t>] . Then 

{{xuv) {yuv) z)  = {{ xyz)uv ) £ [u..u] 

for  all  z £ M,  by  (51)  and  (53),  so  [u . . u]  is  an  ideal.  Furthermore  every  element 
{xuv)  £ [m..w]  satisfies  {xuv)  — { u{xuv)v ) by  (51)  and  (55).  I 

Our  intervals  [u  . . w]  have  nice  properties,  because  of  the  median  laws: 

v £ [it . . u]  =>  u — w. \ (58) 

x £ [u  . . u]  and  y £ [u  . . x)  =>  y£[u..v\;  (59) 

x £ [u  . . u]  and  y £ [u . . z]  and  y £ [v  . . z]  y £ [x  . . z] . (60) 

Equivalently,  [u  . . u]  — {«};  if  x £ [w  . . v]  then  [u  . . x]  C [u  . . u];  and  x £ [u  . . u] 

also  implies  that  [u . . z]  ft  [v  . . z]  C [x  . . z]  for  all  z.  (See  exercise  72.) 
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Now  let’s  define  a graph  on  the  vertex  set  M,  with  the  following  edges: 

u — v u -£  v and  (xuv)  G {u,  v}  for  all  x G M.  (61) 

In  other  words,  u and  v are  adjacent  if  and  only  if  the  interval  [u . . u]  consists  of 
just  the  two  points  u and  v. 

Theorem  G.  If  M is  any  finite  median  algebra,  the  graph  defined  by  (61)  is 
connected.  Moreover,  vertex  x belongs  to  the  interval  [u  . . v } if  and  only  if  x lies 
on  a shortest  path  from  u to  v. 

Proof.  If  M isn’t  connected,  choose  u and  v so  that  there  is  no  path  from  u 
to  v and  the  interval  [u . . u]  has  as  few  elements  as  possible.  Let  x G [u . . v\  be 
distinct  from  u and  v.  Then  (xuv)  — x ^ v,  so  v £ [u . . x]\  similarly  u$l[x..v]. 
But  [u..x]  and  [x..v]  are  contained  in  [«..«],  by  (59).  So  they  are  smaller 
intervals,  and  there  must  be  a path  from  u to  x and  from  x to  v.  Contradiction. 

The  other  half  of  the  theorem  is  proved  in  exercise  73.  | 

Our  definition  of  intervals  implies  that  ( xyz ) e [x  . . y]  n [x  . . z]  n [?/ . . z] , 
because  (xyz)  = ((xyz)xy)  = (( xyz)xz ) = (( xyz)yz ) by  (55).  Conversely, 
if  w G [x..y]C\[x..z]r\[y..z],  exercise  74  proves  that  w = (xyz).  In  other 
words,  the  intersection  [x  . . y]  fl  [x  . . z]  n [y  . . z]  always  contains  exactly  one  point, 
whenever  x,  y , and  z are  points  of  M. 

Figure  7 illustrates  this  principle  in  a 4 x 4 x 4 cube,  where  each  point  x has 
coordinates  (xi,x2,x3)  with  0 < xi,x2,x3  < 4.  The  vertices  of  this  cube  form  a 
median  algebra  because  (xyz)  = (( xiyizi ),  (x2y2z2),  ( X3J/3Z3));  furthermore,  the 
edges  of  the  graph  in  Fig.  7 are  those  defined  in  (61),  running  between  vertices 
whose  coordinates  agree  except  that  one  coordinate  changes  by  ±1.  Three  typical 
intervals  [x  . . y\,  [x  . . z],  and  [y  . . z]  are  shown;  the  only  point  common  to  all  three 
intervals  is  the  vertex  (xyz)  = (2,  2, 1). 


Fig.  7.  Intervals  between  the  vertices  x = (0,2, 1), 
V ~ 3, 3),  and  z = (2, 0,  0)  in  a 4 x 4 x 4 cube. 
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So  far  we’ve  started  with  a median  algebra  and  used  it  to  define  a graph  with 
certain  properties.  But  we  can  also  start  with  a graph  that  has  those  properties 
and  use  it  to  define  a median  algebra.  If  u and  v are  vertices  of  any  graph,  let  us 
define  the  interval  [u  . .v]  to  be  the  set  of  all  points  on  shortest  paths  between  u 
and  v.  A finite  graph  is  said  to  be  a median  graph  if  exactly  one  vertex  lies  in  the 
intersection  [x  . . y)  fl  [x  . . z]  fl  [y  . . z]  of  the  three  intervals  that  tie  any  three  given 
vertices  x,  y.  and  z together;  and  we  denote  that  vertex  by  ( xyz ).  Exercise  75 
proves  that  the  resulting  ternary  operation  satisfies  the  median  axioms. 

Many  important  graphs  turn  out  to  be  median  graphs  according  to  this 
definition.  For  example,  any  free  tree  is  easily  seen  to  be  a median  graph;  and  a 
graph  like  the  ni  x 712  X • • • x nm  hyperrectangle  provides  another  simple  example. 
Cartesian  products  of  arbitrary  median  graphs  also  satisfy  the  required  condition. 


*Median  labels.  If  u and  v are  any  elements  of  a median  algebra,  the  mapping 
f(x)  that  takes  x H > (xuv)  is  a homomorphism-,  that  is,  it  satisfies 

f((xyz))  = ( f(x)f(y)f(z )),  (62) 

because  of  the  long  distributive  law  (54).  This  function  (xuv)  “projects”  any 
given  point  x into  the  interval  [u.  .v],  by  (57).  And  it  is  particularly  interesting 
in  the  case  when  u — v is  an  edge  of  the  corresponding  graph,  because  f(x)  is 
then  two-valued,  essentially  a Boolean  mapping. 

For  example,  consider  the  typical  free  tree  shown  below,  with  eight  vertices 
and  seven  edges.  We  can  project  each  vertex  x onto  each  of  the  edge  intervals 
[u . . v\  by  deciding  whether  x is  closer  to  u or  to  v: 
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(63) 


On  the  right  we’ve  reduced  the  projections  to  Os  and  Is,  arbitrarily  deciding  that 
a ha  0000000.  The  resulting  bit  strings  are  called  labels  of  the  vertices,  and  we 
write,  for  example,  1(b)  = 1100000.  Since  each  projection  is  a homomorphism, 
we  can  calculate  the  median  of  any  three  points  by  simply  taking  Boolean 
medians  in  each  component  of  their  labels.  For  example,  to  compute  (bgh)  we 
find  the  bitwise  median  of  Z(i>)  = 1100000,  1(g)  = 1011010,  and  1(h)  = 1010001, 
namely  1010000  = 1(d). 

When  we  project  onto  all  the  edges  of  a median  graph,  we  might  find  that 
two  columns  of  the  binary  labels  are  identical.  This  situation  cannot  occur  with 
a free  tree,  but  let’s  consider  what  would  happen  if  the  edge  g — h were  added 
to  the  tree  in  (63):  The  resulting  graph  would  still  be  a median  graph,  but  the 
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columns  for  eg  and  dh  would  become  identical  (except  with  e f>  d and  g h). 
Furthermore,  the  new  column  for  gh  would  turn  out  to  be  equivalent  to  the 
column  for  de.  Redundant  components  should  be  omitted  from  the  labels  in 
such  cases;  therefore  the  vertices  of  the  augmented  graph  would  have  six-bit 
labels,  like  1(g)  = 101101  and  1(h)  = 101001,  instead  of  seven-bit  labels. 

The  elements  of  any  median  algebra  can  always  be  represented  by  labels  in 
this  way.  Therefore  any  identity  that  holds  in  the  Boolean  case  will  be  true  in 
all  median  algebras.  This  “zero-one  principle”  makes  it  possible  to  test  whether 
any  two  given  expressions  built  from  the  ternary  operation  (xyz)  can  be  shown 
to  be  equal  as  a consequence  of  axioms  (50),  (51),  and  (52) — although  we  do 
have  to  check  2”  1 — 1 cases  when  we  test  n-variable  expressions  by  this  method. 

For  example,  the  associative  law  (xw(ywz)')  = ((xwy)wz)  suggests  that 
there  should  be  a symmetrical  interpretation  of  both  sides  that  does  not  involve 
nested  brackets.  And  indeed,  there  is  such  a formula: 

( xw(ywz ))  = ((xwy)wz)  = (xwywz),  (64) 

where  ( xwywz ) denotes  the  median  of  the  five-element  multiset  {x,  w,  y , w,  z}  = 
{w,  w,  x,  y,  z}.  We  can  prove  this  formula  by  using  the  zero-one  principle,  noting 
also  that  median  is  the  same  thing  as  majority  in  the  Boolean  case.  In  a similar 
way  we  can  prove  (49),  and  we  can  show  that  the  function  used  by  Post  in  (47) 
can  be  simplified  to 

( xiy(x2V  ■ ■ -yixs-iyXs )...))  = (xxyx2y  ...yx^yxs)-,  (65) 

it’s  a median  of  2s  - 1 quantities,  where  nearly  half  of  them  are  equal  to  y. 

A set  C of  vertices  in  a graph  is  called  convex  if  [u . . v\  C C whenever 
11  £ C and  v £ C.  In  other  words,  whenever  the  endpoints  of  a shortest  path 
belong  to  C,  all  vertices  of  that  path  must  also  be  present  in  C.  (A  convex 
set  is  therefore  identical  to  what  we  called  an  “ideal,”  a few  pages  ago;  now 
our  language  has  become  geometric  instead  of  algebraic.)  The  convex  hull  of 
{vi, . . . ,vm}  is  defined  to  be  the  smallest  convex  set  that  contains  each  of  the 
vertices  vx,  . . . , vm.  Our  theoretical  results  above  have  shown  that  every  interval 
\u  • ■ vl  is  convex;  hence  [u  . . v]  is  the  convex  hull  of  the  two-point  set  {u,  v}.  But 
in  fact  much  more  is  true: 

Theorem  C.  The  convex  hull  of  {iq , v2, . . . , vrn } in  a median  graph  is  the  set 
of  all  points 

C = {(vxxv2x...xvm)  | x £ Mj.  (66) 

Furthermore,  x is  in  C if  and  only  if  x = (vixv2x . . . xvm). 

Proof.  Clearly  vj  £ C for  1 < j < m.  Every  point  of  C must  belong  to  the 
convex  hull,  because  the  point  x’  = (v2x...xvm)  is  in  the  hull  (by  induction 
on  to),  and  because  (v\x  . . . xvm)  £ [iq  . . x'].  The  zero-one  principle  proves  that 

(x(Vlyv2y. . . yvm)(v1zv2z. . . zvm))  = (tq  (xyz)v2  (xyz)  . . . (xyz)vm)-  (67) 

hence  C is  convex.  Setting  x = y in  this  formula  proves  that  (vixv2x  . . . xvm)  is 
the  closest  point  of  C to  x,  and  that  (vxxv2x  . . . xvm)  £ [x . . z\  for  all  z £ C.  | 
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Corollary  C.  Let  the  label  of  Vj  be  Vj\  . . . Vjt  for  1 < j < m.  Then  the  convex 
hull  of  {iq , . . . , vm  } is  the  set  of  all  x G M whose  label  x± . . . xt  satisfies  xj  = Cj 
whenever  Vij  — V2j  = ■ ■ ■ = vmj  — cj.  | 

For  example,  the  convex  hull  of  {c,g,h}  in  (63)  consists  of  all  elements  whose 
label  matches  the  pattern  10**0**,  namely  {c,  d,  e,  g,  h}. 

When  a median  graph  contains  a 4-cycle  u — x — v — y — u,  the  edges 
u — x and  v — y are  equivalent,  in  the  sense  that  projection  onto  [u . .x\  and 
projection  onto  [v . ,y\  both  yield  the  same  label  coordinates.  The  reason  is  that, 
for  any  z with  ( zux ) — u,  we  have 

V = ( uvy ) = ({ zux)vy ) 

- ({ zvy)(uvy)(xvy )} 

= ((zvy)yv), 

hence  (zvy)  = y;  similarly  (zux)  = x implies  ( zvy ) = v.  The  edges  x — v and 
y — u are  equivalent  for  the  same  reasons.  Exercise  77  shows,  among  other 
things,  that  two  edges  yield  equivalent  projections  if  and  only  if  they  can  be 
proved  equivalent  by  a chain  of  equivalences  obtained  from  4-cycles  in  this  way. 
Therefore  the  number  of  bits  in  each  vertex  label  is  the  number  of  equivalence 
classes  of  edges  induced  by  the  4-cycles;  and  it  follows  that  the  reduced  labels  for 
vertices  are  uniquely  determined,  once  we  specify  a vertex  whose  label  is  00  ...  0. 

A nice  way  to  find  the  vertex  labels  of  any  median  graph  was  discovered 
by  P.  K.  Jha  and  G.  Slutzki  [Ars  Combin.  34  (1992),  75-92]  and  improved  by 
J.  Hagauer,  W.  Imrich,  and  S.  Klavzar  [ Theor . Comp.  Sci.  215  (1999),  123-136]: 

Algorithm  H ( Median  labels).  Given  a median  graph  G and  a source  vertex  a, 
this  algorithm  determines  the  equivalence  classes  defined  by  the  4-cycles  of  G, 
and  computes  the  labels  l(y)  = Vi . . ,vt  of  each  vertex,  where  t is  the  number  of 
classes  and  1(a)  = 0 . . . 0. 

HI.  [Initialize.]  Preprocess  G by  visiting  all  vertices  in  order  of  their  distance 
from  a.  For  each  edge  u — v,  we  say  that  u is  an  early  neighbor  of  v if  a is 
closer  to  u than  to  v,  otherwise  u is  a late  neighbor ; in  other  words,  the  early 
neighbors  of  v will  already  have  been  visited  when  v is  encountered,  but  the 
late  neighbors  will  still  be  awaiting  their  turn.  Rearrange  all  adjacency  lists 
so  that  early  neighbors  are  listed  first.  Place  each  edge  initially  in  its  own 
equivalence  class;  a “union-find  algorithm”  like  Algorithm  2.3.3E  will  be 
used  to  merge  classes  when  the  algorithm  learns  that  they’re  equivalent. 
H2.  [Call  the  subroutine.]  Set  j 0 and  invoke  Subroutine  I with  parameter  a. 
(Subroutine  I appears  below.  The  global  variable  j will  be  used  to  create  a 
master  list  of  edges  Tj  — Sj  for  1 < j < n,  where  n is  the  total  number  of 
vertices;  there  will  be  one  entry  with  Sj  = v,  for  each  vertex  v ^ a.) 

H3.  [Assign  the  labels.]  Number  the  equivalence  classes  from  1 to  t.  Then  set 
1(a)  to  the  t-bit.  string  0 ...  0.  For  j = 1,  2,  . . . , n — 1 (in  this  order),  set 
l(sj)  to  l(rj)  with  bit  k changed  from  0 to  1,  where  k is  the  equivalence 
class  of  edge  r}  — sj . | 
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Subroutine  I ( Process  descendants  of  r ).  This  recursive  subroutine,  with 
parameter  r and  global  variable  j,  does  the  main  work  of  Algorithm  H on 
the  graph  of  all  vertices  currently  reachable  from  vertex  r.  In  the  course  of 
processing,  all  such  vertices  will  be  recorded  on  the  master  list,  except  r itself, 
and  all  edges  between  them  will  be  removed  from  the  current  graph.  Each  vertex 
has  four  fields  called  its  LINK,  MARK,  RANK,  and  MATE,  initially  null. 

11.  [Loop  over  s.]  Choose  a vertex  s with  r — s.  If  there  is  no  such  vertex, 
return  from  the  subroutine. 

12.  [Record  the  edge.]  Set  j 4-  j + 1,  rj  4-  r,  and  Sj  4-  s. 

13.  [Begin  breadth-first  search.]  (Now  we  want  to  find  and  delete  all  edges 
of  the  current  graph  that  are  equivalent  to  r — s.)  Set  MARK(s)  4-  s, 
RANK(s)  4-  1,  LINK(s)  4-  A,  and  v 4-  q 4-  s. 

14.  [Find  the  mate  of  v.]  Find  the  early  neighbor  u of  v for  which  MARK(u)  ^ s. 
(There  will  be  exactly  one  such  vertex  u.  Recall  that  early  neighbors  have 
been  placed  first,  in  step  HI.)  Set  MATE(u)  4-  u. 

15.  [Delete  u — u.]  Make  the  edges  u — v and  r — s equivalent  by  merging 
their  equivalence  classes.  Remove  u and  v from  each  other’s  adjacency  lists. 

16.  [Classify  the  neighbors  of  u.]  For  each  early  neighbor  u of  v,  do  step  17;  for 
each  late  neighbor  u of  v,  do  step  18.  Then  go  to  step  19. 

17.  [Note  a possible  equivalence.]  If  MARK(u)  = s and  RANK(u)  = 1,  make  the 
edge  u — v equivalent  to  the  edge  MATE  (it)  — MATE(u).  Return  to  16. 

18.  [Rank  u.]  If  MARK  (it)  = s and  RANK  (it)  = 1,  return  to  16.  Otherwise  set 
MARK  (it)  4—  s and  RANK  (it)  4—  2.  Set  w to  the  first  neighbor  of  u (it  will 
be  early).  If  w — v,  reset  w to  ids  second  early  neighbor;  but  return  to  16 
if  u has  only  one  early  neighbor.  If  MARK  (w)  / s or  RANK(w)  / 2,  set 
RANK(u)  4—  1,  LINK(it)  4—  A,  LINK(g)  4—  u,  and  q 4—  u.  Return  to  16. 

19.  [Continue  breadth-first  search.]  Set  v 4-  LINK(d).  Return  to  14  if  v yf  A. 

110.  [Process  subgraph  s.]  Call  Subroutine  I recursively  with  parameter  s.  Then 
return  to  II.  | 

This  algorithm  and  subroutine  have  been  described  in  terms  of  relatively  high- 
level  data  structures;  further  details  are  left  to  the  reader.  For  example,  adja- 
cency lists  should  be  doubly  linked,  so  that  edges  can  readily  be  deleted  in  step  15. 
Any  convenient  method  for  merging  equivalence  classes  can  be  used  in  that  step. 

Exercise  77  explains  the  theory  that  makes  this  algorithm  work,  and  ex- 
ercise 78  proves  that  each  vertex  is  encountered  at  most  lgn  times  in  step  14. 
Furthermore,  exercise  79  shows  that  a median  graph  has  at  most  O(nlogn) 
edges.  Therefore  the  total  running  time  of  Algorithm  H is  0(n(logn)2),  except 
perhaps  for  the  bit-setting  in  step  H3. 

The  reader  may  wish  to  play  through  Algorithm  H by  hand  on  the  median 
graph  in  Table  2,  whose  vertices  represent  the  twelve  monotone  self-dual  Boolean 
functions  of  four  variables  {w,x,y,z}.  All  such  functions  that  actually  involve 
all  four  variables  can  be  expressed  as  a median  of  five  things,  like  (64).  With 
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Table  2 

LABELS  FOR  THE  FREE  MEDIAN  ALGEBRA  ON  FOUR  GENERATORS 


z 

{xyz) 

( wxyzz ) 

( wxxyz ) 
( wxz ) 

X 


3 

ri 

Sj 

Ksi) 

w 

0000000 

1 

w 

(wwxyz) 

0000001 

2 

(wwxyz) 

(wyz) 

0010001 

3 

(wyz) 

(wxyzz) 

0010101 

4 

(wxyzz) 

(xyz) 

0010111 

5 

(wxyzz) 

z 

1010101 

6 

(wyz) 

(wxyyz) 

0010011 

7 

( wxyyz ) 

y 

0110011 

8 

(wwxyz) 

(wxz) 

0000101 

9 

(wxz) 

(wxxyz) 

0000111 

10 

(wxxyz) 

X 

0001111 

11 

(wwxyz) 

(wxy) 

0000011 

starting  vertex  o = w,  the  algorithm  computes  the  master  list  of  edges  rj  — sj 
and  the  binary  labels  shown  in  the  table.  (The  actual  order  of  processing  depends 
on  the  order  in  which  vertices  appear  in  adjacency  lists.  But  the  final  labels  will 
be  the  same  under  any  ordering,  except  for  permutations  of  the  columns.) 

Notice  that  the  number  of  1-bits  in  each  label  l(v)  is  the  distance  of  v from 
the  starting  vertex  a.  In  fact,  the  uniqueness  of  labels  tells  us  that  the  distance 
between  any  two  vertices  is  the  number  of  bit  positions  in  which  their  labels 
differ,  because  we  could  have  started  at  any  particular  vertex. 

The  special  median  graph  in  Table  2 could  actually  have  been  handled  in  a 
completely  different  way,  without  using  Algorithm  H at  all,  because  the  labels 
in  this  case  are  essentially  the  same  as  the  truth  tables  of  the  corresponding 
functions.  Here’s  why:  We  can  say  that  the  simple  functions  w,  x,  y , z have 
the  respective  truth  tables  t(w)  = 0000000011111111,  t(x)  = 0000111100001111, 
t(y)  = 0011001100110011,  t(z)  = 0101010101010101.  Then  the  truth  table  of 
(■ wwxyz ) is  the  bitwise  majority  function  (t(w)t(w)t(x)t(y)t(z)),  namely  the 
string  0000000101111111;  and  a similar  computation  gives  the  truth  tables  of  all 
the  other  vertices. 

The  last  half  of  any  self-dual  function’s  truth  table  is  the  same  as  the  first 
half,  but  complemented  and  reversed,  so  we  can  eliminate  it.  Furthermore  the 
leftmost  bit  in  each  of  our  truth  tables  is  always  zero.  We  are  left  with  the 
seven-bit  labels  shown  in  Table  2;  and  the  uniqueness  property  guarantees  that 
Algorithm  H will  produce  the  same  result,  except  for  possible  permutation  of 
columns,  when  it  is  presented  with  this  particular  graph. 

This  reasoning  tells  us  that  the  edges  of  the  graph  in  Table  2 correspond  to 
pairs  of  functions  whose  truth  tables  are  almost  the  same.  We  move  between 
neighboring  vertices  by  switching  only  two  complementary  bits  of  their  truth 
tables.  In  fact,  the  degree  of  each  vertex  turns  out  to  be  exactly  the  number  of 
prime  implicants  in  the  disjunctive  prime  form  of  the  monotone  self-dual  function 
represented  by  that  vertex  (see  exercises  70  and  84). 
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Median  sets.  A median  set  is  a collection  X of  binary  vectors  with  the  property 
that  (xyz)  6 X whenever  x £ X,  y E X,  and  z £ X,  where  the  medians  are 
computed  componentwise  as  we’ve  done  with  median  labels.  Thomas  Schaefer 
noticed  in  1978  that  median  sets  provide  us  with  an  attractive  counterpoint  to 
the  characterization  of  Horn  functions  in  Theorem  H: 

Theorem  S.  The  Boolean  function  f(Xl,  ...,xn)  is  expressible  as  a conjunction 
ol  Krom  clauses  if  and  only  if 

f(x  i,...,xn)  = f(yi,-..iyn)  = f(zi,...,zn)  = 1 

implies  f({xiyiz i), . . . , ( xnynzn ))  = 1 (68) 

for  all  Boolean  values  Xj,  y:) , and  Zj. 

Proof.  [STOC  10  (1978),  216-226,  Lemma  3.1B.]  If  we  have  V z2  = yx  V y2  = 
z}Vz2  = 1,  say,  with  a:,  < yx  < z1;  then  (xxyxzx)  V (x2y2z2)  = yxV  (x2y2z2)  = 1. 
smcf  iji  — 0 implies  that  x2  = y2  = \.  Thus  (68)  is  necessary. 

Conversely,  if  (68)  holds,  let  ux  V • • • V uk  be  a prime  clause  of  /,  where  each 
uj  is  a literal.  Then,  for  1 < j < k,  the  clause  ux  V • • • V uj_1  V uj+1  V ■ ■ • V uk  is 
not  a clause  of  /;  so  there’s  a vector  x(P  with  f{x^)  = 1 but  with  u j)  = 0 for 
all  * / j.  If  k > 3,  the  median  (x^x^x^)  has  ut  = 0 for  1 < i < k;  but  that’s 
impossible,  because  Mi  V • • • V uk  was  supposedly  a clause.  Hence  k < 2.  | 

Thus  median  sets  are  the  same  as  “2SAT  instances,”  the  sets  of  points  that  satisfy 
some  formula  / in  2CNF. 

A median  set  is  said  to  be  reduced  if  its  vectors  x = Xl . . . xt  contain  no 
redundant  components.  In  other  words,  for  each  coordinate  position  k,  a reduced 
median  set  has  at  least  two  vectors  x ^ and  y^  with  the  property  that  x[k)  = 0 
and  yk  - 1 but  x{  = yt  for  all  i ± k.  We’ve  seen  that  the  labels  of  a median 

graph  satisfy  this  condition;  in  fact,  if  coordinate  k corresponds  to  the  edge  u v 

m the  graph,  we  can  let  x^  and  y ^ be  the  labels  of  u and  v.  Conversely,  any 
reduced  median  set  X defines  a median  graph,  with  one  vertex  for  each  element 
of  X and  with  adjacency  defined  by  all-but-one  equality  of  coordinates.  The 
median  labels  of  these  vertices  must  be  identical  to  the  original  vectors  in  X. 
because  we  know  that  median  labels  are  essentially  unique. 

Median  labels  and  reduced  median  sets  can  also  be  characterized  in  yet 
another  instructive  way,  which  harks  back  to  the  networks  of  comparator  modules 
that  we  studied  in  Section  5.3.4.  We  noted  in  that  section  that  such  networks 
are  useful  for  “oblivious  sorting”  of  numbers,  and  we  noted  in  Theorem  5.3.4Z 
that  a network  of  comparators  will  sort  all  n!  possible  input  permutations  if  and 
only  if  it  correctly  sorts  all  2"  combinations  of  Os  and  Is.  When  a comparator 
module  is  attached  to  two  horizontal  lines,  with  inputs  x and  y entering  from 
the  left,  it  outputs  the  same  two  values  on  the  right,  but  with  min(ar, y)  = xAy 
on  the  upper  line  and  max(ai,  y)  = x V y on  the  lower  line.  Let’s  now  extend 
the  concept  slightly  by  also  allowing  inverter  modules , which  change  0 to  1 and 
vice  versa.  Here,  for  example,  is  a comparator-inverter  network  (or  Cl-net,  for 
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short),  which  transforms  the  binary  value  0010  into  0111: 


0 0 0 

0 —  • — 1 1 

1 —  — o — • — 1 

o — — l 1 


0 • — 1 

1 • 0 

1 • 0 

1— • 0 


0 • 1 — 0 

0 • 1 1 

1 1 1 

0 0 1 


(69) 


(A  single  dot  denotes  an  inverter.)  Indeed,  this  network  transforms 


0000  ha 0110; 

0001  1-+  0111; 

0010  ha  0111; 

0011  • y 0110; 


0100  1 — y 0111; 

0101  ha  1111; 

0110  ha  1111; 

0111  i — y 0111; 


1000  ha  0111; 

1001  ha  0101; 

1010  ha  0101; 

1011  ha  0111; 


1100  ha  0110; 

1101  ha  0111; 

1110  ha  0111; 

1111  ha  0110. 


(70) 


Suppose  a Cl-net  transforms  the  bit  string  x = xx  . . . xt  into  the  bit  string 
x\  . . . x't  = f(x).  This  function  /,  which  maps  the  /-cube  into  itself,  is  in  fact  a 
graph  homomorphism.  In  other  words,  we  have  f(x)  — f(y)  whenever  x — y in 
the  t-cube:  Changing  one  bit  of  x always  causes  exactly  one  bit  of  f(x)  to  change, 
because  every  module  in  the  network  has  this  behavior.  Moreover,  Cl-nets  have 
a remarkable  connection  with  median  labels: 


Theorem  F.  Every  set  X of  t-bit  median  labels  can  be  represented  by  a 
comparator-inverter  network  that  computes  a Boolean  function  f(x)  with  the 
property  that  f(x)  £ X for  all  bit  vectors  X\  . . . xt,  and  f(x)  = x for  all  x £ X. 

Proof.  [Tomas  Feder,  Memoirs  Amer.  Math.  Soc.  555  (1995),  1-223,  Lemma  3.37; 
see  also  the  Ph.  D.  thesis  of  D.  H.  Wiedemann  (University  of  Waterloo,  1986).] 
Consider  columns  i and  j of  the  median  labels,  where  1 < i < j < t.  Any  such 
pair  of  columns  contains  at  least  three  of  the  four  possibilities  {00,01, 10, 11},  if 
we  look  through  the  entire  set  of  labels,  because  median  labels  have  no  redundant 
columns.  Let  us  write  j -A  i,  j — > i,  i -A  j,  or  i -A  j if  the  value  00,  01,  10,  or  11 
(respectively)  is  missing  from  those  two  columns;  we  can  also  note  the  equivalent 
relations  1 — > j,  1 -A  J,  j — > I,  or  j -A  7,  respectively,  which  involve  1 instead  of  i. 
For  example,  the  labels  in  Table  2 give  us  the  relations 

1 ->2,3,4, 5, 6,  7 2, 3, 4,  5, 6, 7 -A  I; 

2 — > 3, 4, 5, 6,  7 3, 4,  5, 6,  7 — » 2; 

3 -A  4,7 

4 ->  5,6,7 
5a7 
6 — > 7 

(There  is  no  relation  between  3 and  5 because  all  four  possibilities  occur  in  those 
columns.  But  we  have  3 -A  4 because  11  doesn’t  appear  in  columns  3 and  4. 
The  vertices  whose  label  has  a 1 in  column  3 are  those  closer  to  ( wyz ) than  to 
(■ wwxyz } in  Table  2;  they  form  a convex  set  in  which  column  4 of  the  labels  is 
always  0,  because  they  are  also  closer  to  ( wxxyz ) than  to  x.) 

These  relations  between  the  literals  {1,1,  2,2  contain  no  cycles, 

so  they  can  always  be  topologically  sorted  into  an  anti-symmetrical  sequence 


_4,7^3; 
5, 6,  7 -A  4; 
7 -A  5; 
7 -A  6. 


(71 
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u\u2  ...  u2t  in  which  uj  is  the  complement  of  u2t+i-j.  For  example, 

17423566532471  (72) 

is  one  such  way  to  sort  the  relations  in  (71)  topologically. 

Now  we  proceed  to  construct  the  network,  by  starting  with  t empty  lines 
and  successively  examining  elements  uk  and  uk+d  in  the  topological  sequence, 
for  d = 2t  - 2,  2t  - 3,  . . . , 1 (in  this  order),  and  for  k = 1,  2,  . . . , t - \dj 2],  If 
uk  < uk+d  is  a relation  between  columns  i and  j,  where  i < j,  we  append  new 
modules  to  lines  i and  j of  the  network  as  follows: 

If  * j If  i -t  J If  i -*  j If  i -A  j 


For  example,  from  (71)  and  (72)  we  first  enforce  1 ->•  7,  then  1 -A  4,  then  1 2, 

then  7 — >■  4 (that  is,  4 — > 7),  etc.,  obtaining  the  following  network: 


’ 4 

rrl.  1 

a 

— 1 

r — : : — 

1 

I 

L 

□ 

I 

— 

_ , 

> 

- I 1 

' ' „ 

^ 

, 

*4 

— I 

--i- 

(Go  figure.  No  modules  are  contributed  when,  say,  uk  is  7 and  uk+d  is  3,  because 
the  relation  3 -A  7 does  not  appear  in  (71).) 

Exercise  89  proves  that  each  new  cluster  of  modules  (73)  preserves  all  of  the 
previous  inequalities  and  enforces  a new  one.  Therefore,  if  x is  any  input  vector, 
f{x)  satisfies  all  of  the  inequalities;  so  f (x)  £ X by  Theorem  S.  Conversely,  if 
x £ X,  every  cluster  of  modules  in  the  network  leaves  x unchanged.  | 

Corollary  F.  Suppose  the  median  labels  in  Theorem  F are  closed  under  the 
operations  of  bitwise  AND  and  OR,  so  that  x & y £ X and  x\y  £ X whenever 
x £ X and  y £ X.  Then  there  is  a permutation  of  coordinates  under  which  the 
labels  are  representable  by  a network  of  comparator  modules  only. 

Proof.  The  bitwise  AND  of  all  labels  is  0 ...  0,  and  the  bitwise  OR  is  1 ...  1.  So  the 
only  possible  relations  between  columns  are  i -A  j and  j -a  i.  By  topologically 
sorting  and  renaming  the  columns,  we  can  ensure  that  only  i — > j occurs  when 
i < j;  and  in  this  case  the  construction  in  the  proof  never  uses  an  inverter.  | 

In  general,  if  G is  any  graph,  a homomorphism  / that  maps  the  vertices  of  G 
onto  a subset  X of  those  vertices  is  called  a retraction  if  it  satisfies  f(x)  = x for  all 
x £ X\  and  we  call  X a retract  of  G when  such  an  / exists.  The  importance  of  this 
concept  in  the  theory  of  graphs  was  first  pointed  out  by  Pavol  Hell  [§ee  Lecture 
Notes  in  Math.  406  (1974),  291-301].  One  consequence,  for  example,  is  that 
the  distance  between  vertices  in  X — the  number  of  edges  on  a shortest  path  — 
remains  the  same  even  if  we  restrict  consideration  to  paths  that  lie  entirely  in  X. 
(See  exercise  93.) 

Theorem  F demonstrates  that  every  t-dimensional  set  of  median  labels  is 
a retract  of  the  f-dimensional  hypercube.  Conversely,  exercise  94  shows  that 
hypercube  retracts  are  always  median  graphs. 
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Threshold  functions.  A particularly  appealing  and  important  class  of  Boolean 
functions  f(x i,x2, . . . ,xn)  arises  when  / can  be  defined  by  the  formula 

f(x  i,x2,...,xn)  = [wiXi  + w2x2-\ \-wnxn>t],  (75) 

where  the  constants  uq,  w2,  ...,  wn  are  integer  “weights”  and  t is  an  integer 
“threshold”  value.  For  example,  threshold  functions  are  important  even  when 
all  the  weights  are  unity:  We  have 

x\  A x2  A ■ • • A xn  — [xi  + x2  -\ + xn  > n}\  (76) 

iiVi2v--Vi„  = [ii+i2l fin>l];  (77) 

and  (xix2  ■ ■ .x2t-i)  = [x1  + x2  H f ar2t-i  >t],  (78) 

where  (2:12:2  • • • X2t_i)  stands  for  the  median  (or  majority)  value  of  a multiset  that 
consists  of  any  odd  number  of  Boolean  values  {xi,x2, . . . , x2t-i}.  In  particular, 
the  basic  mappings  x A y,  x V y,  and  (2 :yz)  are  all  threshold  functions,  and  so  is 

x = [— x > 0].  (79) 

With  more  general  weights  we  get  many  other  functions  of  interest,  such  as 

[2"  1xi+2n  2X2  + • • • + x„  > (t\t2  . . . tn)2\,  (80) 

which  is  true  if  and  only  if  the  binary  string  X1X2  . . . xn  is  lexicographically 
greater  than  or  equal  to  a given  binary  string  t^t2  . . . tn.  Given  a set  of  n objects 
having  sizes  uq,  w2,  . . . , wn,  a subset  of  those  objects  will  fit  into  a knapsack 
of  size  i — 1 if  and  only  if  f(x i,x2, . . . ,xn)  = 0,  where  Xj  = 1 represents  the 
presence  of  object  j in  the  subset.  Simple  models  of  neurons,  originally  proposed 
by  W.  McCulloch  and  W.  Pitts  in  Bull.  Math.  Biophysics  5 (1943),  115-133,  have 
led  to  thousands  of  research  papers  about  “neural  networks”  built  from  threshold 
functions. 

We  can  get  rid  of  any  negative  weight  Wj  by  setting  Xj  A-  Xj,  Wj  < wj, 

and  t <—  t + \voj\.  Thus  a general  threshold  function  can  be  reduced  to  a 
positive  threshold  function  in  which  all  weights  are  nonnegative.  Furthermore, 
any  positive  threshold  function  (75)  can  be  expressed  as  a special  case  of  the 
inedian/majority-of-odd  function,  because  we  have 

(0albx™1xlf2  . . .x„n)  — [b  + wix  1 + w2x2  H f wnxn  > b + £],  (81) 

where  xm  stands  for  m copies  of  x,  and  where  a and  b are  defined  by  the  rules 

a — max(0,2t  — 1 — w),  b ~ max(0,u;  + l — 2t),  w = wi+w2  + - ■ - + wn.  (82) 

For  example,  when  all  weights  are  1,  we  have 

(0n~1xi  . . . x„)  = Xi  A ■ • • A xn  and  (l"_1Xi  . . . x„)  = xx  V ■ • ■ V x„;  (83) 

we’ve  already  seen  these  formulas  in  (45)  when  n = 2.  In  general,  either  a or  6 is 
zero,  and  the  left-hand  side  of  (81)  specifies  a median  of  2T  — 1 elements,  where 

T — b + t.  = max(t,  u>i  + w2  + ■ ■ ■ + wn  + 1 - t).  (84) 
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There  would  be  no  point  in  letting  both  a and  b be  greater  than  zero,  because 
the  majority  function  clearly  satisfies  the  cancellation  law 

(OIX1X2  ...  X2t-l)  — (X1X2  . . . X2i  — 1).  (85) 

One  important  consequence  of  (81)  is  that  every  positive  threshold  function 
comes  from  the  pure  majority  function 

g(x  0,xx,x2,...,xn)  = (xq+6x“1x“2  . . . x“")  (86) 

by  setting  x0  = 0 or  1.  In  other  words,  we  know  all  threshold  functions  of  n vari- 
ables if  and  only  if  we  know  all  of  the  distinct  median-of-odd  functions  of  n + 1 or 
fewer  variables  (containing  no  constants).  Every  pure  majority  function  is  mono- 
tone and  self-dual;  thus  we’ve  seen  the  pure  majority  functions  of  four  variables 
{m,x,y,  z)  in  column  Sj  of  Table  2 on  page  71,  namely  (in),  ( wwxyz ),  ( wyz ), 
(wxyzz),  ( xyz ),  ( z ),  ( wxyyz ),  (y),  ( wxz ),  ( wxxyz ),  (x),  (wxy).  By  setting  w = 0 
or  1,  we  obtain  all  the  positive  threshold  functions  f(x,y,z)  of  three  variables: 

(0),  (1),  (00 xyz),  (11  xyz),  (0 yz),  (1  yz),  (0 xyzz),  (1  xyzz),  {xyz),  (z), 

(Oxyyz),  { lxyyz ),  (y),  (0 xz),  (lxz),  (0 xxyz),  (1  xxyz),  (x),  (Oxy),  (lary).  (87) 

All  150  positive  threshold  functions  of  four  variables  can  be  obtained  in  a similar 
fashion  from  the  self-dual  majority  functions  in  the  answer  to  exercise  84. 

There  are  infinitely  many  sequences  of  weights  (w1,w2, . . . ,wn),  but  only 
finitely  many  threshold  functions  for  any  given  value  of  n.  So  it  is  clear  that 
many  different  weight  sequences  are  equivalent.  For  example,  consider  the  pure 
majority  function 

(X1X2X3X4X5  Xg  ), 

in  which  prime  numbers  have  been  used  as  weights.  A brute-force  examination 
of  26  cases  shows  that 


[xlxlxlxlxl'x]?)  = {x^xlxlxlxtxl)- 


(88) 


thus  we  can  express  the  same  function  with  substantially  smaller  weights.  Simi- 
larly, the  threshold  function 


[(XXX2  . . . X20)2  > (01100100100001111110)2]  = (1225028^524288^262144  _ _ ^ 

a special  case  of  (80),  turns  out  to  be  simply 


/ 1 323^164  323  323  118  118  87  31  31  25  6 6 6 6 , /0  , 

\ 1 x2  x3  x4  x5  x6  XT  x8  x9  a'10a'lla'12a;13a'14a:r15:I;16a::172:18a;19) • (89) 


Exercise  103  explains  how  to  find  a minimum  set  of  weights  without  resorting  to 
a huge  brute-force  search,  using  linear  programming. 

A nice  indexing  scheme  by  which  a unique  identifier  can  be  assigned  to 
any  threshold  function  was  discovered  by  C.  K.  Chow  [FOCS  2 (1961),  34-38]. 
Given  any  Boolean  function  f(x1,...,xn),  let  N(f)  be  the  number  of  vectors 
x = (xi,...,x„)  for  which  f(x)  = 1,  and  let  E(/)  be  the  sum  of  all  those 
vectors.  For  example,  if  /(xj,x2)  = xx  V x2,  we  have  Nlf)  = 3 and  X(f)  = 
(0,1)  + (1,0)  + (1,1)  = (2,2). 
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Theorem  T.  Let  f(x x, . . . , xn)  and  g{x\ , . . . , xn)  be  Boolean  functions  with 
N(f)  = N(g)  and  £(/)  = £(y),  where  f is  a threshold  function.  Then  f — g. 

Proof.  Suppose  there  are  exactly  k vectors  x^\  . . . , x ^ such  that  /(x^)  = 1 
and  g{x^)  — 0.  Since  N(f)  = N(g ),  there  must  be  exactly  k vectors  y^\  . . . , 

such  that  f(y^)  = 0 and  g(y^)  — 1.  And  since  £(/)  = £(y),  we  must  also 
have  x^1)  + • • • + x ^ = t/1)  + • • • + 

Now  suppose  / is  the  threshold  function  (75);  then  we  have  w ■ x W > t and 
w ■ yd)  < t for  1 < j < k.  But  if  / ^ g we  have  k > 0,  and  w ■ (x*^  + • • • + x^)  > 
kt  > w ■ (y1'1'1  + • • ■ + y a contradiction.  | 

Threshold  functions  have  many  curious  properties,  some  of  which  are  ex- 
plored in  the  exercises  below.  Their  classical  theory  is  well  summarized  in  Saburo 
Muroga’s  book  Threshold  Logic  and  its  Applications  (Wiley,  1971). 

Symmetric  Boolean  functions.  A function  /(x1; . . . , xn)  is  called  symmetric 
if  /(x  1, . . . , xn)  is  equal  to  f(xp(ip  . . . ,xp(nj)  for  all  permutations  p(l) . . .p(n)  of 

{1 n}.  When  all  the  Xj  are  0 or  1,  this  condition  means  that  / depends  only 

on  the  number  of  Is  that  are  present  in  the  arguments,  namely  the  “sideways 

sum”  vx  — n(xx, . . . ,x„)  — x\-\ bxn.  The  notation  Skltk2,...,kr(xi, . . . ,xn)  is 

commonly  used  to  stand  for  the  Boolean  function  that  is  true  if  and  only  if  vx  is 
either  fci  or  k2  or  • • • or  kr.  For  example,  Sit3<5(v,w,x,y,z)  = w®w®x©y®x; 
S3t4t5{v,w,x,y,z)  = ( vwxyz );  S4i5(v,w,x,y,  z)  = (00 vwxyz). 

Many  applications  of  symmetry  involve  the  basic  functions  5*,(xi, . . . , x„) 
that  are  true  only  when  vx  — k.  For  example,  S3(xi,  x2,  X3,  X4,  x5,  x6)  is  true 
if  and  only  if  exactly  half  of  the  arguments  {xi, . . . , x6}  are  true  and  the  other 
half  are  false.  In  such  cases  we  obviously  have 

(x  1 , . . . , Xn)  S>k  (xi , • . . , Xn  ) A (xj  , . . . , Xn) , (90) 

where  5>fc(x1; . . . ,xn)  is  an  abbreviation  for  Sk,k+i n(xi, . . . ,x„).  The  func- 

tions 5>fc(x  1, . . . , x„)  are,  of  course,  the  threshold  functions  [xi  + • • • + x„  > k] 
that  we  have  already  studied. 

More  complicated  cases  can  be  treated  as  threshold  functions  of  threshold 
functions.  For  example,  we  have 

52,3,6,8,9 (xi,  • • • , X12)  = [vx  > 2 + 4[vx  > 4]  + 2[vx  > 7]  + 5[vx  > 10]] 

= (00xi . . ,x12(05xi . . ,x12)4(lxi  . . .xi2)2(l7xi  . . .X12)5),  (91) 

because  the  number  of  Is  in  the  outermost  majority-of-25  turns  out  to  be  re- 
spectively (11, 12, 13, 14, 11, 12, 13, 12, 13, 14, 10, 11, 12)  when  xi  + • ■ • + x12  = 
(0, 1, ... , 12).  A similar  two-level  scheme  works  in  general  [R.  C.  Minnick,  IRE 
Tfans.  EC-10  (1961),  6-16];  and  with  three  or  more  levels  of  logic  we  can  reduce 
the  number  of  thresholding  operations  even  further.  (See  exercise  113.) 

A variety  of  ingenious  tricks  have  been  discovered  for  evaluating  symmetric 
Boolean  functions.  For  example,  S.  Muroga  attributes  the  following  remarkable 
sequence  of  formulas  to  F.  Sasaki: 

Xq  © Xi  © ' * * © X2m  (xo^l^2  • * • ^2 m)j 

where  Sj  (xq XjXjjr\ . . . Xjjt-rn—\Xj4-nlXjjrrnjr.\ . . .Xj_|_2m_x),  (9^) 
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if  rn  > 0 and  if  we  consider  x2 m+k  to  be  the  same  as  for  k > 1.  In  particular, 
when  to  = 1 and  to  = 2 we  have  the  identities 

x0  ® an  © x2  — ( x0(x0xix2){xoX2Xi ));  (g3) 

X0  © • • • © X4  - ( XO(X0XIX2X3X4)(XOX2X3X4X1)(XOX3X4X1X2)(XOX4X1X2X3 )).  (94) 

The  right-hand  sides  are  fully  symmetric,  but  not  obviously  so!  (See  exercise  115.) 

Canalizing  functions.  A Boolean  function  f(x  1, . . . , xn)  is  said  to  be  canalizing 
or  forcing  if  we  might  be  able  to  deduce  its  value  by  examining  at  most  one  of 
its  variables.  More  precisely,  / is  canalizing  if  n — 0 or  if  there’s  a subscript  j for 
which  /(x)  either  has  a constant  value  when  we  set  Xj  - 0 or  a constant  value 
when  we  set  Xj  = 1.  For  example,  f(x,  y,z)  = (x  0 z)  V y is  canalizing  because  it 
always  equals  1 when  y = 0.  (When  y = 1 we  don’t  know  the  value  of  / without 
examining  also  x and  z;  but  half  a loaf  is  better  than  none.)  Such  functions, 
introduced  by  Stuart  Kauffman  [Lectures  on  Mathematics  in  the  Life  Sciences 
3 (1972),  63-116;  J.  Theoretical  Biology  44  (1974),  167-190],  have  proved  to  be 
important  in  many  applications,  especially  in  chemistry  and  biology.  Some  of 
their  properties  are  examined  in  exercises  125-129. 

Quantitative  considerations.  We’ve  been  studying  many  different  kinds  of 
Boolean  functions,  so  it’s  natural  to  ask:  How  many  n-variable  functions  of  each 
type  actually  exist?  Tables  3,  4,  and  5 provide  the  answers,  at  least  for  small 
values  of  n. 

All  functions  are  counted  in  Table  3.  There  are  2r‘  possibilities  for  each  n. 
since  there  are  22"  possible  truth  tables.  Some  of  these  functions  are  self-dual, 
some  are  monotone;  some  are  both  monotone  and  self-dual,  as  in  Theorem  P. 
Some  are  Horn  functions  as  in  Theorem  H;  some  are  Krom  functions  as  in 
Theorem  S;  and  so  on. 

But  in  Table  4,  two  functions  are  considered  identical  if  they  differ  only 
because  the  names  of  variables  have  changed.  Thus  only  12  different  cases  arise 
when  n — 2,  because  (for  example)  x V y and  x V y are  essentially  the  same. 

Table  5 goes  a step  further:  It  allows  us  to  complement  individual  variables, 
and  even  to  complement  the  entire  function,  without  essentially  changing  it. 
From  this  perspective  the  256  Boolean  functions  of  (x,  y , z)  fall  into  only  14 
different  equivalence  classes: 


Representative 

Class  size 

Representative 

Class  size 

0 

2 

x A (y  ® z) 

24 

X 

6 

x © (y  A z) 

24 

x Ay 

24 

(iAj/)V(iA  z) 

24 

a ’®y 

6 

(x  V y)  A (x  0'  z) 

48 

x /\y  A z 

16 

(x  0 y)  V (x  0 z) 

8 

x © y © z 

2 

(xyz) 

8 

x /\(yV  z) 

48 

Si(x,y,z) 

16 

We  shall  study  ways  to  count  and  to  list  inequivalent  combinatorial  objects  in 
Section  7.2.3. 
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n = 0 

n = 1 

n — 2 

n = 3 

n = 4 

n = 5 

n = 6 

arbitrary 

2 

4 

16 

256 

65,536  4,294,967,296 

18,446,744,073,709,551,616 

self-dual 

0 

2 

4 

16 

256 

65,536 

4,294,967,296 

monotone 

2 

3 

6 

20 

168 

7,581 

7,828,354 

both 

0 

1 

2 

4 

12 

81 

2,646 

Horn 

2 

4 

14 

122 

4,960 

2,771,104 

151,947,502,948 

Krom 

2 

4 

16 

166 

4,170 

224,716 

24,445,368 

threshold 

2 

4 

14 

104 

1,882 

94,572 

15,028,134 

symmetric 

2 

4 

8 

16 

32 

64 

128 

canalizing 

2 

4 

14 

120 

3,514 

1,292,276 

103,071,426,294 

Table  4 

BOOLEAN  FUNCTIONS 

DISTINCT  UNDER 

PERMUTATION  OF  VARIABLES 

n = 0 

n = 1 

n = 2 

n = 3 

n = 4 

n = 5 

n = 6 

arbitrary 

2 

4 

12 

80 

3,984  37,333,248 

25,626,412,338,274,304 

self-dual 

0 

2 

2 

8 

32 

1,088 

6,385,408 

monotone 

2 

3 

5 

10 

30 

210 

16,353 

both 

0 

1 

1 

2 

3 

7 

30 

Horn 

2 

4 

10 

38 

368 

29,328 

216,591,692 

Krom 

2 

4 

12 

48 

308 

3,028 

49,490 

threshold 

2 

4 

10 

34 

178 

1,720 

590,440 

canalizing 

2 

4 

10 

38 

294 

15,774 

149,325,022 

Table  5 

BOOLEAN 

FUNCTIONS  DISTINCT 

UNDER  COMPLEMENTATION /PERMUTATION 

n = 0 

n = 1 

n = 2 

n = 3 

n = 4 

n = 5 

n = 6 

arbitrary 

1 

2 

4 

14 

222 

616,126 

200,253,952,527,184 

self-dual 

0 

1 

1 

3 

7 

83 

109,950 

threshold 

1 

2 

3 

6 

15 

63 

567 

both 

0 

1 

1 

2 

3 

7 

21 

canalizing 

I 

2 

3 

6 

22 

402 

1,228,158 

EXERCISES 

1.  [15]  (Lewis  Carroll.)  Make  sense  of  Tweedledee’s  comment,  quoted  near  the 
beginning  of  this  section.  [Hint:  See  Table  1.] 

2.  [17]  Logicians  on  the  remote  planet  Pincus  use  the  symbol  1 to  represent  “false” 
and  0 to  represent  “true.”  Thus,  for  example,  they  have  a binary  operation  called  “or” 
whose  properties 

1 or  1 = 1,  lor  0 = 0,  0 or  1 = 0,  0or0  = 0 

we  associate  with  A.  What  operations  would  we  associate  with  the  16  logical  opera- 
tors that  Pincusians  respectively  call  “falsehood,”  “and,”  . . . , “nand,”  “validity”  (see 
Table  1)? 
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► 3.  [13]  Suppose  logical  values  were  respectively -1  for  falsehood  and +1  for  truth,  in- 
stead of  0 and  1.  What  operations  o in  Table  1 would  then  correspond  to  (a)  max(i,  y )? 
(b)  min(z,  y)l  (c)  -z?  (d)  x ■ y7 

4.  [24]  (H.  M.  Sheffer.)  The  purpose  of  this  exercise  is  to  show  that  all  of  the 
operations  in  Table  1 can  be  expressed  in  terms  of  NAND.  (a)  For  each  of  the  16 
operators  o in  that  table,  find  a formula  equivalent  to  x o y that  uses  only  A as  an 
operator.  Your  formula  should  be  as  short  as  possible.  For  example,  the  answer  for 
operation  L is  simply  “x”,  but  the  answer  for  L is  “z  A x”.  Do  not  use  the  constants 
0 or  1 in  your  formulas,  (b)  Similarly,  find  16  short  formulas  when  constants  are 
allowed.  For  example,  z [ y can  now  be  expressed  also  as  “z  A 1”. 

5.  [24]  Consider  exercise  4 with  C as  the  basic  operation  instead  of  A. 

6.  [21]  (E.  Schroder.)  (a)  Which  of  the  16  operations  in  Table  1 are  associative  — in 
other  words,  which  of  them  satisfy  x o (y  o z)  = (x  o y)  o z?  (b)  Which  of  them  satisfy 
the  identity  (z  o y)  o [y  o z)  = x o z? 

7.  [20]  Which  operations  in  Table  1 have  the  property  that  x o y = z if  and  only  if 
y o z = z? 

8.  [24]  Which  of  the  162  pairs  of  operations  (o,p)  satisfy  the  left-distributive  law 
X o (yaz)  = (loj)n(zoz)? 

9.  [16]  True  or  false?  (a)  (z  © y)  V 2 = (z  V z)  © (y  V z);  (b)  (w  ® z © y)  V 2 = 
(w  V z)  © (z  V z)  ® (y  V z);  (c)  (z  ® y)  V {y  © z)  = (z  © z)  V (y  ® z). 

10.  [17]  What  is  the  multilinear  representation  of  the  “random”  function  (22)? 

11.  [M25]  Is  there  an  intuitive  way  to  understand  exactly  when  the  multilinear  rep- 
resentation of  f(x  1, . . . ,zn)  contains,  say,  the  term  Z2Z3Z6Z8?  (See  (19).) 

► 12.  [ M23 ] The  integer  multilinear  representation  of  a Boolean  function  extends  rep- 
resentations like  (19)  to  a polynomial  f(x i,...,z„)  with  integer  coefficients,  in  such 
a way  that  f(x  1, . . . ,z„)  has  the  correct  value  (0  or  1)  for  all  2n  possible  0-1  vectors 
(zx, . . ,,x„),  without  taking  a remainder  mod  2.  For  example,  the  integer  multilinear 
representation  corresponding  to  (19)  is  1 - xy  - xz  — yz  + 3xyz. 

a)  What  is  the  integer  multilinear  representation  of  the  “random”  function  (22)? 

b)  How  large  can  the  coefficients  of  such  a representation  /(z  1, . . . , xn)  be? 

c)  Show  that,  in  every  integer  multilinear  representation,  0 < /(z i,...,z„)  < 1 
whenever  xi,  xn  are  real  numbers  with  0 < zi, . . . , z„  < 1. 

d)  Similarly,  if  /(zi, . . . , x„)  < g(x  i,...,x„)  whenever  {zi,...,z„}  C {0,1},  then 
/(z  1,  • • • ,zn)  < g(xi, . . . ,xn)  whenever  {xll...,xn}  C [0  . . 1]. 

e)  If  / is  monotone  and  0 < xj  < y3  < 1 for  1 < j < n,  prove  that  /(z)  < f(y). 

► 13.  [20]  Consider  a system  that  consists  of  n units,  each  of  which  may  be  “working” 
or  “failing.”  If  Xj  represents  the  condition  “unit  j is  working,”  then  a Boolean  function 
like  xi  A (z2  V z3)  represents  the  statement  “unit  1 is  working,  but  either  unit  2 or 
unit  3 is  failing”;  and  S3(z  1, . . . ,xn)  means  “exactly  three  units  are  working.” 

Suppose  each  unit  j is  in  working  order  with  probability  pj,  independent  of  the 
other  units.  Show  that  the  Boolean  function  /(z i,...,z„)  is  true  with  probability 
F(pi, . . . ,pn),  where  F is  a polynomial  in  the  variables  pi,  . . . , pn. 

14.  [20]  The  probability  function  F(pi, . . . ,pn)  in  exercise  13  is  often  called  the 
availability  of  the  system.  Find  the  self-dual  function  /(z i,Z2,z3)  of  maximum  avail- 
ability when  the  probabilities  (pi,p2,p3)  are  (a)  (.9,  .8,  .7);  (b)  (.8,  .6,  .4);  (c)  (.8,  .6,  .1). 
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► 15.  [M20  ] If  /(ari,.  ..,xn)  is  any  Boolean  function,  show  that  there  is  a polynomial 
F(x)  with  the  property  that  F(x)  is  an  integer  when  x is  an  integer,  and  f(x i, . . . , xn)  = 
F((xn  ...  2:1)2)  mod  2.  Hint:  Consider  (*)  mod  2. 

16.  [13]  Can  we  replace  each  V by  ® in  a full  disjunctive  normal  form? 

17.  [10]  By  De  Morgan’s  laws,  a general  disjunctive  normal  form  such  as  (25)  is  not 
only  an  OR  of  ANDs,  it  is  a NAND  of  NANDs: 


KA-Aui,,)  A---A  (uml  A • • • A umSm). 

Both  levels  of  logic  can  therefore  be  considered  to  be  identical. 

A student  named  J.  H.  Quick  rewrote  this  expression  in  the  form 

(un  A • • • A «iSl)  A •••A  («mi  A ■ ■ ■ A wmSm). 

Was  that  a good  idea? 

► 18.  [20]  Let  Bi  A ■ • 1 A tts  be  an  implicant  in  a disjunctive  normal  form  for  a Boolean 
function  /,  and  let  vi  V • • ■ V vt  be  a clause  in  a conjunctive  normal  form  for  the  same 
function.  Prove  that  Ui  = Vj  for  some  i and  j. 

19.  [20]  What  is  the  conjunctive  prime  form  of  the  “random”  function  in  (22)? 

20.  [M21]  True  or  false:  Every  prime  implicant  of  / A g can  be  written  /'A  g\  where 
/ is  a prime  implicant  of  / and  g is  a prime  implicant  of  g. 

21.  [M20]  Prove  that  a nonconstant  Boolean  function  is  monotone  if  and  only  if  it 
can  be  expressed  entirely  in  terms  of  the  operations  A and  V. 

22.  [20]  Suppose  f(xi, . . . ,xn)  = g(xi,. . . ,x„-i)  ® h(x1} . . . ,a;n_i)Aa:n  as  in  (16). 
What  conditions  on  the  functions  g and  h are  necessary  and  sufficient  for  / to  be 
monotone? 

23.  [15]  What  is  the  conjunctive  prime  form  of  (vAwAx)  V (vAxAz)  V ( xAyAz )? 

24.  [M20]  Consider  the  complete  binary  tree  with 
2k  leaves,  illustrated  here  for  k = 3.  Operate  al- 
ternately with  A or  V on  each  level,  using  A at  the 
root,  obtaining  for  example  ((i0Aii)  V (i2Ai3))  A 
((2:4  Ai5)  V(a:6  A 2:7)).  How  many  prime  implicants  does  the  resulting  function  contain? 

25.  [M21]  How  many  prime  implicants  does  (2^2:2)  A(2:2Vi3)  A- ■ • A(x„_1  V2:„)  have? 

26.  [M23]  Let  _F  and  Q be  the  families  of  index  sets  for  the  prime  clauses  and  the 
prime  implicants  of  a monotone  CNF  and  a monotone  DNF: 

f(x)=  A 9(x)=  \J  f\xj. 

ieTiei  Jee  jeJ 

Exhibit  an  x such  that  f(x)  ^ g(x)  if  any  of  the  following  conditions  hold: 

a)  There  is  an  I £ F and  a J € Q with  I C J = 0. 

b)  U/<=.f  I / Ujgg  J- 

c)  There’s  an  I £ T with  |/|  > \Q\,  or  a J € G with  |J|  > |Jj. 

d)  2"h/|  + 2n-|J|  < 2n,  where  n = | U/6Jr  /|. 

27.  [MSI]  Continuing  the  previous  exercise,  consider  the  following  algorithm  X(F,  Q), 
which  either  returns  a vector  x with  /( x)  / g(x),  or  returns  A if  f = g: 
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XI.  [Check  necessary  conditions.]  Return  an  appropriate  value  x if  condition  (a), 
(b),  (c),  or  (d)  in  exercise  26  applies. 

X2.  [Done?]  If  |.7r||C?|  < 1,  return  A. 

X3.  [Recurse.]  Compute  the  following  reduced  families,  for  a “best”  index  k: 

■Fi  = {/  | / G T,  k i /},  T0  = Tx  u {/  | k i I,  I U {k}  e T}; 

Go  = {J\J  eg,  k£  J},  gx  = e0u{J  I k j,  Ju{k}e  G}. 

Delete  any  member  of  To  or  Q\  that  contains  another  member  of  the  same  fam- 
ily. The  index  k should  be  chosen  so  that  the  ratio  p = mind-Fi \/\T\,  \Q0\/\g\) 
is  as  small  as  possible.  If  X(To,  Go)  returns  a vector  x,  return  the  same  vector 
extended  with  xk  = 0.  Otherwise  if  X(Jri,(7i)  returns  a vector  x,  return  the 
same  vector  extended  with  xk  = 1.  Otherwise  return  A.  | 

If  N = l-F|  + \G\,  prove  that  step  XI  is  executed  at  most  N°^°&n)2  times.  Hint:  Show 
that  we  always  have  p < 1 - 1/lgJV  in  step  X3. 

28.  [21]  (W.  V.  Quine,  1952.)  If  f(x i,...,xn)  is  a Boolean  function  with  prime 
implicants  pi , . . . , pq,  let  5(2/1,  • • . , yq)  = A/(x)=i  V { 2/j  I Pj(x)  = 1}-  For  example,  the 

random  function  (22)  is  true  at  the  eight  points  (28),  and  it  has  five  prime  implicants 
given  by  (29)  and  (30);  so  g(yi,. . . ,y5)  is 

(2/1V2/2)  A (2/1)  A (2/2V1/3)  A (1/4)  A (2/3 V 2/5)  A (1/5)  A (2/5)  A (2/4V2/5) 

= (l/i A 2/2 A 1/4 A 2/5)  V (2/1 A 1/3 A 2/4 A 2/5) 

in  this  case.  Prove  that  every  shortest  DNF  expression  for  / corresponds  to  a prime 
implicant  of  the  monotone  function  g. 

29.  [22]  (The  next  several  exercises  are  devoted  to  algorithms  that  deal  with  the 
implicants  of  Boolean  functions  by  representing  points  of  the  n-cube  as  n-bit  numbers 
(bn- ! • • • 61^0)2,  rather  than  as  bit  strings  x1...xn.)  Given  a bit  position  j,  and  given 
n-bit  values  v0  < vx  < • ■ ■ < vm-\,  explain  how  to  find  all  pairs  ( k,k ')  such  that 
0 < k < k'  < m and  vy  = vk  © 2j , in  increasing  order  of  k.  The  running  time  of  your 
procedure  should  be  O(m),  if  bitwise  operations  on  n-bit  words  take  constant  time. 

► 30.  [27]  The  text  points  out  that  an  implicant  of  a Boolean  function  can  be  regarded 
as  a subcube  such  as  01*0*,  contained  in  the  set  V of  all  points  for  which  the  function  is 
true.  Every  subcube  can  be  represented  as  a pair  of  binary  numbers  a = (<zn- 1 . . .a0)2 
and  b = (6„_  1 . . . 60)2,  where  a records  the  positions  of  the  asterisks  and  b records  the 
bits  in  non-*  positions.  For  example,  the  numbers  a = (00101)2  and  b = (01000)2 
represent  the  subcube  c = 01*0*.  We  always  have  a & b = 0. 

The  “j-buddy”  of  a subcube  is  defined  whenever  aj  = 0,  by  changing  b to  b © 2j . 
For  example,  01*0*  has  three  buddies,  namely  its  4-buddy  11*0*,  its  3-buddy  00*0*. 
and  its  1-buddy  01*1*.  Every  subcube  cCV  can  be  assigned  a tag  value  (fn^i  . . . t0)2, 
where  tj  = 1 if  and  only  if  the  5-buddy  of  c is  defined  and  contained  in  V.  With  this 
definition,  c represents  a maximal  subcube  (hence  a prime  implicant)  if  and  only  if  its 
tag  is  zero. 

Use  these  concepts  to  design  an  algorithm  that  finds  all  maximal  subcubes  (a,  b) 
of  a given  set  V,  where  V is  represented  by  the  n-bit  numbers  v0  < vx  < ■ ■ ■ < vm-\- 

► 31.  [28]  The  algorithm  in  exercise  30  requires  a complete  list  of  all  points  where  a 
Boolean  function  is  true,  and  that  list  may  be  quite  long.  Therefore  we  may  prefer  to 
work  directly  with  subcubes,  never  going  down  to  the  level  of  explicit  n-tuples  unless 

r I 


7.1.1 


BOOLEAN  BASICS 


83 


necessary.  The  key  to  such  higher-level  methods  is  the  notion  of  consensus  between 
subcubes  c and  c , denoted  by  cUc'  and  defined  to  be  the  largest  subcube  c"  such  that 

c"  C cUc1,  c"  g c,  and  c"  g c. 

Such  a c"  does  not  always  exist.  For  example,  if  c = 000*  and  c'  = *111,  every  subcube 
contained  in  cUc'  is  contained  either  in  c or  in  c . 

a)  Prove  that  the  consensus,  when  it  exists,  can  be  computed  componentwise  using 
the  following  formulas  in  each  coordinate  position: 

xUx  = x\J*  = *Ux  = x and  xUx  = *U*  = *,  for  x = 0 and  x = 1. 

Furthermore,  cU  c exists  if  and  only  if  the  rule  x LJ  x = * has  been  used  in  exactly 
one  component. 

b)  A subcube  with  fc  asterisks  is  called  a fc-cube.  Show  that,  if  c is  a fc-cube  and  c' 
is  a fc'-cube,  and  if  the  consensus  c"  = c U d exists,  then  c"  is  a fc"-cube  where 
1 < k"  < min(fc,fc')  + 1. 

c)  If  C and  C'  are  families  of  subcubes,  let 

CUC  = { c U c | c £ C,  c'  € C1 , and  c U c'  exists}. 

Explain  why  the  following  algorithm  works. 

Algorithm  E (Find  maximal  subcubes).  Given  a family  C of  subcubes  of  the  n- 
cube,  this  algorithm  outputs  the  maximal  subcubes  of  V = |Jcec  c,  without  actually 
computing  the  set  V itself. 

El.  [Initialize.]  Set  j <-  0.  Delete  any  subcube  c of  C that  is  contained  in  another. 

E2.  [Done.'’]  (At  this  point,  every  j-cube  C V is  contained  in  some  element 
of  C,  and  C contains  no  fc-cubes  with  k < j.)  If  C is  empty,  the  algorithm 
terminates. 

E3.  [Take  consensuses.]  Set  C «—  C LI  C,  and  remove  all  subcubes  from  C'  that 
are  fc-cubes  for  fc  < j.  While  performing  this  computation,  also  output  any 
j-cube  c 6 C for  which  c U C does  not  produce'  a (j  + l)-cube  of  C' . 

E4.  [Advance.]  Set  C ■<—  CUC',  but  delete  all  j-cubes  from  this  union.  Then  delete 
any  subcube  c £ C that  is  contained  in  another.  Set  j «—  j+ 1 and  go  to  E2.  | 

(See  exercise  7.1.3-142  for  an  efficient  way  to  perform  these  computations.) 

► 32.  [ M29 } Let  ci,  . . . , cm  be  subcubes  of  the  n-cube. 

a)  Prove  that  c\  U • • • U cm  contains  at  most  one  maximal  subcube  c that  is  not 
contained  in  ci  U • • • U Cj-i  U c-,+1  U • • • U cm  for  any  j £ {1, . . . , m}.  (If  c exists,  we 
call  it  the  generalized  consensus  of  ci,  . . . , cm,  because  c = ci  U c2  in  the  notation 
of  exercise  31  when  m = 2.) 

b)  Find  a set  of  m subcubes  for  which  each  of  the  2m  — 1 nonempty  subsets  of 
{ci, . . . , Cm}  has  a generalized  consensus. 

c)  Prove  that  a DNF  with  m implicants  has  at  most  2m  — 1 prime  implicants. 

d)  Find  a DNF  that  has  m implicants  and  2m  — 1 prime  implicants. 

33.  [ M21 ] Let  f(x i,...,x„)  be  one  of  the  (^)  Boolean  functions  that  are  true  at 
exactly  m points.  If  / is  chosen  at  random,  what  is  the  probability  that  x\  A • • • A Xk 
is  (a)  an  implicant  of  /?  (b)  a prime  implicant  of  /?  [Give  the  answer  to  part  (b)  as  a 
sum;  but  evaluate  it  in  closed  form  when  fc  = n.] 
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► 34.  [HM37]  Continuing  exercise  33,  let  c(m,  n)  be  the  average  total  number  of  impli- 
cants,  and  let  p ( m . n)  be  the  average  total  number  of  prime  implicants. 

a)  If  0-  < m < 2 n/n,  show  that  m < c(m,n)  < f m + 0(m/n)  and  p(m,n)  > 
me  1 +0(m/n);  hence  p(m,n)  = 0(c(m,n))  in  this  range. 

b)  Now  let  2n/n  < m < (1  — e)2n,  where  e is  a fixed  positive  constant.  Define  the 
numbers  t and  amn  by  the  relations 


n 


-4/3 


< 


&mn 


< n 


-2/3 


integer  t. 


Express  the  asymptotic  values  of  c{m,n)  and  p{m,n)  in  terms  of  n,  t,  and  amn. 

[Hint:  Show  that  almost  all  of  the  implicants  have  exactly  n — torn  — f — 1 literals.] 

c)  Estimate  c(m,n)/p(m,n)  when  m = 2n~1  and  n = [(lnt  - lnlnt)22‘j,  integer  t. 

d)  Prove  that  c(m,n)/p(m,n)  = 0(log  log  n/ log  log  logn)  when  m < (1  — e)2n. 

► 35.  [M25]  A DNF  is  called  orthogonal  if  its  implicants  correspond  to  disjoint  sub- 
cubes. Orthogonal  disjunctive  normal  forms  are  particularly  useful  when  the  reliability 
polynomial  of  exercise  13  is  being  calculated  or  estimated. 

The  full  DNF  of  every  function  is  obviously  orthogonal,  because  its  subcubes 
are  single  points.  But  we  can  often  find  an  orthogonal  DNF  that  has  significantly 
fewer  implicants,  especially  when  the  function  is  monotone.  For  example,  the  function 
(X1AX2)  V (12A13)  V (13A14)  is  true  at  eight  points,  and  it  has  the  orthogonal  DNF 


(x1Ax2)  V (riAi2Ax3)  V (x2 Ax3Ax4). 

In  other  words,  the  overlapping  subcubes  11**,  *11*,  **11  can  be  replaced  by  the  dis- 
joint subcubes  11**,  Oil*,  *011.  Using  the  binary  notation  for  subcubes  in  exercise  30, 
these  subcubes  have  asterisk  codes  0011,  0001,  1000  and  bit  codes  1100,  0110,  0011. 

Every  monotone  function  can  be  defined  by  a list  of  bit  codes  Bk,  . . . , Bp,  when 
the  asterisk  codes  are  respectively  Bu  . . . , Bp.  Given  such  a list,  let  the  “shadow”  Sk 
of  Bk  be  the  bitwise  OR  of  B3  k Bk,  for  all  1 < j < k such  that  1 /{B,  & Bk)  = 1: 

S*  = /3lfc  | • • • | P(k-i)k,  Pjk  = {(BjkBk)  ® ((BjkBk)  - 1))  - {(BjkBk)  - 1). 

For  example,  when  the  bit  codes  are  (B1,B2,B3)  = (1100,0110,0011),  we  get  the 
shadow  codes  (Si,S2,S3)  = (0000,1000,0100). 

a)  Show  that  the  asterisk  codes  A'j  = Bj  — Sj  and  bit  codes  Bj  define  subcubes  that 
cover  the  same  points  as  the  subcubes  with  asterisk  codes  Aj  = Bj . 

b)  A list  of  bit  codes  f?i,  . . . , Bp  is  called  a shelling  if  Bj  k Sk  is  nonzero  for  all 
1 < j < k < p.  For  example,  (1100,0110,0011)  is  a shelling;  but  if  we  arrange 
those  bit  codes  in  the  order  (1100,0011,0110)  the  shelling  condition  fails  when 
j = 1 and  k = 2,  although  we  do  have  S3  = 1001.  Prove  that  the  subcubes  in 
part  (a)  are  disjoint  if  and  only  if  the  list  of  bit  codes  is  a shelling. 

c)  According  to  Theorem  Q,  every  prime  implicant  must  appear  among  the  B’s  when 
we  represent  a monotone  Boolean  function  in  this  way.  But  sometimes  we  need 
to  add  additional  implicants  if  we  want  the  subcubes  to  be  disjoint.  For  example, 
there  is  no  shelling  for  the  bit  codes  1100  and  0011.  Show  that  we  can,  however, 
obtain  a shelling  for  this  function  (xi  Ax2)  V (X3AX4)  by  adding  one  more  bit  code. 
What  is  the  resulting  orthogonal  DNF? 

d)  Permute  the  bit  codes  {11000,01100,00110,00011, 11010}  to  obtain  a shelling. 

e)  Add  two  bit  codes  to  the  set  {110000,011000,001100,000110,000011}  in  order  to 
make  a shellable  list. 
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36.  [M21]  Continuing  exercise  35,  let  / be  any  monotone  function,  not  identically  1. 
Show  that  the  set  of  bit  vectors 

B = {x  | f(x)  = 1 and  f(x')  =0},  x = x & (z-1), 

is  always  shellable  when  listed  in  decreasing  lexicographic  order.  (The  vector  x is 
obtained  from  x by  zeroing  out  the  rightmost  1.)  For  example,  this  method  produces 
an  orthogonal  DNF  for  (iiAr2)  V (X3AZ4)  from  the  list  (1100,1011,0111,0011). 

► 37.  [M31]  Find  a shellable  DNF  for  (aqAa^)  V (x3Ax4)  V • ■ ■ V {x2n-\ f\x2n)  that  has 
2"  - 1 implicants,  and  prove  that  no  orthogonal  DNF  for  this  function  has  fewer. 

38.  [05]  Is  it  hard  to  test  the  satisfiability  of  functions  in  disjunctive  normal  form? 

► 39.  [25]  Let  f(x  1, . . . ,xn)  be  a Boolean  formula  represented  as  an  extended  binary 
tree  with  N internal  nodes  and  IV  + 1 leaves.  Each  leaf  is  labeled  with  a variable  Xk, 
and  each  internal  node  is  labeled  with  one  of  the  sixteen  binary  operators  in  Table  1; 
applying  the  operators  from  bottom  to  top  yields  f{x  1, . . . , xn ) as  the  value  of  the  root. 

Explain  how  to  construct  a formula  F(xi, . . . , xn,  yi, . . . , yN)  in  3CNF,  having 
exactly  AN  + 1 clauses,  such  that  f(x  1,. . . ,xn)  = 3yi . . . 3yNF(xi,. . .,xn,yu. . .,yN). 
(Thus  / is  satisfiable  if  and  only  if  F is  satisfiable.) 

40.  [23]  Given  an  undirected  graph  G,  construct  the  following  clauses  on  the  Boolean 
variables  {puv  \ u ^ v}  U {qUvw  \ u yf  v,  u / w,  v / w,  u —f—  w},  where  u,  v,  and  w 
denote  vertices  of  G: 

Al  {Puv  V pVu ) A ( Puv  V Pvu]  | u ^ v }; 

B = /\{  {puv  V pvw  V puw)  \ u ^ v,  u w,  u / ut }; 

17  (.Qv.vwV  Puv)  A {quvwV  Pvw)  A {quvw^  PuvV  Pvw  JJ  / WJ,  u f w } 

D {qUvw  v qwvu))  | u / w,  u-f-w}. 

Prove  that  the  formula  A A B A.C  A D is  satisfiable  if  and  only  if  G has  a Hamiltonian 
path.  Hint:  Think  of  puv  as  the  statement  ‘u  < v’. 

41.  [20]  ( The  pigeonhole  principle.)  The  island  of  San  Serriffe  contains  m pigeons  and 
n holes.  Find  a conjunctive  normal  form  that  is  satisfiable  if  and  only  if  each  pigeon 
can  be  the  sole  occupant  of  at  least  one  hole. 

42.  [20]  Find  a short,  unsatisfiable  CNF  that  is  not  totally  trivial,  although  it  consists 
entirely  of  Horn  clauses  that  are  also  Krom  clauses. 

43.  [20]  Is  there  an  efficient  way  to  decide  satisfiability  of  a conjunctive  normal  form 
that  consists  entirely  of  Horn  clauses  and/or  Krom  clauses  (possibly  mixed)? 

44.  [M23]  Complete  the  proof  of  Theorem  H by  studying  the  implications  of  (33). 

45.  [M20]  (a)  Show  that  exactly  half  of  the  Horn  functions  of  n variables  are  definite, 
(b)  Also  show  that  there  are  more  Horn  functions  of  n variables  than  monotone 
functions  of  n variables  (unless  n = 0). 

46.  [20]  Which  of  the  11  x 11  character  pairs  xy  can  occur  next  to  each  other  in  the 
context-free  grammar  (34)? 

47.  [20]  Given  a sequence  of  relations  j -<  k with  1 < j,  k < n as  in  Algorithm  2.2.3T 
(topological  sorting),  consider  the  clauses 

Xj  1 A • • • A Xjt  Xk  for  1 < k < n, 

where  is  the  set  of  elements  such  that  ji  -<  k.  Compare  the  behavior  of 

Algorithm  C on  these  clauses  to  the  behavior  of  Algorithm  2.2.3T. 
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► 48.  [21]  What’s  a good  way  to  test  a set  of  Horn  clauses  for  satisfiability? 

49.  [22]  Show  that,  if  f(xi, . . . , x„)  and  g(x i, . . . , xn)  are  both  defined  by  Horn  clauses 
in  CNF,  there  is  an  easy  way  to  test  if  f(xu...,xn)  < g(x  i,...,xn)  for  all  xu  ...,  xn. 

50.  [HM42]  There  are  ( n + 2)2n  1 possible  Horn  clauses  on  n variables.  Select  c ■ 2n 
of  them  at  random,  with  repetition  permitted,  where  c > 0;  and  let  Pn(c)  be  the 
probability  that  all  of  the  selected  clauses  are  simultaneously  satisfiable.  Prove  that 

Jim  Pn(c)  = l-(l-e-c)(l-e-2c)(l  — e-4c)(l— e-8c).... 

► 51.  [22]  A great  many  two-player  games  can  be  defined  by  specifying  a directed  graph 
m which  each  vertex  represents  a game  position.  There  are  two  players,  Alice  and  Bob. 
who  construct  an  oriented  path  by  starting  at  a particular  vertex  and  taking  turns  to 
extend  the  path,  one  arc  at  a time.  Before  the  game  starts,  each  vertex  has  either 
been  marked  A (meaning  that  Alice  wins),  or  marked  B (meaning  that  Bob  wins),  or 
marked  C (meaning  that  the  cat  wins),  or  left  unmarked. 

When  the  path  reaches  a vertex  v marked  A or  B,  that  player  wins.  The  game 
stops  without  a winner  if  v has  been  visited  before,  with  the  same  player  to  move.  If  v 
is  marked  C,  the  currently  active  player  has  the  option  of  accepting  a draw;  otherwise 
he  or  she  must  choose  an  outgoing  arc  to  extend  the  path,  and  the  other  player  becomes 
active.  (If  v is  an  unmarked  vertex  with  out-degree  zero,  the  active  player  loses.) 

Associating  four  propositional  variables  A+(v),  A-(u),  B+(v),  and  B~  (v)  with 
every  vertex  v of  the  graph,  explain  how  to  construct  a set  of  definite  Horn  clauses 
such  that  A+  ( v ) is  in  the  core  if  and  only  if  Alice  can  force  a win  when  the  path  starts 
at  v and  she  moves  first;  A~  (v)  is  in  the  core  if  and  only  if  Bob  can  force  her  to  lose  in 
that  game;  B+(v)  and  B ( v ) are  similar  to  A+(u)  and  A~(v),  but  with  roles  reversed. 

52.  [25]  ( Boolean  games.)  Any  Boolean  function  f(x  1, . . . , xn)  leads  to  a game  called 
“two  steps  forward  or  one  step  back,”  in  the  following  way:  There  are  two  players, 
0 and  1,  who  repeatedly  assign  values  to  the  variables  Xj ; player  y tries  to  make 
f(xi, . . . ,xn)  equal  to  y.  Initially  all  variables  are  unassigned,  and  the  position  marker 
m is  zero.  Players  take  turns,  and  the  currently  active  player  either  sets  m «—  m + 2 
(if  m + 2 < n)  or  m (—  m - 1 (if  m — 1 > 1),  then  sets 

f xm  0 or  1,  if  xm  was  not  previously  assigned; 

£771  ^ 2-m  5 if  xm  was  previously  assigned. 

The  game  is  over  as  soon  as  a value  has  been  assigned  to  all  variables;  then  f(x\ , . . . , xn ) 
is  the  winner.  A draw  is  declared  if  the  same  state  (including  the  value  of  m)  is’ reached 
twice.  Notice  that  at  most  four  moves  are  possible  at  any  time. 

Study  examples  of  this  game  when  2 < n < 9,  in  the  following  four  cases: 

a)  f(x i>  • • • j xn)  = [xi  . . ,xn  < xn  . . . Xi]  (in  lexicographic  order); 

b)  f(x xn)  = X!  © • ••  ® xn; 

c)  f{x  1,  - - . , xn)  = [a;  1 . . . xn  contains  no  two  consecutive  Is]; 

d)  f(x  1, . . . , xn)  = [(xi  . . . xn)2  is  prime]. 

53.  [23]  Show  that  the  impossible  comedy  festival  of  (37)  can  be  scheduled  if  a 
change  is  made  to  the  requirements  of  only  (a)  Tomlin;  (b)  Unwin;  (c)  Vegas;  (d)  Xie- 
(e)  Yankovic;  (f)  Zany. 

54.  [20]  Let  S = {1x1,  u2, . . . , uk}  be  the  set  of  literals  in  some  strong  component  of  a 
digraph  that  corresponds  to  a 2CNF  formula  as  in  Fig.  6.  Show  that  S contains  both 
a variable  and  its  complement  if  and  only  if  Uj  = Ui  for  some  j with  2 < j < k. 
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► 55.  [30]  Call  f(xi, . . . ,xn ) a renamed  Horn  function  if  there  are  Boolean  constants 
2/i , • • • , Un  such  that  f(x i ® y\, . . . , xn  © yn)  is  a Horn  function. 

a)  Given  f(x i, . . . , xn)  in  CNF,  explain  how  to  construct  g(yi, . . . , yn ) in  2CNF  so  that 
the  clauses  of /(*i©t/i, . . . , xn(Byn)  are  Horn  clauses  if  and  only  if  g(yi, . . . , yn)  = l. 

b)  Design  an  algorithm  that  decides  in  O(m)  steps  whether  or  not  all  clauses  of  a 
given  CNF  of  length  m can  be  converted  into  Horn  clauses  by  complementing  some 
subset  of  the  variables. 

► 56.  [20]  The  satisfiability  problem  for  a Boolean  function  f(x i,x2, . . . ,xn)  can  be 
stated  formally  as  the  question  of  whether  or  not  the  quantified  formula 

3*1  3*2  .. . 3xn  f(xx,x2, . . . ,x„) 

is  true;  here  1 3xj  a'  means,  “there  exists  a Boolean  value  Xj  such  that  a holds.” 

A much  more  general  evaluation  problem  arises  when  we  replace  one  or  more  of 
the  existential  quantifiers  3 Xj  by  the  universal  quantifier  V*.,-,  where  ‘Vij  a’  means, 
“for  all  Boolean  values  Xj,  a holds.” 

Which  of  the  eight  quantified  formulas  3*  3y  3z  f(x,  y,  z),  3x  3y  'iz  f(x,  y,  z),  ..., 
V*  \/y  \/z  f(x , y,  z)  are  true  when  f(x,  y,  z ) = (*Vj/)  A (rVz)  A (j/Vz)? 

► 57.  [30]  (B.  Aspvall,  M.  F.  Plass,  and  R.  E.  Tarjan.)  Continuing  exercise  56,  design 
an  algorithm  that  decides  in  linear  time  whether  or  not  a given  fully  quantified  formula 
/(*!,...,*„)  is  true,  when  / is  any  formula  in  2CNF  (any  conjunction  of  Krom  clauses). 

► 58.  [37]  Continuing  exercise  57,  design  an  efficient  algorithm  that  decides  whether  or 
not  a given  fully  quantified  conjunction  of  Horn  clauses  is  true. 

► 59.  [M20]  (D.  Pehoushek  and  R.  Fraer,  1997.)  If  the  truth  table  for  f(x i,x2, . . . ,xn) 
has  a 1 in  exactly  k places,  show  that  exactly  k of  the  fully  quantified  formulas 
Q*iQ*2  • • * Qxn  f(x i,*2,-  • • ,xn ) are  true,  when  each  Q is  either  3 or  V. 

60.  [12]  Which  of  the  following  expressions  yield  the  median  ( xyz ),  as  defined  in  (43)? 

(a)  (zA y)  © (yAz)  © (xAz).  (b)  (xWy)  © (yVz)  © (iVz).  (c)  (x®y)  A (y®z)  A (*©2). 
(d)  ( x=y ) © ( y=z ) © ( x=z ).  (e)  (xA y)  A (yAz)  A (xAz).  (f)  (xAy)  V (yAz)  V (iAz). 

61.  [13]  True  or  false:  If  o is  any  one  of  the  Boolean  binary  operations  in  Table  1,  we 
have  the  distributive  law  w o (xyz)  = ((wox)(woy)(woz)). 

62.  [25]  (C.  Schensted.)  If  f(x\, . . . , xn)  is  a monotone  Boolean  function  and  n > 3, 
prove  the  median  expansion  formula 

/(*!,..  . ,Xn)  = (f(x  1,Xi,X3,X4,..  .,Xn)f(x  1,X2,X2,X4,  ■ ■ ■ , *n)  f(x3,  X2 , X3,  *4,  . . .,*„)). 

63.  [20]  Equation  (49)  shows  how  to  compute  the  median  of  five  elements  via  medians 
of  three.  Conversely,  can  we  compute  (xyz)  with  a subroutine  for  medians  of  five? 

64.  [23]  (S.  B.  Akers,  Jr.)  (a)  Prove  that  a Boolean  function  f(x  1,...  ,xn)  is  mono- 
tone and  self-dual  if  and  only  if  it  satisfies  the  following  condition: 

For  all  * = *1  . . . *„  and  y = y\  . . . yn  there  exists  k such  that  f(x)  = Xk  and  f(y)  = yk- 

(b)  Suppose  / is  undefined  for  certain  values,  but  the  stated  condition  holds  whenever 
both  f(x)  and  f(y)  are  defined.  Show  that  there  is  a monotone  self-dual  Boolean 
function  g for  which  g(x)  = f(x)  whenever  f(x)  is  defined. 

► 65.  [M21]  Any  subset  X of  {1,  2, . . . , n)  corresponds  to  a binary  vector  x = x\x2  . . . xn 
via  the  rule  Xj  = [j  € A ] . And  any  family  J7  of  such  subsets  corresponds  to  a Boolean 
function  f(x)  = f(x  1,  x2, . . . , x„)  of  n variables,  via  the  rule  f(x)  = [A  € J7].  Therefore 
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every  statement  about  families  of  subsets  corresponds  to  a statement  about  Boolean 
functions,  and  vice  versa. 

A family  J~  is  called  intersecting  if  A Pi  Y 0 whenever  A,  Y £ 2F.  An  intersecting 
family  that  loses  this  property  whenever  we  try  to  add  another  subset  is  said  to  be 
maximal.  Prove  that  J-  is  a maximal  intersecting  family  if  and  only  if  the  corresponding 
Boolean  function  / is  monotone  and  self-dual. 

► 66.  [M25]  A coterie  of  {1, ... , n}  is  a family  C of  subsets  called  quorums,  which  have 
the  following  properties  whenever  Q £ C and  Q'  e C:  (i)  Q n Q'  ^ 0;  (ii)  Q C Q' 
implies  Q = Q' . Coterie  C dominates  coterie  C'  if  C / C'  and  if,  for  every  Q'  £ C' , 
there  is  a Q € C with  Q C Q'.  For  example,  the  coterie  {{1,  2},  {2, 3}}  is  dominated 
by  {{1,  2},  {1,3},  {2, 3}}  and  also  by  {{2}}.  [Coteries  were  introduced  in  classic  papers 
by  L.  Lamport,  CACM  21  (1978),  558-565;  H.  Garcia-Molina  and  D.  Barbara,  JACM 
32  (1985),  841-860.  They  have  numerous  applications  to  distributed  system  protocols, 
including  mutual  exclusion,  data  replication,  and  name  servers.  In  these  applications 
C is  preferred  to  any  coterie  that  it  dominates.] 

Prove  that  C is  a nondominated  coterie  if  and  only  if  its  quorums  are  the  index 
sets  of  variables  in  the  prime  implicants  of  a monotone  self-dual  Boolean  function 
f(x i, . . . ,x„).  (Thus  Table  2 illustrates  the  nondominated  coteries  on  {1,2, 3, 4}.) 

► 67.  [M30\  (J.  W.  Milnor  and  C.  Schensted.)  A triangular  grid  of 
order  n,  illustrated  here  for  n = 3,  contains  (n  + 2)(n  + l)/2  points 
with  nonnegative  “bary centric  coordinates”  xyz,  where  x + y + z = n. 

Two  points  are  adjacent  if  they  differ  by  ±1  in  exactly  two  coordinate 
positions.  A point  is  said  to  lie  on  the  x side  if  its  x coordinate  is 
zero,  on  the  y side  if  its  y coordinate  is  zero,  or  on  the  2 side  if  its  2 
coordinate  is  zero;  thus  each  side  contains  n+ 1 points.  If  n > 0,  a point  lies  on  two  dif- 
ferent sides  if  and  only  if  it  occupies  one  of  the  three  corner  positions. 

A Y is  a connected  set  of  points  with  at  least  one  point  on  each  side.  Suppose 
each  vertex  of  a triangular  grid  is  covered  with  a white  stone  or  a black  stone.  For 
example,  the  52  black  stones  in 
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contain  a (somewhat  distorted)  Y;  but  if  any  of  them  is  changed  from  black  to  white, 
there  is  a white  Y instead.  A moment’s  thought  makes  it  intuitively  clear  that,  in  any 
placement,  the  black  stones  contain  a Y if  and  only  if  the  white  stones  do  not. 

We  can  represent  the  color  of  each  stone  by  a Boolean  variable,  with  0 for  white  and 
1 for  black.  Let  Y(t)  = 1 if  and  only  if  there’s  a black  Y,  where  t is  a triangular  grid  com- 
prising all  the  Boolean  variables.  This  function  Y is  clearly  monotone;  and  the  intuitive 
claim  made  in  the  preceding  paragraph  is  equivalent  to  saying  that  Y is  also  self-dual. 
The  purpose  of  this  exercise  is  to  prove  the  claim  rigorously,  using  median  algebra. 

Given  a,  b,  c > 0,  let  tabc  be  the  triangular  subgrid  containing  all  points  whose 
coordinates  xyz  satisfy  x > a,  y > b,  z > c.  For  example,  f0oi  denotes  all  points  except 
those  on  the  2 side  (the  bottom  row).  Notice  that,  if  a + b + c = n,  tabc  is  the  single 
point  with  coordinates  abc,  and  in  general,  tabc  is  a triangular  grid  of  order  n-a-b-c. 
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a)  Prove  that,  if  n > 0,  Y(t)  = (Y(tioo)Y(t0io)Y(t0oi)). 

b)  If  n > 0,  let  t*  be  the  triangular  grid  of  order  n — 1 defined  by  the  rule 

! xyz  (x-\-l)yztx(y+l)ztxy(z+l) ) > for  X -\~  y Z — Tl  1. 

Prove  that  Y(t)  = Y(t*).  [In  other  words,  t*  condenses  each  small  triangle  of 
stones  by  taking  the  median  of  their  colors.  Repeating  this  process  defines  a 
pyramid  of  stones,  with  the  top  stone  black  if  and  only  if  there  is  a black  Y at  the 
bottom.  It’s  fun  to  apply  this  condensation  principle  to  the  twisted  Y above.] 

68.  [46]  The  just-barely-Y  configuration  shown  in  the  previous  exercise  has  52  black 
stones.  What  is  the  largest  number  of  black  stones  possible  in  such  a configuration? 
(That  is,  how  many  variables  can  there  be  in  a prime  implicant  of  the  function  Y(t)l) 

► 69.  [M26]  (C.  Schensted.)  Exercise  67  expresses  the  Y function  in  terms  of  medians. 
Conversely,  let  f(x i, . . . ,xn)  be  any  monotone  self-dual  Boolean  function  with  m + 1 
prime  implicants  p0,  Pi,  •••,  Pm ■ Prove  that  f(x = Y(T),  where  T is  any 
triangular  grid  of  order  m — 1 in  which  Tabc  is  a variable  common  to  pa  and  pa+b+i, 
fora  + 6 + c = m-  l.  For  example,  when  f(w,  x,  y,  z)  = ( xwywz ) we  have  m = 3 and 

f(w,  x,  y,  z)  — (w  A x)  V (w  A y)  V (w  A z)  V (x  A y A z)  = Y[  ™ ™ ) . 

\x  y z) 

► 70.  [M20]  (A.  Meyerowitz,  1989.)  Given  any  monotone  self-dual  Boolean  function 
f(x)  = f(x i, . . . ,xn),  choose  any  prime  implicant  Xjx  A • • • A Xj3  and  let 

9(x)  = (f(x)  A [x^t])  V [x  = t], 

where  t = ti  . . . tn  is  the  bit  vector  that  has  Is  in  positions  {j i, . . . , js}.  Prove  that 
g(x)  is  also  monotone  and  self-dual.  (Notice  that  g( x)  is  equal  to  f(x)  except  at  the 
two  points  t and  t.) 

► 71.  [M21]  Given  the  axioms  (50),  (51),  and  (52)  of  a median  algebra,  prove  that  the 
long  distributive  law  (54)  is  a consequence  of  the  shorter  law  (53). 

72.  [M22]  Derive  (58),  (59),  and  (60)  from  the  median  laws  (so)-(53). 

73.  [M32]  (S.  P.  Avann.)  Given  a median  algebra  M,  whose  intervals  are  defined 
by  (57)  and  whose  corresponding  median  graph  is  defined  by  (61),  let  d(u,v)  denote 
the  distance  from  u to  v.  Also  let  l[uxv\  stand  for  the  statement  “ x lies  on  a shortest 
path  from  u to  v." 

a)  Prove  that  [uxv]  holds  if  and  only  if  d(u,v)  = d(u,  x)  + d(x,v). 

b)  Suppose  x 6 [u . . v]  and  u G [x . . y],  where  x ^ u and  y — v is  an  edge  of  the 
graph.  Show  that  x — u is  also  an  edge. 

c)  If  £ G [u . . t>],  prove  [uxv],  by  induction  on  d(u,v). 

d)  Conversely,  prove  that  [uxv]  implies  x G [u . . v], 

74.  [M21]  In  a median  algebra,  show  that  w = (xyz)  whenever  we  have  w G [x..y\, 
w G [x  . . z],  and  w G [y  . . z]  according  to  definition  (57). 

► 75.  [ M36 ] (M.  Sholander,  1954.)  Suppose  M is  a set  of  points  with  a betweenness 
relation  “x  lies  between  u and  v,"  symbolized  by  [uxv],  which  satisfies  the  following 
three  axioms: 

i)  If  [uvu]  then  u = v. 

ii)  If  [uxu]  and  [xyu]  then  [vyu]. 

iii)  Given  x,  y,  and  z,  exactly  one  point  w = (xyz)  satisfies  [xwy],  [xwz],  and  [ywz\. 
The  object  of  this  exercise  is  to  prove  that  M is  a median  algebra. 

a)  Prove  the  majority  law  (xxy)  = x,  Eq.  (50). 
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b)  Prove  the  commutative  law  ( xyz ) = (xzy)  = ■ ■ ■ = (zyx),  Eq.  (51). 

c)  Prove  that  [uxv]  if  and  only  if  x = (uxv). 

d)  If  [uxy]  and  [uyv],  prove  that  [xyv\. 

e)  If  [uxv]  and  [uyz]  and  [vyz],  prove  that  [xyz].  Hint:  Construct  the  points  w = 
(yuv),  p = (wux),  q = (wvx),  r = ( pxz >,  s = (qxz),  and  t = { rsz }. 

f)  Finally,  deduce  the  short  distributive  law,  Eq.  (53):  (( xyz)uv ) = ( x{yuv)(zuv )). 

76.  [ M33 ] Derive  the  betweenness  axioms  (i),  (ii),  and  (iii)  of  exercise  75,  starting 
from  the  three  median  axioms  (50),  (51),  and  (52),  letting  [uxv]  be  an  abbreviation  for 
“x  = {uxv)."  Do  not  use  the  distributive  law  (53).  Hint:  See  exercise  74. 

77.  [M28]  Let  G be  a median  graph  containing  the  edge  r — s.  For  each  edge  u v. 

call  u an  early  neighbor  of  v if  and  only  if  r is  closer  to  u than  to  v.  Partition  the 
vertices  into  “left”  and  “right”  parts,  where  left  vertices  are  closer  to  r than  to  s and 
right  vertices  are  closer  to  s than  to  r.  Each  right  vertex  v has  a rank , which  is  the 
shortest  distance  from  v to  a left  vertex.  Similarly,  each  left  vertex  u has  rank  1 — d, 
where  d is  the  shortest  distance  from  u to  a right  vertex.  Thus  u has  rank  zero  if  it  is 
adjacent  to  a right  vertex,  otherwise  its  rank  is  negative.  Vertex  r clearly  has  rank  0, 
and  s has  rank  1. 

a)  Show  that  every  vertex  of  rank  1 is  adjacent  to  exactly  one  vertex  of  rank  0. 

b)  Show  that  the  set  of  all  right  vertices  is  convex. 

c)  Show  that  the  set  of  all  vertices  with  rank  1 is  convex. 

d)  Prove  that  steps  13-19  of  Subroutine  I correctly  mark  all  vertices  of  ranks  1 and  2. 

e)  Prove  that  Algorithm  H is  correct. 

► 78.  [M26]  If  the  vertex  v is  examined  k times  in  step  14  during  the  execution  of 
Algorithm  H,  prove  that  the  graph  has  at  least  2k  vertices.  Hint:  There  are  k ways  to 
start  a shortest  path  from  v to  a;  thus  at  least  k Is  appear  in  l{v) . 

► 79.  [Mg 7]  (R.  L.  Graham.)  An  induced  subgraph  of  a hypercube  is  a graph  whose 
vertices  v can  be  labeled  with  bit  strings  l(v)  in  such  a way  that  u — v if  and  only  if 
l{u)  and  l(v)  differ  in  exactly  one  bit  position.  (Each  label  has  the  same  length.) 

a)  One  way  to  define  an  n-vertex  subgraph  of  a hypercube  is  to  let  l(v)  be  the 
binary  representation  of  v,  for  0 < v < n.  Show  that  this  subgraph  has  exactly 
f(n ) = '52k= 0 v{k)  edges,  where  is(k)  is  the  sideways  addition  function. 

b)  Prove  that  f(n)  < n[lgn]/2. 

c)  Prove  that  no  n-vertex  subgraph  of  a hypercube  has  more  than  f(n)  edges. 

80.  [27]  A partial  cube  is  an  “isometric”  subgraph  of  a hypercube,  namely  a subgraph 
in  which  the  distances  between  vertices  are  the  same  as  they  are  in  the  full  graph.  The 
vertices  of  a partial  cube  can  therefore  be  labeled  in  such  a way  that  the  distance 
from  u to  v is  the  “Hamming  distance”  between  l{u)  and  l(v),  namely  u(l(u)  © l(v)). 
Algorithm  H shows  that  every  median  graph  is  a partial  cube. 

a)  Find  an  induced  subgraph  of  the  4-cube  that  isn’t  a partial  cube. 

b)  Give  an  example  of  a partial  cube  that  isn’t  a median  graph. 

81.  [16]  Is  every  median  graph  bipartite? 

82.  [25]  ( Incremental  changes  in  service.)  Given  a sequence  of  vertices  (no,  tq, . . . , vt ) 
in  a graph  G,  consider  the  problem  of  finding  another  sequence  (u0,  ux , . . . , ut)  for  which 
u0  = v0  and  the  sum 

(d(u0,  mi)  + d(Ul,u2)  + ■■■  + d(ut-uut))  + (d(ui  ,Vl)  + d(u2,v2)  + ■■■  + d(ut,vt)) 
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is  minimized,  where  d{u,  v)  denotes  the  distance  from  u to  v.  (Each  Vk  can  be  regarded 
as  a request  for  a resource  needed  at  that  vertex;  a server  moves  to  Uk  as  those  requests 
are  handled  in  sequence.)  Prove  that  if  G is  a median  graph,  we  get  an  optimum  solution 
by  choosing  Uk  = (uk-iVkVk+i)  for  0 < k < t,  and  ut  = vt. 

► 83.  [38]  Generalizing  exercise  82,  find  an  efficient  way  to  minimize 

(d(u o,ui)  + d(u 1,112)  + • • • + d(ut-i,ut ))  + p(d(ui,vi)  + d(u 2,  V2)  + h d(ut,vt )) 

in  a median  graph,  given  any  positive  ratio  p. 

84.  [30]  Write  a program  to  find  all  monotone  self-dual  Boolean  functions  of  five 
variables.  What  are  the  edges  of  the  corresponding  median  graph?  (Table  2 illustrates 
the  four-variable  case.) 

► 85.  [M22]  Theorem  S tells  us  that  every  formula  in  2CNF  corresponds  to  a median 
set;  therefore  every  antisymmetric  digraph  such  as  Fig.  6 also  corresponds  to  a median 
set.  Precisely  which  of  those  digraphs  correspond  to  reduced  median  sets? 

86.  [15]  If  v,  w , x,  y , and  z belong  to  a median  set  X,  does  their  five-element  median 
( vwxyz ),  computed  componentwise,  always  belong  to  X? 

87.  [24]  What  Cl-net  does  the  proof  of  Theorem  F construct  for  the  free  tree  (63)? 

88.  [M21]  We  can  use  parallel  computation  to  condense  the  network  (74)  into 


' 

’ ' 

I 

iflW 

IM 

I T 

. - . -1 

: : : n 

7 — r 

— 

- 

I 

♦=3 

■HHi 

Ml 

1— | 

PM 

by  letting  each  module  act  at  the  earliest  possible  time.  Prove  that,  although  the 
network  constructed  in  the  proof  of  Theorem  F may  contain  Q(t2)  modules,  it  always 
requires  at  most  0(t  log  t)  levels  of  delay. 

89.  [24]  When  the  construction  (73)  appends  a new  cluster  of  modules  to  enforce 
the  condition  u — >■  v,  for  some  literals  u and  v,  prove  that  it  preserves  all  previously 
enforced  conditions  v!  — > v' . 

► 90.  [21]  Construct  a Cl-net  with  input  bits  x\  . . . xt  and  output  bits  yi . . ,yt,  where 
Vi  = ■ • ■ = J/t-i  = 0 and  yt  = aq  © • • • © xt.  Try  for  only  O(logt)  levels  of  delay. 

91.  [46]  Can  a retraction  mapping  for  the  labels  of  every  median  graph  of  dimension  t 
be  computed  by  a Cl-net  that  has  only  O(logf)  levels  of  delay?  [This  question  is  moti- 
vated by  the  existence  of  asymptotically  optimum  networks  for  the  analogous  problem 
of  sorting;  see  M.  Ajtai,  J.  Komlos,  and  E.  Szemeredi,  Combinatorica  3 (1983),  1-19.] 

92.  [46]  Can  a Cl-net  sort  n Boolean  inputs  with  fewer  modules  than  a “pure”  sorting 
network  that  has  no  inverters? 

93.  [M20]  Prove  that  every  retract  A of  a graph  G is  an  isometric  subgraph  of  G. 
(In  other  words,  distances  in  X are  the  same  as  in  G;  see  exercise  80.) 

94.  [M21]  Prove  that  every  retract  X of  a hypercube  is  a set  of  median  labels,  if  we 
suppress  coordinates  that  are  constant  for  all  ® € X. 

95.  [M25]  True  or  false:  The  set  of  all  outputs  produced  by  a comparator-inverter 
network,  when  the  inputs  range  over  all  possible  bit  strings,  is  always  a median  set. 

96.  [HM25]  Instead  of  insisting  that  the  constants  uq,  uq,  • • • , w„,  and  t in  (75)  must 
be  integers,  we  could  allow  them  to  be  arbitrary  real  numbers.  Would  that  increase 
the  number  of  threshold  functions? 
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97.  [10]  What  median/majority  functions  arise  in  (8i)  when  n = 2,  wi  = w2  = 1, 
and  t = -1,  0,  1,  2,  3,  or  4? 

98.  [M23]  Prove  that  any  self-dual  threshold  function  can  be  expressed  in  the  form 

f(x  1j  *2,  ■ ■ ■ i xn)  = [ viyi  + • ■ ■ + vnyn  > 0], 

where  each  Vj  is  either  Xj  or  Xj.  For  example,  2*i  +3^2  + 5*3  + 7x4  + lla*  + 13x6  > 21 
if  and  only  if  2xx  + 3*2  + 5*3  - 7*4  + llz5  - 13*6  > 0. 

► 99.  [20]  (J.  E.  Mezei,  1961.)  Prove  that 

((*i  . . ■x2a-i)y1  . . .y2t-2)  = (*i  . ■ ■ x2s_1yj  . . .t/|t_2). 

100.  [20]  True  or  false:  If  /(*i, . ..  ,x„)  is  a threshold  function,  so  are  the  functions 
/(*i,. . . ,xn)  A *n+i  and  f(x i, . . . ,xn ) V xn+i- 

101.  [M23]  The  Fibonacci  threshold  function  Fn{x i, . is  defined  by  the  formula 

{*!  *2 2 ■ . . i1xnn  2)  when  n > 3;  for  example,  F7(xx, . . . , *7)  = (*1*2*|*4*|*|*7). 

a)  What  are  the  prime  implicants  of  Fn(x i, . . . ,*„)? 

b)  Find  an  orthogonal  DNF  for  Fn(x i,  — , xn)  (see  exercise  35). 

c)  Express  Fn(x i,  . . . ,*„)  in  terms  of  the  Y function  (see  exercises  67  and  69). 

102.  [M21]  The  self-dualization  of  a Boolean  function  is  defined  by  the  formulas 

f(x  o,xi,...,xn)  = (ioA/(ii,  ...,*„))  V (*oA/(*i, ...,  xn)) 

= (*oV/(*i, . . . ,xn))  A (*o  V/(*i, . . .,*„)). 

a)  If  f(xi, . . . , *„)  is  any  Boolean  function,  prove  that  / is  self-dual. 

b)  Prove  that  / is  a threshold  function  if  and  only  if  / is  a threshold  function. 

103.  [HM25]  Explain  how  to  use  linear  programming  to  test  whether  or  not  a mono- 
tone, self-dual  Boolean  function  is  a threshold  function,  given  a list  of  its  prime 
implicants.  Also,  if  it  is  a threshold  function,  explain  how  to  minimize  the  size  of 
its  representation  as  a majority  function  (x™1  . . . *“’*). 

104.  [ 2o ] Apply  the  method  of  exercise  103  to  find  the  shortest  representations  of 
the  following  threshold  functions  as  majority  functions:  (a)  {xj x^xlxlxl1  x\3 x\7 x\9)- 
(b)  [{xiX2x3Xa)2  > f],  for  0 < t < 16  (17  cases);  (c)  (xj9 *|5*J9*f*P*l*?*!*|*10). 

105.  [M25]  Show  that  the  Fibonacci  threshold  function  in  exercise  101  has  no  shorter 
representation  as  a majority  function  than  the  one  used  to  define  it. 

► 106.  [M25]  The  median-of-three  operation  (xyz)  is  true  if  and  only  if  x > y + z. 

a)  Generalizing,  show  that  we  can  test  the  condition  (*i*2  • . .*„)2  >(t/it/2  . . -yn)2+z 
by  performing  a median  of  2n+1  - 1 Boolean  variables. 

b)  Prove  that  no  median  of  fewer  than  2n+1  - 1 will  suffice  for  this  problem. 

107.  [17]  Calculate  N(f)  and  E (/)  for  the  16  functions  in  Table  1.  (See  Theorem  T.) 

108.  [M21]  Let  g(x o,*i, . . . ,*n)  be  a self-dual  function;  thus  IV((/)  = 2n  in  the  nota- 
tion of  Theorem  T.  Express  N(f)  and  E (/)  in  terms  of  E (<?),  when  f(xu.  ,.,xn)  is 
(a)  9(0, xi,...,  Xn);  (b)  9(1,  *1, . . . , xn). 

109.  [M2j]  The  binary  string  a = ax...an  is  said  to  majorize  the  binary  string 
P = bx...bn,  written  a t P or  /3  ^ a,  if  ai  + ■ ■ • + ak  > bx  + • • • + bk  for  0 < k < n. 

a)  Let  a = di  . . . an.  Show  that  a Y /3  if  and  only  if  P > a. 
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length  5.  (See  exercise  109.) 
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b)  Show  that  any  two  binary  strings  of  length  n have  a greatest  lower  bound  a A f3, 
which  has  the  property  that  a y 7 and  (3  > 7 if  and  only  if  a A f3  y 7.  Explain 
how  to  compute  a A /3,  given  a and  /3. 

c)  Similarly,  explain  how  to  compute  a least  upper  bound  a V /3,  with  the  property 
that  7 y a and  7 > f3  if  and  only  if  7 >7  a V (3. 

d)  True  or  false:  a A (/3V 7)  = (aA/3)  V (aA7);  a V (/3A7)  = (aV/3)  A (aV 7). 

e)  Say  that  a covers  /3  if  a y f3  and  a ^ f3,  and  if  a >7  7 V /3  implies  that  we 
have  either  7 = a or  7 = /3.  For  example,  Fig.  8 illustrates  the  covering  relations 
between  binary  strings  of  length  5.  Find  a simple  way  to  describe  the  strings  that 
are  covered  by  a given  binary  string. 

f)  Show  that  every  path  a = Qo,  qi,  . . . , ar  = 0 . . . 0 from  a given  string  a to  0 ...  0, 
where  aj-i  covers  atj  for  1 < j < r,  has  the  same  length  r = r(a). 

g)  Let  m(a)  be  the  number  of  binary  strings  f3  with  /3  y a.  Prove  that  m(la)  = 
m(a ) and  m(0a)  = m(a ) + m(a'),  where  a is  a with  its  leftmost  1 (if  any) 
changed  to  0. 

h)  How  many  strings  a of  length  n satisfy  a V a? 

110.  [M23]  A Boolean  function  is  called  regular  if  x -<  y implies  that  f(x)  < f(y) 
for  all  vectors  x and  y , where  A is  the  majorization  relation  in  exercise  109.  Prove  or 
disprove  the  following  statements: 

a)  Every  regular  function  is  monotone. 

b)  If  f is  a threshold  function  (75)  for  which  Wi  > W2  > ■ ■ ■ > wn,  f is  regular. 

c)  If  / is  as  in  (b)  and  E (/)  = (si, . . . , s„),  then  si  > s2  > • • ■ > sn. 

d)  If  / is  a threshold  function  (75)  with  > s2  > ■ ■ ■ > sn,  then  wi  > w2  > ■ ■ ■ > w„. 

111.  [M36]  An  optimum  coterie  for  a system  with  working  probabilities  (pi, . . . ,pn)  is 

a coterie  that  corresponds  to  a monotone  self-dual  function  with  maximum  availability, 
among  all  monotone  self-dual  functions  with  n variables.  (See  exercises  14  and  66.) 

a)  Prove  that  if  1 > pi  > ■ ■ • > pn  > | , at  least  one  self-dual  function  with  maximum 
availability  is  a regular  function.  Describe  such  a function. 

b)  Furthermore,  it  suffices  to  test  the  optimality  of  a regular  self-dual  function  / at 
points  y of  the  binary  majorization  lattice  for  which  f{y)  = 1 but  f(x)  = 0 for 
all  x covered  by  y. 

c)  What  coterie  is  optimum  when  some  probabilities  are  < |? 
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► 112.  [M3  7]  (J.  Hastad.)  If  f(x  i,x2,...,xm)  is  a Boolean  function,  let  M(f)  be  its 
representation  as  a multilinear  polynomial  with  integer  coefficients  (see  exercise  12). 
Arrange  the  terms  in  this  polynomial  by  using  Chase’s  sequence  a0  = 00 . . . 0,  ax  = 
10... 0,  ...,  a2m_ i = 11...  1 to  order  the  exponents;  Chase’s  sequence,  obtained 
by  concatenating  the  sequences  An0,  A{n_1)u  . . . , A0n  of  7.2.1.3-(35),  has  the  nice 
property  that  aq  is  identical  to  aq+i  except  for  a slight  change,  either  0 — > 1 or  01  — > 10 
or  001  -A  100  or  10  -4  01  or  100  — > 001.  For  example,  Chase’s  sequence  is 

0000, 1000, 0010,0001, 0100, 1100, 1010, 1001,  0011, 0101, 0110, 1110, 1101, 1011, 0111. 1111 

when  m = 4,  corresponding  to  the  respective  terms  1,  xu  x3,  x4,  x2,  Xlx2,  . . . , x2x3x4 , 
xlX2x3Xi-  so  the  relevant  representation  of,  say,  ((aq  © x2)  A x3)  V (xx  A 23  A x4)  is 

%3  — X1X3  -f-  X1X4  — X2X3  + 2X\X2Xs  — X1X3X4 

when  the  terms  have  been  arranged  in  this  order.  Now  let 

F(f)  = [tlle  most  significant  coefficient  of  M(f)  is  positive]. 

For  example,  the  most  significant  (final)  nonzero  term  of  ((aq  © x2)  A23)  V (xx  A23  A24) 
is  — X1X3X4  in  Chase’s  ordering,  SO  F(f)  = 0 in  this  case. 

a)  Determine  F{f)  for  each  of  the  16  functions  in  Table  1. 

b)  Show  that  F(f)  is  a threshold  function  of  the  n = 2m  entries  {/o...oo,/o  01 

/1...11}  of  the  truth  table  for  /.  Write  this  function  out  explicitly  when  m = 2. 

c)  Prove  that,  when  m is  large,  all  the  weights  in  any  threshold  representation  of  F 
must  be  huge:  Their  absolute  values  must  all  exceed 

3(  3 ) 7(T)  15(T)  . . . (2m-1_l)(m) 

i (l-0(n-1))  = 2 mn/2-"-2(3/2)m/ln2+o((5/4)m) 

Hint:  Consider  discrete  Fourier  transforms  of  the  truth  table  entries. 

113.  [24]  Show  that  the  following  three  threshold  operations  suffice  to  evaluate  the 
function  S2t3)e^i9(xx, . . . ,xX2)  in  (91): 

gi(xi,-  ■ -,xu)  = [ux>&]  = (laq  . . .aq2); 

<72  (aq, . . . , X12)  = [vx  — 6<q  >2]  = (l3a:i...  aq2<7i); 

53(2:1,...,  2i2 ) = [-2i22  + 13ffl  +752  >1]  = (05xi...x212g}3g72). 

Also  find  a four-threshold  scheme  that  evaluates  Si,3,5,8(aq, . . . ,Xl2). 

114.  [20]  (D.  A.  Huffman.)  What  is  the  function  S3,6(2, 2, 2, 2, 5,  5, 2)? 

115.  [M2 2]  Explain  why  (92)  correctly  computes  the  parity  function  20©2i©-  • -©22m. 

► 116.  [HM28]  (B.  Dunham  and  R.  Fridshal,  1957.)  By  considering  symmetric  functions, 
one  can  prove  that  Boolean  functions  of  n variables  might  have  many  prime  implicants. 

a)  Suppose  0 < j ^ k < n.  For  which  symmetric  functions  /( aq, . , . , Xn)  is  the  term 
21  A • ■ • A Xj  A Xj+i  A ■ ■ • A 2 k a prime  implicant? 

b)  How  many  prime  implicants  does  the  function  S3,4,5,6(aq, . . . , 29)  have? 

c)  Let  b(n)  be  the  maximum  number  of  prime  implicants,  over  all  symmetric  Boolean 
functions  of  n variables.  Find  a recurrence  formula  for  b(n),  and  compute  S(9). 

d)  Prove  that  6(n)  = 0(3n/«)- 

e)  Show  that,  furthermore,  there  are  symmetric  functions  f(xi,...,xn)  for  which 
both  / and  / have  0(23n/2/n)  prime  implicants. 
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117.  [M26]  A disjunctive  normal  form  is  called  irredundant  if  none  of  its  implicants 
implies  another.  Let  b*  (n)  be  the  maximum  number  of  implicants  in  an  irredundant 
DNF,  over  all  Boolean  functions  of  n variables.  Find  a simple  formula  for  fe*(n),  and 
determine  its  asymptotic  value. 

118.  [29]  How  many  Boolean  functions  f(x i, £2, £3, £4)  have  exactly  m prime  impli- 
cants, for  m = 0,  1,  . . . ? 

119.  [M48]  Continuing  the  previous  exercises,  let  b(n)  be  the  maximum  number  of 
prime  implicants  in  a Boolean  function  of  n variables.  Clearly  b(n)  < b(n)  < b*(n); 
what  is  the  asymptotic  value  of  b(n)l 

120.  [23]  What  is  the  shortest  DNF  for  the  symmetric  functions  (a)  x\  ©2:2©-  • -©xj 
(b)  So,  1,3, 4,6, 7(2:1, . . . , 2:7)?  (c)  Prove  that  every  Boolean  function  of  n variables  can  be 
expressed  as  a DNF  with  at  most  2n_1  prime  implicants. 

► 121.  [M23]  The  function  (l(xi 9x2)919293)  is  partially  symmetric,  since  it  is  symmet- 
ric in  {£1,2:2}  and  in  {91,92,93},  but  not  in  all  five  variables  {2:1, 2:2, 91, 92, 2/3 } . 

a)  Exactly  how  many  Boolean  functions  /( x\, . . . , xm,9i, . . . , yn)  are  symmetric  in 
{2:1  7 • • • 5 *£ra  } and  {91,. . .,y„}4 

b)  How  many  of  those  functions  are  monotone? 

c)  How  many  of  those  functions  are  self-dual? 

d)  How  many  of  those  functions  are  monotone  and  self-dual? 

122.  [M25]  Continuing  exercises  110  and  121,  find  all  Boolean  functions  /( 2:1,  *2, 2:3, 
91,92,93,94,93,9%)  that  are  simultaneously  symmetric  in  {2:1, X2, £3},  symmetric  in 
{91,92,  ■ ■ ■ , 9% } , self-dual,  and  regular.  Which  of  them  are  threshold  functions? 

123.  [46]  Determine  the  exact  number  of  self-dual  Boolean  functions  of  ten  variables 
that  are  threshold  functions. 

124.  [20]  Find  a Boolean  function  of  four  variables  that  is  equivalent  to  767  other 
functions,  under  the  ground  rules  of  Table  5. 

125.  [18]  Which  of  the  function  classes  in  (95)  are  canalizing? 

126.  [23]  (a)  Show  that  a Boolean  function  is  canalizing  if  and  only  if  its  sets  of  prime 
implicants  and  prime  clauses  have  a certain  simple  property,  (b)  Show  that  a Boolean 
function  is  canalizing  if  and  only  if  its  Chow  parameters  N(f)  and  £(/)  have  a certain 
simple  property  (see  Theorem  T).  (c)  Define  the  Boolean  vectors 

V(/)  = \/{x  | f(x)  = i}  and  A(/)  = /\{*  I /(®)  = !}; 

by  analogy  with  the  integer  vector  E(/).  Show  that  it’s  possible  to  decide  whether  or 
not  / is  canalizing,  given  only  the  four  vectors  V(/),  V(/),  A (/),  and  A (/). 

127.  [M25]  Which  canalizing  functions  are  (a)  self-dual?  (b)  definite  Horn  functions? 

► 128.  [20]  Find  a noncanalizing  f(x\, . . . ,x„)  that  is  true  at  exactly  two  points. 

129.  [M25]  How  many  different  canalizing  functions  of  n variables  exist? 

130.  [M21]  According  to  Table  3,  there  are  168  monotone  Boolean  functions  of  four 
variables.  But  some  of  them,  like  x Ay,  depend  on  only  three  variables  or  fewer. 

a)  How  many  4-variable  monotone  Boolean  functions  actually  involve  each  variable? 

b)  How  many  of  those  functions  are  distinct  under  permutation,  as  in  Table  4? 

131.  [HM42]  Table  3 makes  it  clear  that  there  are  many  more  Horn  functions  than 
Krom  functions.  What  is  the  asymptotic  number,  asn->  00? 
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► 132.  [ HM30 } The  Boolean  function  g(x)  = g(xx , . . . , x„)  is  called  affine  if  it  can  be 
written  in  the  form  yo  © (xi  A y\ ) ® • • • 0 ( xn  Ayn)  = (yo  -\-x-y)  mod  2 for  some  Boolean 
constants  y0,  yi,  ■ ■ ■ , yn- 

a)  Given  any  Boolean  function  f(x),  show  that  some  affine  function  agrees  with  f(x) 
at  2n_1  + 2"/2-1  or  more  points  x.  Hint:  Let  s(y)  = ^2X(— l)^x^+x'v,  and  prove 
that  Yly  s(y)s{y  © z)  = 2 2n[z  = 0 ...  0]  for  all  vectors  2. 

b)  The  Boolean  function  f(x)  is  called  bent  if  no  affine  function  agrees  with  it  at 
more  than  2n_1  + 2n/2-1  points.  Prove  that 

(xi  A a>2 ) © (2:3  A £4)  © • • • © ( Xn-1  A Xn)  © h(x 2,X4,  . . . , Xn) 

is  a bent  function,  when  n is  even  and  h(yi,y2, yn/ 2)  is  arbitrary. 

c)  Prove  that  f(x)  is  a bent  function  if  and  only  if 

0 f(x  0 V))  = 2”_1  for  all  y / 0 . . . 0. 

X 

d)  If  a bent  function  f(x  i,...,xn)  is  represented  by  a multilinear  polynomial  mod  2 
as  in  (19),  show  that  it  never  contains  the  term  xi  . . . xr  when  r > n/2  > 1. 

► 133.  [20]  (Mark  A.  Smith,  1990.)  Suppose  we  flip  n independent  coins  to  get  n 
random  bits,  where  the  fcth  coin  produces  bit  1 with  probability  pk.  Find  a way  to 
choose  (pi, . . . ,pn)  so  that  f(x  1, . . . , x„)  = 1 with  probability  (t0t\  . . . t2»-i)2/(22"  — 1), 
where  foti  . ..f2n-i  is  the  truth  table  of  the  Boolean  function  /.  (Thus,  n suitable 
random  coins  can  generate  a probability  with  2n-bit  precision.) 

By  and  large  the  minimization  of  switching  components 
outweighs  aii  other  engineering  considerations 
in  designing  economical  logic  circuits. 
— H.  A.  CURTIS,  A New  Approach  to  the  Design  of  Switching  Circuits  (1962) 

He  must  be  a great  calculator  indeed  who  succeeds. 

Simplify,  simplify. 

— HENRY  D.  THOREAU,  Walden ; or,  Life  in  the  Woods  (1854) 

7.1.2.  Boolean  Evaluation 

Our  next  goal  is  to  study  the  efficient  evaluation  of  Boolean  functions,  much  as 
we  studied  the  evaluation  of  polynomials  in  Section  4.6.4.  One  natural  way  to 
investigate  this  topic  is  to  consider  chains  of  basic  operations,  analogous  to  the 
polynomial  chains  discussed  in  that  section. 

A Boolean  chain,  for  functions  of  n variables  (xi, . . . ,xn),  is  a sequence 
(x„+i, . . . , xn+r)  with  the  property  that  each  step  combines  two  of  the  preceding 
steps: 

xi  — xj(i ) °i  xk(i),  for  n + 1 < i < n + r,  (1) 

where  1 < j(i)  < i and  1 < k(i)  < i,  and  where  o,  is  one  of  the  sixteen  binary 
operators  of  Table  7. 1.1-1.  For  example,  when  n — 3 the  two  chains 
X4  — xi  A x2  Xi  — x2  ® x3 

X5  = Xi  A £3  and  £5  = x\  A £4  (2) 

£6  = £4  V £5  £6  = £3  © £5 

both  evaluate  the  “mux”  or  “if-then-else”  function  x6  = (. £j ? £2:  £3),  which 
takes  the  value  x2  or  £3  depending  on  whether  £1  is  1 (true)  or  0 (false). 
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(Notice  that  the  left-hand  example  in  (2)  uses  the  simplified  notation  ‘x5  = 
£1  A13’  to  specify  the  NOTBUT  operation,  instead  of  the  form  ‘x5  = x\  C £3’ 
that  appears  in  Table  7. 1.1-1.  The  main  point  is  that,  regardless  of  notation, 
every  step  of  a Boolean  chain  is  a Boolean  combination  of  two  prior  results.) 

Boolean  chains  correspond  naturally  to  electronic  circuits,  with  each  step 
in  the  chain  corresponding  to  a “gate”  that  has  two  inputs  and  one  output. 
Electrical  engineers  traditionally  represent  the  Boolean  chains  of  (2)  by  circuit 
diagrams  such  as 


They  need  to  design  economical  circuits  that  are  subject  to  various  technological 
constraints;  for  example,  some  gates  might  be  more  expensive  than  others,  some 
outputs  might  need  to  be  amplified  if  reused,  the  layout  might  need  to  be  planar 
or  nearly  so,  some  paths  might  need  to  be  short.  But  our  chief  concern  in  this 
book  is  software,  not  hardware,  so  we  don’t  have  to  worry  about  such  things. 
For  our  purposes,  all  gates  have  equal  cost,  and  all  outputs  can  be  reused  as 
often  as  desired.  (Jargonwise,  our  Boolean  chains  boil  down  to  circuits  in  which 
all  gates  have  fan-in  2 and  unlimited  fan-out.) 

Furthermore  we  shall  depict  Boolean  chains  as  binary  trees  such  as 


2 3 


(4) 


instead  of  using  circuit  diagrams  like  (3).  Such  binary  trees  will  have  overlapping 
subtrees  when  intermediate  steps  of  the  chain  are  used  more  than  once.  Every 
internal  node  is  labeled  with  a binary  operator;  external  nodes  are  labeled  with 
an  integer  k,  representing  the  variable  xfc.  The  label  ‘©’in  the  left  tree  of  (4) 
stands  for  the  NOTBUT  operator,  since  x A y = [x  <y];  similarly,  the  BUTNOT 
operator,  x Ay,  can  be  represented  by  the  node  label  ‘©’. 

Several  different  Boolean  chains  might  have  the  same  tree  diagram.  For 
example,  the  left-hand  tree  of  (4)  also  represents  the  chain 


£4  = X\  A £3,  £5  = £1  A £2,  Xq  — £5  V £4. 

Any  topological  sorting  of  the  tree  nodes  yields  an  equivalent  chain. 

Given  a Boolean  function  / of  n variables,  we  often  want  to  find  a Boolean 
chain  such  that  xn+r  = f(xi, . . . ,xn),  where  r is  as  small  as  possible.  The 
combinational  complexity  C(f ) of  a function  / is  the  length  of  the  shortest  chain 
that  computes  it.  To  save  excess  verbiage,  we  will  simply  call  C(f)  the  “cost 
of  /.”  The  mux  function  in  our  examples  above  has  cost  3,  because  one  can  show 
by  exhaustive  trials  that  it  can’t  be  produced  by  any  Boolean  chain  of  length  2. 

The  DNF  and  CNF  representations  of  /,  which  we  studied  in  Section  7.1.1, 
rarely  tell  us  much  about  C(f),  since  substantially  more  efficient  schemes  of 
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calculation  are  usually  possible.  For  example,  in  the  discussion  following  7.1.1 
(30)  we  found  that  the  more-or-less  random  function  of  four  variables  whose 
truth  table  is  1100  1001  0000  1111  has  no  DNF  expression  shorter  than 

(xi  A x2  A x3)  V (x!  A x3  A x4)  V (x2  A x3  A x4)  V (xx  A x2).  (5) 

This  formula  corresponds  to  a Boolean  chain  of  10  steps.  But  that  function  can 
also  be  expressed  more  cleverly  as 


(((x2  A x4)  © x3)  A xi)  ©x2, 


(6) 


so  its  complexity  is  at  most  4. 

How  can  nonobvious  formulas  like  (6)  be  discovered?  We  will  see  that  a 
computer  can  find  the  best  chains  for  functions  of  four  variables  without  doing  an 
enormous  amount  of  work.  Still,  the  results  can  be  quite  startling,  even  for  people 
who  have  had  considerable  experience  with  Boolean  algebra.  Typical  examples 
of  this  phenomenon  can  be  seen  in  Fig.  9,  which  illustrates  the  four-variable 
functions  that  are  perhaps  of  greatest  general  interest,  namely  the  functions 
that  are  symmetric  under  all  permutations  of  their  variables. 

Consider,  for  example,  the  function  S2(x i,x2,x3,x4),  for  which  we  have 


Xl 

x2 

x3 

X4 

X5  — Xl  © X3 

x6  = Xl  © x2 

X7  = x3  © x4 
X8  = X5  V Xq 
Xg  — X6  © X7 
X10  — Xs  A Xg 


0000  0000  1111  1111 
0000  1111  0000  1111 
0011  0011  0011  0011 
0101  0101  0101  0101 
0011  0011  1100  1100 

0000  1111  1111  oooo 
0110  0110  0110  0110 
0011  1111  1111  1100 
0110  1001  1001  0110 

0001  0110  0110  1000 


(7) 


according  to  Fig.  9.  Truth  tables  are  shown  here  so  that  we  can  easily  verify 
each  step  of  the  calculation.  Step  x8  yields  a function  that  is  true  whenever 
Xi  / x2  or  Xi  / x3;  and  Xg  = Xi  © x2  © x3  © x4  is  the  parity  function  (xx  + x2  + 
x3  + x4)  mod  2.  Therefore  the  final  result,  x4o,  is  true  precisely  when  exactly  two 
of  {xx,x2,x3,x4}  are  1 ; these  are  the  cases  that  satisfy  xg  and  have  even  parity. 

Several  of  the  other  computational  schemes  of  Fig.  9 can  also  be  justified 
intuitively.  But  some  of  the  chains,  like  the  one  for  <Sj)4,  are  quite  amazing. 

Notice  that  the  intermediate  result  x6  is  used  twice  in  (7).  In  fact,  no  six- 
step  chain  for  the  function  52(xi,  x2,  x3,  x4)  is  possible  without  making  double 
use  of  some  intermediate  subexpression;  the  shortest  algebraic  formulas  for  S2, 
including  nice  symmetrical  ones  like 


((xi  A x2)  V (x3  A x4 ) ) © ((xi  V x2)  A (x3  V x4)),  (8) 

all  have  cost  7.  But  Fig.  9 shows  that  the  other  symmetric  functions  of  four  vari- 
ables can  all  be  evaluated  optimally  via  “pure”  binary  trees,  without  overlapping 
subtrees  except  at  external  nodes  (which  represent  the  variables). 
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1 2 3 4 1 2 3 4 


12341234 


Fig.  9.  Optimum  Boolean  chains  for  the  symmetric  functions  of  four  variables. 


In  general,  if  f(x i, . . . ,xn)  is  any  Boolean  function,  we  say  that  its  length 
L(f ) is  the  number  of  binary  operators  in  the  shortest  formula  for  /.  Obviously 
L{f)  > C(f);  and  we  can  easily  verify  that  L(f)  = C(f)  whenever  n < 3,  by 
considering  the  fourteen  basic  types  of  3-variable  functions  in  7.1.1-(95).  But  we 
have  just  seen  that  L(S2)  = 7 exceeds  C(S2)  — 6 when  n — 4,  and  in  fact  L(f) 
is  almost  always  substantially  larger  than  C(f)  when  n is  large  (see  exercise  49). 

The  depth  D(f)  of  a Boolean  function  / is  another  important  measure  of  its 
inherent  complexity:  We  say  that  the  depth  of  a Boolean  chain  is  the  length  of  the 
longest  downward  path  in  its  tree  diagram,  and  D(f)  is  the  minimum  achievable 
depth  when  all  Boolean  chains  for  / are  considered.  All  of  the  chains  illustrated 
in  Fig.  9 have  not  only  the  minimum  cost  but  also  the  minimum  depth  — except 
in  the  cases  52,3  and  5i,2,  where  we  cannot  simultaneously  achieve  cost  6 and 
depth  3.  The  formula 

S2,3(xi,x2,  x3,x4)  = ((a:i  A x2)  © (x3  A x4))  V ((x4  V x2)  A (x3  Q x4))  (9) 

shows  that  D(S2t3)  = 3,  and  a similar  formula  works  for  ,S\,2- 

Optimum  chains  for  n = 4.  Exhaustive  computations  for  4- variable  functions 
are  feasible  because  such  functions  have  only  216  = 65,536  possible  truth  tables. 
In  fact  we  need  only  consider  half  of  those  truth  tables,  because  the  complement  / 
of  any  function  / has  the  same  cost,  length,  and  depth  as  / itself. 
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Let’s  say  that  f(x is  normal  if  /( 0, . . . , 0)  = 0,  and  in  general  that 

f(Xl,...,xn)  ® /(0,...,0)  (10) 

is  the  •normalization”  off.  Any  Boolean  chain  can  be  normalized  by  normalizing 
each  of  its  steps  and  by  making  appropriate  changes  to  the  operators;  for  if 

r1/  V ‘ : C<-V  aie  the  normalizati°ns  of  (Xl, ... , **_!)  and  if  Xi  = xj(i)  ot  xk(t)  as 
m (i),  then  Xl  is  clearly  a binary  function  of  £j(i)  and  xk(i).  (Exercise  7 presents 
an  example.)  Therefore  we  can  restrict  consideration  to  normal  Boolean  chains 
without  loss  of  generality. 

Notice  that  a Boolean  chain  is  normal  if  and  only  if  each  of  its  binary 
operators  o*  is  normal.  And  there  are  only  eight  normal  binary  operators  — 
three  of  which,  namely  _L,  L,  and  R,  are  trivial.  So  we  can  assume  that  all 
Boolean  chains  of  interest  are  formed  from  the  five  operators  A,  C 5 V and  © 
which  are  denoted  respectively  by  ®,  ©,  ©,  ©,  and  © in  Fig.  9.’  Furthermore 
we  can  assume  that  j(i)  < k{i)  in  each  step. 

There  are  215  = 32,768  normal  functions  of  four  variables,  and  we  can  com- 
pute their  lengths  without  difficulty  by  systematically  enumerating  all  functions 
o length  0 1,  2 etc.  Indeed,  L(f)  = r implies  that  / = g o h for  some  g and  h. 

w eie  (g)  + (ft.)  — r - 1 and  o is  one  of  the  five  nontrivial  normal  operators- 
so  we  can  proceed  as  follows: 


Algorithm  L (Find,  normal  lengths).  This  algorithm  determines  L(f ) for  all 
normal  truth  tables  0 < / < 22"~\  by  building  lists  of  all  nonzero  normal 
functions  of  length  r for  r > 0. 

LI.  [Initialize.]  Let  L(0)  •<—  0 and  L(f ) •<—  oo  for  1 < / < 22"~1.  Then,  for 
1 < k < n,  set  L(xk)  <—  0 and  put  xk  into  list  0,  where 

xk  = (22"  - l)/(22"-fc  + 1)  (ll) 

is  the  truth  table  for  xk.  (See  exercise  8.)  Finally,  set  c <-  22"-1  - n - 1; 
c is  the  number  of  places  where  L(f)  = oo. 

L2.  [Loop  on  r.]  Do  step  L3  for  r = 1,  2,  . . . ; eventually  the  algorithm  will 
terminate  when  c becomes  0. 

L3.  [Loop  on  j and  k.}  Do  step  L4  for  j = 0,  1,  , . . , and  k = r - 1 - j while 
j <k. 

L4.  [Loop  on  g and  h]  Do  step  L5  for  all  g in  list  j and  all  h in  list  k.  (If  j = k 
it  suffices  to  restrict  h to  functions  that  follow  g in  list  k.) 

L5.  [Loop  on  /.[Do  step  L6  for  f = gkh,  f = Sbh.  / = ghk,  / = g\h,  end 
1 ~9®  h-  (Here  9 & h denotes  the  bitwise  AND  of  the  integers  g and  h ■ we 
are  representing  truth  tables  by  integers  in  binary  notation.) 

L6.  [Is  / new?]  If  L(f)  = oo,  set  L(f)  Fr.cFc-l,  and  put  / in  list  r. 
terminate  the  algorithm  if  c = 0.  | 

Exercise  10  shows  that  a similar  procedure  will  compute  all  depths  D(f). 

With  a little  more  work,  we  can  in  fact  modify  Algorithm  L so  that  it  finds 
better  upper  bounds  on  C(f),  by  computing  a heuristic  bit  vector  f(f ) called 
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Table  1 

THE  NUMBER  OF  FOUR- VARIABLE  FUNCTIONS  WITH  GIVEN  COMPLEXITY 


C(f) 

Classes 

Functions 

L(f) 

Classes 

Functions 

D(f ) 

Classes 

Functions 

0 

2 

10 

0 

2 

10 

0 

2 

10 

1 

2 

60 

1 

2 

60 

1 

2 

60 

2 

5 

456 

2 

5 

456 

2 

17 

1458 

3 

20 

2474 

3 

20 

2474 

3 

179 

56456 

4 

34 

10624 

4 

34 

10624 

4 

22 

7552 

5 

75 

24184 

5 

75 

24184 

5 

0 

0 

6 

72 

25008 

6 

68 

24640 

6 

0 

0 

7 

12 

2720 

7 

16 

3088 

7 

0 

0 

the  ‘‘footprint”  of  /.  A normal  Boolean  chain  can  begin  in  only  5 (”)  different 
ways,  since  the  first  step  xn+i  must  be  either  Xi  A x2  or  x\  A x2  or  Xi  A x2  or 
Xi  V x2  or  x\  © x2  or  xi  A x3  or  • • • or  xn_i  © xn.  Suppose  4>(f)  is  a bit  vector  of 
length  5(2)  and  U(f)  is  an  upper  bound  on  C(/),  with  the  following  property: 
Every  1 bit  in  </(/)  corresponds  to  the  first  step  of  some  Boolean  chain  that 
computes  / in  U(f)  steps. 

Such  pairs  ( U(f),(p(f ))  can  be  computed  by  extending  the  basic  strategy  of 
Algorithm  L.  Initially  we  set  !/(/)  <—  1 and  we  set  4>{f)  to  an  appropriate  vector 
0 . . . 010  ...  0,  for  all  functions  / of  cost  1.  Then,  for  r = 2,  3,  . . . , we  proceed  to 
look  for  functions  f = goh  where  U(g)  + U(h)  = r - 1,  as  before,  but  with  two 
changes:  (1)  If  the  footprints  of  g and  h have  at  least  one  element  in  common, 
namely  if  4>(g)  & 4>(h)  ^ 0,  then  we  know  that  C(/)  < r — 1,  so  we  can  decrease 
U(f)  if  it  was  > r.  (2)  If  the  cost  of  g o h is  equal  to  (but  not  less  than)  our 
current  upper  bound  U(f),  we  can  set  </(/)  </(/)  | (<j)(g)  | 4>(h))  if  [/(/)  = r, 

/>(/)  t—  />(/)  I (/>( g ) & (f>{h ))  if  U(f)  — r — 1.  Exercise  11  works  out  the  details. 

It  turns  out  that  this  footprint  heuristic  is  powerful  enough  to  find  chains  of 
optimum  cost  U(f)  = C(f)  for  all  functions  /,  when  n = 4.  Moreover,  we’ll  see 
later  that  footprints  also  help  us  solve  more  complicated  evaluation  problems. 

According  to  Table  7.1.1  5,  the  216  = 65,536  functions  of  four  variables 
belong  to  only  222  distinct  classes  when  we  ignore  minor  differences  due  to 
permutation  of  variables  and/or  complementation  of  values.  Algorithm  L and 
its  variants  lead  to  the  overall  statistics  shown  in  Table  1. 

*Evaluation  with  minimum  memory.  Suppose  the  Boolean  values  aq,  . . . , xn 
appear  in  n registers,  and  we  want  to  evaluate  a function  by  performing  a 
sequence  of  operations  having  the  form 


^j(i)  f © •i'fc(i)?  for  1 ^ i v , (12) 

where  1 < j(i)  < n and  1 < k(i)  < n and  is  a binary  operator.  At  the  end  of 
the  computation,  the  desired  function  value  should  appear  in  one  of  the  registers. 
When  n = 3,  for  example,  the  four-step  sequence 


X\  <r-  Xl  © x2 
X3  <—  X3  A xi 

X2  i — X2  A Xl 
X3  i X3  V X2 


(xi  = 00001111 
(xi  = 00111100 
(xi  = 00111100 
(xi  = 00111100 
(xi  = 00111100 


x2  = 00110011 
x2  = 00110011 
x2  = 00110011 
x2  = 00000011 
X2  = 00000011 


x3  = 01010101) 
x3  = 01010101) 
x3  = 00010100) 
x3  = 00010100) 
x3  = 00010111) 


(13) 
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computes  the  median  ( xxx2x3 ) and  puts  it  into  the  original  position  of  x3.  (All 
eight  possibilities  for  the  register  contents  are  shown  here  as  truth  tables,  before 
and  after  each  operation.) 

In  fact  we  can  check  the  calculation  by  working  with  only  one  truth  table  at  a 
time,  instead  of  keeping  track  of  all  three,  if  we  analyze  the  situation  backwards. 
Let  fi{x1,...,xn)  denote  the  function  computed  by  steps  /,  l + 1,  . . . , r of  the 
sequence,  omitting  the  first  l - 1 steps;  thus,  in  our  example,  f2{x x,x2,  x3)  would 
be  the  result  in  x3  after  the  three  steps  x3  4-  x3Axi,  x2  4-  x2  Ax1;  x3  <-x3Vi2. 
Then  the  function  computed  in  register  x3  by  all  four  steps  is 

/i(xi,x2,x3)  = f2(xx  ©x2,x2,x3).  (14) 

Similarly  /2(x1;x2,x3)  — /3(x i,x2,x3  Axi),  /3(x i,x2,x3)  = /4(x i,x2  Axi,x3), 
fi{xx,x2,x3)  = f3(xx,x2,x3  V x2),  and  fs(xx,  x2,  x3)  = x3.  We  can  therefore  go 
back  from  f3  to  /4  to  • • • to  j\  by  operating  on  truth  tables  in  an  appropriate  way. 
For  example,  suppose  f(x x,x2,x3)  is  a function  whose  truth  table  is 

t = aoal®2®3a4®5®6a7i 

then  the  truth  table  for  g(xi,x2,x3)  = f(xx  ® x2,x2,x3)  is 

U = CLq(L\ CLq  CL j (140,^0,2  (^3  5 

obtained  by  replacing  ax  by  ax' , where 

x - (xix2x3)2  implies  x'  = ((xi®x2)x2x3)2. 

Similarly  the  truth  table  for,  say,  h(xi,x2,x3)  = f(xx,x2,x3  A x4)  is 

V = fto^Oa2a2a4a5a6a7  • 

And  we  can  use  bitwise  operations  to  compute  u and  v from  t (see  7.1.3-(83)): 

« = l®((f©(f»4)e(i«4p  (00110011)2);  (15) 

v = t ® ((t  ® (t  > 1))  & (01010000)2).  (16) 

Let  Cm(f)  be  the  length  of  a shortest  minimum- memory  computation  for  /. 
The  backward-computation  principle  tells  us  that,  if  we  know  the  truth  tables 
of  all  functions  / with  Cm(f)  < r,  we  can  readily  find  all  the  truth  tables  of 
functions  with  Cm(f)  = r.  Namely,  we  can  restrict  consideration  to  normal 
functions  as  before.  Then,  for  all  normal  g such  that  Cm(g)  = r — 1,  we  can 
construct  the  5 n(n  - 1)  truth  tables  for 

d(x  1 , . . . , Xj—\ , Xj  o Xfc,  Xj_j_4  , . . . , xn)  (17) 

and  mark  them  with  cost  r if  they  haven’t  previously  been  marked.  Exercise  14 
shows  that  those  truth  tables  can  all  be  computed  by  performing  simple  bitwise 
operations  on  the  truth  table  for  g. 

When  n = 4,  all  but  13  of  the  222  basic  function  types  turn  out  to  have 
C<m(/)  = C(f),  so  they  can  be  evaluated  in  minimum  memory  without  increasing 
the  cost.  In  particular,  all  of  the  symmetric  functions  have  this  property  — 
although  that  fact  is  not  at  all  obvious  from  Fig.  9.  Five  classes  of  functions 
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have  C(f)  = 5 but  Cm(f)  = 6;  eight  classes  have  C(f)  = 6 but  Cm(f)  = 7.  The 
most  interesting  example  of  the  latter  type  is  probably  the  function  (xx  V x2)  ® 
(x3  V x4)  ® (a?!  A x2  A x3  A x4),  which  has  cost  6 because  of  the  formula 

*i  ® {x3  V x4)  ® ( x2  A (x4  V (2:3  A x4))),  (18) 

but  it  has  no  minimum-memory  chain  of  length  less  than  7.  (See  exercise  15.) 

*Determining  the  minimum  cost.  The  exact  value  of  C{f)  can  be  found 
by  observing  that  all  optimum  Boolean  chains  (:rn_|_i, . . . ,xn4.r)  for  / obviously 
satisfy  at  least  one  of  three  conditions: 

i)  xn+r  = Xj  o Xk,  where  x:j  and  Xf;  use  no  common  intermediate  results; 

ii)  xn+i  = Xj  o Xk,  where  either  Xj  or  Xk  is  not  used  in  steps  xn+2,  . . . , xn+r; 

iii)  Neither  of  the  above,  even  when  the  intermediate  steps  are  renumbered. 

I11  case  (i)  we  have  / = g o h,  where  C(g)  + C{h)  = r — 1,  and  we  can  call  this 
a “top-down”  construction.  In  case  (ii)  we  have  /( x4,  ...,xn)  = g(x  1, . . . , Xj-4, 
XjOXk,Xj+ 1, . . . ,x„),  where  C(g)  = r—  1;  we  call  this  construction  “bottom- up.” 

The  best  chains  that  recursively  use  only  top-down  constructions  correspond 
to  minimum  formula  length,  L(f).  The  best  chains  that  recursively  use  only 
bottom-up  constructions  correspond  to  minimum-memory  calculations,  of  length 
Cm(f).  We  can  do  better  yet,  by  mixing  top-down  constructions  with  bottom-up 
constructions;  but  we  still  won’t,  know  that  we’ve  found  C(/),  because  a special 
chain  belonging  to  case  (iii)  might  be  shorter. 

Fortunately  such  special  chains  are  rare,  because  they  must  satisfy  rather 
strong  conditions,  and  they  can  be  exhaustively  listed  when  n and  r aren’t  too 
large.  For  example,  exercise  19  proves  that  no  special  chains  exist  when  r < n+ 2; 
and  when  n = 4,  r = 6,  there  are  only  25  essentially  different  special  chains  that 
cannot  obviously  be  shortened: 


By  systematically  trying  5'  possibilities  in  every  special  chain,  one  for  each  way 
to  assign  a normal  operator  to  the  internal  nodes  of  the  tree,  we  will  find  at  least 


an 
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one  function  / in  every  equivalence  class  for  which  the  minimum  cost  C(f)  is 
achievable  only  in  case  (iii). 

In  fact,  when  n = 4 and  r = 6,  these  25  • 56  = 390,625  trials  yield  only 
one  class  of  functions  that  can’t  be  computed  in  6 steps  by  any  top-down-plus- 
bottom-up  chain.  The  missing  class,  typified  by  the  partially  symmetric  function 
{{x\x2xz)  V x4)  © (xxAx2Ax3),  can  be  reached  in  six  steps  by  appropriately 
specializing  any  of  the  first  five  chains  illustrated  above;  for  example,  one  way  is 

x$  = xi  A x2,  xq  ~ x\V  x2,  xy  = x3  (B  x5, 

Xg  X4  A Xg,  Xg  — Xg  A Xy,  X\g  = Xg  V £9,  (19) 

corresponding  to  the  first  special  chain.  Since  all  other  functions  have  L(f)  < 7, 
these  trial  calculations  have  established  the  true  minimum  cost  in  all  cases. 

Historical  notes:  The  first  concerted  attempts  to  evaluate  all  Boolean  func- 
tions f(w,x,y,z ) optimally  were  reported  in  Annals  of  the  Computation  Labo- 
ratory of  Harvard  University  27  (1951),  where  Howard  Aiken’s  staff  presented 
heuristic  methods  and  extensive  tables  of  the  best  switching  circuits  they  were 
able  to  construct.  Their  cost  measure  V{f)  was  different  from  the  cost  C(f) 
that  we’ve  been  considering,  because  it  was  based  on  “control  grids”  of  vacuum 
tubes:  They  had  four  kinds  of  gates,  NOT(/),  NAND(/,  g),  OR(/1; . . . , fk),  and 
AND(/i,  . . . , fk),  respectively  costing  1,  2,  k,  and  0.  Every  input  to  NOT,  NAND, 
or  OR  could  be  either  a variable,  or  the  complement  of  a variable,  or  the  result 
of  a previous  gate;  every  input  to  AND  had  to  be  the  output  of  either  NOT  or 
NAND  that  wasn’t  also  used  elsewhere. 

With  those  cost  criteria,  a function  might  not  have  the  same  cost  as  its 
complement.  One  could,  for  instance,  evaluate  x A y as  AND(NOT(x),  NOT(j/)) . 
with  cost  2;  but  the  cost  of  x V (y  A z)  = NAND(x,  OR (y,z))  was  4 while  its 
complement  x A (y  V z)  = AND (NOT(x), NAND (y,z))  cost  only  3.  Therefore 
the  Harvard  researchers  needed  to  consider  402  essentially  different  classes  of 
4-variable  functions  instead  of  222  (see  the  answer  to  exercise  7.1.1-125).  Of 
course  in  those  days  they  worked  mostly  by  hand.  They  found  V(f)  < 20  in  all 
cases,  except  for  the  64  functions  equivalent  to  S0,i(w,  x,  y,  z)  V (S2(w,  x,  y)  A z), 
which  they  evaluated  with  20  control  grids  as  follows: 

91  = AND(NOT(w),  NOT(x)),  g2  = NAND(y,  z), 
g3  = AND(NOT(w),  NOT(x)); 

/ = AND(NAND(ffl,52),NAND(53,  AND(NOT(y),NOT(f))), 
NOT(AND(NOT(53),  NOT(j/),  NOT(*))), 

NOT(AND(NOT(5l),  NOT(#2),  NOT(«fe)))).  (20) 

The  first  computer  program  to  find  provably  optimum  circuits  was  written 
by  Leo  Hellerman  [IEEE  Transactions  EC-12  (1963),  198-223],  who  determined 
the  fewest  NOR  gates  needed  to  evaluate  any  given  function  f(x,y,z).  He  re- 
quired every  input  of  every  gate  to  be  either  an  uncomplemented  variable  or 
the  output  of  a previous  gate;  fan-in  and  fan-out  were  limited  to  at  most  3. 
When  two  circuits  had  the  same  gate  count,  he  preferred  the  one  with  smallest 
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Table  2 

THE  NUMBER  OF  FIVE- VARIABLE  FUNCTIONS  WITH  GIVEN  COMPLEXITY 


C(f) 

Classes 

Functions 

L(f) 

Classes 

Functions 

D(f) 

Classes 

Functions 

0 

2 

12 

0 

2 

12 

0 

2 

12 

1 

2 

100 

1 

2 

100 

1 

2 

100 

2 

5 

1140 

2 

5 

1140 

2 

17 

5350 

3 

20 

11570 

3 

20 

11570 

3 

1789 

6702242 

4 

93 

109826 

4 

93 

109826 

4 

614316 

4288259592 

5 

389 

995240 

5 

366 

936440 

5 

0 

0 

6 

1988 

8430800 

6 

1730 

7236880 

6 

0 

0 

7 

11382 

63401728 

7 

8782 

47739088 

7 

0 

0 

8 

60713 

383877392 

8 

40297 

250674320 

8 

0 

0 

9 

221541 

1519125536 

9 

141422 

955812256 

9 

0 

0 

10 

293455 

2123645248 

10 

273277 

1945383936 

10 

0 

0 

11 

26535 

195366784 

11 

145707 

1055912608 

11 

0 

0 

12 

1 

1920 

12 

4423 

31149120 

12 

0 

0 

sum-of- inputs.  For  example,  he  computed  x — NOR(x)  with  cost  1;  x V y V z = 
NOR(NOR(x,  y,  z))  with  cost  2;  (. xyz ) = NOR(NOR(x,  y),  NOR(x,  z),  NOR(y,  z)) 
with  cost  4;  Si(x,y,z)  = NOR(NOR(x,  y,  z),  (xyz))  with  cost  6;  etc.  Since  he 
limited  the  fan-out  to  3,  he  found  that  every  function  of  three  variables  could  be 
evaluated  with  cost  7 or  less,  except  for  the  parity  function  x®y®z  = ( x=y)  = z , 
where  x=y  has  cost  4 because  it  is  NOR(NOR(x,  NOR(x,  y)),  NOR (y,  NOR(x,  y))). 

Electrical  engineers  continued  to  explore  other  cost  criteria;  but  four- variable 
functions  seemed  out  of  reach  until  1977,  when  Frank  M.  Liang  established  the 
values  of  C(f)  shown  in  Table  1.  Liang’s  unpublished  derivation  was  based  on 
a study  of  all  chains  that  cannot  be  reduced  by  the  bottom-up  construction. 

The  case  n = 5.  There  are  616,126  classes  of  essentially  different  functions 
f(x i,X2,X3,x4,x5),  according  to  Table  7. 1.1-5.  Computers  are  now  fast  enough 
that  this  number  is  no  longer  frightening;  so  the  author  decided  while  writing 
this  section  to  investigate  C(f)  for  all  Boolean  functions  of  five  variables.  Thanks 
to  a bit  of  good  luck,  complete  results  could  indeed  be  obtained,  leading  to  the 
statistics  shown  in  Table  2. 

For  this  calculation  Algorithm  L and  its  variants  were  modified  to  deal 
with  class  representatives,  instead  of  with  the  entire  set  of  231  normal  truth 
tables.  The  method  of  exercise  7.2.1.2-20  made  it  easy  to  generate  all  functions 
of  a class,  given  any  one  of  them,  resulting  in  a thousand-fold  speedup.  The 
bottom-up  method  was  enhanced  slightly,  allowing  it  to  deduce  for  example  that 
/(x i A x2,xi  V x2,X3,X4,x5)  has  cost  < r if  C(f)  — r — 2.  After  all  classes 
of  cost  10  had  been  found,  the  top-down  and  bottom-up  methods  were  able  to 
find  chains  of  length  < 11  for  all  but  seven  classes  of  functions.  Then  the  time- 
consuming  part  of  the  computation  began,  in  which  approximately  53  million 
special  chains  with  n — 5 and  r = 11  were  generated;  every  such  chain  led  to 
511  = 48,828,125  functions,  some  of  which  would  hopefully  fall  into  the  seven 
remaining  mystery  classes.  But  only  six  of  those  classes  were  found  to  have  11- 
step  solutions.  The  lone  survivor,  whose  truth  table  is  169ae443  in  hexadecimal 
notation,  is  the  unique  class  for  which  C(f)  — 12,  and  it  also  has  L(f)  — 12. 
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Fig.  10.  Boolean  chains  of  minimum  cost 
for  symmetric  functions  of  five  variables. 


The  resulting  constructions  of  symmetric  functions  are  shown  in  Fig.  10. 
Some  of  them  are  astonishingly  beautiful;  some  of  them  are  beautifully  simple; 
and  others  are  simply  astonishing.  (Look,  for  example,  at  the  8-step  computation 
$2,3  (2R  5 ®2,  £3,  ;r4,  £5),  or  the  elegant  formula  for  $2,3,4,  or  the  nonmonotonic 
chains  for  $4,5  and  $3,4,5-)  Incidentally,  Table  2 shows  that  all  5- variable  func- 
tions have  depth  < 4,  but  no  attempt  to  minimize  depth  has  been  made  in  Fig.  10. 

It  turns  out  that  all  of  these  symmetric  functions  can  be  evaluated  in 
minimum  memory  without  increasing  the  cost.  No  simple  reason  is  known. 
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Multiple  outputs.  We  often  want  to  evaluate  several  different  Boolean  func- 
tions / i(*i, . . . , xn),  . . . , fm(x i, . . . , xn)  at  the  same  input  values  xx,  . . . , xn; 
in  other  words,  we  often  want  to  evaluate  a multibit  function  y = f(x),  where 
V - fi  ■ ■ ■ fm  is  a binary  vector  of  length  m and  x = xi . . . xn  is  a binary 
vector  of  length  n.  With  luck,  much  of  the  work  involved  in  the  computation  of 
one  component  value  fj(x\, . . . ,xn)  can  be  shared  with  the  operations  that  are 
needed  to  evaluate  the  other  component  values  fk(xi,  • . . ,x„). 

Let  C(f)  = C(f\  . . . fm)  be  the  length  of  a shortest  Boolean  chain  that  com- 
putes all  of  the  nontrivial  functions  fy  More  precisely,  the  chain  (xn+1, . . . , xn+T ) 
should  have  the  property  that,  for  1 < j < m,  either  fj(x i, . . . ,xn ) = x^y  or 
fj{xi,. . . ,xn)  - X;(j),  for  some  l(j)  with  0 < l(j)  <n  + r , where  x0  = 0.  Clearly 
C(f)  < C(/i)  + • • • + C(fm),  but  we  might  be  able  to  do  much  better. 

For  example,  suppose  we  want  to  compute  the  functions  z\  and  z0  defined  by 

(z\Zo)2  = xi+x2  + x3,  (21) 

the  two-bit  binary  sum  of  three  Boolean  variables.  We  have 

zi  = (xix2x3)  and  z0  = xi®x2©x3,  (22) 

so  the  individual  costs  are  C(zx)  = 4 and  C(z0)  = 2.  But  it’s  easy  to  see  that 

the  combined  cost  C(ziz0)  is  at  most  5,  because  x\  © x2  is  a suitable  first  step 

in  the  evaluation  of  each  bit  zy. 

X4  = Xi  © x2,  z0  — x5  = x3  © x4; 

xq  — x3  A x4 , x7  = xiAx2,  Zi  = Ig  = x6  V x7.  (23) 

Furthermore,  exhaustive  calculations  show  that  C(ziZo)  > 4;  hence  C(z\Zo)  — 5. 

Electrical  engineers  traditionally  call  a circuit  for  (21)  a full  adder , because 
n such  building  blocks  can  be  hooked  together  to  add  two  n-bit  numbers.  The 
special  case  of  (22)  in  which  x3  = 0 is  also  important,  although  it  boils  down 
simply  to 

z\  = Xi  A x2  and  z0  — x\  © x2  (24) 

and  has  complexity  2;  engineers  call  it  a “half  adder”  in  spite  of  the  fact  that 
the  cost  of  a full  adder  exceeds  the  cost  of  two  half  adders. 

The  general  problem  of  radix-2  addition 

(xn_x . . . XiX0)2 

(Vn- !•••  Vl  2/0)2  (25) 

(•^n  Z-n  — 1 • • • Z\  2^0)2 

is  to  compute  n + 1 Boolean  outputs  zn  . . . Z\Zq  from  the  2 n Boolean  inputs 
xn_i . . . xiXoi/„-i . . . 2/12/0;  and  it  is  readily  solved  by  the  formulas 

Cj+i  — {xjVjCj),  Zj  = Xj  © 2/j  © Cj,  for  0 < j < n,  (26) 

where  the  cj  are  “carry  bits”  and  we  have  c0  — 0,  zn  = cn.  Therefore  we  can 
use  a half  adder  to  compute  cx  and  z0,  followed  by  n - 1 full  adders  to  compute 
the  other  c’s  and  z’s,  accumulating  a total  cost  of  5 n — 3.  And  in  fact  N.  P. 
Red’kin  [Problemy  Kibernetiki  38  (1981),  181-216]  has  proved  that  5n  — 3 steps 
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are  actually  necessary,  by  constructing  an  elaborate  35-page  proof  by  induction 
which  concludes  with  Case  2.2.2.3.1.2.3.2.4.3(!).  But  the  depth  of  this  circuit' 
2 n ~ 1,  is  far  too  large  for  practical  parallel  computation,  so  a great  deal  of  effort 
has  gone  into  the  task  of  devising  circuits  for  addition  that  have  depth  O(logn) 
as  well  as  reasonable  cost.  (See  exercises  41-44.) 

Now  let’s  extend  (21)  and  try  to  compute  a general  “sideways  sum” 

izllgn]  ■ ■ ■ zizo)2  = ^1  + X2  + ■ ■ ■ + Xn.  (27 ) 

If  n = 2k  + 1,  we  can  use  k full  adders  to  reduce  the  sum  to  (xi  4 f xn)  mod  2 

plus  k bits  of  weight  2,  because  each  full  adder  decreases  the  number  of  weight-1 
bits  by  2.  For  example,  if  n = 9 and  k = 4 the  computation  is 

Xio=Xi®x2®x3,  xu=xA®xb®x&,  *i2=x7®a;8©a;9,  xi3=abo©afii®:Ei2, 
yi  = (x1x2x3),  y2  = (x4x5x6),  y3  = (x7x8x9),  y4  = (x1  0xux12), 

and  we  have  xt  + • • • + x9  = x13  + 2(Vl  + y2  + y3  + y4).  If  n = 2k  is  even,  a 
similar  reduction  applies  but  with  a half  adder  at  the  end.  The  bits  of  weight  2 
can  then  be  summed  in  the  same  way;  so  we  obtain  the  recurrence 

s(n)  = 5|n/2j  — 3[n  even]  + s([n/2j),  s(0)  = 0,  (28) 

for  the  total  number  of  gates  needed  to  compute  ^LlgnJ  . . . Zlz0.  (A  closed  formula 
for  s(n)  appears  in  exercise  30.)  We  have  s(n)  < 5 n,  and  the  first  values 

n — 1 2 3 4 5 6 7 8 9 10  11  12  13  14  15  16  17  18  19  20 

s(n)  = 0 2 5 9 12  17  20  26  29  34  37  44  47  52  55  63  66  71  74  81 

show  that  the  method  is  quite  efficient  even  for  small  n.  For  example,  when 
n = 5 it  produces 


which  computes  three  different  symmetric  functions  22  = S^aq, . . . ,x5),  z3  = 
S*’3(xi,  ■ ■ -,x5),  zQ  = Sli3^(x1,. . . ,x5)  in  just  12  steps.  The  10-step  computa- 
tion of  S4:5  is  optimum,  according  to  Fig.  10;  of  course  the  4-step  computation 
of  Sr ,3,5  is  also  optimum.  Furthermore,  although  C(S3,3)  = 8,  the  function  S2  3 
is  computed  here  in  a clever  10-step  way  that  shares  all  but  one  gate  with  S4'5. 

Notice  that  we  can  now  compute  any  symmetric  function  efficiently,  because 
every  symmetric  function  of  {x4 , . . . , xn  } is  a Boolean  function  of  zy ig  „j  ...  Zl  z0 . 
We  know,  for  example,  that  any  Boolean  function  of  four  variables  has  complexity 
<7-,  therefore  any  symmetric  function  Sklt...M(Xl,. . . ,x15)  costs  at  most  s(15)  + 
7-62.  Surprise:  The  symmetric  functions  of  n variables  were  among  the  hardest 
of  all  to  evaluate,  when  n was  small,  but  they’re  among  the  easiest  when  n > 10. 
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We  can  also  compute  sets  of  symmetric  functions  efficiently.  If  we  want,  say, 
to  evaluate  all  n + 1 symmetric  functions  Sk(x i, . . . ,xn)  for  0 < k < n with  a 
single  Boolean  chain,  we  simply  need  to  evaluate  the  first  n+1  minterms  of  z0,  zlt 
■ ■ ■ , zpgnj  • F°r  example,  when  n = 5 the  minterms  that  give  us  all  functions  S k 
are  respectively  S0  — z0  A zx  A z2,  Si  = z0  A Zi  A z2,  . . . , S5  = z0  A Zi  A z2. 

How  hard  is  it  to  compute  all  2"  minterms  of  n variables?  Electrical 
engineers  call  this  function  an  n- to- 2"  binary  decoder , because  it  converts  n bits 
xi . . ,xn  into  a sequence  of  2n  bits  d0di  . . . d2-_i,  exactly  one  of  which  is  1.  The 
principle  of  “divide  and  conquer”  suggests  that  we  first  evaluate  all  minterms 
on  the  first  \n/ 2]  variables,  as  well  as  all  minterms  on  the  last  |_n/2_|;  then  2n 
AND  gates  will  finish  the  job.  The  cost  of  this  method  is  t(n),  where 

f(0)=t(l)  = 0;  t(n)  = 2"  + t(\n/2\)  + t([n/2\)  for  n > 2.  (30) 

So  t(n)  = 2"  + 0(2n/2);  there’s  roughly  one  gate  per  minterm.  (See  exercise  32.) 

Functions  with  multiple  outputs  often  help  us  build  larger  functions  with 
single  outputs.  For  example,  we’ve  seen  that  the  sideways  adder  (27)  allows 
us  to  compute  symmetric  functions;  and  an  n-to-2"  decoder  also  has  many 
applications,  in  spite  of  the  fact  that  2"  can  be  huge  when  n is  large.  A case  in 
point  is  the  2 m -way  multiplexer  Mm(aq, . . . , xm;  y0,  y1: . . . , y2™-i),  also  known 
as  the  rn-bit  storage  access  function , which  has  n = m + 2m  inputs  and  takes 
the  value  y when  (aq  . . . xm)2  = k.  By  definition  we  have 

2m  — 1 

Mm(xi, . . . ,xm;yo,yi, . . . = \f(dkAyk),  (31) 

k= 0 

where  dk  is  the  fcth  output  of  an  m-to-2m  binary  decoder;  thus,  by  (30),  we  can 
evaluate  Mm  with  2m  + (2m-l)  + t(m)  = 3n  + 0(y/n)  gates.  But  exercise  39 
shows  that  we  can  actually  reduce  the  cost  to  only  2 n + 0(\/n).  (See  also 
exercise  79.) 

Asymptotic  facts.  When  the  number  of  variables  is  small,  our  exhaustive- 
search  methods  have  turned  up  lots  of  cases  where  Boolean  functions  can  be 
evaluated  with  stunning  efficiency.  So  it’s  natural  to  expect  that,  when  more 
variables  are  present,  even  more  opportunities  for  ingenious  evaluations  will  arise. 
But  the  truth  is  exactly  the  opposite,  at  least  from  a statistical  standpoint: 

Theorem  S.  The  cost  of  almost  every  Boolean  function  /(aq, . . . , xn)  exceeds 
2 n/n.  More  precisely , if  c(n,r)  Boolean  functions  have  complexity  < r,  we  have 

(r  — l)\c(n,r)  < 22r+1(n  + r - l)2r . (32) 

Proof.  If  a function  can  be  computed  in  r - 1 steps,  it  is  also  computable  by 
an  r-step  chain.  (This  statement  is  obvious  when  r = 1;  otherwise  we  can  let 
xn+r  = xn+r-i  A xn+r-i.)  We  will  show  that  there  aren’t  very  many  r-step 
chains,  hence  we  can’t  compute  very  many  different  functions  with  cost  < r. 

Let  7r  be  a permutation  of  {1, . . . , n + r}  that  takes  1 i-t  1 ru-in,  and 

n+r  n+r;  there  are  (r  — 1)!  such  permutations.  Suppose  (xn+i, . . . , xn+r ) is  a 
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Boolean  chain  in  which  each  of  the  intermediate  steps  xn+i,  . . . , xn+r_l  is  used 
in  at  least  one  subsequent  step.  Then  the  permuted  chains  defined  by  the  rule 

x*  ~ °i  xk'(i)  = xj(in)n-  °in  Xk(in)n- , for  n < i < n + r,  (33) 

are  distinct  for  different  n.  (If  tt  takes  a p-a  b,  we  write  b = ai r and  a = for-.) 
For  example,  if  7i  takes  5i-i6i->-7i-a8i-i9i-^5,  the  chain  (7)  becomes 

Original  Permuted 


*5  — *1  ® *3, 
*6  = X±  ® *2, 
*7  = *3  © x4, 
*8  = *5  V x6, 
*9  = x6  ® X7, 
xio  = Xg  A xg; 


*5  = *1  ® X2, 
*6  = X3  © X4, 

x7  = x9  V x5, 
x8  — x5  ® xe, 
Xg  = Xx®  x3, 
*10  = *7  A x8. 


(34) 


Notice  that  we  might  have  j'(i)  > k'(i)  or  j'(i)  > i or  k'(i)  > i,  contrary  to  our 
usual  rules.  But  the  permuted  chain  computes  the  same  function  xn+r  as  before, 
and  it  doesn’t  have  any  cycles  by  which  an  entry  is  defined  indirectly  in  terms 
of  itself,  because  the  permuted  x,  is  the  original  xl7r. 

We  can  restrict  consideration  to  normal  Boolean  chains,  as  remarked  earlier. 
So  the  c(n,r)/ 2 normal  Boolean  functions  of  cost  < r lead  to  (r  - l)!c(n,  r)/2 
different  permuted  chains,  where  the  operator  ot  in  each  step  is  either  A,  V D 
or  ®.  And  there  are  at  most  4r(n+r-  l)2''  such  chains,  because  there  are  four 
choices  for  o,  and  n+r-1  choices  for  each  of  j(i)  and  k(i),  for  n < i < n + r. 
Equation  (32)  follows;  and  we  obtain  the  opening  statement  of  the  theorem  by 
setting  r = \ 2n/n\.  (See  exercise  46.)  | 


On  the  other  hand,  there’s  also  good  news  for  infinity-minded  people:  We 
can  actually  evaluate  every  Boolean  function  of  n variables  with  only  slightly 
more  than  2 n/n  steps  of  computation,  even  if  we  avoid  ® and  =,  using  a technique 
devised  by  C.  E.  Shannon  and  improved  by  O.  B.  Lupanov  [Bell  System  Tech.  J. 
28  (1949),  59-98,  Theorem  6;  Isvestiia  VUZov,  RadioRzika  1 (1958),  120-140], 
In  fact,  the  Shannon-Lupanov  approach  leads  to  useful  results  even  when 
n is  small,  so  let’s  get  acquainted  with  it  by  studying  a small  example.  Consider 


/(*  1>  *2,  *3>  *4,  *5,  *6)  — [(*1*2*32:4*5*6)2  is  prime] , (35) 

a function  that  identifies  all  6-bit  prime  numbers.  Its  truth  table  has  26  = 64 
bits,  and  we  can  work  with  it  conveniently  by  using  a 4 x 16  array  to  look  at 
those  bits  instead  of  confining  ourselves  to  one  dimension: 


13  = 00000000  1 1111111 

14  = 0000  1 11100001111 

15  = 00  1 1001100110011 

16  = 0101010101010101 


| Group  1 
| Group  2 


xix2  = 00 
X1X2  = 01 
X\x7  = 10 
xix2  = 11 


0011010100010100 

0101000100000101 

0000010001010001 

0000010000010100 


(36) 
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The  rows  have  been  divided  into  two  groups  of  two  rows  each;  and  each  group 
of  rows  has  16  columns,  which  are  of  four  basic  types,  namely  § , J , or  } . Thus 
we  see  that  the  function  can  be  expressed  as 


/Oi,  . ..,x6) 


{[x\x2  £ {00}]  A [x3x4x5x6  £ {0010, 0101, 1011}]) 

V ([xix2  G {01}]  A [x3x4x5x6  £ {0001, 1111}]) 

V ([xix2  G {00, 01}]  A [x3x4x5x6  £ {0011, 0111, 1101}]) 

V (]xxx2g{10}]  A [x3x4x5x6  £ {1001, 1111}]) 

V ([xix2  G {11}]  A [x3x4x5x6  £ {1101}]) 

V (jxxx2  G {10,  11}]  A [x3x4x5x6  £ {0101, 1011}]).  (37) 


(The  first  line  corresponds  to  group  1,  type  J , then  comes  group  1,  type  4 , etc.; 
the  last  line  corresponds  to  group  2 and  type  }.)  A function  like  [x3x4x5X6  G 
{0010,0101, 1011}]  is  the  OR  of  three  minterms  of  {x3,  x4,  x3,  xq}. 

In  general  we  can  view  the  truth  table  as  a 2fc  x 2n~k  array,  with  l groups 
of  rows  having  either  [2k/l\  or  \2k/l]  rows  in  each  group.  A group  of  size  m 
will  have  columns  of  2m  basic  types.  We  form  a conjunction  {gu(x  1, . . . , Xfc)  A 
hit(xk+ 1, . . - , x„))  for  each  group  i and  each  nonzero  type  t,  where  ga  is  the  OR 
of  all  minterms  of  {xi, . . . , Xfc}  for  the  rows  of  the  group  where  t has  a 1,  while 
hit  is  the  OR  of  all  minterms  of  {x*,+i, . . . , xn}  for  the  columns  having  type  t in 
group  i.  The  OR  of  all  these  conjunctions  ( ga  A ha)  gives  /(x  1, . . . , x„). 

Once  we’ve  chosen  the  parameters  k and  l,  with  1 < /,:  < n— 2 and  1 < l < 2k , 
the  computation  starts  by  computing  all  the  minterms  of  {xi,...,Xfc}  and  all 
the  minterms  of  {x^+1, . . . ,xn},  in  f(fc)  + t(n  — k)  steps  (see  (30)).  Then,  for 
1 < i < I,  we  let  group  i consist  of  rows  for  the  values  of  (xi, . . . , Xfc)  such  that 
[i  - 1)2 k/l  < (xj . . . Xfe)2  < i2k/l\  it  contains  TOj  = fz2fe/ Z”|  — {(i  — l)2k/V\  rows. 
We  form  all  functions  ga  for  t £ Si,  the  family  of  2mi  — 1 nonempty  subsets  of 
those  rows;  2mi  — rn,  — 1 ORs  of  previously  computed  minterms  will  accomplish 
that  task.  We  also  form  all  functions  ha  representing  columns  of  nonzero  type  t; 
for  this  purpose  we’ll  need  at  most  2n~k  OR  operations  in  each  group  i , since  we 
can  OR  each  minterm  into  the  h function  of  the  appropriate  type  t.  Finally  we 
compute  / = V i=i  Vtgs,  (hit  A ha)',  each  AND  operation  is  compensated  by  an 
unnecessary  first  OR  into  ha.  So  the  total  cost  is  at  most 

1 

t(k)  + t(n  — k)  + (1  — 1)  + ^((2m--m8-l)  + 2n~k  + (2m--2));  (38) 

i= 1 

we  want  to  choose  k and  l so  that  this  upper  bound  is  minimized.  Exercise  52 
discusses  the  best  choice  when  n is  small.  And  when  n is  large,  a good  choice 
yields  a provably  near-optimum  chain,  at  least  for  most  functions: 


Theorem  L.  Let  C(n)  denote  the  cost  of  the  most  expensive  Boolean  functions 
of  n variables.  Then  as  n — » 00  we  have 
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Proof.  Exercise  48  shows  that  the  lower  bound  (39)  is  a consequence  of  The- 
orem S.  For  the  upper  bound,  we  set  k = [21gnJ  and  l = \2 k/(n  - 31gn)]  in 
Lupanov’s  method;  see  exercise  53.  | 


Synthesizing  a good  chain.  Formula  (37)  isn’t  the  best  way  to  implement  a 6- 
bit  prime  detector,  but  it  does  suggest  a decent  strategy.  For  example,  we  needn’t 
let  variables  aq  and  *2  govern  the  rows:  Exercise  51  shows  that  a better  chain 
results  if  the  rows  are  based  on  *52:6  while  the  columns  come  from  X1X2X3X4, 
and  in  general  there  are  many  ways  to  partition  a truth  table  by  playing  k of 
the  variables  against  the  other  n - k. 

Furthermore,  we  can  improve  on  (37)  by  using  our  complete  knowledge  of 
all  4-variable  functions;  there’s  no  need  to  evaluate  a function  like  [X3X4X5X6  £ 
{0010, 0101, 1011}]  by  first  computing  the  minterms  of  {x3,  x4,  x5,  x6},  if  we  know 
the  best  way  to  evaluate  every  such  function  from  scratch.  On  the  other  hand,  we 
do  need  to  evaluate  several  4-variable  functions  simultaneously,  so  the  minterm 
approach  might  not  be  such  a bad  idea  after  all.  Can  we  really  improve  on  it? 

Let  s try  to  find  a good  way  to  synthesize  a Boolean  chain  that  computes  a 
given  set  of  4- variable  functions.  The  six  functions  of  X3X4X5X6  in  (37)  are  rather 
tame  (see  exercise  54),  so  we’ll  learn  more  by  considering  a more  interesting 
example  chosen  from  everyday  life. 

A seven-segment  display  is  a now- ubiquitous  way  to  represent 
a 4-bit  number  (2:12:2X32:4)2  in  terms  of  seven  cleverly  positioned 
segments  that  are  either  visible  or  invisible.  The  segments  are 
traditionally  named  (a,  b,  c,  d,  e,  /,  g)  as  shown;  we  get  a ‘0’  by 
turning  on  segments  ( a,b,c,d,e,f ),  but  a T uses  only  segments 
(6,  c).  (Incidentally,  the  idea  for  such  displays  was  invented  by  F.  W. 

Wood,  U.S.  Patent  974943  (1910),  although  Wood’s  original  design 
used  eight  segments  because  he  thought  that  a ‘4’  requires  a diagonal  stroke.) 
Seven-segment  displays  usually  support  only  the  decimal  digits  ‘O’,  T’,  . . . , ‘9’; 
but  of  course  a computer  scientist’s  digital  watch  should  display  also  hexadecimal 
digits.  So  we  shall  design  seven-segment  logic  that  displays  the  sixteen  digits 


n cnooot icc 

uil  j 1 ju  fujiiuLOcr 


(41) 


when  given  the  respective  inputs  xix2x3x4  = 0000,  0001,  0010,  . . . , 1111. 

In  other  words,  we  want  to  evaluate  seven  Boolean  functions  whose  truth 
tables  are  respectively 


a = 1011  0111  1110  0011, 
b = 1111  1001  1110  0100, 

c = 1101  1111  1111  0100, 
d = 1011  0110  1101  1110, 
e = 1010  0010  1011  nil, 

/ = 1000  1111  1111  0011, 
g = 0011  1110  1111  1111. 


(42) 
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If  we  simply  wanted  to  evaluate  each  function  separately,  several  methods  that 
we’ve  already  discussed  would  tell  us  how  to  do  it  with  minimum  costs  C(a)  = 5, 
C(b)  = C(c)  = C(d)  = 6,  C(e ) = C(/)  — 5,  and  C(g)  — 4;  the  total  cost  for  all 
seven  functions  would  then  be  37.  But  we  want  to  find  a single  Boolean  chain 
that  contains  them  all,  and  the  shortest  such  chain  is  presumably  much  more 
efficient.  How  can  we  discover  it? 

Well,  the  task  of  finding  a truly  optimum  chain  for  {a,b,  c,  d,  e,  f,  g}  is 
probably  infeasible  from  a computational  standpoint.  But  a surprisingly  good 
solution  can  be  found  with  the  help  of  the  “footprint”  idea  explained  earlier. 
Namely,  we  know  how  to  compute  not  only  a function’s  minimum  cost,  but  also 
the  set  of  all  first  steps  consistent  with  that  minimum  cost  in  a normal  chain. 
Function  e,  for  example,  has  cost  5,  but  only  if  we  evaluate  it  by  starting  with 
one  of  the  instructions 


*5  = *i©*4  or  x5—x2/\x3  or  i5  = i2Vi3. 


Fortunately,  one  of  the  desirable  first  steps  belongs  to  four  of  the  seven 
footprints:  Functions  c,  d,  f,  and  g can  all  be  evaluated  optimally  by  starting 
with  £5  = x2  ©£3.  So  that  is  a natural  choice;  it  essentially  saves  us  three  steps, 
because  we  know  that  at  most  33  of  the  original  37  steps  will  be  needed  to  finish 
Now  we  can  recompute  the  costs  and  footprints  of  all  216  functions,  proceed- 
ing as  before  but  also  initializing  the  cost  of  the  new  function  £5  to  zero.  The 
costs  of  functions  c,  d , /,  and  g decrease  by  1 as  a result,  and  the  footprints 
change  too.  For  example,  function  a still  has  cost  5,  but  its  footprint  has 
increased  from  {xi  © x3,  x2  A x3}  to  {xx  © x3,  xx  A £4,  xr  A £4,  x2  A x3,  x2  A £4, 
x2  © X4,  £4  A £5,  X4  © xs}  when  the  function  £5  = x2  © £3  is  available  for  free. 

In  fact,  £6  = ii  A £4  is  common  to  four  of  the  new  footprints,  so  again  we 
have  a natural  way  to  proceed.  And  when  everything  is  recalculated  with  zero 
cost  given  to  both  £5  and  x6,  the  subsequent  step  £7  = x3  A x6  turns  out  to  be 
desirable  in  five  of  the  newest  footprints.  Continuing  in  this  “greedy”  fashion,  we 
aren’t  always  so  lucky,  but  a chain  of  22  steps  does  emerge;  and  David  Stevenson 
has  shown  that  only  21  steps  are  actually  needed  if  we  choose  x10  non-greedily: 


*5  = £2  © *3, 
Xq  — £4  A £4, 
£7  £3  A Xq  , 

£8  = £1  © £2, 
£9  = *4  © *5, 
XlO  = £3  V £9, 
£ll  £5  © £10? 


£12  = £1  A £2, 
£13  = £9  A £12, 
£14  = £3  A £13 , 
£15  = £5  © *14, 
£46  *1  © £7, 

£17  = £1  V £5, 

*18  = *6  © *13, 


® = *19  — *15  © *18, 

b — *20  = *11  A £13, 
c = £21  — £8  A £n, 
d — £22  = *9  A £16,  (43) 

e = £23  = *6  V £14, 

/ = *24  = *8  A £15, 

9 ~ *25  — *7  V £17. 


(This  is  a normal  chain,  so  it  contains  the  normalizations  {a,  b,  c,  d,  e, /,  g} 
instead  of  {a,b,c,d,e,  f,g}.  Simple  changes  will  produce  the  unnormalized 
functions  without  changing  the  cost.) 


Partial  functions.  In  practice  the  output  value  of  a Boolean  function  is  often 
specified  only  at  certain  inputs  £!...£„,  and  the  outputs  in  other  cases  don’t 
really  matter.  We  might  know,  for  example,  that  some  of  the  input  combinations 
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will  never  arise.  In  such  cases,  we  place  an  asterisk  into  the  corresponding 
positions  of  the  truth  table,  instead  of  specifying  0 or  1 everywhere. 

The  seven-segment  display  provides  a case  in  point,  because  most  of  its 
applications  involve  only  the  ten  binary-coded  decimal  inputs  for  which  we  have 
(xix2x3x4)2  < 9.  We  don’t  care  what  segments  are  visible  in  the  other  six  cases. 
So  the  truth  tables  of  (42)  actually  become 


a = 1011  0111  11**  ****, 
b — 1111  1001  11**  ****, 
c = 1101 1111  11**  ****5 

d = 1011  0110  11**  ****, 
c = 1010  0010  10**  ****? 
/ = 1000  111*  11**  ****? 
Q = 0011  1110  11**  ****. 


(44) 


(Function  / here  has  an  asterisk  also  in  position  x4 x2x3x4  = 0111,  because  a ‘7’ 
can  be  displayed  as  either  ^ or  rj.  Both  of  these  styles  appeared  about  equally 
often  in  the  display  units  available  to  the  author  when  this  section  was  written. 
Truncated  variants  of  the  £ and  the  5 were  sometimes  seen  in  olden  days,  but 
they  have  thankfully  disappeared.) 

Asterisks  in  truth  tables  are  generally  known  as  don’t-cares  — a quaint  term 
that  could  only  have  been  invented  by  an  electrical  engineer.  Table  3 shows  that 
^i6\^e(*0m  t0  c^loose  arbitrary  outputs  is  advantageous.  For  example,  there  are 
( 3 )2  = 4,587,520  truth  tables  with  3 don’t-cares;  69%  of  them  cost  4 or  less, 

even  though  only  21%  of  the  asterisk-free  truth  tables  permit  such  economy.  On 
the  other  hand,  don’t-cares  don’t  save  us  as  much  as  we  might  hope;  exercise  63 
proves  that  a random  function  with,  say,  30%  don’t-cares  in  its  truth  table  tends 
to  save  only  about  30%  of  the  cost  of  a fully  specified  function. 

What  is  the  shortest  Boolean  chain  that  evaluates  the  seven  partially  spec- 
ified functions  in  (44)?  Our  greedy-footprint  method  adapts  itself  readily  to 
the  presence  of  don’t-cares,  because  we  can  OR  together  the  footprints  of  all  2d 
functions  that  match  a pattern  with  d asterisks.  The  initial  costs  to  evaluate  each 
function  separately  are  now  reduced  to  C(a)  = 3,  C(b)  = C(c)  = 2,  C(d ) = 5, 
C(e)  = 2,  C(f)  = 3,  C(g)  = 4,  totalling  just  21  instead  of  37.  Function  g hasn’t 
gotten  cheaper,  but  it  does  have  a larger  footprint.  Proceeding  as  before,  but 
taking  advantage  of  the  don’t-cares,  we  now  can  find  a suitable  chain  of  length 
only  13  a chain  with  fewer  than  two  operations  per  output(I): 


x5  = Xi  ® x2 , 
xg  = x3  A x4, 
X7  = £1  ® x3, 
Xg  — x2  A x6, 
Xg  = X3V  x4, 


e = xio  = x4  V x8, 
g = x a = x7  ® xs, 

%12  = X4  © X11, 

d — x13  = £10  A x12, 
a = x44  = x3  A x13, 


b = x i5  = x2  A x13, 

C = Xi6=X2A  Xg, 
f = X17  — X5  A Xg.  (45) 


Tic-tac-toe.  Let’s  turn  now  to  a slightly  larger  problem,  based  on  a popular 
children’s  game.  Two  players  take  turns  filling  the  cells  of  a 3 x 3 grid.  One 
player  writes  X’s  and  the  other  writes  O’s,  continuing  until  there  either  are  three 
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Table  3 

THE  NUMBER  OF  4- VARIABLE  FUNCTIONS  WITH  d DON’T-CARES  AND  COST  c 


c = 0 

c = 1 

<1  = 0 

10 

60 

d = 1 

160 

960 

d = 2 

1200 

7200 

d = 3 

5600 

33600 

d = 4 

18200 

108816 

d = 5 

43680 

257472 

d = 6 

80080 

455616 

d = 7 

114400 

606944 

d = 8 

128660 

604756 

d = 9 

114080 

440960 

d=  10 

78960 

224144 

d=  11 

41440 

72064 

d=  12 

15480 

12360 

d = 1.3 

3680 

800 

d = 14 

480 

d=  15 

32 

d=  16 

1 

c = 2 

c — 3 

c = 4 

456 

2474 

10624 

7296 

35040 

131904 

52736 

221840 

700512 

228992 

831232 

2045952 

666528 

2034408 

3505344 

1367776 

3351488 

3491648 

2015072 

3648608 

1914800 

2115648 

2474688 

533568 

1528808 

960080 

71520 

707488 

197632 

4160 

189248 

20160 

25472 

800 

1280 

c = 5 

c = 6 

c = 7 

24184 

25008 

2720 

227296 

119072 

2560 

816448 

166144 

1381952 

60192 

1118128 

3296 

433568 

86016 

12032 

896 

32 

x’s  or  three  O’s  in  a straight  line  (in  which  case  that  player  wins)  or  all  nine 
cells  are  filled  without  a winner  (in  which  case  it’s  a “cat’s  game”  or  tie).  For 
example,  the  game  might  proceed  thus: 


# # * 


OIXIX 


To 


(46) 


X has  won.  Our  goal  is  to  design  a machine  that  plays  tic-tac-toe  optimally  — 
making  a winning  move  from  each  position  in  which  a forced  victory  is  possible, 
and  never  making  a losing  move  from  a position  in  which  defeat  is  avoidable. 

More  precisely,  we  will  set  things  up  so  that  there  are  18  Boolean  variables 
£1,  . . . , £9,  01,  . . . , o9,  which  govern  lamps  to  illuminate  cells  of  the  current 
position.  The  cells  are  numbered  f||  as  on  a telephone  dial.  Cell  j displays 
an  X if  Xj  = 1,  an  O if  Oj  = 1,  or  remains  blank  if  Xj  = Oj  = 0.*  We  never 
have  Xj  — oj  = 1,  because  that  would  display  '(S’.  We  shall  assume  that  the 
variables  x\ . . . xg eq  . . . 09  have  been  set  to  indicate  a legal  position  in  which 
nobody  has  won;  the  computer  plays  the  X’s,  and  it  is  the  computer’s  turn  to 
move.  For  this  purpose  we  want  to  define  nine  functions  jq,  . . . , yg,  where  yj 
means  “change  Xj  from  0 to  1.”  If  the  current  position  is  a cat’s  game,  we  should 
make  y1  = ■ ■ ■ = yg  = 0;  otherwise  exactly  one  yj  should  be  equal  to  1,  and  of 
course  the  output  value  yj  = 1 should  occur  only  if  xj  — Oj  — 0. 

With  18  variables,  each  of  our  nine  functions  yj  will  have  a truth  table  of 
size  218  = 262,144.  It  turns  out  that  only  4520  legal  inputs  X\  . . . XgOi . . . 09  are 


* This  setup  is  based  on  an  exhibit  from  the  early  1950s  at  the  Museum  of  Science  and 
Industry  in  Chicago,  where  the  author  was  first  introduced  to  the  magic  of  switching  circuits. 
The  machine  in  Chicago,  designed  circa  1940  by  W.  Keister  at  Bell  Telephone  Laboratories, 
allowed  me  to  go  first;  yet  I soon  discovered  that  there  was  no  way  to  defeat  it.  Therefore 
I decided  to  move  as  stupidly  as  possible,  hoping  that  the  designer  had  not  anticipated  such 
bizarre  behavior.  In  fact  I allowed  the  machine  to  reach  a position  where  it  had  two  winning 
moves;  and  it  seized  both  of  them!  Moving  twice  is  of  course  a flagrant  violation  of  the  rules, 
so  I had  won  a moral  victory  even  though  the  machine  announced  that  I had  lost. 
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I commenced  an  examination  of  a game  called  “tit-tat-to" 
to  ascertain  what  number  of  combinations  were  required 
for  all  the  possible  variety  of  moves  and  situations. 

I found  this  to  be  comparatively  insignificant. 

...  A difficulty,  however,  arose  of  a novel  kind. 

When  the  automaton  had  to  move,  it  might  occur  that  there  were 
two  different  moves,  each  equally  conducive  to  his  winning  the  game. 

■ . . Unless,  also,  some  provision  were  made, 
the  machine  would  attempt  two  contradictory  motions. 

CHARLES  BABBAGE.  Passages  from  the  Life  of  a Philosopher  (1864) 

possible,  so  those  truth  tables  are  98.3%  filled  with  don’t-cares.  Still,  4520  is 
uncomfortably  large  if  we  hope  to  design  and  understand  a Boolean  chain  that 
makes  sense  intuitively.  Section  7.1.4  will  discuss  alternative  ways  to  represent 
Boolean  functions,  by  which  it  is  often  possible  to  deal  with  hundreds  of  variables 
even  though  the  associated  truth  tables  are  impossibly  large. 

Most  functions  of  18  variables  require  more  than  21S/18  gates,  but  let’s  hope 
we  can  do  better.  Indeed,  a plausible  strategy  for  making  suitable  moves  in 
tic-tac-toe  suggests  itself  immediately,  in  terms  of  several  conditions  that  aren’t 
hard  to  recognize: 


Wj,  an  X in  cell  j will  win,  completing  a line  of  x’s; 

bj>  an  ° in  cell  j would  lose,  completing  a line  of  O’s; 

fj,  an  X in  cell  j will  give  X two  ways  to  win; 

dj,  an  O in  cell  j would  give  O two  ways  to  win. 


For  example,  X’s  move  to  the  center  in  (46)  was  needed  to  block  O,  so  it  was  of 
fyPe  &5!  fortunately  it  was  also  of  type  /5,  forcing  a win  on  the  next  move. 


Let  L - {{1,2, 3},  {4, 5, 6},  {7, 8, 9},  {1,4, 7},  {2, 5, 8},  {3, 6, 9},  {1,5, 9},  {3, 5, 7}} 
be  the  set  of  winning  lines.  Then  we  have 


mj  = Xj  A Oj ; 

wj=mj  A y {itjik}eL{xi  Axk); 
bj  — mj  A \J{lJMeL(otAoky, 
fj  = mj  A S2({aik  \ {i,j,k}  £ L}); 
dj  = mj  A S2({Pik  | {i,j,  k}  £ L})- 


[moving  in  cell  j is  legal] 
[moving  in  cell  j wins] 
[moving  in  cell  j blocks] 
[moving  in  cell  j forks] 
[moving  in  cell  j defends] 


here  aik  and  0lk  denote  a single  X or  O together  with  a blank,  namely 


(47) 

(48) 

(49) 

(50) 

(51) 


aik  — (xif\mk)  V (rrii/\xk),  f3ik  = (oi/\mk)  V (mi/\ok).  (52) 

For  example,  =TOl  a ((02A03)V{04A07)V(05A09));  f2  = m2AS2(a13,a58)  = 

m2  A a13  A a58;  d5  = m5  A S2(/319,  /?28, 037, 046). 

W ith  these  definitions  we  might  try  rank-ordering  our  moves  thus: 

{UR,  ...,W9}>  {blt.  69}  > {/l,  ...,  /9}  > {d1;  ...,  d9}  >{«!,...  , mg}.  (53) 

“Win  if  you  can;  otherwise  block  if  you  can;  otherwise  fork  if  you  can;  otherwise 
defend  if  you  can;  otherwise  make  a legal  move.”  Furthermore,  when  choosing 
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between  legal  moves  it  seems  sensible  to  use  the  ordering 


ma  > mi  > m3  > mQ  > m7  > m2  > m6  > m$  > ra4,  (54) 

because  5,  the  middle  cell,  occurs  in  four  winning  lines,  while  a corner  move  to 
1,  3,  9,  or  7 occurs  in  three,  and  a side  cell  2,  6,  8,  or  4 occurs  in  only  two.  We 
might  as  well  adopt  this  ordering  of  subscripts  within  all  five  groups  of  moves 
{wj},  {6j},  {/,},  {dj},  and  {mj}  in  (53). 

To  ensure  that  at  most  one  move  is  chosen,  we  define  w' , 6',  /j,  d'v  m'  to 
mean  “a  prior  choice  is  better.”  Thus,  w'5  = 0,  w[  = w5,  w’3  = w1  V w[,  . . . , 
w4  ~ w8  v w8>  K = w4  V W4,  b[  = b5  V 65,  . . . , TO4  = rag  V mg.  Then  we  can 
complete  the  definition  of  a tic-tac-toe  automaton  by  letting 

Vj  = KA§')  V (bjAb'j)  V V (djAdj)  V (m/m'),  for  1 < j < 9.  (55) 

So  we’ve  constructed  9 gates  for  the  m’s,  48  for  the  w’s,  48  for  the  6’s,  144  for 
the  a’s  and  /3’s,  35  for  the  /’ s (with  the  help  of  Fig.  9),  35  for  the  d’s,  43  for  the 
primed  variables,  and  80  for  the  y’s.  Furthermore  we  can  use  our  knowledge  of 
partial  4-variable  functions  to  reduce  the  six  operations  in  (52)  to  only  four, 

uik  = (xt®xk)  V (os®ofe),  ftfc  = (xi®xk)  V (oi©ofc).  (56) 


This  trick  saves  48  gates;  so  our  design  has  cost  396  gates  altogether. 

The  strategy  for  tic-tac-toe  in  (47)-(s6)  works  fine  in  most  cases,  but  it  also 
has  some  glaring  glitches.  For  example,  it  loses  ignominiously  in  the  game 


ft 


01  IX 

JX_ 

o|x|o 


(57) 


the  second  X move  is  d3,  defending  against  a fork  by  O,  yet  it  actually  forces  O 
to  fork  in  the  opposite  corner!  Another  failure  arises,  for  example,  after  position 
=)=|=,  when  move  m5  leads  to  the  cat’s  game  1^,  1^,  zjlji,  z|g,  e||,  ||gg,  instead 
of  to  the  victory  for  X that  appeared  in  (46).  Exercise  65  patches  things  up  and 
obtains  a fully  correct  Boolean  tic-tac-toe  player  that  needs  just  445  gates. 


*Functional  decomposition.  If  the  function  f{x\, . . . , xn)  can  be  written  in 
the  form  g(x  1, . . . , xk,  h(xk+i, . . . , xn)),  it’s  usually  a good  idea  to  evaluate  y = 
H^k+i,  •■■,xn)  first  and  then  to  compute  g(x  1, . . .,xk,y).  Robert  L.  Ashenhurst 
inaugurated  the  study  of  such  decompositions  in  1952  [see  Annals  Computation 
Lab.  Harvard  University  29  (1957),  74-116],  and  observed  that  there’s  an  easy 
way  to  recognize  when  / has  this  special  property:  If  we  write  the  truth  table 
for  / in  a 2k  x 2n~k  array  as  in  (36),  with  rows  for  each  setting  of  Xi . . . xk  and 
columns  for  each  setting  of  xk+i  . . . xn , then  the  desired  subfunctions  g and  h 
exist  if  and  only  if  the  columns  of  this  array  have  at  most  two  different  values. 
For  example,  the  truth  table  for  the  function  ( xix2{x3x4x5 ))  is 


00000000 
0 0 0 1 0 1 1 1 
0 0 0 1 0 1 1 1 
11111111 
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when  expressed  in  this  two-dimensional  form.  One  type  of  column  corresponds 
to  the  case  h(xk+i,. . ,,xn)  = 0;  the  other  corresponds  to  h( xk+x, . . ,,xn)  = 1. 

In  general  the  variables  X = {xi, . . . ,xn}  might  be  partitioned  into  any  two 
disjoint  subsets  Y = {t/i, . . . , yk]  and  Z = {zi, . . . , and  we  might  have 

f(x)  ~ 9{y,h{z)).  We  could  test  for  a (Y,Z)  decomposition  by  looking  at  the 
columns  of  the  2k  x 2n~k  truth  table  whose  rows  correspond  to  values  of  y.  But 
there  are  2”  such  ways  to  partition  X ; and  all  of  them  are  potential  winners, 
except  for  trivial  cases  when  |1  | = 0 or  \Z\  < 1.  How  can  we  avoid  examining 
such  a humungous  number  of  possibilities? 

A practical  way  to  proceed  was  discovered  by  V.  Y.-S.  Shen,  A.  C.  McKellar, 
and  P.  Weiner  [IEEE  Transactions  C-20  (1971),  304-309],  whose  method  usually 
needs  only  0(n2)  steps  to  identify  any  potentially  useful  partition  (Y,  Z)  that 
may  exist.  The  basic  idea  is  simple:  Suppose  Xi  £ Z,  xj  £ Z,  and  xrn  £ Y . 
Define  eight  binary  vectors  Si  for  l = (lil2l3)2,  where  Si  has  (Zi,  I2, I3)  respectively 
in  components  {i,j,  m),  and  zeros  elsewhere.  Consider  any  randomly  chosen 
vector  x = xi . . ,xn,  and  evaluate  /,  = f{x®5i)  for  0 < l < 7.  Then  the  four  pairs 

© © © © 

will  appear  in  a 2 x 4 submatrix  of  the  2k  x 2n~k  truth  table.  So  a decomposition 
is  impossible  if  these  pairs  are  distinct,  or  if  they  contain  three  different  values. 

Let’s  call  the  pairs  “good”  if  they’re  all  equal,  or  if  they  have  only  two 
different  values.  Otherwise  they’re  “bad.”  If  / has  essentially  random  behavior, 
we’ll  soon  find  bad  pairs  if  we  do  this  experiment  with  several  different  randomly 
chosen  vectors  x,  because  only  88  of  the  256  possibilities  for  /0/i  . . . f7  correspond 
to  a good  set  of  pairs;  the  probability  of  finding  good  pairs  ten  times  in  a row  is 
only  (25?;)  ° ~ -00002.  And  when  we  do  discover  bad  pairs,  we  can  conclude  that 

Xi  £ Z and  x3  £ Z =>•  xm  £ Z,  (59) 

because  the  alternative  xm  £ Y is  impossible. 

Suppose,  for  example,  that  n — 9 and  that  / is  the  function  whose  truth 
table  11001001000011 . . . 00101  consists  of  the  512  most  significant  bits  of  7r,  in 
binary  notation.  (This  is  the  “more-or-less  random  function”  that  we  studied 
for  n = 4 in  (5)  and  (6)  above.)  Bad  pairs  for  this  7r  function  are  quickly 
found  in  each  of  the  cases  ( i,j,m ) for  which  m 7^  i < j / m.  Indeed,  in 
the  author’s  experiments,  170  of  those  252  cases  were  decided  immediately;  the 
average  number  of  random  x vectors  per  case  was  only  1.52;  and  only  one  case 
needed  as  many  as  eight  x’s  before  bad  pairs  appeared.  Thus  (59)  holds  for  all 
relevant  and  the  function  is  clearly  indecomposable.  In  fact,  exercise 

73  points  out  that  we  needn’t  make  252  tests  to  establish  the  indecomposability 
of  this  7r  function;  only  (£)  = 36  of  them  would  have  been  sufficient. 

Turning  to  a less  random  function,  let  /( xlf  ...,x9)  = (det  X)  mod  2,  where 

(xi  x2  x3 
x4  x5  x6 

X7  X&  Xg 


X = 


(60) 
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This  function  does  not  satisfy  condition  (59)  when  i — 1,  j = 2,  and  m = 3, 
because  there  are  no  bad  pairs  in  that  case.  But  it  does  satisfy  (59)  for  4 < m < 9 
when  {i,j}  = {1,  2}.  We  can  denote  this  behavior  by  the  convenient  abbreviation 
‘12=4-456789’;  the  full  set  of  implications,  for  all  pairs  {i,j},  is 

12=4-456789  18=4-34569  27=4-34569  37=4-24568  48=4-12369  67=4-12358 

13=^456789  19=4-24568  28=^134679  38=^14567  49=4-12358  68=4-12347 

14=4-235689  23=4-456789  29=4-14567  39=4-124578  56=4-123789  69=4-124578 

15=4-36789  24=4-36789  34=4-25789  45=4-123789  57=4-12369  78=4-123456 

16=4-25789  25=4-134679  35=4-14789  46=4-123789  58=4-134679  79=4-123456 

17=4-235689  26=4-14789  36=4-124578  47=4-235689  59=4-12347  89=4-123456 

(see  exercise  69).  Bad  pairs  are  a little  more  difficult  to  find  when  we  probe 
this  function  at  random:  The  average  number  of  x’s  needed  in  the  author’s 
experiments  rose  to  about  3.6,  when  bad  pairs  did  exist.  And  of  course  there 
was  a need  to  limit  the  testing,  by  choosing  a tolerance  threshold  t and  then 
giving  up  when  t consecutive  trials  failed  to  find  any  bad  pairs.  Choosing  t = 10 
would  have  found  all  but  8 of  the  198  implications  listed  above. 

Implications  like  (59)  are  Horn  clauses,  and  we  know  from  Section  7.1.1  that 
it’s  easy  to  make  further  deductions  from  Horn  clauses.  Indeed,  the  method  of 
exercise  74  will  deduce  that  the  only  possible  partition  with  \Z\  > 1 is  the  trivial 
one  (Y  = 0,  Z = {xi, . . . ,2:9}),  after  looking  at  fewer  than  50  cases  ( i,j,m ). 

Similar  results  occur  when  f{x\, . . . ,x9)  = [per  A > 0],  where  per  denotes 
the  permanent  function.  (In  this  case  / tells  us  if  there  is  a perfect  matching 
in  the  bipartite  subgraph  of  A'3  3 whose  edges  are  specified  by  the  variables 
Xi . . . Xq.)  Now  there  are  just  180  implications, 

12=4-456789  18=4-3459  27=4-3459  37=4-2468  48=4-1269  67=4-1358 

13=4-456789  19=4-2468  28=4-134679  38=4-1567  49=4-1358  68=4-2347 

14=4-235689  23=4-456789  29=4-1567  39=4-124578  56=4-123789  69=4-124578 

15=>3678  24=4-3678  34=4-2579  45=4-123789  57=4-1269  78=4-123456 

16=4-2579  25=4-134679  35=4-1489  46=4-123789  58=4-134679  79=4-123456 

17=4-235689  26=4-1489  36=4-124578  47=4-235689  59=4-2347  89=4-123456, 

only  122  of  which  would  have  been  discovered  with  t = 10  as  the  cutoff  threshold. 
(The  best  choice  of  t is  not  clear;  perhaps  it  should  vary  dynamically.)  Still,  those 
122  Horn  clauses  were  more  than  enough  to  establish  indecomposability. 

What  about  a decomposable  function?  With  / = {x2XzXqX^{xiX4X^xtx^)) 
we  get  iAj=4-m  for  all  m £ {*,  j } , except  when  {i,  j}  C {1, 4,  5,  7, 8};  in  the  latter 
case,  m must  also  belong  to  {1, 4,  5,  7, 8}.  Although  only  185  of  these  212  impli- 
cations were  discovered  with  tolerance  t = 10,  the  partition  Y = {X2,  X3,  xq,  xg}, 
Z = {xi,X4,X5,xr,xs}  emerged  quickly  as  a strong  possibility. 

Whenever  a potential  decomposition  is  supported  by  the  evidence,  we  need 
to  verify  that  the  corresponding  2fc  x 2n~k  truth  table  does  indeed  have  only 
one  or  two  distinct  columns.  But  we’re  happy  to  spend  2n  units  of  time  on  that 
verification,  because  we’ve  greatly  simplified  the  evaluation  of  /. 


120  COMBINATORIAL  SEARCHING 


The  comparison  function  / = [( xxx2x3x4)2  > (x5x6x7x8)2  + x9] 
interesting  case.  Its  184  potentially  deducible  implications  are 
12=^3456789  18=>2345679  27=^34689  37=>489  48=^9 

13=^2456789  19=^2345678  28=»34679  38=>479  49=^8 

14=^2356789  23=^46789  29=^34678  39=^478  56=^1234789 

15=^2346789  24=t>36789  34=^789  45=^1236789  57=>1234689 

16=^2345789  25=^1346789  35=t-1246789  46=^-23789  58=^1234679 

17=>2345689  26=>34789  36=>24789  47=>389  59=^1234678 


7.1.2 
is  another 

67=^-23489 
68=>23479 
69=>23478 
78^-349 
79=^348 
89=^4, 

and  145  of  them  were  found  when  t = 10.  Three  decompositions  reveal  them- 
selves in  this  case,  having  Z = {x4,x8,x9},  Z = {x3,x4,x7,x8,x9},  and  Z = 
{X2,x3,x4,x6,x7,x8,x9},  respectively.  Ashenhurst  proved  that  we  can  reduce  / 
immediately  as  soon  as  we  find  a nontrivial  decomposition;  the  other  decompo- 
sitions will  show  up  later,  when  we  try  to  reduce  the  simpler  functions  g and  h. 

^Decomposition  of  partial  functions.  When  the  function  / is  only  partially 
specified,  a decomposition  with  partition  (Y,  Z)  hinges  on  being  able  to  assign 
values  to  the  don’t-cares  so  that  at  most  two  different  columns  appear  in  the 
corresponding  2k  x 2n~k  truth  table. 

Two  vectors  ux . . . um  and  vx  . . . vm  consisting  of  Os,  Is,  and  *s  are  said  to 
be  incompatible  if  either  u. , - 0 and  Vj  = 1 or  u}  = 1 and  Vj  = 0,  for  some  j - 
equivalently,  if  the  subcubes  of  the  m-cube  specified  by  u and  v have  no  points 
m common.  Consider  the  graph  whose  vertices  are  the  columns  of  a truth  table 
with  don’t-cares,  where  u — v if  and  only  if  u and  „ are  incompatible.  We  can 
assign  values  to  the  *s  to  achieve  at  most  two  distinct  columns  if  and  only  if  this 
graph  is  bipartite.  For  if  m,  . . . , ut  are  mutually  compatible,  their  generalized 
consensus  uxU  • ■ • U uh  defined  in  exercise  7.1.1  -32,  is  compatible  with  all  of  them 
[See  S.  L.  Hight,  IEEE  Trans.  C-22  (1973),  103-110;  E.  Boros,  V.  Gurvich,  P.  L. 
Hammer,  T.  Ibaraki,  and  A.  Kogan,  Discrete  Applied  Math.  62  (1995),  51-75.1 
Since  a graph  is  bipartite  if  and  only  if  it  contains  no  odd  cycles,  we  can  easily 
test  this  condition  with  a depth-first  search  (see  Section  7.4.1). 

Consequently  the  method  of  Shen,  McKellar,  and  Weiner  works  also  when 
don’t-cares  are  present:  The  four  pairs  in  (58)  are  considered  bad  if  and  only 
if  three  of  them  are  mutually  incompatible.  We  can  operate  almost  as  before 
although  bad  pairs  will  naturally  be  harder  to  find  when  there  are  lots  of  *s  (see 
exercise  72).  However,  Ashenhurst’s  theorem  no  longer  applies.  When  several 
decompositions  exist,  they  all  should  be  explored  further,  because  they  might  use 
different  settings  of  the  don’t-cares,  and  some  might  be  better  than  the  others. 

Although  most  functions  f(x)  have  no  simple  decomposition  g{y,h{z)),  we 
needn’t  give  up  hope  too  quickly,  because  other  forms  like  g(y,  hx(z),  h2(z))  might 
weli  lead  to  an  efficient  chain.  If,  for  example,  / is  symmetric  in  three  of  its  vari- 
ables {zx,z2,  z3},  we  can  always  write  f(x)  = g(y , Sh2(zu  z2,  z3),  S1,3(z1,z2,  z3)), 
since  j\,2\zX,  z2,z3)  and  Sli3(zi,  z2,  z3)  characterize  the  value  of  zx  + z2  + z3. 
(Notice  that  just  four  steps  will  suffice  to  compute  both  S12  and  Sx  3.) 

In  general,  as  observed  by  H.  A.  Curtis  [JACM  8 (1961),  484-496 ],  f(x)  can 
be  expressed  in  the  form  g(y,  hx(z), . . . , hr(z))  if  and  only  if  the  2k  x 2n~k  truth 
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table  corresponding  to  Y and  Z has  at  most  2r  different  columns.  And  when 
don’t-cares  are  present,  the  same  result  holds  if  and  only  if  the  incompatibility 
graph  for  Y and  Z can  be  colored  with  at  most  2r  colors. 

For  example,  the  function  /( x)  — (detX)mod2  considered  above  turns 
out  to  have  eight  distinct  columns  when  Z = {x4,xs,X6,X7,x$,Xq}-,  that’s  a 
surprisingly  small  number,  considering  that  the  truth  table  has  8 rows  and 
64  columns.  From  this  fact  we  might  be  led  to  discover  how  to  expand  a 
determinant  by  cofactors  of  the  first  row, 

f(x)  = xihhi(x4, . . . ,x9)  ® x2Ah2(x4, . . . ,x9)  © x3Ah3(x4, . . . ,x9), 
if  we  didn’t  already  know  such  a rule. 

When  there  are  d < 2r  different  columns,  we  can  think  of  f(x)  as  a function 
of  y and  h(z),  where  h takes  each  binary  vector  z\  . . . zn-k  into  one  of  the 
values  {0, 1, . . . , d — 1}.  Thus  (hi,...,hr)  is  essentially  an  encoding  of  the 
different  column  types,  and  we  hope  to  find  very  simple  functions  h\,  ...  ,hr  that 
provide  such  an  encoding.  Moreover,  if  d is  strictly  less  than  2r,  the  function 
g(y,  hi, ... , hr)  will  have  many  don’t-cares  that  may  well  decrease  its  cost. 

The  distinct  columns  might  also  suggest  a function  g for  which  the  h's  have 
don't-cares.  For  example,  we  can  use  g{yi,y2,hi,hi)  = (yi  ©(hi  Ay2))  Ah2  when 
all  columns  are  either  (0,0,0,0)T  or  (0,0, 1,1)T  or  (0, 1, 1, 0)T;  then  the  value 
of  h\{z)  is  arbitrary  when  z corresponds  to  an  all-zero  column.  H.  A.  Curtis 
has  explained  how  to  exploit  this  idea  when  |F|  = 1 and  \Z\  — n — 1 [see  IEEE 
Transactions  C-25  (1976),  1033-1044]. 

For  a comprehensive  discussion  of  decomposition  techniques,  see  Richard  M. 
Karp,  J.  Society  for  Industrial  and  Applied  Math.  11  (1963),  291-335. 

Larger  values  of  n.  We’ve  been  considering  only  rather  tiny  examples  of 
Boolean  functions.  Theorem  S tells  us  that  large,  random  examples  are  inher- 
ently difficult;  but  practical  examples  might  well  be  highly  nonrandom.  So  it 
makes  sense  to  search  for  simplifications  using  heuristic  methods. 

When  n grows,  the  best  ways  currently  known  for  dealing  with  Boolean 
functions  generally  start  with  a Boolean  chain  — not  with  a huge  truth  table  — 
and  they  try  to  improve  that  chain  via  “local  changes.”  The  chain  can  be 
specified  by  a set  of  equations.  Then,  if  an  intermediate  result  is  used  in  com- 
paratively few  subsequent  steps,  we  can  try  to  eliminate  it,  temporarily  making 
those  subsequent  steps  into  functions  of  three  variables,  and  reformulating  those 
functions  in  order  to  make  a better  chain  when  possible. 

For  example,  suppose  the  gate  Xi  = Xj  o is  used  only  once,  in  the  gate 
xi  — Xi  oxm,  so  that  xi  — (xj  o Xk)  axm.  Other  gates  might  already  exist,  by 
which  we  have  computed  other  functions  of  Xj,  Xk,  and  xm-,  and  the  definitions 
of  Xj,  Xk,  and  xm  may  imply  that  some  of  the  joint  values  of  (xj,Xk,xm)  are 
impossible.  Thus  we  might  be  able  to  compute  xi  from  other  gates  by  doing 
just  one  further  operation.  For  example,  if  X{  — Xj  A Xf.  and  xi  = xm,  and 
if  the  values  Xj  V xm  and  Xk  V xm  appear  elsewhere  in  the  chain,  we  can  set 
xi  = ( XjVxm ) A (xkVxm);  this  eliminates  Xi  and  reduces  the  cost  by  1.  Or  if, 
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say,  xj  A (xfc®xm)  appears  elsewhere  and  we  know  that  x,xfcxm  + 101  we  can 
set  Xl  = xm  ® (Xj  A (xkexm)). 

■ 18  Td  °nly  I11  V and  Xl  iS  USed  °nly  in  then  §ate  xp  depends  on  four 

variables,  and  we  might,  be  able  to  reduce  the  cost  by  using  our  total  knowledge  of 

four-variable  functions,  obtaining  xp  in  a better  way  while  eliminating  x2  and  x;. 

lmilarly,  if  xt  appears  only  in  x,  and  xp , we  can  eliminate  xz  if  we  find  a better 
way  to  evaluate  two  different  functions  of  four  variables,  possibly  with  don’t- 
cares  and  with  other  functions  of  those  four  variables  available  for  free  Again 
we  know  how  to  solve  such  problems,  using  the  footprint  method  discussed  above’ 

When  no  local  changes  are  able  to  decrease  the  cost,  we  can  also  try  local 
changes  that  preserve  or  even  increase  the  cost,  in  order  to  discover  different 
kinds  of  chains  that  might  simplify  in  other  ways.  We  shall  discuss  such  local 
search  methods  extensively  in  Section  7.10. 

Excellent  surveys  of  techniques  for  Boolean  optimization,  which  electrical 
engineers  call  the  problem  of  “multilevel  logic  synthesis,”  have  been  published 
hyf  . I'  achte1,  and  A-  L'  Sangiovanni-Vincentelli,  Proceedings 

j!  h.C  ^ E 78  1990)’  264'300’  and  in  the  book  Synthesis  and  Optimization  of 
Digital  Circuits  by  G.  De  Micheli  (McGraw-Hill,  1994), 

Lower  bounds.  Theorem  S tells  us  that  nearly  every  Boolean  function  of 
n 12  variables  is  hard  to  evaluate,  requiring  a chain  whose  length  exceeds  2 n/n 
Yet  modern  computers,  which  are  built  from  logic  circuits  involving  electric 
signals  that  represent  thousands  of  Boolean  variables,  happily  evaluate  zillions 
of  Boolean  functions  every  microsecond.  Evidently  there  are  plenty  of  important 
unctions  that  can  be  evaluated  quickly,  in  spite  of  Theorem  S.  Indeed,  the  proof 
of  that  theorem  was  indirect;  we  simply  counted  the  cases  of  low  cost,  so  we 
earned  absolutely  nothing  about  any  particular  examples  that  might  arise  in 
practice.  When  we  want  to  compute  a given  function  and  we  can  only  think  of  a 
laborious  way  to  do  the  job,  how  can  we  be  sure  that  there’s  no  tricky  shortcut? 

The  answer  to  that  question  is  almost  scandalous:  After  decades  of  concen- 
trated research  computer  scientists  have  been  unable  to  find  any  explicit  family 
of  functions  /(*!,.„, x„)  whose  cost  is  inherently  nonlinear,  as  n increases. 

ie  true  behavior  is  2 n/n,  but  no  lower  bound  as  strong  as  n log  log  log  n has 
yet  been  proved!  Of  course  we  could  rig  up  artificial  examples,  such  as  “the 
lexicographically  smallest  truth  table  of  length  2"  that  isn’t  achievable  by  any 
Boolean  chain  of  length  [2"/n\  - 1”;  but  such  functions  are  surely  not  explicit. 
The  truth  tabk  of  an  explicit  function  f(xu...,xn)  should  be  computable  in 
a most,  say  2 units  of  time  for  some  constant  c;  that  is,  the  time  needed  to 

1 yT^  j°f  ^ functl0n  vahles  should  be  polynomial  in  the  length  of  the  truth 
table.  Under  those  ground  rules,  no  family  of  single-output  functions  is  currently 

rc110WM  di  haVe  a combmatlonal  complexity  that  exceeds  3 n + 0(1)  as  n -a  oo. 
[bee  IN.  Blum,  Theoretical  Computer  Science  28  (1984),  337-345.] 

The  picture  is  not  totally  bleak,  because  several  interesting  linear  lower 
bounds  have  been  proved  for  functions  of  practical  importance.  A basic  way  to 
obtain  such  results  was  introduced  by  N.  P.  Red’kin  in  1970:  Suppose  we  have 
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an  optimum  chain  of  cost  r for  f(xi, . . . ,xn).  By  setting  x„  <—  0 or  xn  <—  1,  we 
obtain  reduced  chains  for  the  functions  g(x\, . . . , x„_i)  = f(x 1; . . . , xn-i , 0)  and 
h(x i, . . . , £n_i)  = f(x i, . . . , xn-i , 1),  having  cost  r — u if  xn  was  used  as  an  input 
to  u different  gates.  Moreover,  if  xn  is  used  in  a “canalizing”  gate  Xi  — xn  o x*,, 
where  the  operator  o is  neither  ® nor  =,  some  setting  of  xn  will  force  x,  to 
be  constant,  thereby  further  reducing  the  chain  for  g or  h.  Lower  bounds  on  g 
and/or  h therefore  lead  to  a lower  bound  on  /.  (See  exercises  77-81.) 

But  where  are  the  proofs  of  nonlinear  lower  bounds?  Almost  every  problem 
with  a yes-no  answer  can  be  formulated  as  a Boolean  function,  so  there’s  no 
shortage  of  explicit  functions  that  we  don’t  know  how  to  evaluate  in  linear 
time,  or  even  in  polynomial  time.  For  example,  any  directed  graph  G with 
vertices  {ui, . . . , vm}  can  be  represented  by  its  adjacency  matrix  X,  where  xtj  = 
[■ Vi  — > Vj  ] ; then 

f(xi2,...,xim,...,xmi,...,xm(m_i))  — [G  has  a Hamiltonian  path]  (6i) 

is  a Boolean  function  of  n = m(m  — 1)  variables.  We  would  dearly  love  to  be 
able  to  evaluate  this  function  in,  say,  n4  steps.  We  do  know  how  to  compute 
the  truth  table  for  / in  0(m!  2")  = 2n+°('/"  lo§n)  steps,  since  only  m!  potential 
Hamiltonian  paths  exist;  thus  / is  indeed  “explicit.”  But  nobody  knows  how  to 
evaluate  / in  polynomial  time,  or  how  to  prove  that  there  isn’t  a 4n-step  chain. 

For  all  we  know,  short  Boolean  chains  for  / might  exist,  for  each  n.  After  all, 
Figs.  9 and  10  reveal  the  existence  of  fiendishly  clever  chains  even  in  the  cases  of 
4 and  5 variables.  Efficient  chains  for  all  of  the  larger  problems  that  we  ever  will 
need  to  solve  might  well  be  “out  there”  — yet  totally  beyond  our  grasp,  because 
we  don’t  have  time  to  find  them.  Even  if  an  omniscient  being  revealed  the  simple 
chains  to  us,  we  might  find  them  incomprehensible,  because  the  shortest  proof 
of  their  correctness  might  be  longer  than  the  number  of  cells  in  our  brains. 

Theorem  S rules  out  such  a scenario  for  most  Boolean  functions.  But  fewer 
than  2100  Boolean  functions  will  ever  be  of  practical  importance  in  the  entire 
history  of  the  world,  and  Theorem  S tells  us  zilch  about  them. 

In  1974,  Larry  Stockmeyer  and  Albert  Meyer  were,  however,  able  to  con- 
struct a Boolean  function  / whose  complexity  is  provably  huge.  Their  / isn’t 
“explicit,”  in  the  precise  sense  described  above,  but  it  isn’t  artificial  either;  it 
arises  naturally  in  mathematical  logic.  Consider  symbolic  statements  such  as 


048+1015*1063;  (62) 

Vm3n(m<n+1) ; (63) 

Vn3m(m+l<n) ; (64) 

VaVb(b>a+2=>3ab(a<abAab<b)  ) ; (65) 

VAVB(A=B#-i3n(n€AAn(?BVn6BAn£A) ) ; (66) 

VA(3n(neA)=>3m(m€AAVn(n6A=tm<n)))  ; (67) 

VA(3n(neA)=>3m(m€AAVn(neA*m>n)))  ; (68) 

3PVa((aep#a+3eP)#a<1000)  ; (69) 

VAVB  (VCVc  (C=AAc=lvC=BAc=0*  (Vn(n€C#n+l€C)4>c=l)  ) =>-iAsB)  . (70) 
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Stockmeyer  and  Meyer  defined  a language  L by  using  the  63-character  alphabet 

V3-,()2€e+AV=>»<<=*>>abcdefghijklmnopqABCDEFGHIJKLMN0PQ0123456789 

and  giving  conventional  meanings  to  these  symbols.  Strings  of  lowercase  letters 
wRhm  the  sentences  of  L,  like  ‘ab’  in  (65),  represent  numeric  variables,  restricted 
to  nonnegative  integers;  strings  of  uppercase  letters  represent  set  variables, 
restricted  to  finite  sets  of  such  numbers.  For  example,  (66)  means,  “For  all 
mte  sets  A and  B.  we  have  A = B if  and  only  if  there  doesn’t  exist  a number  n 
that  is  in  A but  not  in  B,  or  in  B but  not  in  A."  Some  of  these  statements  are 
true,  others  are  false.  (See  exercise  82.) 

All  of  the  strings  (62)-(7o)  belong  to  L,  but  the  language  is  actually  quite 
restricted.  The  only  arithmetic  operation  allowed  on  a number  is  to  add  a 
constant;  we  can  write  ‘a+13’  but  not  ‘a+b’.  The  only  relation  allowed  between 
a number  and  a set  is  elementhood  (e  or  €).  The  only  relation  allowed  between 
sets  is  equality  (-).  Furthermore  all  variables  must  be  quantified  by  3 or  V * 

Every  sentence  of  L that  has  length  k < n can  be  represented  by  a binary 
vector  of  length  6n  with  zeros  in  the  last  6 (n  - k)  bits.  Let  f(x)  be  a Boolean 
function  of  6n  variables  such  that  /(x)  = 1 whenever  x represents  a true  sentence 
ot  L,  and  f(x)  _ 0 whenever  x represents  a sentence  that  is  false;  the  value  of  f (x) 
is  unspecified  when  x doesn’t  represent  a meaningful  sentence.  The  truth  table 
for  such  a function  / can  be  constructed  in  a finite  number  of  steps,  according 
o theorems  of  Bucln  and  Elgot  [Zeitschrift  fiir  math.  Logik  und  Grundlagen  der 
Math.  6 (1960),  66-92;  Transactions  of  the  Amer.  Math.  Soc.  98  (1961),  21-511. 
ut  finite”  does  not  mean  “feasible” : Stockmeyer  and  Meyer  proved  that 

C(/)  > 2r-5  whenever  n > 460  + ,302r  + 5.08 In r and  r > 36.  (7i) 

In  particular,  we  have  C(f)  > 2™  > i0i»  when  „ = 621.  A Boolean  chain  with 
that  many  gates  could  never  be  built,  since  10128  is  a generous  upper  bound  on 

the  number  of  protons  in  the  universe.  So  this  is  a fairly  small,  finite  problem 
that  will  never  be  solved. 

Details  of  Stockmeyer  and  Meyer’s  proof  appear  in  JACM  49  (2002).  753- 
4.  The  basic  idea  is  that  the  language  L,  though  limited,  is  rich  enough  to 
describe  truth  tables  and  the  complexity  of  Boolean  chains,  using  fairly  short 
sentences;  hence  / has  to  deal  with  inputs  that  essentially  refer  to  themselves. 

*For  further  reading.  Thousands  of  significant  papers  have  been  written  about 
networks  of  Boolean  gates,  because  such  networks  underlie  so  many  aspects  of 
theory  and  practice.  We  have  focused  in  this  section  chiefly  on  topics  that  are 
relevant  to  computer  programming  for  sequential  machines.  But  other  topics 
have  also  been  extensively  investigated,  of  primary  relevance  to  parallel  compu- 
a ion,  such  as  the  study  of  small-depth  circuits  in  which  gates  can  have  any 
number  of  inputs  (“unlimited  fan-in”).  Ingo  Wegener’s  book  The  Complexity  of 

one^ucctsiT^Xrks?;  "TT6"  °f  u be‘°ng  ‘°  “W6ak  second-”der  monadic  logic  with 
c W k second-°rder  logic  allows  quantification  over  finite  sets-  monadic  loe-ic 

with  k successors  is  the  theory  of  unlabeled  k- ary  trees.  monadic  logic 
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Boolean  Functions  (Teubner  and  Wiley,  1987)  provides  a good  introduction  to 
the  entire  subject. 

We  have  mostly  considered  Boolean  chains  in  which  all  binary  operators 
have  equal  importance.  For  our  purposes,  gates  such  as  ® or  C are  neither  more 
nor  less  desirable  than  gates  such  as  A or  V.  But  it’s  natural  to  wonder  if  we 
can  get  by  with  only  the  monotone  operators  A and  V when  we  are  computing  a 
monotone  function.  Alexander  Razborov  has  developed  striking  proof  techniques 
to  show  that,  in  fact,  monotone  operators  by  themselves  have  inherently  limited 
capabilities.  He  proved,  for  example,  that  all  AND-OR  chains  to  determine 
whether  the  permanent  of  an  n x n matrix  of  Os  and  Is  is  zero  or  nonzero 
must  have  cost  nn(1°sn).  [See  Doklady  Akademii  Nauk  SSSR  281  (1985),  798- 
801;  Matematicheskie  Zametki  37  (1985),  887-900.]  By  contrast,  we  will  see  in 
Section  7.5.1  that  this  problem,  equivalent  to  “bipartite  matching,”  is  solvable 
in  only  0(n2  5)  steps.  Furthermore,  the  efficient  methods  in  that  section  can 
be  implemented  as  Boolean  chains  of  only  slightly  larger  cost,  when  we  allow 
negation  or  other  Boolean  operations  in  addition  to  A and  V.  (Vaughan  Pratt 
has  called  this  “the  power  of  negative  thinking.”)  An  introduction  to  Razborov’s 
methods  appears  in  exercises  85  and  86. 

EXERCISES 

1.  [24]  The  “random”  function  in  formula  (6)  corresponds  to  a Boolean  chain  of 
cost  4 and  depth  4.  Find  a formula  of  depth  3 that  has  the  same  cost. 

2.  [21]  Show  how  to  compute  (a)  w © ( xyz ) and  (b)  w A ( xyz ) with  formulas  that 
have  depth  3 and  cost  5. 

3.  [M23]  (B.  I.  Finikov,  1957.)  If  the  Boolean  function  f(x i,...,i„)  is  true  at 
exactly  k points,  prove  that  L(f)  < 2n+(k-2)2k~1 . Hint:  Think  of  k = 3 and  n = 106. 

4.  [M26]  (P.  M.  Spira,  1971.)  Prove  that  the  minimum  depth  and  formula  length  of 
a Boolean  function  satisfy  lg  L(f)  < D(f)  < algL(/)  + 1,  where  a = 2/  lg(§)  « 3.419. 
Hint:  Every  binary  tree  with  r > 3 internal  nodes  contains  a subtree  with  s internal 
nodes,  where  < s < | r. 

► 5.  [21]  The  Fibonacci  threshold  function  Fn(x i, . . . ,xn ) = (xf1®^2  . . . ^n-i1  xn"~2) 
was  analyzed  in  exercise  7.1.1-101,  when  n > 3.  Is  there  an  efficient  way  to  evaluate  it? 

6.  [20]  True  or  false:  A Boolean  function  f(x i,...,i„)  is  normal  if  and  only  if  it 
satisfies  the  general  distributive  law  /(x i, . . . , xn ) A y = f(x i A y, . . . , x„  A y). 

7.  [20]  Convert  the  Boolean  chain  ‘x5  = ii  V i4,  x6  = i2  V 15,  xr  = ®i  A X3, 
xs  = xe  = X7’  to  an  equivalent  chain  (x5,  xe,  X7,  xg)  in  which  every  step  is  normal. 

► 8.  [20]  Explain  why  (11)  is  the  truth  table  of  variable  Xk- 

9.  [20]  Algorithm  L determines  the  lengths  of  shortest  formulas  for  all  functions  /, 
but  it  gives  no  further  information.  Extend  the  algorithm  so  that  it  also  provides  actual 
minimum-length  formulas  like  (6). 

► 10.  [20]  Modify  Algorithm  L so  that  it  computes  D(f ) instead  of  L(f). 

► 11.  [22]  Modify  Algorithm  L so  that,  instead  of  lengths  L(f),  it  computes  upper 
bounds  U(f)  and  footprints  4>(!)  as  described  in  the  text. 

12.  [15]  What  Boolean  chain  is  equivalent  to  the  minimum-memory  scheme  (13)? 
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13.  [16]  What  are  the  truth  tables  of  /i,  /2,  /3,  /4,  and  /5  in  example  (13)? 

14.  [22]  What’s  a convenient  way  to  compute  the  5n(n - 1)  truth  tables  of  (17),  given 
the  truth  table  of  g ? (Use  bitwise  operations  as  in  (15)  and  (16).) 

15.  [28]  Find  short.-as-possible  ways  to  evaluate  the  following  functions  using  mini- 
mum memory:  (a)  S2(*i, x2, x3, *4);  (b)  S1(x1,x2,x3,x 4);  (c)  the  function  in  (18). 

16.  [HM33]  Prove  that  fewer  than  2118  of  the  2128  Boolean  functions  f(Xl,...,x7) 
are  computable  in  minimum  memory. 

► 17.  [25]  (M.  S.  Paterson,  1977.)  Although  Boolean  functions  f(x i,...,xn)  cannot 
always  be  evaluated  in  n registers,  prove  that  n + 1 registers  are  always  sufficient.  In 
other  words,  show  that  there  is  always  a sequence  of  operations  like  (13)  to  compute 
f(xi,  ■ ■ ■ ,xn)  if  we  allow  0 < j(i),k(i)  < n. 

► 18.  [35]  Investigate  optimum  minimum- memory  computations  for  f(xi,x2,  x3,  s4,  x5): 

How  many  classes  of  five-variable  functions  have  Cm(f)  = r,  for  r = 0,  1,  2 ? 

19.  [M22]  If  a Boolean  chain  uses  n variables  and  has  length  r < n + 2,  prove  that  it 
must  be  either  a top-down’  or  a “bottom-up”  construction. 

► 20.  [40]  (R.  Schroeppel,  2004.)  A Boolean  chain  is  canalizing  if  it  does  not  use  the 
operators  © or  = Find  the  optimum  cost,  length,  and  depth  of  all  4-variable  functions 
under  this  constraint.  Does  the  footprint  heuristic  still  give  optimum  results? 

21.  [46]  For  how  many  four- variable  functions  did  the  Harvard  researchers  discover 
an  optimum  vacuum-tube  circuit  in  1951? 

22.  [21]  Explain  the  chain  for  S3  in  Fig.  10,  by  noting  that  it  incorporates  the  chain 
for  S2, 3 in  Fig.  9.  Find  a similar  chain  for  S2  (*1,0:2,  x3,x4,x5). 

► 23.  [23]  Figure  10  illustrates  only  16  of  the  64  symmetric  functions  on  five  elements. 
Explain  how  to  write  down  optimum  chains  for  the  others. 

24.  [47]  Does  every  symmetric  function  / have  Cm(/)  = C(/)? 

► 25.  [17]  Suppose  we  want  a Boolean  chain  that  includes  all  functions  of  n variables: 
Let  fk(x  1, . . . ,xn)  be  the  function  whose  truth  table  is  the  binary  representation  of  k 
for  0 < k < m = 22”.  What  is  C(/0/i  . . . /m-t)? 

26  [25]  True  or  false:  If  /(*„,  ...,*„)  = (*o  A g(xu. . . , *„))  0 h(xu.  ..,*„),  where  g 
and  h are  nontrivial  Boolean  functions  whose  joint  cost  is  C(gh),  then  C(f)  = 2 + C(gh). 

► 27.  [23]  Can  a full  adder  (22)  be  implemented  in  five  steps  using  only  minimum 
memory  (that  is,  completely  inside  three  one-bit  registers)? 

28.  [26]  Prove  that  <7(uV)  = C(u"v")  = 5 for  the  two-output  functions  defined  by 

(«V)2  = (x  + y-  (uv)2)  mod  4,  (u'v")2  = (-x  - y - (uv)2)  mod  4. 

Use  these  functions  to  evaluate  [(xi  + • • • + xn)  mod  4 = 0]  in  fewer  than  2.5n  steps. 

29.  [M28]  Prove  that  the  text’s  circuit  for  sideways  addition  (27)  has  depth  0(log n ). 

30.  [M25]  Solve  the  binary  recurrence  (28)  for  the  cost  s(n)  of  sideways  addition. 

31-  W]  If  f(xi,...,Xn)  is  symmetric,  prove  that  C(f)  < 5n  + 0(n/log  n). 

32.  [HM16]  Why  does  the  solution  to  (30)  satisfy  f(n)  = 2n  + 0( 2"/2)? 

33.  [HM22]  True  or  false:  If  1 < N < 2n,  the  first  N minterms  of  {Xl, . . . ,xn\ 
all  be  evaluated  in  N + 0(VN ) steps,  as  n -4  00  and  IV  00. 


can 


7.1.2 


BOOLEAN  EVALUATION 


127 


► 34.  [22]  A priority  encoder  has  n = 2m  - 1 inputs  xi  . . . xn  and  m outputs  yi...ym, 
where  (y1  . . . ym)2  = k if  and  only  if  k = max{  j \ j = 0 or  xj  = 1}.  Design  a priority 
encoder  that  has  cost  O(n)  and  depth  0(m). 

35.  [23]  If  n > 1,  show  that  the  conjunctions  x\  A • • • A xk-\  A xk+i  A • • • A x„  for 
1 < k < n can  all  be  computed  from  (xi, . . . , xn)  with  total  cost  < 3n  - 6. 

► 36.  [M28]  (R.  E.  Ladner  and  M.  J.  Fischer,  1980.)  Let  i ik  be  the  “prefix”  xi  A • • • Axk 
for  1 < k < n.  Clearly  C(y\  . . ,yn)  = n — 1 and  D(y\  . . ,yn)  = [lgn];  but  we  can’t 
simultaneously  minimize  both  cost  and  depth.  Find  a chain  of  optimum  depth  [lg  n] 
that  has  cost  < 4n. 

37.  [M28]  (Marc  Snir,  1986.)  Given  n > m > 1,  consider  the  following  algorithm: 

51.  [Upward  loop.]  For  t <-  1,  2,  . . , , [lgm],  set  xm,u(m^k)  x2.(fc_1/2)  A 

for  k > 1 and  2t{k  - 1/2)  < m. 

52.  [Downward  loop.]  For  t <-  [lgm]  - 1,  [lgm]  - 2,  ...,  1,  set  x2t(k+1/2)  <- 
x2tk  A *2t(fc+i/2)  for  k > 1 and  2 t(k  + 1/2)  < m. 

53.  [Extension.]  For  k t-  m + 1,  m + 2,  . . . , n,  set  xk  f-  xk-\  A xk.  | 

a)  Prove  that  this  algorithm  solves  the  prefix  problem  of  exercise  36:  It  transforms 
(xi,x2, . . . , xn)  into  (xi,xi  A x2, . . . , xi  A x2  A ■ • • A *„). 

b)  Let  c(m,  n ) and  d(m,  n ) be  the  cost  and  depth  of  the  corresponding  Boolean  chain. 
Prove  for  fixed  m that,  if  n is  sufficiently  large,  c(m,  n)  + d(m,  n)  = 2n  - 2. 

c)  Given  n > 1,  what  is  d(n)  = mini <m<nd(m,n)?  Show  that  d(n)  < 21g  n. 

d)  Prove  that  there’s  a Boolean  chain  of  cost  2 n — 2 — d and  depth  d for  the  prefix 
problem  whenever  d(n)  < d < n.  (This  cost  is  optimum,  by  exercise  81.) 

38.  [25]  In  Section  5.3.4  we  studied  sorting  networks,  by  which  S(n)  comparator 
modules  are  able  to  sort  n numbers  [x\,  x2, ...  ,xn)  into  ascending  order.  If  the  inputs 
Xj  are  Os  and  Is,  each  comparator  module  is  equivalent  to  two  gates  (iAj.iV  y); 
so  a sorting  network  corresponds  to  a certain  kind  of  Boolean  chain,  which  evaluates 
n particular  functions  of  (x\,x2, . . . , xn ). 

a)  What  are  the  n functions  /1/2  • • ■ fn  that  a sorting  network  computes? 

b)  Show  that  those  functions  {/1,  f2, . . . , /„}  can  be  computed  in  O(n)  steps  with  a 
chain  of  depth  O(logn).  (Hence  sorting  networks  aren’t  asymptotically  optimal, 
Booleanwise.) 

► 39.  [M21]  (M.  S.  Paterson  and  P.  Klein,  1980.)  Implement  the  2m-way  multiplexer 

Mm(x1, . . . ,Xm',yo,yi,  ■ ■ ■ of  (31)  with  an  efficient  chain  that  simultaneously 

establishes  the  upper  bounds  C(Mm ) < 2n  + 0(s/n)  and  D(Mm)  < m + O(logm). 

40.  [25]  If  n > k > 1,  let  fnk(x  1, . . . ,xn)  be  the  “fc  in  a row”  function, 

(xi  A • ■ • A xk)  V (x2  A ■ • • A xfc+i)  V • • • V (xn+1„k  A • • • A xn). 

Show  that  the  cost  C(fnk)  of  this  function  is  less  than  4n  — 3 k. 

41.  [M23]  (Conditional-sum  adders.)  One  way  to  accomplish  binary  addition  (25) 
with  depth  O(logn)  is  based  on  the  multiplexer  trick  of  exercise  4:  If  (xx')2  + (yy') 2 = 
(zz')2,  where  |x'|  = \y'\  = \z'\,  we  have  either  (x)2  + (y)2  = (z)2  and  (x')2  + (y')2  = ( z')2 , 
or  (x)2  + (y)2  + 1 = ( z)2  and  (x')2  + (y')2  = (1  z')2.  To  save  time,  we  can  compute  both 
(x)2  + (y) 2 and  ( x)2  + (y)2  + 1 simultaneously  as  we  compute  (x')2  + (y')2.  Afterwards, 
when  we  know  whether  or  not  the  less  significant  part  (x')2  + (y')2  produces  a carry, 
we  can  use  multiplexers  to  select  the  correct  bits  for  the  most  significant  part. 
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If  this  method  is  used  recursively  to  build  2n-bit  adders  from  n-bit  adders,  how 
many  gates  are  needed  when  n = 2m?  What  is  the  corresponding  depth? 

42.  [25]  In  the  binary  addition  (25),  let  uk  = xk  A yk  and  vk  = xk  ® yk  for  0 < k < n. 

a)  Show  that  zk  = vk  © ck , where  the  carry  bits  ck  satisfy 

Ck  = Mfc_i  V (vk-\  A (uk-.2  V ( Vk-2  A (■  ■ • (m  A u 0)  • • •)))). 

b)  Let  U£  = 0,  Vkk  = 1,  and  Uk+1  =uk\/  ( vk  A Uk),  Vk+1  = vk  A Vf,  for  k > j. 
Prove  that  ck  = U$,  and  that  Uk  = Uk  V (V/  A Uf),  Vk  = Vk  A V?  for  i < j < k. 

c)  Let  h(m)  = 2m(m_1)/2.  Show  that  when  n = h(m),  the  carries  a,  . . . , cn  can  all 
be  evaluated  with  depth  (m  + l)m/2  « lg  n + y/2  lg  n and  with  total  cost  0(2mn). 

► 43.  [28]  A finite-state  transducer  is  an  abstract  machine  with  a finite  input  alpha- 
bet A,  a finite  output  alphabet  B,  and  a finite  set  of  internal  states  Q.  One  of  those 
states,  (jo i is  called  the  initial  state.”  Given  a string  ot  = ai  . . . dn , where  each  u,  C A, 
the  machine  computes  a string  0 = bi...bn,  where  each  bj  € B,  as  follows: 

Tl.  [Initialize.]  Set  j 1 and  q «—  q0. 

T2.  [Done?]  Terminate  the  algorithm  if  j > n. 

T3.  [Output  bj.]  Set  bj  «—  c(q,aj). 

T4.  [Advance  j.]  Set  q <—  d(q,aj),  j <—  j + 1,  and  return  to  step  T2.  | 

The  machine  has  built-in  instructions  that  specify  c(q,  a)  G B and  d(q , a)  £ Q for  every 
state  q £ Q and  every  character  a £ A.  The  purpose  of  this  exercise  is  to  show  that,  if 
the  alphabets  A and  B of  any  finite  state  transducer  are  encoded  in  binary,  the  string 
can  be  computed  from  a by  a Boolean  chain  of  size  0(n)  and  depth  O(logn). 

a)  Consider  the  problem  of  changing  a binary  vector  ai  . . . an  to  b\  ...  bn  by  setting 

bj  <-  aj  ® [aj  = aj-!  = • • • = aj-k  = 1 and  aj-k-i  = 0,  where  fc  > 1 is  odd], 

assuming  that  a0  = 0.  For  example,  a = 1100100100011111101101010  ^ = 

1000100100010101001001010.  Prove  that  this  transformation  can  be  carried  out 
by  a finite  state  transducer  with  |A|  = |B|  = |Q|  = 2. 

b)  Suppose  a finite  state  transducer  with  |Q|  = 2 is  in  state  q,  after  reading  ax  ..  ,aj-X. 
Explain  how  to  compute  the  sequence  qx  . . . qn  with  a Boolean  chain  of  cost  O(n) 
and  depth  O(logn),  using  the  construction  of  Ladner  and  Fischer  in  exercise  36. 
(From  this  sequence  qx  . . . qn  it  is  easy  to  compute  bx  . . . bn,  since  bj  = c(qj,aj).) 

c)  Apply  the  method  of  (b)  to  the  problem  in  (a). 

► 44.  [26]  (R.  E.  Ladner  and  M.  J.  Fischer,  1980.)  Show  that  the  problem  of  binary 
addition  (25)  can  be  viewed  as  a finite  state  transduction.  Describe  the  Boolean  chain 
that  results  from  the  construction  of  exercise  43  when  n = 2m,  and  compare  it  to  the 
conditional-sum  adder  of  exercise  41. 

45.  [ HM20 ] Why  doesn’t  the  proof  of  Theorem  S simply  argue  that  the  number  of 
ways  to  choose  j(i)  and  k(i)  so  that  1 < j(i),k(i)  < i is  n2(n+l)2. . . (n+r-1)2? 

► 46.  [ HM21 ] Let  a(n)  = c(n,  [2  /n])/22  be  the  fraction  of  n-variable  Boolean  func- 
tions f(x  1, . . . ,xn)  for  which  C(f)  < 2 n/n.  Prove  that  a(n)  ->  0 rapidly  as  n ->  00. 

47.  [M23]  Extend  Theorem  S to  functions  with  n inputs  and  m outputs. 

48.  [ HM23 ] Find  the  smallest  integer  r = r(n)  such  that  (r-1)!  22"<  22r+1(n+r-l)2r, 
(a)  exactly  when  1 < n < 16;  (b)  asymptotically  when  n — > 00. 
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49.  [HM25]  Prove  that,  as  n ->  oo,  almost  all  Boolean  functions  f(x\, . . . ,xn)  have 
minimum  formula  length  L(f)  > 2n/\gn  - 2n+2/(lg  n)2. 

50.  [24]  What  are  the  prime  implicants  and  prime  clauses  of  the  prime-number  func- 
tion (35)?  Express  that  function  in  (a)  DNF  (b)  CNF  of  minimum  length. 

51.  [20]  What,  representation  of  the  prime-number  detector  replaces  (37),  if  rows  of 
the  truth  table  are  based  on  x3xe  instead  of  11X2? 

52.  [23]  What  choices  of  k and  l minimize  the  upper  bound  (38)  when  5 < n < 16? 

53.  [HM22]  Estimate  (38)  when  k = [21gnJ  and  l = [2 k/(n  - 31gn)}  and  n -/  00. 

54.  [29]  Find  a short  Boolean  chain  to  evaluate  all  six  of  the  functions  f } ( x ) = 
[x! XiXaXiSAj],  where  A4  = {0010,0101,1011},  A2  = {0001,1111},  A3  = {0011,0111, 
1101},  A4  = {1001,1111},  A5  = {1101},  Aq  = {0101,1011}.  (These  six  functions 
appear  in  the  prime-number  detector  (37).)  Compare  your  chain  to  the  minterm-first 
evaluation  scheme  of  Lupanov’s  general  method. 

55.  [34]  Show  that  the  cost  of  the  6-bit  prime-detecting  function  is  at  most  14. 

► 56.  [16]  Explain  why  all  functions  with  14  or  more  don’t-cares  in  Table  3 have  cost  0. 
57.  [19]  What  seven-segment  “digits”  are  displayed  when  ( X!X2X3X4)2  > 9 in  (45)? 

► 58.  [30]  A 4 x 4-bit  S-box  is  a permutation  of  the  4-bit  vectors  {0000, 0001, . . . , 1111}; 
such  permutations  are  used  as  components  of  well-known  cryptographic  systems  such 
as  the  Russian  standard  GOST  28147  (1989).  Every  4x4-bit  S-box  corresponds  to 
a sequence  of  four  functions  fi(xi,  x2,  x3,  x4),  ...,  f4(xi,x2,X3,x4),  which  transform 
X4X2X3XA  (-4-  /1/2/3/4. 

Find  all  4x4-bit  S-boxes  for  which  C(/i)  = C{f2)  = C(f3)  = C(f4)  = 7. 

59.  [29]  One  of  the  S-boxes  satisfying  the  conditions  of  exercise  58  takes  (0, . . . , f ) 
(0,6,5,b,3,9,f,e,c,4,7,8,d,2,a,  1);  in  other  words,  the  truth  tables  of  (/1,  f2,  fs,  f4) 
are  respectively  (179a,  63e8,  5b26,  3e29).  Find  a Boolean  chain  that  evaluates  these 
four  “maximally  difficult”  functions  in  fewer  than  20  steps. 

60.  [23]  (Frank  Ruskey.)  Suppose  z = ( x + y ) mod  3,  where  x = (xix2)2,  y = (2/11/2)2, 
2 = (^122)2,  and  each  two-bit  value  is  required  to  be  either  00,  01,  or  10.  Compute  z\ 
and  z2  from  x 1 , x2,  yi , and  y2  in  six  Boolean  steps. 

61.  [34]  Continuing  exercise  60,  find  a good  way  to  compute  z = (x  + y)  mod  5,  using 
the  three-bit  values  000,  001,  010,  Oil,  100. 

62.  [HM23]  Consider  a random  Boolean  partial  function  of  n variables  that  has  2nc 
cares  and  2 d don  t-cares,”  where  c.  + d = 1.  Prove  that  the  cost  of  almost  all  such 

partial  functions  exceeds  2 "c/n. 

63.  [HM35]  (L.  A.  Sholomov,  1969.)  Continuing  exercise  62,  prove  that  all  such 
functions  have  cost  < 2nc/n(l  + 0(n^  log n)).  Hint:  There  is  a set  of  2m(l  + k) 
vectors  xi  . . . xk  that  intersects  every  ( k - m)-dimensional  subcube  of  the  lc-cube. 

64.  [25]  ( Magic  Fifteen.)  Two  players  alternately  select  digits  from  1 to  9,  using  no 
digit  twice;  the  winner,  if  any,  is  the  first  to  get  three  digits  that  sum  to  15.  What’s  a 
good  strategy  for  playing  this  game? 

► 65.  [35]  Modify  the  tic-tac-toe  strategy  of  (47)“(s6)  so  that  it  always  plays  correctly. 
66.  [20]  Criticize  the  moves  chosen  in  exercise  65.  Are  they  always  optimum? 
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► 67.  [40]  Instead  of  simply  ending  one  correct  move  for  each  position  in  tic-tac-toe 
we  might,  prefer  to  find  them  all.  In  other  words,  given  Xl  . . . x9ox  . . . og,  we  could  try 
to  compute  nine  outputs  gx...g9,  where  g3  = 1 if  and  only  if  a move  into  cell  j is  legal 
and  mimrmires  X’s  worst-case  outcome.  For  example,  exclamation  marks  indicate  all 
ot  the  right  moves  for  X in  the  following  typical  positions: 


* ft;  ft; 


ft; 


A machine  that  chooses  randomly  among  these  possibilities  is  more  fun  to  play  against 
than  a machine  that  has  only  one  fixed  strategy. 

One  attractive  way  to  solve  the  all-good-moves  problem  is  to  use  the  fact  that 
tic-tac-toe  has  eight  symmetries.  Imagine  a chip  that  has  18  inputs  an  . ..x90l  o9 
and  three  outputs  (, c,s,m ),  for  “corner,”  “side,”  and  “middle,”  with  the  property 
that  the  desired  functions  9j  can  be  computed  by  hooking  together  eight  of  the  chips 
appropriately: 


9 1 C{^'1'^‘2'J'3X4X5XqX7XsXq0i020s04050q070s0q^ 

V C^XiX4X7X2X5XsXsXqXqOi  O4O7O2O5O8 O3 Oq O9  ) , 

92  S{XlX2X3X4X5XeX7XsXg0i020304050Q07080g) 

V s (X3X2X1XQX5X4 £9^8^7030201060504090807), 

93  ^(*^3^2^1^6^5^4^9^8^7030201060504090807) 

V c(x3XQXgX2XsX8X\X4X7030Q0g0205080\0407), 

94  s(xlx4X7X2X5X8X3XQXg0i0407020308030e0g) 

V s(x7X4XiX8X5X2x9XqX307040i080502090q03^  , 

99  C(X9x8x7x6x5x4x3x2xl0g08070e050403020i) 

V c{^XgXQX3X8x5x2x7x4xlOg Oq O3O8 O5 O2O7O4O1 ) , 

and  g5  is  the  OR  of  the  m outputs  from  all  eight  chips. 

Design  the  logic  for  such  a chip,  using  fewer  than  2000  gates. 

68.  [M25]  Consider  the  n-bit  tt  function  7rn(xx  . . . xn),  whose  value  is  the  (an  . . . xn)2th 
bit  to  the  right  of  the  most  significant  bit  in  the  binary  representation  of  7 r.  Does  the 
method  of  exercise  4.3.1-39,  which  describes  an  efficient  way  to  compute  arbitrary  bits 
of  7 r,  prove  that  C(nn)  < 2 n/n  for  sufficiently  large  n? 

69.  [M24]  Let  the  multilinear  representation  of  / be 


Qooo  © aooia:m 


oioioXj  © Qona:ja:m  © Qiooaii  © oiioiXiXm 


© anio XiXj  © ama:jXja:m, 


where  each  coefficient  ai  is  a 

a)  Prove  that  the  pairs  (58) 


function  of  the  variables  {xu  . . . , xn}  \ {Xi,  Xj , Xrn}. 
are  ‘good  if  and  only  if  the  coefficients  satisfy 


aoioanoi  — Q-oiiQioo,  aioiauo  = aiooam,  and  an0a0ii  = Qmaoio. 

b)  For  which  values  are  the  pairs  bad,  when  / = (det  X)  mod  2?  (See  (60).) 

► 70.  [M27]  Let  X be  the  3 x 3 Boolean  matrix  (60).  Find  efficient  chains  for  the 
Boolean  functions  (a)  (det  X)  mod  2;  (b)  [perX>0];  (c)  [detX>0]. 

► 71.  [. M26 ] Suppose  f{x)  is  equal  to  0 with  probability  p at  each  point  x = Xl...Xn 
independent  of  its  value  at  other  points. 

a)  What  is  the  probability  that  the  pairs  (58)  are  good? 

b)  What  is  the  probability  that  bad  pairs  (58)  exist? 

c)  What  is  the  probability  that  bad  pairs  (58)  are  found  in  at  most  t random  trials? 

d)  What  is  the  expected  time  to  test  case  (i,j,  m),  as  a function  of  p,  t,  and  n? 
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72.  [M24\  Extend  the  previous  exercise  to  the  case  of  partial  functions,  where  f(x)  = 
0 with  probability  p,  f(x ) = 1 with  probability  q,  and  f(x)  = * with  probability  r. 

► 73.  [20]  If  bad  pairs  (58)  exist  for  all  ( i,j,m ) with  m / i / j ^ m,  show  that  the 
indecomposability  of  / can  be  deduced  after  testing  only  (!()  well-chosen  triples  ( i , j , m) . 

74.  [25]  Extend  the  idea  in  the  previous  exercise,  suggesting  a strategy  for  choosing 
successive  triples  ( i,j,m ) when  using  the  method  of  Shen,  McKellar,  and  Weiner. 

75.  [20]  What  happens  when  the  text’s  decomposition  procedure  is  applied  to  the 
“all-equal”  function  So,n(a;i, . . . ,xn)7 

► 76.  [M25]  (D.  Uhlig,  1974.)  The  purpose  of  this  exercise  is  to  prove  the  amazing  fact 
that,  for  certain  functions  /,  the  best  chain  to  evaluate  the  Boolean  function 

F(ui,...,Un,Vi,...,Vn)  = /(«!,...  ,Un)  V f(vi,  ..  .,V„) 

costs  less  than  2 C(/);  hence  functional  decomposition  is  not  always  a good  idea. 

We  let  n = m + 2m  and  write  /(n, . . . , im,  x0, . . . , x2m-i)  = fi(x),  where  i is 
regarded  as  the  number  (ix  . ..im)2.  Then  (ui,.. . ,u„)  = (*i, . . . , im,  x0, . . .,x2™_ 1), 
(vi,  ...,vn)  = (j  1, . . .,jm,yo,  ■ ■ ■ ,2/2m_r),  and  F{u,  v)  = fi{x)  V fj(y). 

a)  Prove  that  a chain  of  cost  0(n/logn)2  suffices  to  evaluate  the  2m  + 1 functions 

zi  = x®  ({[l<i]  ® [i  <j])  A (x®y)),  0 < l < 2m, 

from  given  vectors  i,  j,  x,  and  y\  each  zi  is  a vector  of  length  2m. 

b)  Let  gi(x)  = fi(x)  ® fi-i(x)  for  0 < i < 2m,  where  f-i(x)  = f2™{x)  = 0.  Estimate 
the  cost  of  computing  the  2m  + 1 values  cj  = gi(zi),  given  the  vectors  zi,  for 
0 < l < 2m. 

c)  Let  c'j  = ci  A ([?'  < j]  = [l  < i])  and  c'/  = c;  A ([i  < j]  = [j  > /]).  Prove  that 

fi(x)  = Co  ® ci  © • • • ® c2m,  fj{y)  = Co  ffic'/  © • • • ©ci'm. 

d)  Conclude  that  C(F ) < 2n/n  + 0(2"(log n)/n2).  (When  n is  sufficiently  large,  this 
cost  is  definitely  less  than  2 n+1/n,  but  functions  / exist  with  C(f)  > 2 n/n.) 

e)  For  clarity,  write  out  the  chain  for  F when  m = 1 and  f(i,x 0,2:1)  = (i  A xo)  V xi. 

► 77.  [35]  (N.  P.  Red’kin,  1970.)  Suppose  a Boolean  chain  uses  only  the  operations 
AND,  OR,  or  NOT;  thus,  every  step  is  either  Xi  = ajj-jq  A xk (q  or  Xi  = Xj(q  V xk (,) 
or  x,  = Xj(i).  Prove  that  if  such  a chain  computes  either  the  “odd  parity”  function 
fn(x j, . . . , x„)  = x\  © • • ■ © xn  or  the  “even  parity”  function  fn(x  1, . . . ,xn)  = 1 © xi  © 
• ■ ■ © xn,  where  n > 2,  the  length  of  the  chain  is  at  least  4(n  — 1). 

78.  [26]  (W.  J.  Paul,  1977.)  Let  f(xi,...,  xm,  yo, ...,  y2m-i ) be  any  Boolean  function 
that  equals  yk  whenever  (n  . . . xm)2  = k G S,  for  some  given  set  S C (0, 1, . . . , 2m  - 1}; 
we  don’t  care  about  the  value  of  / at  other  points.  Show  that  C(f)  > 2||5||  — 2 whenever 
S is  nonempty.  (In  particular,  when  S = {0, 1, . . . , 2m  - 1},  the  multiplexer  chain  of 
exercise  39  is  asymptotically  optimum.) 

79.  [32]  (C.  P.  Schnorr,  1976.)  Say  that  variables  u and  v are  “mates”  in  a Boolean 
chain  if  there  is  exactly  one  simple  path  between  them  in  the  corresponding  binary  tree 
diagram.  Two  variables  can  be  mates  only  if  they  are  each  used  only  once  in  the  chain; 
but  this  necessary  condition  is  not  sufficient.  For  example,  variables  2 and  4 are  mates 
in  the  chain  for  Sii2,3  in  Fig.  9,  but  they  are  not  mates  in  the  chain  for  S2. 

a)  Prove  that  a Boolean  chain  on  n variables  with  no  mates  has  cost  >2n  — 2. 

b)  Prove  that  C(f)  = 2n  — 3 when  / is  the  all-equal  function  So,n(xi, . . . , x„). 
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► 80.  [M2 7]  (L.  J.  Stockmeyer,  1977.)  Another  notation  for  symmetric  functions  is 
sometimes  convenient:  If  a = a0a1...an  is  any  binary  string,  let  Sa(x)  = a„x.  For 
examp  e,  (xix2x3)  Soon  and  xi  © x2  ® x3  = Soioi  in  this  notation.  Notice  that 


Sa(0,x2, . . . ,x„)  - Sa/(x2, . . . ,xn)  and  Sa(  l,x2,...,xn)  = S>a(  x2,...,x 
and  a stand  respectively  for  a with  its  last  or  first  element  deleted.  Also 


x„),  where  a' 


S'a'  {x3 , . . . , Xn  ) 


Sa(f(x3,  . . . ,Xn),  f(x 3,  . . . ,Xn),X3,  . . . ,Xn ) 

when  / is  any  Boolean  function  of  n - 2 variables. 

a)  A parity  function  has  a0  / fll  * a2  * • • • * a„.  Assume  that  n > 2.  Prove  that  if 

is  not  a parity  function  and  S/Q,  isn’t  constant,  then 

c(Sa)  > max(C(Sa,)+2,  C(S,a)+2,  mm(C(Sa,)+3,  C(S,a)  + 3,  C(5w)+5)). 

b)  What  lower  bounds  on  C(Sk)  and  C(S>k)  follow  from  this  result,  when  0 < k < n? 

81.  [23]  (M.  Snir,  1986.)  Show  that  any  chain  of  cost  c and  depth  d for  the  prefix 
problem  of  exercise  36  has  c + d > 2n  - 2.  P 

► 82.  [M2 3]  Explain  the  logical  sentences  (62)-(7o).  Which  of  them  are  true? 

83.  [21]  If  there’s  a Boolean  chain  for  f(xu.  ..,*„)  that  contains  p canalizing  oper- 
ations, show  that  C(f)  < (p  + l)(n  + p/2).  8 P 

84.  [M20]  A monotone  Boolean  chain  is  a Boolean  chain  in  which  every  operator  o, 
is  monotone.  The  length  of  a shortest  monotone  chain  for  / is  denoted  by  C+(f)  If 
there  s a monotone  Boolean  chain  for  f(xu  ...,*„)  that  contains  p occurrences  of  A 
and  g occurrences  of  V,  show  that  C+(f)  < min((p  + 1)(„  + p/2),  {q  + 1){n  + q/2))_ 

► 85.  [M28]  Let  Mn  be  the  set  of  all  monotone  functions  of  n variables.  If  L is  a family 

of  functions  contained  in  Mn,  let  y 


xUy=  /\{z  eL\zDxVy} 


and 


xny  = \J{z  e L \ z C x /\y}. 


We  call  L “legitimate”  if  it  includes  the  constant  functions  0 and  1 as  well  as  the 
projec  ion  functions  X]  for  ! < j < n,  and  if  z U y € L,  x n y 6 L whenever  x,y  e L 

a)  When  n 3 we  can  write  M3  = (00,  01,  03,  05,  11,  07,  13,  15,  Of,  33,  55.  17,  If, 

5f’  7I;67rf’  ff>’  rePresenting  each  function  by  its  hexadecimal  truth 
able.  There  are  2 families  L such  that  {00,  Of,  33, 55,  ff}CLCM3;  how  many 
ot  them  are  legitimate? 

b)  If  A is  a subset  of  {1,  n},  let  \A]  = \J aeAxa-  also  let  fool  = 1.  Suppose  A 

a family  of  subsets  of  {1, ....  n)  that  contains  all  sets  of  size  < 1 and  is  closed 
under  intersection;  in  other  words,  ArBe  A whenever  A e A and  B € A.  Prove 
that  the  family  L = {rAl|AeAUoo}is  legitimate. 

c)  Let  (xn+1,...,Xn+r)  be  a monotone  Boolean  chain  (i).  Suppose  (x„^  r , 1 
is  obtained  from  the  same  Boolean  chain,  but  with  every  operator  A chknged  to  n 
and  with  every  operator  V changed  to  U,  with  respect  to  some  legitimate  family  L. 
Prove  that,  for  n + 1 < / < n + r,  we  must  have 


£l  c X,V  \J  {Xi  (B  (xHi)  v xk(i))  K = v}; 

i=n+l 

l 

Xt  c x,v  V {£i  © A £fc(j))  | Oj  = a}. 
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86.  [HM3 7]  A graph  G on  vertices  {l,...,n}  can  be  defined  by  IV  = (")  Boolean 
variables  xuv  for  1 < u < v < n,  where  xuv  = [ u — v in  G].  Let  / be  the  function 
f(x)  = [G  contains  a triangle];  for  example,  when  n = 4,  f{x12,  *13,  xi4,  x23,  x24,  x34)  = 
(xi2  A X13  A X23 ) V (X12  A X14  A 2:24)  V (2:13  A X14  A X34)  V (*23  A *24  A X34).  The  purpose 
of  this  exercise  is  to  prove  that  the  monotone  complexity  C+(f)  is  Q(n/logn)3. 

a)  If  Uj  — Vj  for  1 < j < r in  a graph  G,  call  S = {{rti,i'i},  • • • , {ur,Wr}}  an  r- 

family , and  let  A (S)  = Ui<i<j<r({uii  , Vj})  be  the  elements  of  its  pairwise 

intersections.  Say  that  G is  r -closed  if  we  have  u — v whenever  A (5)  C {u,v}  for 
some  r-family  5.  It  is  strongly  r-closed  if,  in  addition,  we  have  |A(S)|  > 2 for  all 
r-families  S.  Prove  that  a strongly  r-closed  graph  is  also  strongly  (r  + l)-closed. 

b)  Prove  that  the  complete  bigraph  Krn.n  is  strongly  r-closed  when  r > max(m,n). 

c)  Prove  that  a strongly  r-closed  graph  has  at  most  (r  — l)2  edges. 

d)  Let  L be  the  family  of  functions  {1}  U {[G]  | G is  a strongly  r-closed  graph  on 
{1, . . . , n}}.  (See  exercise  85(b);  we  regard  G as  a set  of  edges.  For  example,  when 
the  edges  are  1 — 3,  1 — 4,  2 — 3,  2 — 4,  we  have  |"G]  = Xi3  Vi14Vi23Vi24.) 
Is  L legitimate? 

e)  Let  ajjv+i , . . . , XN+p+q  = f be  a monotone  Boolean  chain  with  p A-steps  and  q 
V-steps,  and  consider  the  modified  chain  xjv+i,  . ..,  xN+p+q  = f based  on  the 
family  L in  (d).  If  / ^ 1,  show  that  2 (r  - l)3p+  (r  - l)2(n  - 2)  > (").  Hint:  Use 
the  second  formula  in  exercise  85(c). 

f)  Furthermore,  if  / = 1 we  must  have  r2q  > 2r_1. 

g)  Therefore  p = Q(n/logn)3.  Hint:  Let  r ss  61gn  and  apply  exercise  84. 

87.  [M20]  Show  that  when  nonmonotonic  operations  are  permitted,  the  triangle  func- 
tion of  exercise  86  has  cost  C(f)  = 0(nlg7(logn)2)  = 0{n2  81).  Hint:  A graph  has  a 
triangle  if  and  only  if  the  cube  of  its  adjacency  matrix  has  a nonzero  diagonal. 

88.  [40]  A median  chain  is  analogous  to  a Boolean  chain,  but  it  uses  median-of-three 
steps  Xi  = {xj(i')Xk(i')Xi(i'))  for  n + 1 < i < n + r,  instead  of  the  binary  operations  in  (1). 

Study  the  optimum  length,  depth,  and  cost  of  median  chains,  for  all  self-dual  mono- 
tone Boolean  functions  of  7 variables.  What  is  the  shortest  chain  for  (xiX2X3X4XSX6XT)r! 

Lady  Caroline.  Psha!  that’s  such  a hack! 

Sir  Simon.  A hack,  Lady  Caroline,  that 

the  knowing  ones  have  warranted  sound. 

— GEORGE  COLMAN,  John  Bull,  Act  3,  Scene  1 (1803) 

7.1.3.  Bitwise  Tricks  and  Techniques 

Now  comes  the  fun  part:  We  get  to  use  Boolean  operations  in  our  programs. 

People  are  more  familiar  with  arithmetic  operations  like  addition,  subtrac- 
tion, and  multiplication  than  they  are  with  bitwise  operations  such  as  “and,” 
“exclusive-or,  ’ and  so  on,  because  arithmetic  has  a very  long  history.  But  we  will 
see  that  Boolean  operations  on  binary  numbers  deserve  to  be  much  better  known. 
Indeed,  they’re  an  important  component  of  every  good  programmer’s  toolkit. 

Early  machine  designers  provided  fullword  bitwise  operations  in  their  com- 
puters primarily  because  such  instructions  could  be  included  in  a machine’s 
repertoire  almost  for  free.  Binary  logic  seemed  to  be  potentially  useful,  although 


134  COMBINATORIAL  SEARCHING 


7.1.3 


only  a few  applications  were  originally  foreseen.  For  example,  the  EDSAC  com- 
puter, completed  in  1949,  included  a “collate”  command  that  essentially  per- 
formed the  operation  z <—  z + (x  & y),  where  z was  the  accumulator,  x was 
the  multiplier  register,  and  y was  a specified  word  in  memory;  it  was  used  for 
unpacking  data.  The  Manchester  Mark  I computer,  built  at  about  the  same 
time,  included  not  only  bitwise  AND,  but  also  OR  and  XOR.  When  Alan  Turing 
wrote  the  first  programming  manual  for  the  Mark  I in  1950,  he  remarked  that 
bitwise  NOT  can  be  obtained  by  using  XOR  (denoted  in  combination  with  a 
row  of  Is.  R.  A.  Brooker,  who  extended  Turing’s  manual  in  1952  when  the  Mark 
II  computer  was  being  designed,  remarked  further  that  OR  could  be  used  “to 
round  off  a number  by  forcing  1 into  its  least  significant  digit  position.”  By  this 
time  the  Mark  II,  which  was  to  become  the  prototype  of  the  Ferranti  Mercury, 
had  also  acquired  new  instructions  for  sideways  addition  and  for  the  position  of 
the  most  significant  1. 

Keith  Tocher  published  an  unusual  application  of  AND  and  OR  in  1954, 
which  has  subsequently  been  reinvented  frequently  (see  exercise  85).  And  dur- 
ing the  ensuing  decades,  programmers  have  gradually  discovered  that  bitwise 
operations  can  be  amazingly  useful.  Many  of  these  tricks  have  remained  part  of 
the  folklore;  the  time  is  now  ripe  to  take  advantage  of  what  has  been  learned. 

A trick  is  a clever  idea  that  can  be  used  once,  while  a technique  is  a trick 

that  can  be  used  at  least  twice.  We  will  see  in  this  section  that  tricks  tend  to 
evolve  naturally  into  techniques. 

Enriched  arithmetic.  Let’s  begin  by  officially  defining  bitwise  operations  on 
integers  so  that,  if  x = ( . . . x2x\x0)2 , y = ( . . . 1/22/12/0)2,  and  z = ( . . . ,222120)2 
in  binary  notation,  we  have 

xk,y  = z 4=k  xk  A yk  — Zk,  for  all  k > 0;  (1) 

x I V — z xk^Uk  = Zk,  for  all  k > 0;  (2) 

x ®y  = z 4=1-  xk  © yk  = zk,  for  all  k > 0.  (3) 

(It  would  be  tempting  to  write  ‘ccA y'  instead  of  xhy,  and  lx\Zy'  instead  of  x\y;  but 
when  we  study  optimization  problems  we  ll  find  it  better  to  reserve  the  notations 
a:  A y and  z V y for  min(a;,i/)  and  ma x(x,y),  respectively.)  Thus,  for  example, 

5 & 11  = 1,  5 | 11  = 15,  and  5 0 11  = 14, 

since  5 = (0101)2,  11  = (1011)2,  1 = (0001)2,  15  = (1111)2,  and  14  = (1110)2. 
Negative  integers  are  to  be  thought  of  in  this  connection  as  infinite-precision 
numbers  in  two’s  complement  notation,  having  infinitely  many  Is  at  the  left;  for 
example,  — 5 is  ( . . . 1111011)2.  Such  infinite-precision  numbers  are  a special  case 
of  2-adic  integers , which  are  discussed  in  exercise  4.1-31,  and  in  fact  the  operators 
&,  I,  © make  perfect  sense  when  they  are  applied  to  arbitrary  2-adic  numbers. 

Mathematicians  have  never  paid  much  attention  to  the  properties  of  & and  | 
as  operations  on  integers.  But  the  third  operation,  ©,  has  a venerable  history, 
because  it  describes  a winning  strategy  in  the  game  of  nim  (see  exercises  8-16)! 
For  this  reason  x © y has  often  been  called  the  “nim  sum”  of  the  integers  x and  y. 
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All  three  of  the  basic  bitwise  operations  turn  out  to  have  many  useful 
properties.  For  example,  every  relation  between  A,  V,  and  © that  we  studied  in 
Section  7.1.1  is  automatically  inherited  by  k,  |,  and  © on  integers,  since  the  rela- 
tion holds  in  every  bit  position.  We  might  as  well  recap  the  main  identities  here: 


xky  = ykx,  x \ y — y \ x,  x ® y = y ® x;  (4) 

(xky)kz  = xk{ykz),  (x\y)\z  = x\(y\z),  (x®y)®z  = x®(y®z);  (5) 

{x\y)kz  = (xkz)\  {ykz),  (x  k y)  \ z = (x  \ z)  k (y  \ z);  (6) 

(x  © y)  k z = (x  k z)  © (y  k z);  (7) 

(xky)  \x  = x,  {x\y)kx  = x-  (8) 

(xky)®{x\y)  = x®y,  (9) 

x&0  = 0,  x | 0 = x,  x®0  = x;  (10) 

xkx  = x,  x \ x — x,  x ® x = 0;  (11) 

xk-l  = x,  x | -1  = — 1,  x®-l  — x;  (12) 

xkx  — 0,  x\x  = -l,  x®x  — -l\  (13) 

xky  = x\y,  x\y  = xkij , x®y  = x®y  = x®y.  (14) 


The  notation  x in  (12),  (13),  and  (14)  stands  for  bitwise  complementation  of  x, 
namely  ( . . . x2xix0)2,  also  written  ~x.  Notice  that  (12)  and  (13)  aren’t  quite 
the  same  as  7.1.1-(io)  and  7.1.1-(i8);  we  must  now  use  — 1 = ( . . . 1111)2  instead 
of  1 = ( . . . 0001)2  in  order  to  make  the  formulas  bitwise  correct. 

We  say  that  x is  contained  in  y,  written  x C y or  y D x,  if  the  individual 
bits  of  x and  y satisfy  xy.  < y j.  for  all  k > 0.  Thus 

x C y xky-  x x | y = y x k y = 0.  (15) 

Of  course  we  needn’t  use  bitwise  operations  only  in  connection  with  each 
other:  we  can  combine  them  with  all  the  ordinary  operations  of  arithmetic.  For 
example,  from  the  relation  x + x = ( . . . 1 1 1 1)2  = — 1 we  can  deduce  the  formula 

x — x + 1,  (16) 

which  turns  out  to  be  extremely  important.  Replacing  x by  x — 1 gives  also 

-x  = x - 1;  (17) 

and  in  general  we  can  reduce  subtraction  to  complementation  and  addition: 

x - y = x + y.  (18) 

We  often  want  to  shift  binary  numbers  to  the  left  or  right.  These  operations 
are  equivalent  to  multiplication  and  division  by  powers  of  2,  with  appropriate 
rounding,  but  it  is  convenient  to  have  special  notations  for  them: 

x <C  k = 2;  shifted  left  k bits  = [2fcxJ;  (19) 

x » k — x shifted  right  A bits  = \_2~kx\.  (20) 

Here  A can  be  any  integer,  possibly  negative.  I11  particular  we  have 

x<^(—k)  — x^>k  and  i>(-A)=r<A,  (21) 
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for  every  infinite-precision  number  x.  Also  (x  ky)  <C  k — (x  < k)  & (y  < k),  etc. 

When  bitwise  operations  are  combined  with  addition,  subtraction,  multi- 
plication, and/or  shifting,  extremely  intricate  results  can  arise,  even  when  the 
formulas  are  quite  short.  A taste  of  the  possibilities  can  be  seen,  for  example, 
in  Fig.  11.  Furthermore,  such  formulas  do  not  merely  produce  purposeless, 
chaotic  behavior:  A famous  chain  of  operations  known  as  “Gosper’s  hack,”  first 
published  in  1972,  opened  people’s  eyes  to  the  fact  that  a large  number  of  useful 
and  nontrivial  functions  can  be  computed  rapidly  (see  exercise  20).  Our  goal  in 
this  section  is  to  explore  how  such  efficient  constructions  might  be  discovered. 


Fig.  11.  A small  portion  of 
the  patchwork  quilt  defined  by 
the  bitwise  function  f(x,y)  = 
((i®j)&((i-  350)  »3))2; 
the  square  cell  in  row  x and 
column  y is  painted  white  or 
black  according  as  the  value  of 
((/(».  J/)  » 12)  & 1)  is  0 or  1. 
(Design  by  D.  Sleator,  1976; 
see  also  exercise  18.) 


Packing  and  unpacking.  We  studied  algorithms  for  multiple-precision  arith- 
metic in  Section  4.3.1,  dealing  with  situations  where  integers  are  too  large  to  fit  in 
a single  word  of  memory  or  a single  computer  register.  But  the  opposite  situation, 
when  integers  are  significantly  smaller  than  the  capacity  of  one  computer  word,  is 
actually  much  more  common;  D.  H.  Lehmer  called  this  “fractional  precision.”  We 
can  often  deal  with  several  integers  at  once,  by  packing  them  into  a single  word. 

For  example,  a date  x that  consists  of  a year  number  y , a month  number  m, 
and  a day  number  d,  can  be  represented  by  using  4 bits  for  m and  5 bits  for  d: 

x = i({y  < 4)  + to)  < 5)  + d.  (22) 

We’ll  see  below  that  many  operations  can  be  performed  directly  on  dates  in  this 
packed  form.  For  example,  x < x'  when  date  x precedes  date  x' . But  if  necessary 
the  individual  components  ( y,m,d ) can  readily  be  unpacked  when  x is  given: 

d = x mod  32,  m = (x  » 5)  mod  16,  y = x > 9.  (23) 

And  these  “mod”  operations  do  not  require  division,  because  of  the  important 
law 

x mod  2"  = x & (2n  — 1)  (24) 

for  any  integer  n > 0.  We  have,  for  instance,  d = x & 31  in  (22)  and  (23). 

Such  packing  of  data  obviously  saves  space  in  memory,  and  it  also  saves  time: 
We  can  more  quickly  move  or  copy  items  of  data  from  one  place  to  another  when 
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they’ve  been  packed  together.  Moreover,  computers  run  considerably  faster  when 
they  operate  on  numbers  that  fit  into  a cache  memory  of  limited  size. 

The  ultimate  packing  density  is  achieved  when  we  have  1-bit  items,  because 
we  can  then  cram  64  of  them  into  a single  64-bit  word.  Suppose,  for  example, 
that  we  want  a table  of  all  odd  prime  numbers  less  than  1024,  so  that  we  can 
easily  decide  the  primality  of  a small  integer.  No  problem;  only  eight  64-bit 
numbers  are  required: 

P0  = 0111011011010011001011010010011001011001010010001011011010000001, 
Pi  = 0100110000110010010100100110000110110000010000010110100110000100, 
P2  = 1001001100101100001000000101101000000100100001101001000100100101, 
P3  = 0010001010001000011000011001010010001011010000010001010001010010, 
P4  = 0000110000000010010000100100110010000100100110010010110000010000, 
P5  = 1101001001100000101001000100001000100001000100100101000100101000, 
P6  = 1010000001000010000011000011011000010000001011010000001011010000, 
P7  = 0000010100010000100010100100100000010100100100010010000010100110. 
To  test  whether  2k  + 1 is  prime,  for  0 < k < 512,  we  simply  compute 

P[k/ 64J  < (k  & 63)  (25) 

in  a 64-bit  register,  and  see  if  the  leftmost  bit  is  1.  For  example,  the  following 
MMIX  instructions  will  do  the  job,  if  register  phase  holds  the  address  of  P0: 


SRU 

$0,k,3 

$0  <-  [fc/8j  (i.e.,  3). 

LD0U 

$1 .phase , $0 

$1  P|_®o/8j  (be.,  P[fc/64j  )■ 

AND 

$0,k,#3f 

$0  k mod  64  (i.e.,  k & #3f ). 

SLU 

$1,$1,$0 

$1  <-  ($l<$0)mod  264. 

BN 

$1, PRIME 

Branch  to  PRIME  if  s($l)  < 0.  | 

Notice  that  the  leftmost  bit  of  a register  is  1 if  and  only  if  the  register  contents 
are  negative. 

We  could  equally  well  pack  the  bits  from  right  to  left  in  each  word: 

Q0  — 1000000101101101000100101001101001100100101101001100101101101110, 

<2i  =0010000110010110100000100000110110000110010010100100110000110010, 

q2  = 1010010010001001011000010010000001011010000001000011010011001001, 
q3  =0100101000101000100000101101000100101001100001100001000101000100, 

<2  4 — 0000100000110100100110010010000100110010010000100100000000110000, 
<25  = 0001010010001010010010001000010001000010001001010000011001001011, 

<2e  =0000101101000000101101000000100001101100001100000100001000000101, 

Qr  = 0110010100000100100010010010100000010010010100010000100010100000; 
here  Qj  — Pj1.  Instead  of  shifting  left  as  in  (25),  we  now  shift  right, 

<2|fc/64j  > (fc&63),  (27) 

and  look  at  the  rightmost  bit  of  the  result.  The  last  two  lines  of  (26)  become 

SRU  $1,$1,$0  $1<-$1»$0. 

BOD  $1 , PRIME  Branch  to  PRIME  if  $1  is  odd.  | 

(And  of  course  we  use  qbase  instead  of  phase.)  Either  way,  the  classic  sieve  of 
Eratosthenes  will  readily  set  up  the  basic  table  entries  Pj  or  Qj  (see  exercise  24). 
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Table  1 

THE  BIG-ENDIAN  VIEW  OF  A 32-BYTE  MEMORY 


octa  0 

tetra  0 

tetra  4 

wyde  0 

wyde  2 

wyde  4 

wyde  6 

byteO  byte  1 ^ byte  2 ^ byte  3 '"""byte  4 " byte  5 byte  6 ' byte  7 ' 

• 07  “8  ■ • • a 15  a 16  . . . a 23  024  ■ ■ ■ a 31  <132  . . . 039  <140  . . . 047  04 g . . . 055  ' a56  . a63  ' 

octa  8 

11  ^ 


tetra  8 tetra  12 


wyde  8 

wyde  10 

wyde  12 

wyde  14 

byte  8 

byte  9 

byte 

10 

byte  11  byte  12 

byte  13  byte  14 

byte  15 

“64  ■ ■ • «71 

a 72  • 

■ • 079 

a.80  ■ • 

• «87 

«88 

. . . <195  <196  ■ 

octa  16 

• ■ <1103 

<1104  ■ 

■ • am  an2  . . . aug  ai2o  • 

N s 

■ ■ <1127 

tetra  16 

tetra  20 

wyde  16 

wyde  18 

wyde  20 

wyde  22 

byte  16 

byte  17 

byte  18 

byte  19  byte  20 

byte  21  byte  22 

byte  23 

ai28  ■ ■ ■ ai35  ai36  . 

■ . ai43  ai44  . . 

■ 0151  <U52 

. . . <1159  a 160 

. . . ai67  <1168  ■ 

■ • 0175  ai76  . . . ai83  ais4  ■ 

■ • «i9i 

octa  24 

tetra  24 

tetra  28 

wyde  24 

wyde  26 

wyde  28 

wyde  30 

byte  24 

byte  25 

byte  26 

byte  27  byte  28 

byte  29  byte  30 

byte  31 

a192  . . . a igg  0200  ■ ■ 

• a 207  «208  • ■ ■ 

«215  <1216 

■ • ■ (1223  <*224  ■ 

■ • <1231  (1232  ■ ■ 

• 0239  0240  • • ■ 0247  0248  ■ ■ 

s s 

■ 0255 

Big-endian  and  little-endian  conventions.  Whenever  we  pack  bits  or  bytes 
into  words,  we  must  decide  whether  to  place  them  from  left  to  right  or  from  right 
to  left.  The  left-to-right  convention  is  called  “big-endian,”  because  the  initial 
items  go  into  the  most  significant  positions;  thus  they  will  have  bigger  significance 
than  their  successors,  when  numbers  are  compared.  The  right-to-left  convention 
is  called  little-endian  ; it  puts  the  first  items  where  little  numbers  go. 

A big-endian  approach  seems  more  natural  in  many  cases,  because  we’re  ac- 
customed to  reading  and  writing  from  left  to  right.  But  a little-endian  placement 
has  advantages  too.  For  example,  let’s  consider  the  prime  number  problem  again; 
let  cik  = [2fc+l  is  prime].  Our  table  entries  {To,  Pi, ... , P7}  are  big-endian,  and 
we  can  regard  them  as  the  representation  of  a single  multiple-precision  integer 
that  is  512  bits  long: 

(P0Pi...Pr)2e4  = (doGt!  ...  0511)2-  (29) 

Similarly,  our  little-endian  table  entries  represent  the  multiprecise  integer 

{Q7  . . . QiQo)264  = (<1511  ...  0100)2-  (30) 

The  latter  integer  is  mathematically  nicer  than  the  former,  because  it  is 

511  511  oo 

^ ak  = X/  [2fc+l  is  prime]  = 2k[2k+l  is  prime]'!  mod  2512.  (31) 

fc=0  k=0  k= 0 


7.1.3 


BITWISE  TRICKS  AND  TECHNIQUES  139 

Table  2 

THE  LITTLE-ENDIAN  VIEW  OF  A 32-BYTE  MEMORY 
octa  24 

- A. 

tetra  28 tetra  24 

wyde  30 wyde  28  wyde  26  wyde  24 

byte  31  byte  30  byte  29  byte  28  byte  27  byte  26  byte  25  byte  24 

“255  • ■ ■ <*248  «247  • ■ ■ 0240  0239  • ■ • 0232  0231  ■ ■ • “224  0223  ■ • • 0216  0215  • • • 0208  0207  • ■ • O 200  “199  • • • “192 

octa  16 

tetra  20  tetra  16 

wyde  22 wyde  20  wyde  18  wyde  16 

byte  23  byte  22  byte  21  byte  20  byte  19  byte  18  byte  17  byte  16 
0191  • ■ • “184  0183  ■ • ■ 0176  “175  . . . 0168  “167  ■ • • 0 160  O159  . . . 0152  0151  . . . 0144  0143  . . . 0136  0135  . . . 0128 

octa  8 

tetra  12  tetra  8 

- ; s, , 

wyde  14  wyde  12  wyde  10  wyde  8 

* : — s, — w —*■ „ 

byte  15  byte  14  byte  13  byte  12  byte  11  byte  10  byte  9 byte  8 

ai27  ■ • • 0120  0119  . . . aii2  om  . . . aio4  0103  • • • age  age  . . . ass  agr  . . . ago  079  . . . 072  071  . . . 064 

octa  0 

tetra  4 tetra  0 

' ^ N /■  1 " ^ v 

wyde  6 wyde  4 wyde  2 wyde  0 

' ^ S /"  ■ ^ N /»  . ■ ^ v 

byte  7 byte  6 byte  5 byte  4 byte  3 byte  2 byte  1 byte  0 

“63  ■ ■ • 056  055  . . . 048  047  . . . 040  039  . . . 032  031  . . . 024  O23  • • ■ “16  015  ...  08  07  ...  00 

Notice,  however,  that  we  used  (Q7  . . . QiQo)264  to  get  this  simple  result,  not 
{Q0Q1  ■ ■ ■ Q 7)264-  The  other  number, 

[QoQl  ■ ■ ■ Ql )264  = (a63  • • • fllC*Oa127  • • • a65°64a191  • • ■ <2385°384a511  • • ■ a449a448)2 

is  in  fact  quite  weird,  and  it  has  no  really  nice  formula.  (See  exercise  25.) 

Endianness  has  important  consequences,  because  most  computers  allow  in- 
dividual bytes  of  the  memory  to  be  addressed  as  well  as  register-sized  units.  MMIX 
has  a big-endian  architecture;  therefore  if  register  x contains  the  64-bit  number 
#0123456789abcdef , and  if  we  use  the  commands  ‘STOU  x,0;  LDBU  y,l’  to 
store  x into  octabyte  location  0 and  read  back  the  byte  in  location  1,  the  result 
in  register  y will  be  *23.  On  machines  with  a little-endian  architecture,  the 
analogous  commands  would  set  y #cd  instead;  #23  would  be  byte  6. 

Tables  1 and  2 illustrate  the  competing  “world  views”  of  big-endian  and 
little-endian  aficionados.  The  big-endian  approach  is  basically  top-down,  with 
bit  0 and  byte  0 at  the  top  left;  the  little-endian  approach  is  basically  bottom- up, 
with  bit  0 and  byte  0 at  the  bottom  right.  Because  of  this  difference,  great  care 
is  necessary  when  transmitting  data  from  one  kind  of  computer  to  another,  or 
when  writing  programs  that  are  supposed  to  give  equivalent  results  in  both  cases. 
On  the  other  hand,  our  example  of  the  Q table  for  primes  shows  that  we  can 
perfectly  well  use  a little-endian  packing  convention  on  a big-endian  computer 
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like  MMIX,  or  vice  versa.  The  difference  is  noticeable  only  when  data  is  loaded 
and  stored  in  different-sized  chunks,  or  passed  between  machines. 

Working  with  the  rightmost  bits.  Big-endian  and  little-endian  approaches 
aren  t readily  interchangeable  in  general,  because  the  laws  of  arithmetic  send 
signals  leftward  from  the  bits  that  are  “least  significant.”  Some  of  the  most 
important  bitwise  manipulation  techniques  are  based  on  this  fact. 

If  x is  almost  any  nonzero  2-adic  integer,  we  can  write  its  bits  in  the  form 


x = (a  01a10b)2;  (32) 

in  other  words,  x consists  of  some  arbitrary  (but  infinite)  binary  string  a.  followed 
by  a 0,  which  is  followed  by  a + 1 ones,  and  followed  by  b zeros,  for  some  a > 0 
and  b > 0.  (The  exceptions  occur  when  x = - 2b ; then  a = 00.)  Consequently 


s = (aiOa01b)2,  (33) 

*-1  = (a01“016)2,  (34) 

-*  = (dlOa10b)2;  (35) 


and  we  see  that  x + 1 - -x  = x - 1,  in  agreement  with  (16)  and  (17).  With  two 
operations  we  can  therefore  compute  relatives  of  x in  several  useful  ways: 


x&  (x-1)  = ( a 01a006)2 
xk-x  = (0°°00a106)2 
x\-x=  (l°°lla10fc)2 
x©-x  = (1°°11o006)2 
x | (x-1)  = ( a 01all6)2 
X©  (x-1)  = (0°°00all6)2 
xk(x-l)  = (0°°00a016)2 


[remove  the  rightmost  1]; 

[extract  the  rightmost  1]; 

[smear  the  rightmost  1 to  the  left]; 

[remove  and  smear  it  to  the  left]; 

[smear  the  rightmost  1 to  the  right]; 
[extract  and  smear  it  to  the  right]; 

[extract,  remove,  and  smear  it  to  the  right]. 


(36) 

(37) 

(38) 

(39) 

(40) 

(41) 

(42) 


And  two  further  operations  produce  yet  another  variant: 


((x|(x  1))+1)  k x - ( a 00a00b)2  [remove  the  rightmost  run  of  Is],  (43) 

When  x = 0,  five  of  these  formulas  produce  0,  the  other  three  give  -1.  [For- 
mula (36)  is  due  to  Peter  Wegner,  CACM  3 (1960),  322;  and  (43)  is  due  to 
H.  Tim  Gladwin,  CACM  14  (1971),  407-408.  See  also  Henry  S.  Warren  Jr 
CACM  20  (1977),  439-441.] 

The  quantity  b in  these  formulas,  which  specifies  the  number  of  trailing  zeros 
ln  x>  is  called  the  ruler  function  of  x and  written  px,  because  it  is  related  to 

tlleleilgtllS.0fthe.tiCk  markS  that  ^ °ften  US6d  t0  indicate  fractions  of  an  inch: 

' ' ' ' " 1 In  general,  px  is  the  largest  integer  k such  that  2k  divides  x, 

when  i/0;  and  we  define  pO  = 00.  The  recurrence  relations 


p(2x+l)-0,  p(2x)  = p(x)  + 1 (44) 

also  serve  to  define  px  for  nonzero  x.  Another  handy  relation  is  worthy  of  note, 

P(x-y)  = p(x  © y).  (45) 
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The  elegant  formula  x & -x  in  (37)  allows  us  to  extract  the  rightmost  1 bit 
very  nicely,  but  we  often  want  to  identify  exactly  which  bit  it  is.  The  ruler 
function  can  be  computed  in  many  ways,  and  the  best  method  often  depends 
heavily  on  the  computer  that  is  being  used.  For  example,  a two-instruction 
sequence  due  to  J.  Dallos  does  the  job  quickly  and  easily  on  MMIX  (see  (42)): 

SUBUt,x,l;  SADD  rho,t,x.  (46) 

(See  exercise  30  for  the  case  x — 0.)  We  shall  discuss  here  two  approaches  that 
do  not  rely  on  exotic  commands  like  SADD;  and  later,  after  learning  a few  more 
techniques,  we’ll  consider  a third  way. 

The  first  general-purpose  method  makes  use  of  “magic  mask”  constants  p k 
that  prove  to  be  useful  in  many  other  applications,  namely 

Ho  = ( • • • 101010101010101010101010101010101)2  = -1/3, 

Hi  — (■■■  100110011001100110011001100110011)2  = -1/5,  (47) 

H2  = {-.- 100001111000011110000111100001111)2  = -1/17, 
and  so  on.  In  general  Hk  is  the  infinite  2-adic  fraction  -l/(22fe+  1),  because 
(22  + 1 )Hk  = (hfc  ^ 2fc)  + Hk  = ( • • • 11111)2  = — 1.  On  a computer  that  has 
2d-bit  registers  we  don’t  need  infinite  precision,  of  course,  so  we  use  the  truncated 
constants  d 

Hd,k  = (22  - l)/(22  +1)  for  0 < k < d.  (48) 

These  constants  are  familiar  from  our  study  of  Boolean  evaluation,  because  they 
are  the  truth  tables  of  the  projection  functions  Xd~k  (see,  for  example,  7.1.2— (7)). 
When  a:  is  a power  of  2,  we  can  use  these  masks  to  compute 

px  = [x  k Ho  = 0]  + 2[x  & pi  = 0]  + A[x  k p2  = 0]  + 8[x  & p3  = 0]  H , (49) 

because  [2J  & Hk  = 0]  = jk  when  j = ( . . . jzj2j\jo)2-  Thus,  on  a 2d-bit  computer, 
we  can  start  with  p <-  0 and  y 4-  x & -x;  then  set  p <-  p + 2k  if  y k Hd,k  = 0,  for 
0 < k < d.  This  procedure  gives  p = px  when  x / 0.  (It  also  gives  pO  = 2d  - 1, 
an  anomalous  value  that  may  need  to  be  corrected;  see  exercise  30.) 

For  example,  the  corresponding  MMIX  program  might  look  like  this: 


mO  GREG  #5555555555555555  ;ml  GREG  #3333333333333333 
m2  GREG  #0f  Of  Of  Of  Of  Of  Of  Of  ;m3  GREG  #00f  f OOf  fOOffOOff 
m4  GREG  #0000f fffOOOOffff  ;m5  GREG  #00000000ff fff fff 


NEGUy.x;  AND  y,x,y;  AND  q,y,m5;  ZSZ  rho,q,32; 
AND  q,y,m4;  ADD  t ,rho , 16 ; CSZrho,q,t; 


AND  q , y , m3 ; 
AND  q,y,m2; 
AND  q,y,ml; 
AND  q,y,m0; 


ADD  t , rho , 8 
ADD  t , rho , 4 
ADD  t , rho , 2 
ADD  t , rho , 1 


CSZ  rho,q,t; 
CSZ  rho,q,t; 
CSZ  rho,q,t; 
CSZ  rho,q,t; 


total  time  = 19u.  Or  we  could  replace  the  last  three  lines  by 


(50) 


SRU  y,y,rho;  LDB  t ,rhotab,y ; ADD  rho, rho, t (51) 

where  rhotab  points  to  the  beginning  of  an  appropriate  129-byte  table  (only 
eight  of  whose  entries  are  actually  used).  The  total  time  would  then  be  p+  13v. 
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The  second  general-purpose  approach  to  the  computation  of  px  is  quite 
different.  On  a 64-bit  machine  it  starts  as  before,  with  y <-  x & -x;  but  then  it 
simply  sets 

p <-  decode  [((a  • y)  mod  264)  58] , (52) 

where  a is  a suitable  multiplier  and  decode  is  a suitable  64-byte  table.  The 
constant  a — (0.6,3  • • ■ «i  0,0)2  must  have  the  property  that  its  64  substrings 

a63a62  . . . a58,  062^61  as7,  •••,  a5a4...a0,  a4a3a2aia00,  ao00000 

are  distinct.  Exercise  2.3.1  — 1 23  shows  that  many  such  “de  Bruijn  cycles”  exist; 
for  example,  we  can  use  M.  H.  Martin’s  constant  #03f  79d71b4ca8b09,  which 
is  discussed  in  exercise  3.2.2-17.  The  decoding  table  decode[ 0], . . . , decode  [63]  is 
then 

00, 01,  56, 02, 57, 49,  28, 03, 61,  58, 42,  50, 38, 29, 17, 04, 
62,47,59,36,45,43,51,22,53,39,33,30,24,18,12,05, 

63,  55, 48,  27, 60, 41,  37, 16, 46,  35, 44,  21,  52, 32, 23, 1 1, 

54,  26, 40, 15,  34,  20, 31, 10,  25, 14, 19,  09, 13, 08, 07, 06. 

[This  technique  was  devised  in  1967  by  Luther  Woodrum  of  IBM’s  Systems  De- 
velopment Division  (unpublished);  many  other  programmers  have  subsequently 
discovered  it  independently.] 

Working  with  the  leftmost  bits.  The  function  Ax  = [lgxj,  which  is  dual  to 
px  because  it  locates  the  leftmost  1 when  x > 0,  was  introduced  in  Eq.  4.6.3-(6). 
It  satisfies  the  recurrence 

A1  — 0;  A(2x)  = A(2x  + 1)  = A(x)  + 1 for  x > 0;  (54) 

and  it  is  undefined  when  x is  not  a positive  integer.  What  is  a good  way  to 
compute  it?  Once  again  MMIX  provides  a quick-but-tricky  solution: 

FL0TU  y,R0UND_D0WN,x;  SUB  y,y, tone;  SR  lam, y, 52  (55) 

where  tone  = #3ff 0000000000000  is  the  floating  point  representation  of  1.0. 
(Total  time  6w.)  This  code  floats  x,  then  extracts  the  exponent. 

But  if  floating  point  conversion  is  not  readily  available,  a binary  reduction 
strategy  works  fairly  well  on  a 2d-bit.  machine.  We  can  start  with  A <-  0 and 
V *;  then  we  set  A A + 2fc  and  y <-  y > 2fc  if  y > 2k  ^ 0,  for  k = d-  1, 

• • . , 1,  0 (or  until  k is  reduced  to  the  point  where  a short  table  can  be  used  to 
finish  up).  The  MMIX  code  analogous  to  (50)  and  (51)  is  now 

SRU  y,x,32;  ZSNZ  lam,y,32; 

ADD  t, lam, 16;  SRUy.x.t;  CSNZlam,y,t; 

ADD  t, lam, 8;  SRUy.x.t;  CSNZ  lam,y,t ; ^ 

SRU  y,x,lam;  LDB  t.lamtab.y;  ADD  lam, lam, t; 

and  the  total  time  is  p + llv.  In  this  case  table  lamtab  has  256  entries,  namely 
Ax  for  0 < x < 256.  Notice  that  the  “conditional  set”  (CS)  and  “zero  or  set” 
(ZS)  instructions  have  been  used  here  and  in  (50)  instead  of  branch  instructions. 
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There  appears  to  be  no  simple  way  to  extract  the  leftmost  1 bit  that  appears 
in  a register,  analogous  to  the  trick  by  which  we  extracted  the  rightmost  1 in  (37). 
For  this  purpose  we  could  compute  y <—  Ax  and  then  1<Ci/,  if  x ^ 0;  but  a binary 
“smearing  right”  method  is  somewhat  shorter  and  faster: 

Set  y <—  x,  then  y ^ y \ (j/>  2k ) for  0 < k < d. 

The  leftmost  1 bit  of  x is  then  y — (y  1).  ^ 

[These  non-floating-point  methods  have  been  suggested  by  H.  S.  Warren,  Jr.] 
Other  operations  at  the  left  of  a register,  like  removing  the  leftmost  run  of 
Is,  are  harder  yet;  see  exercise  39.  But  there  is  a remarkably  simple,  machine- 
independent  way  to  determine  whether  or  not  Ax  = A y,  given  unsigned  integers 
x and  (/,  in  spite  of  the  fact  that  we  can’t  compute  Ax  or  A y quickly: 

Ax  = A y if  and  only  if  x®y<xky.  (58) 

[See  exercise  40.  This  elegant  relation  was  discovered  by  W.  C.  Lynch  in  2006.] 
We  will  use  (58)  below,  to  devise  another  way  to  compute  Ax. 

Sideways  addition.  Binary  n-bit  numbers  x = (x„_i  . . . *1X0)2  are  often  used 
to  represent  subsets  X of  the  n-element  universe  {0, 1, ...  ,n  — 1},  with  k E X 
if  and  only  if  2k  C x.  The  functions  Ax  and  px  then  represent  the  largest  and 
smallest  elements  of  X.  The  function 

vx  = x„_i  H hXi+xo,  (59) 

which  is  called  the  “sideways  sum”  or  “population  count”  of  x,  also  has  obvious 
importance  in  this  connection,  because  it  represents  the  cardinality  |X|,  namely 
the  number  of  elements  in  X.  This  function,  which  we  considered  in  4.6.3-(7), 
satisfies  the  recurrence 

vO  = 0;  v{2x)  = v(x)  and  ^(2x  + l)  = v(x)  + 1,  for  x > 0.  (60) 

It  also  has  an  interesting  connection  with  the  ruler  function  (exercise  1.2.5-11), 

n 

px  — 1 + v(x  — 1)  — vx;  equivalently,  pk  = n — vn.  (61) 

fc= 1 

The  first  textbook  on  programming,  The  Preparation  of  Programs  for  an 
Electronic  Digital  Computer  by  Wilkes,  Wheeler,  and  Gill,  second  edition  (Read- 
ing, Mass.:  Addison- Wesley,  1957),  155,  191  193,  presented  an  interesting  sub- 
routine for  sideways  addition  due  to  D.  B.  Gillies  and  J.  C.  P.  Miller.  Their 
method  was  devised  for  the  35-bit  numbers  of  the  EDS  AC,  but  it  is  readily 
converted  to  the  following  64-bit  procedure  for  vx  when  x = (*63  . . .*1X0)2: 

Set  y <r-  x — ((x  3>  1)  & po).  (Now  y — (u 31 . . . uiWo)4:  where  Uj  = *2j+i  + *2j-) 
Set  y <-  {ykpi)  + ((j/»  2)  & /Xi).  (Now  y = (v15  . . .wiu0)i6,  Vj  - u2j+i+u2j.) 
Set  y <-  (y  + (y  > 4))  & p2.  (Now  y - (w7  . ..Wiw0)256,  Wj  = v2j+1  + v2j.) 
Finally  v <—  ((a  • y)  mod  264)  56,  where  a = (11111111)256-  (62) 

The  last  step  cleverly  computes  y mod  255  = w7  + - ■ --l-tni-l-mo  via  multiplication, 
using  the  fact  that  the  sum  fits  comfortably  in  eight  bits.  [David  Muller  had 
programmed  a similar  method  for  the  ILLIAC  I machine  in  1954.] 
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If  £ is  expected  to  be  “sparse,”  having  at  most  a few  1-bits,  we  can  use  a 
faster  method  [P.  Wegner,  CACM  3 (1960),  322]: 

Set  v <-  0,  y <-  x.  Then  while  y / 0,  set  v «-  v + 1,  y «-  y & (y  - l).  (63) 

A snmlar  approach,  using  y <- y | (y+ 1),  works  when  x is  expected  to  be  “dense.” 


Bit  reversal.  For  our  next  trick,  let’s  change  x = (x63  . ..Xlx0)2  to  its  left- 
right  mirror  image,  x - (xoXl . . .x63)2.  Anybody  who  has  been  following  the 
developments  so  far,  seeing  methods  like  (50),  (56),  (57),  and  (62),  will  probably 
think,  Aha  once  again  we  can  divide  by  2 and  conquer!  If  we’ve  already 
discovered  how  to  reverse  32-bit  numbers,  we  can  reverse  64-bit  numbers  almost 
as  fast,  because  (xy)  - y x . All  we  have  to  do  is  apply  the  32-bit  method  in 
parallel  to  both  halves  of  the  register,  then  swap  the  left  half  with  the  right  half.” 

Right.  For  example,  we  can  reverse  an  8-bit  string  in  three  easy  steps: 


Given 
Swap  bits 
Swap  nyps 
Swap  nybbles 


X7XqX5X4X3X2XxX0 

X6X7X4X5X2X3X0Xx 

X4X5X6X7X0XxX2X3 

XoXxX2X3X4X5XqX7 


(64) 


And  six  such  easy  steps  will  reverse  64  bits.  Fortunately,  each  of  the  swapping 
operations  turns  out  to  be  quite  simple  with  the  help  of  the  magic  masks  /rfc: 

y <—  (x  1)  & p 0,  z <—  (x  & p0)  < 1,  x<-y\z; 

z <-  (x  & fix)  -c  2,  X <-  y | z; 

z (x  & H 2)  <4,  x y | z; 

z (x  & p3)  -C  8,  x <r-  y | z; 

z <-  (x  &;  p4)  -C  16,  x <-  y I z: 


y <r-  (x»  2)  & p1; 

y t-  (x>4)  &p2, 
y <r-  (x  > 8)  & p3, 

1/ (x  > 16)  & p4, 


x (x  > 32)  | ((x  <C  32)  mod  264). 


[Christopher  Strachey  foresaw  some  aspects  of  this  construction  in  CACM  4 
(1961),  146,  and  a similar  ternary  method  was  devised  in  1973  by  Bruce  Baum 
gart  (see  exercise  49).  The  mature  algorithm  (65)  was  presented  by  Henry  S. 
Warren,  Jr.,  in  Hacker’s  Delight  (Addison- Wesley,  2002),  102.] 

But  MMIX  is  once  again  able  to  trump  this  general-purpose  technique  with 
less  traditional  commands  that  do  the  job  much  faster.  Consider 


rev  GREG  #0102040810204080;  M0R  x.x.rev;  M0R  x.rev.x;  (66) 

the  first  M0R  instruction  reverses  the  bytes  of  x from  big-endian  to  little-endian 
or  vice  versa,  while  the  second  reverses  the  bits  within  each  byte. 

Bit  swapping.  Suppose  we  only  want  to  interchange  two  bits  within  a register. 
X]  0 xh  where  * > 3-  What  would  be  a good  way  to  proceed?  (Dear  reader 
please  pause  for  a moment  and  solve  this  problem  in  your  head,  or  with  pencil 
and  paper  without  looking  at  the  answer  below.) 

Let  8 = i — j.  Here  is  one  solution  (but  don’t  peek  until  you’re  ready): 
y t—  (xi$>S)  & 2\  z (x  & 2-' ) <C  S,  x (x  & m)  \ y \ z,  where  rh  = 2*  1 2-7 . (67) 
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It  uses  two  shifts  and  five  bitwise  Boolean  operations,  assuming  that  i and  j 
are  given  constants.  It  is  like  each  of  the  first  lines  of  (65),  except  that  a new 
mask  m is  needed  because  y and  z don’t  account  for  all  of  the  bits  of  x. 

We  can,  however,  do  better,  saving  one  operation  and  one  constant: 

y (x  © 0 > <!))  & 23,  x *r-  x©y©  (y  «tf).  (68) 

The  first  assignment  now  puts  xt  © xj  into  position  j ; the  second  changes  X*  to 

xi  © (xt  © Xj)  and  Xj  to  Xj  © ( Xi  © Xj),  as  desired.  In  general  it’s  often  wise  to 

convert  a problem  of  the  form  “change  x to  f(x)”  into  a problem  of  the  form 
“change  x to  x © g(x),”  since  the  bit-difference  g(x)  might  be  easy  to  calculate. 

On  the  other  hand,  there’s  a sense  in  which  (67)  might  be  preferable  to  (68), 
because  the  assignments  to  y and  2 in  (67)  can  sometimes  be  performed  simulta- 
neously. When  expressed  as  a circuit,  (67)  has  a depth  of  4 while  (68)  has  depth  5. 

Operation  (68)  can  of  course  be  used  to  swap  several  pairs  of  bits  simulta- 
neously, when  we  use  a mask  6 that’s  more  general  than  2J: 

y (x  © (x  » 6))  & 0,  x <-  x®y@  (y<6).  (69) 

Let  us  call  this  operation  a “<5-swap,”  because  it  allows  us  to  swap  any  non- 
overlapping pairs  of  bits  that  are  S places  apart.  The  mask  9 has  a 1 in  the  right- 
most position  of  each  pair  that’s  supposed  to  be  swapped.  For  example,  (69)  will 
swap  the  leftmost  25  bits  of  a 64-bit  word  with  the  rightmost  25  bits,  while  leav- 
ing the  14  middle  bits  untouched,  if  we  let  6 = 39  and  9 — 225  - 1 = #lffffff . 
Indeed,  there’s  an  astonishing  way  to  reverse  64  bits  using  d-swaps,  namely 
y <r-  (x  > 1)  & p0,  2 4-  (x  & p0)  < 1,  x <- y \ z, 

y <-  (x  © (x  > 4))  & #0300c0303030c303,  x <-  x © y © (y  « 4), 
y <-  (x®  (x»8))&#00c0300c03f0003f,  x <-  x © y © (y  « 8),  (70) 

y <-  (x  © (x  > 20))  & #00000ff  c00003fff , x^x©y©(j/<  20), 

x 4-  (x  > 34)  | ((x  < 30)  mod  264), 

saving  two  of  the  bitwise  operations  in  (65)  even  though  (65)  looks  “optimum.” 

*Bit  permutation  in  general.  The  methods  we’ve  just  seen  can  be  extended  to 
obtain  an  arbitrary  permutation  of  the  bits  in  a register.  In  fact,  there  always  ex- 
ist masks  60,  . . . , 65,  64,  . . . , 9q  such  that  the  following  operations  transform  x = 
(x63  . . . xjx0)2  into  any  desired  rearrangement  xn  = (x637r  . . . xi7rx07r)2  of  its  bits: 

x e-  2fc-swap  of  x with  mask  9k,  for  k = 0,1,  2,  3,  4,  5; 
x t—  2fc-swap  of  x with  mask  9k,  for  k — 4,  3,  2,  1,  0. 

In  general,  a permutation  of  2d  bits  can  be  achieved  with  2d -l  such  steps,  using 
appropriate  masks  6k  and  9k,  where  the  swap  distances  are  respectively  2°,  21, 
...,  2d~1,  ...,  21,  2°. 

To  prove  this  fact,  we  can  use  a special  case  of  the  permutation  networks 
discovered  independently  by  A.  M.  Duguid  and  J.  Le  Corre  in  1959,  based  on 
earlier  work  of  D.  Slepian  [see  V.  E.  Benes,  Mathematical  Theory  of  Connecting 
Networks  and  Telephone  Traffic  (New  York:  Academic  Press,  1965),  Section  3.3], 
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Figure  12  shows  a permutation  network  P(2n)  for  2 n elements  constructed  from 
two  permutation  networks  for  n elements,  when  n = 4.  Each  ‘ J ’ connection 
between  two  lines  represents  a crossbar  module  that  either  leaves  the  line  contents 
unaltered  or  interchanges  them,  as  the  data  flows  from  left  to  right.  To  start 
the  recursion  when  n = 1,  we  let  P( 2)  consist  of  a single  crossbar.  Every  setting 
of  the  individual  crossbars  clearly  causes  P(2n)  to  produce  a permutation  of  its 
inputs;  conversely,  we  will  show  that  any  permutation  of  the  2n  inputs  can  be 
achieved  if  we  are  clever  enough  to  set  the  crossbars  appropriately. 

The  construction  of  Fig.  12  is  best  understood  by  considering  an  example. 
Suppose  we  want  to  route  the  inputs  (0, 1, 2, 3, 4, 5, 6,  7)  to  (3, 2, 4, 1,  6, 0,  5.  7), 
respectively.  The  first  job  is  to  determine  the  contents  of  the  lines  just  after  the 
first  column  of  crossbars  and  just  before  the  last  column,  since  we  can  then  use 
a similar  method  to  set  the  crossbars  in  the  inner  P(4)’s.  Thus,  in  the  network 


we  want  to  find  permutations  abcdefgh  and  ABCDEFGH  such  that  {a.  b}  = {0, 1}. 
{c,d}  = {2,3},  ...,  {g,h}  = {6,7},  {a,  c,  e,  g}  = {A,C,E,G},  {b.d.f.h}  = 
{B,D,F,H},  {A,  B}  = {3,2},  {C,D}  = {4,1},  ...,  {G,H}  = {5,7}.  Starting  at 
the  bottom,  let  us  choose  h = 7,  because  we  don’t  wish  to  disturb  the  contents 
of  that  line  unless  necessary.  Then  the  following  choices  are  forced'. 


H = 7;  G = 5;  e = 5;  f = 4;  D = 4;  C=l;  a=l;  b = 0;  F = 0;  E = 6;  g = 6.  (73) 

If  we  had  chosen  h = 6,  the  forcing  pattern  would  have  been  similar  but  reversed. 

F = 6;  E = 0;  a = 0;  b = 1;  D = l;  C = 4;  e = 4;  f = 5;  H = 5;  G = 7;  g=7.  (74) 

Options  (73)  and  (74)  can  both  be  completed  by  choosing  either  d = 3 (hence 
B = 3,  A = 2,  c = 2)  or  d = 2 (hence  B = 2,  A = 3,  c = 3). 

In  general  the  forcing  pattern  will  go  in  cycles,  no  matter  what  permutation 
we  begin  with.  To  see  this,  consider  the  graph  on  eight  vertices  {ab,  cd.  ef . gh. 
AB,  CD,  EF,  GH}  that  has  an  edge  from  uv  to  UV  whenever  the  pair  of  inputs 
connected  to  uv  has  an  element  in  common  with  the  pair  of  outputs  connected 

to  UV.  Thus,  in  our  example  the  edges  are  ab  — EF,  ab  — CD,  cd  AB. 

cd  — AB,  ef  — CD,  ef  — GH,  gh  — EF,  gh  — GH.  We  have  a “double  bond”’ 
between  cd  and  AB,  since  the  inputs  connected  to  c and  d are  exactly  the  outputs 
connected  to  A and  B;  subject  to  this  slight  bending  of  the  strict  definition  of 
a graph,  we  see  that  each  vertex  is  adjacent  to  exactly  two  other  vertices,  and 
lowercase  vertices  are  always  adjacent  to  uppercase  ones.  Therefore  the  graph 
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crossbar  modules 


Fig.  12.  The  inside  of  a black  box  P(2n)  that  permutes  2 n elements 
in  all  possible  ways,  when  n > 1.  (Illustrated  for  n = 4.) 

always  consists  of  disjoint  cycles  of  even  length.  In  our  example,  the  cycles  are 

. EF  — gh . 

ab  ^,GH  cd  = AB.  (75) 

CD  — ef 

where  the  longer  cycle  corresponds  to  (73)  and  (74).  If  there  are  k different 
cycles,  there  will  be  2fc  different  ways  to  specify  the  behavior  of  the  first  and  last 
columns  of  crossbars. 

To  complete  the  network,  we  can  process  the  inner  4-element  permutations 
in  the  same  way;  and  any  2rf-element  permutation  is  achievable  in  this  same 
recursive  fashion.  The  resulting  crossbar  settings  determine  the  masks  Oj  and  Oj 
of  (71).  Some  choices  of  crossbars  may  lead  to  a mask  that  is  entirely  zero;  then 
we  can  eliminate  the  corresponding  stage  of  the  computation. 

If  the  input  and  output  are  identical  on  the  bottom  lines  of  the  network,  our 
construction  shows  how  to  ensure  that  none  of  the  crossbars  touching  those  lines 
are  active.  For  example,  the  64-bit  algorithm  in  (71)  could  be  used  also  with  a 
60-bit  register,  without  needing  the  four  extra  bits  for  any  intermediate  results. 

Of  course  we  can  often  beat  the  general  procedure  of  (71)  in  special  cases. 

For  example,  exercise  52  shows  that  method  (71)  needs  nine  swapping  steps  to 
transpose  an  8 x 8 matrix,  but  in  fact  three  swaps  suffice: 

Given  7-swap  14-swap  28-swap 

00  01  02  03  04  05  06  07  00  10  02  12  04  14  06  16  00  10  20  30  04  14  24  34  00  10  20  30  40  50  60  70 

10  11  12  13  14  15  16  17  01  11  03  13  05  15  07  17  01  11  21  31  05  15  25  35  01  11  21  31  41  51  61  71 

20  21  22  23  24  25  26  27  20  30  22  32  24  34  26  36  02  12  22  32  06  16  26  36  02  12  22  32  42  52  62  72 

30  31  32  33  34  35  36  37  21  31  23  33  25  35  27  37  03  13  23  33  07  17  27  37  03  13  23  33  43  53  63  73 

40  41  42  43  44  45  46  47  40  50  42  52  44  54  46  56  40  50  60  70  44  54  64  74  04  14  24  34  44  54  64  74 

50  51  52  53  54  55  56  57  41  51  43  53  45  55  47  57  41  51  61  71  45  55  65  75  05  15  25  35  45  55  65  75 

60  61  62  63  64  65  66  67  60  70  62  72  64  74  66  76  42  52  62  72  46  56  66  76  06  16  26  36  46  56  66  76 

70  71  72  73  74  75  76  77  61  71  63  73  65  75  67  77  43  53  63  73  47  57  67  77  07  17  27  37  47  57  67  77 

The  “perfect  shuffle”  is  another  bit  permutation  that  arises  frequently  in 
practice.  If  x — ( . . . x2xiXo)2  and  U — ( ■ • • 2/22/12/0)2  are  any  2-adic  integers,  we 
define  x\y  (“x  zip  y,"  the  zipper  function  of  x and  y)  by  interleaving  their  bits: 

x\y  = (...x2?/2Zi2/i  £02/0)2-  (76) 
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This  operation  has  important  applications  to  the  representation  of  2-dimensional 
data  because  a small  change  in  either  x or  y usually  causes  only  a small  change 
mxty  (see  exercise  86).  Notice  also  that  the  magic  mask  constants  (47)  satisfy 

9-k  t 9k  = Hk+ 1-  (77) 

H a;  appears  in  the  left  half  of  a register  and  y appears  in  the  right  half,  a perfect 
s ulfic  is  the  permutation  that  changes  the  register  contents  to  x J y. 

A sequence  of  d - 1 swapping  steps  will  perfectly  shuffle  a 2d-bit  register:  in 
fact,  exercise  53  shows  that  there  are  several  ways  to  achieve  this.  Once  again 
therefore,  we  are  able  to  improve  on  the  (2 d- l)-step  method  of  (71)  and  Fig  12’ 
Conversely,  suppose  we’re  given  the  shuffled  value  z = x%y  in  a 2d-bit 
register;  is  there  an  efficient  way  to  extract  the  original  value  of  y?  Sure:  If  the 

v 1®waps  tllat  do  a Perfect  shuffle  are  performed  in  reverse  order,  they’ll  undo 
the  shuffle  and  recover  both  x and  y.  But  if  only  y is  wanted,  we  can  save  half  of 
the  work:  Start  with  y 4-  z & /x0;  then  set  y <-  (y  + (y  » 2k~1))  & for  k = 1. 

i;' ’ d ~ L For  examPle,  when  d = 3 this  procedure  goes  (O^O^OiqOwoD  >-> 

( 2/32/2 00?/!  1/0)2  (0000i/32/2i/i2/o)2.  “Divide  and  conquer”  conquers  again. 

Consider  now  a more  general  problem,  where  we  want  to  extract  and  com- 
press an  arbitrary  subset  of  a register’s  bits.  Suppose  we’re  given  a 2d-bit  word 

2 __(Z2Vv.'"Zlf?)2  and  a = (X2--1.-.XiXo)2  that  has  s 1-bits;  thus 

VX  — s-  f he  problem  is  to  assemble  the  compact  subword 


V - (2/5-1  - - - 2/12/0)2  = {zj3_1  . . . ZjlZj 0)2, 
^ J 1 > jo  are  the  indices  where  \j  = 1. 


(78) 


where  Js  x > ...  > 3l  > j0  are  the  indices  where  Xj  = 1 . For  example,  if 
d - 3 and  X (10110010)2,  we  want  to  transform  2 = (2/3^32/22/1  x2ximxQ)2  into 
V (2/32/22/i2/o)2-  (The  problem  of  going  from  * \ y to  y,  considered  above,  is  the 
special  case  x = Mo-)  We  know  from  (71)  that  y can  be  found  by  ^-swapping, 
at  most  2d  - 1 times;  but  in  this  problem  the  relevant  data  always  moves  to  the 
right,  so  we  can  speed  things  up  by  doing  shifts  instead  of  swaps. 

Let  s say  that  a 5-shift  of  x with  mask  9 is  the  operation 


X <T- 


x ® ((z®  (x>5))&0), 


(79) 


W ich  changes  bit  Xj  to  xj+s  if  9 has  1 in  position  j but  leaves  x}  unchanged 
otherwise.  Guy  Steele  discovered  that  there  always  exist  masks  6>0,  9X  9d_l 
so  that  the  general  extraction  problem  (78)  can  be  solved  with  a few  5-shifts: 

Start  with  x «-  z;  then  do  a 2fc-shift  of  x with  mask  9k, 

for  k = 0,  1,  . . . , d — 1;  finally  set  y <—  x.  (®°) 

In  fact,  the  idea  for  finding  appropriate  masks  is  surprisingly  simple.  Every  bit 
that  wants  to  move  a total  of  exactly  / = (/d_x . . . Z1/0)2  places  to  the  right  should 
be  transported  in  the  2fc-shifts  for  which  lk  = 1. 

For  example  suppose  d = 3 and  * = (10110010)2.  (We  must  assume  that 
A ^ .)  Remembering  that  some  Os  need  to  be  shifted  in  from  the  left  we  can 
set  90  = (00011001)2,  = (00000110)2  , 92  = (1111 1000)2 ; then  (80)  maps 

(93X39291X2X^0X0)2  *-»•  (y3Xzy292yixi9oyo)2  ^ (93X3929291929190)2  ^ (00002/32/22/12/0)2 
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Exercise  69  proves  that  the  bits  being  extracted  will  never  interfere  with  each 
other  during  their  journey.  Furthermore,  there’s  a slick  way  to  compute  suitable 
masks  9k  dynamically  from  x,  in  0(d2)  steps  (see  exercise  70). 

A “sheep-and-goats”  or  “grouping”  operation  has  been  suggested  for  com- 
puter hardware,  extending  (78)  to  produce  the  general  unshuffled  word 

(xr-i . . . xixoys-i  ■ ■ ■ 2/12/0)2  = (zir_ ! ...zilziozja_1...zjlzj  0)2;  (81) 

here  ir_i  > ■ ■ ■ > ix  > i0  are  the  indices  where  Xi  = 0.  But  another  operation 
called  “gather-flip,”  which  reverses  the  order  of  the  unmasked  bits  and  gives 

(xqXi  . . . xr_i ys-i  . . . 2/12/0)2  — {zioZii  • • • zir-\  zj,~  1 • • • Zj1  Zj0) 2,  (8i/) 

turns  out  to  be  more  useful  and  easier  to  implement.  Any  permutation  of  2d  bits 
is  achievable  by  using  either  operation,  at  most  d times  (see  exercises  72  and  73). 

Shifting  also  allows  us  to  go  beyond  permutations,  to  arbitrary  mappings  of 
bits  within  a register.  Suppose  we  want  to  transform 

X=(x2d_1...X  1*0)2  '-t  XV  = (x^d-i)^  . . .xivx0v,)2i  (82) 

where  p is  any  of  the  (2d)2d  functions  from  the  set  {0, 1, . . . , 2d  - 1}  into  itself. 
K.  M.  Chung  and  C.  K.  Wong  [IEEE  Transactions  C-29  (1980),  1029-1032] 
introduced  an  attractive  way  to  do  this  in  O(d)  steps  by  using  cyclic  5-shifts, 
which  are  like  (79)  except  that  we  set 

x X-  x ® ((x  ® (x  » S)  © (x  < {2d  - 5)))  & 6).  (83) 

Their  idea  is  to  let  c;  be  the  number  of  indices  j such  that  jip  = l,  for  0 < l < 2d . 
Then  they  find  masks  90,  9 1,  . . . , with  the  property  that  a cyclic  2fc-shift 
of  x with  mask  9k , done  successively  for  0 < k < d,  will  transform  x into  a 
number  x'  that  contains  exactly  c/  copies  of  bit  X[  for  each  Z.  Finally  the  general 
permutation  procedure  (71)  can  be  used  to  change  x'  i->  xv . 

For  example,  suppose  d = 3 and  x^  = (x3XiXiXoX3X7XsX5)2-  Then  we  have 
(c0,  ci,c2,  c3,  c4,  c5,  c6,  c7)  — (1, 2, 0,  2, 0,  2, 0, 1).  Using  masks  90  = (00011100)2, 
81  = (00001000)2,  and  92  = (01100000)2,  three  cyclic  2fe-shifts  now  take  x = 
(x7XeX5X4X3X2XiX0)2  ( X’7X6XsX5X4X3XiX0)2  ( X7X6XsX5X5X3XxXo)2 

(xjx3xix5x5x3xix0)2  = x'.  Then,  some  5-swaps:  x'  h->  {x3x7x^xix3x5xix0)2  i->- 
(0:30:12:5X7X3*5X1X0)2  1-^  (x3XiXiXoX3X5X5X7)2  I y (x3XiXiXoX3X7X5X5)2  = x^; 
we’re  done!  Of  course  any  8-bit  mapping  can  be  achieved  more  quickly  by  brute 
force,  one  bit  at  a time;  the  method  of  Chung  and  Wong  becomes  much  more 
impressive  in  a 256-bit  register.  Even  with  MMIX’s  64-bit  registers  it’s  pretty 
good,  needing  at  most  96  cycles  in  the  worst  case. 

To  find  90,  we  use  the  fact  that  C[  = 2d,  and  we  look  at  Eeven  = Ylc2 1 
and  Eodd  = Xlc2;+i-  If  Seven  = Sodd  — 2d~1 , we  can  set  90  — 0 and  omit  the 
cyclic  1-shift.  But  if,  say,  Eeven  < E0dd,  we  find  an  even  l with  c;  = 0.  Cyclically 
shifting  into  bits  Z,  1 + 1,  . . . , l + t (modulo  2d)  for  some  t will  produce  new  counts 
(cj»  • • • , c'2d_x)  for  which  E'even  = E'odd  = 2d~1\  so  90  = 2l  + ■ ■ ■ + 2('+()  mod  2<i. 
Then  we  can  deal  with  the  bits  in  even  and  odd  positions  separately,  using  the 
same  method,  until  getting  down  to  1-bit  subwords.  Exercise  74  has  the  details. 
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Working  with  fragmented  fields.  Instead  of  extracting  bits  from  various 
parts  of  a word  and  gathering  them  together,  we  can  often  manipulate  those  bits 
directly  in  their  original  positions. 

For  example,  suppose  we  want  to  run  through  all  subsets  of  a given  set  U. 
where  (as  usual)  the  set  is  specified  by  a mask  y such  that  [k  e U]  = (y  > k)  & 1.’ 
If  ai  C y and  x / y,  there’s  an  easy  way  to  calculate  the  next  largest  subset  of  U 
m lexicographic  order,  namely  the  smallest  integer  x'  > x such  that  x'  C y: 

(84) 

I11  the  special  case  when  x = 0 and  x^O,  we’ve  already  seen  in  (37)  that  this  for- 
mula produces  the  rightmost  bit  of  y,  which  corresponds  to  the  lexicographically 
smallest  nonempty  subset  of  U. 

Why  does  formula  (84)  work?  Imagine  adding  1 to  the  number  x | y,  which 
has  Is  wherever  y is  0.  A carry  will  propagate  through  those  Is  until  it  reaches 
the  rightmost  bit  position  where  r has  a 0 and  y has  a 1;  furthermore  all  bits 
to  the  right  of  that  position  will  become  zero.  Therefore  x'  = ((x  I y)  + 1)  & y. 
But  we  have  (x  | y)  + 1 = (x  + y)  + 1 = x + (y  + 1)  = x - y when  x C y.  QED. 

Notice  further  that  s'  = 0 if  and  only  if  * = y.  So  we’ll  know  when  we’ve 
found  the  largest  subset.  Exercise  79  shows  how  to  go  back  to  x,  given  x' . 

We  might  also  want  to  run  through  all  elements  of  a subcube-  for  example, 
to  find  all  bit  patterns  that  match  a specification  like  *10*1*01,  consisting  of 
Os  Is,  and  *s  (don’t-cares).  Such  a specification  can  be  represented  by  asterisk 
co  es  a (®n- 1 • • • a 0)2  and  bit  codes  b — (6„_r . . . 60)2,  as  in  exercise  7.1  T 30' 
our  example  corresponds  to  o = (10010100)2,  b = (01001001)2.  The  problem  of 
enumerating  all  subsets  of  a set  is  the  special  case  where  a = y and  6 = 0.  In 
the  more  general  subcube  problem,  the  successor  of  a given  bit  pattern  x is 

x = ((x  — (a  + 6))  & a)  + b.  (gg) 

Suppose  the  bits  of  z = (z„_  1 . . . z0)2  have  been  stitched  together  from  two 
subwords  x = (xr_1...x0)2  and  y = (ya^...y0)2,  where  r + s = n,  using 
an  arbitrary  mask  y for  which  i/y  = s to  govern  the  stitching.  For  example, 
z - (y2x4x3yix2yoXiXoh  When  n = 8 and  y = (10010100)2.  We  can  think  of  z 
as  a scattered  accumulator,”  in  which  alien  bits  x,  lurk  among  friendly  bits  y, . 

10m  thls  vlewpoint  the  problem  of  finding  successive  elements  of  a subcube  is 
essentially  the  problem  of  computing  y + 1 inside  a scattered  accumulator  2, 
without  changing  the  value  of  x.  The  sheep-and-goats  operation  (81)  would 
untangle  x and  y;  but  it’s  expensive,  and  (85)  shows  that  we  can  solve  the 
problem  without  it.  We  can,  in  fact,  compute  y + y'  when  y'  = [y'  W ) 
is  any  value  inside  a scattered  accumulator  z' , if  y and  y'  both  appear  in  the 
positions  specified  by  y:  Consider  t = z & y and  t'  = z'  & y.  If  we  form  the 
sum  (t  | y)  + t',  all  carries  that  occur  in  a normal  addition  y + y'  will  propagate 
through  the  blocks  of  Is  in  y,  just  as  if  the  scattered  bits  were  adjacent.  Thus 

((z&iX)  + (z'  | y))&y 

is  the  sum  of  y and  y' , modulo  2s,  scattered  according  to  the  mask  y. 


(86) 
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Tweaking  several  bytes  at  once.  Instead  of  concentrating  on  the  data  in  one 
field  within  a word,  we  often  want  to  deal  simultaneously  with  two  or  more  sub- 
words, performing  calculations  on  each  of  them  in  parallel.  For  example,  many 
applications  need  to  process  long  sequences  of  bytes,  and  we  can  gain  speed  by 
acting  on  eight  bytes  at  a time;  we  might  as  well  use  all  64  bits  that  our  machine 
provides.  General  multibyte  techniques  were  introduced  by  Leslie  Lamport  in 
CACM  18  (1975),  471-475,  and  subsequently  extended  by  many  programmers. 

Suppose  first  that  we  simply  wish  to  take  two  sequences  of  bytes  and  find 
their  sum,  regarding  them  as  coordinates  of  vectors,  doing  arithmetic  mod- 
ulo 256  in  each  byte.  Algebraically  speaking,  we’re  given  8-byte  vectors  x = 
(x7  .. . *1X0)256  and  y = (y7  . . . 1/11/0)256;  we  want  to  compute  z = (z7  . . . zi  .20)256, 
where  Zj  = (xj  + yj)  mod  256  for  0 < j < 8.  Ordinary  addition  of  x to  y doesn’t 
quite  work,  because  we  need  to  prevent  carries  from  propagating  between  bytes. 
So  we  extract  the  high-order  bits  and  deal  with  them  separately: 

z (x®y)kh,  where  h = #8080808080808080; 
z <-  ((x  k h)  + (y  k h))  ® z.  (87) 

The  total  time  for  MMIX  to  do  this  is  6u,  plus  3/z  + 3v  if  we  also  count  the  time  to 
load  x,  load  y,  and  store  z.  By  contrast,  eight  one-byte  additions  (LDBU,  LDBU, 
ADDU,  and  STBU,  repeated  eight  times)  would  cost  8 x (3/x  + Av)  = 2Ap  + 32w. 
Parallel  subtraction  of  bytes  is  just  as  easy  (see  exercise  88). 

We  can  also  compute  bytewise  averages , with  zj  = [(xj  + yj)/ 2J  for  each  j: 

z 4-  ((x  © y)  k T)  > 1,  where  l = #0101010101010101; 
z<-(xky)  + z.  (88) 

This  elegant  trick,  suggested  by  H.  G.  Dietz,  is  based  on  the  well-known  formula 
x + y = (x  © y)  + ((x  k y)  « 1)  (89) 

for  radix-2  addition.  (We  can  implement  (88)  with  four  MMIX  instructions,  not 
five,  because  a single  MOR  operation  will  change  x ® y to  ((x  (By)  kl)  S>  1.) 

Exercises  88-93  and  100  104  develop  these  ideas  further,  showing  how  to  do 
mixed-radix  arithmetic,  as  well  as  such  things  as  the  addition  and  subtraction  of 
vectors  whose  components  are  treated  modulo  m when  m needn’t  be  a power  of  2. 

In  essence,  we  can  regard  the  bits,  bytes,  or  other  subfields  of  a register  as  if 
they  were  elements  of  an  array  of  independent  microprocessors,  acting  indepen- 
dently on  their  own  subproblems  yet  tightly  synchronized,  and  communicating 
with  each  other  via  shift  instructions  and  carry  bits.  Computer  designers  have 
been  interested  for  many  years  in  the  development  of  parallel  processors  with  a 
so-called  SIMD  architecture,  namely  a “Single  Instruction  stream  with  Multiple 
Data  streams”;  see,  for  example,  S.  H.  Unger,  Proc.  IRE  46  (1958),  1744-1750. 
The  increased  availability  of  64-bit  registers  has  meant  that  programmers  of 
ordinary  sequential  computers  are  now  able  to  get  a taste  of  SIMD  processing. 
Indeed,  computations  such  as  (87),  (88),  and  (89)  are  called  SWAR  methods  — 
“SIMD  Within  A Register,”  a name  coined  by  R.  J.  Fisher  and  H.  G.  Dietz  [see 
Lecture  Notes  in  Computer  Science  1656  (1999),  290-305],  See  also  R.  B.  Lee, 
IEEE  Micro  16,4  (August  1996),  51-59. 
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Of  course  bytes  often  contain  alphabetic  data  as  well  as  numbers,  and  one 
of  the  most  common  programming  tasks  is  to  search  through  a long  string  of 
characters  in  order  to  find  the  first  appearance  of  some  particular  byte  value.  For 
example,  strings  are  often  represented  as  a sequence  of  nonzero  bytes  terminated 
by  0.  In  order  to  locate  the  end  of  a string  quickly,  we  need  a fast  way  to 
determine  whether  all  eight  bytes  of  a given  word  x are  nonzero  (because  they 
usually  are).  Several  fairly  good  solutions  to  this  problem  were  found  by  Lamport 

and  others;  but  Alan  Mycroft  discovered  in  1987  that  three  instructions  actually 
suffice: 


t i-  hk(x  -l)  kx,  (90) 

where  h and  / appear  in  (87)  and  (88).  If  each  byte  x3  is  nonzero,  t will  be  zero; 
for  (xj-l)kxj  will  be  2px*'  — l,  which  is  always  less  than  #80  = 27.  But  if  x ■ — 0 
while  its  right  neighbors  Xj-U  (if  any)  are  all  nonzero,  the  subtraction 

x l will  produce  #ff  in  byte  j,  and  t will  be  nonzero.  In  fact,  pt  will  be  8 j + 7. 

Caution:  Although  the  computation  in  (90)  pinpoints  the  rightmost  zero 
byte  of  x,  we  cannot  deduce  the  position  of  the  leftmost  zero  byte  from  the  value 
of  t alone.  (See  exercise  94.)  In  this  respect  the  little-endian  convention  proves 
to  be  preferable  to  the  corresponding  big-endian  behavior.  An  application  that 
needs  to  locate  the  leftmost  zero  byte  can  use  (90)  to  skip  quickly  over  nonzeros, 
but  then  it  must  fall  back  on  a slower  method  when  the  search  has  been  narrowed 
down  to  eight  finalists.  The  following  4-operation  formula  produces  a completely 
precise  test  value  t = (t7  . . . tit0)25e,  in  which  t3  = 128 [xj  = 0]  for  each  j: 


t <-  fl&~(x  | ((x  | />)  _ /))_  (gx) 

The  leftmost  zero  byte  of  x is  now  xj,  where  Xt.  = 8j  + 7. 

Incidentally,  the  single  MMIX  instruction  ‘BDIF  t,l,x’  solves  the  zero-byte 
problem  immediately  by  setting  each  byte  t3  of  t to  [Xj  =0],  because  1-x  = 
[x  — 0].  But  we  are  primarily  interested  here  in  fairly  universal  techniques  that 
don’t  rely  on  exotic  hardware;  MMIX’s  special  features  will  be  discussed  later. 

Now  that  we  know  a fast  way  to  find  the  first  0,  we  can  use  the  same  ideas 
to  search  for  any  desired  byte  value.  For  example,  to  test  if  any  byte  of  x is  the 
newline  character  (#a),  we  simply  look  for  a zero  byte  in  X®#0a0a0a0a0a0a0a0a. 

And  these  techniques  also  open  up  many  other  doors.  Suppose,  for  instance, 
that  we  want  to  compute  2 = (27  . . . z1z0)256  from  x and  y,  where  z3  = x3 
when  Xj  = y3  but  2,  - when  Xj  ? y3.  (Thus  if  x = "beaching"  and 
y - "belching",  we’re  supposed  to  set  2 «-  "beaching".)  It’s  easy: 

t<-hk((x®y)  | (((x  © y)  | h)  - l )); 

m<-(t«l)-(f»7);  (92) 

2 i — x®  ((x®  "**♦*****")  km). 


The  first 
where  x3 

xi  ~ Vii 
(x  km)  | 


step  uses  a variant  of  (91)  to  flag  the  high-order  bits  in  each  byte 
7-  Uj • The  next  step  creates  a mask  to  highlight  those  bytes:  ^00  if 
otherwise  *ff.  And  the  last  step,  which  could  also  be  written  2 <- 
( ********  km),  sets  z3  x3  or  z3  t—  depending  011  the  mask. 
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Operations  (90)  and  (91)  were  originally  designed  as  tests  for  bytes  that  are 
zero;  but  a closer  look  reveals  that  we  can  more  wisely  regard  them  as  tests  for 
bytes  that  are  less  than  1.  Indeed,  if  we  replace  l by  c ■ l = (cccccccc) 256  in 
either  formula,  where  c is  any  positive  constant  < 128,  we  can  use  (90)  or  (91) 
to  see  if  x contains  any  bytes  that  are  less  than  c.  Furthermore  the  comparison 
values  c need  not  be  the  same  in  every  byte  position;  and  with  a bit  more  work 
we  can  also  do  bytewise  comparison  in  the  cases  where  c > 128.  Here’s  an  8-step 
formula  that  sets  tj  t—  128 [xj  < y3]  for  each  byte  position  j in  the  test  word  t: 

t h&n~{xyz),  where  2 = (x  \ h)  — (y  & h) . (93) 

(See  exercise  96.)  The  median  operation  in  this  general  formula  can  often  be 
simplified;  for  example,  (93)  reduces  to  (91)  when  y — l,  because  (x(— \)z)  — x\z. 

Once  we’ve  found  a nonzero  t in  (90)  or  (91)  or  (93),  we  might  want  to 
compute  pt  or  At  in  order  to  discover  the  index  j of  the  rightmost  or  leftmost 
byte  that  has  been  flagged.  The  problem  of  calculating  p or  A is  now  simpler 
than  before,  since  t can  take  on  only  256  different  values.  Indeed,  the  operation 

256  — 1 

j «—  table[((a  ■ t)  mod  264)  56],  where  a = — , (94) 

now  suffices  to  compute  j,  given  an  appropriate  256- byte  table.  And  the  mul- 
tiplication here  can  often  be  performed  faster  by  doing  three  shift-and-add 
operations,  “t  t—  t + (t  -C  7),  1 t—  t + (t  <C  14),  t t—  t + (f  <C  28),”  instead. 

Broadword  computing.  We’ve  now  seen  more  than  a dozen  ways  in  which 
a computer’s  bitwise  operations  can  produce  astonishing  results  at  high  speed, 
and  the  exercises  below  contain  many  more  such  surprises. 

Elwyn  Berlekamp  has  remarked  that  computer  chips  containing  N flip-flops 
continue  to  be  built  with  ever  larger  values  of  N , yet  in  practice  only  0(log  N)  of 
those  components  are  flipping  or  flopping  at  any  given  moment.  The  surprising 
effectiveness  of  bitwise  operations  suggests  that  computers  of  the  future  might 
make  use  of  this  untapped  potential  by  having  enhanced  memory  units  that  are 
able  to  do  efficient  n-bit  computations  for  fairly  large  values  of  n.  To  prepare  for 
that  day,  we  ought  to  have  a good  name  for  the  concept  of  manipulating  “wide 
words.”  Lyle  Ramshaw  has  suggested  the  pleasant  term  broadword , so  that  we 
can  speak  of  n-bit  quantities  as  broadwords  of  width  n. 

Many  of  the  methods  we’ve  discussed  are  2-adic,  in  the  sense  that  they  work 
correctly  with  binary  numbers  that  have  arbitrary  (even  infinite)  precision.  For 
example,  the  operation  x & — x always  extracts  2px,  the  least  significant  1 bit  of 
any  nonzero  2-adic  integer  x.  But  other  methods  have  an  inherently  broadword 
nature,  such  as  the  methods  that  use  O(d)  steps  to  perform  sideways  addition 
or  bit  permutation  of  2d-bit  words.  Broadword  computing  is  the  art  of  dealing 
with  n-bit  words,  when  n is  a parameter  that  is  not  extremely  small. 

Some  broadword  algorithms  are  of  theoretical  interest  only,  because  they  are 
efficient  only  in  an  asymptotic  sense  when  n exceeds  the  size  of  the  universe.  But 
others  are  eminently  practical  even  when  n = 64.  And  in  general,  a broadword 
mindset  often  suggests  good  techniques. 
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One  fascinating-but-impractical  fact  about  broadword  operations  is  the  dis- 
covery by  M.  L.  Fredman  and  D.  E.  Willard  that  0(1)  broadword  steps  suffice 
to  evaluate  the  function  Ax  = [lgx]  for  any  nonzero  n-bit  number  x,  no  matter 
how  big  n is.  Here  is  their  remarkable  scheme,  when  n = g2  and  g is  a power  of  2: 

ti  <-  hk(x  \ ((x  \ h)  - /)),  where  h = 29~1l  and  Z = ( 2"  - 1) /(29  - 1); 

V (((a  ■ h)  mod  2")  > (n  - g))  ■ l , where  a = ( 2n~g  - l)/(2fl_1  - 1): 
t2  <-  h & (y  | ((y  | h)  - b)),  where  b = ( 2n+ 9 - 1) /(29+1  - 1); 
m {t2  Cl)  - (t2  » (g  - 1)),  m <-  in  © (m  > g);  (or) 

2 1-  (((*  ■ (x  & m))  mod  2n)  » (n  - g))  ■ /; 
t3  ■<—  h & {z  | ((z  | h)  - b))\ 

((l-  ({t2  > (25  - lgff  - 1))  + (f3  > (2g  - 1))))  mod  2n)  » (n  - ff). 

(See  exercise  106.)  The  method  fails  to  be  practical  because  five  of  these  29  steps 
are  multiplications,  so  they  aren’t  really  “bitwise”  operations.  In  fact,  we’ll  prove 
later  that  multiplication  by  a constant  requires  at  least  ft(logn)  bitwise  steps. 

A multiplication-free  way  to  find  Ax,  with  only  O(loglogn)  bitwise  broad- 
word operations,  was  discovered  in  1997  by  Gerth  Brodal,  whose  method  is  even 
more  remarkable  than  (95).  It  is  based  on  a formula  analogous  to  (49), 

Ax  = [Ax  = A(x&p0)]  + 2[Ax  = A(x&/l1)]  + 4[Ax  = A(x&^2)]  + -..,  (96) 

and  the  fact  that  the  relation  Ax  = A y is  easily  tested  (see  (58)): 

Algorithm  B ( Binary  logarithm).  This  algorithm  uses  n-bit  operations  to 
compute  Ax  = [lgzj,  assuming  that  0 < x < 2n  and  n = d ■ 2d. 

Bl.  [Scale  down.]  Set  A<-0.  Then  set  A <-  A + 2k  and  x x » 2k  if  x > 22\ 
for  k = fig n]  - 1,  [dgn]  -2,  ...,  d. 

B2.  [Replicate.]  (At  this  point  0 < x < 2 2 ; the  remaining  task  is  to  increase 
A by  fig  xj . We  will  replace  x by  d copies  of  itself,  in  2d-bit  fields  ) Set 
x <-  x | (x  •<  2d+k)  for  0 < k < rigd], 

B3.  [Change  leading  bits.]  Set  y i-  x&~(Mdid_j  . . . • (See  (48).) 

B4.  [Compare  all  fields.]  Set  t *-  h & (y  \ ((y  | h)  - (x  ® y))),  where  h = 

(22  _1 . . . 22  _122  _1)22d . 

B5.  [Compress  bits.]  Set  tf-(t  + (t«  ( 2d+k  - 2k )))  mod  2"  for  0 < k < fig d). 
B6.  [Finish.]  Finally,  set  A «-  A + (t  > (n  - d)).  | 

This  algorithm  is  almost  competitive  with  (56)  when  n = 64  (see  exercise  107). 

Another  surprisingly  efficient  broadword  algorithm  was  discovered  in  2006 
by  M.  S.  Paterson  and  the  author,  who  considered  the  problem  of  identifying 
all  occurrences  of  the  pattern  01”  in  a given  n-bit  binary  string.  This  problem, 
which  is  related  to  finding  r contiguous  free  blocks  when  allocating  storage,  is 
equivalent  to  computing 

q = x k (x  « 1)  & (x  « 2)  & (x  « 3)  & • • • & (x  « r)  (97) 
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when  x = (x„_i  . ..XiXo)2  is  given.  For  example,  when  n = 16,  r = 3,  and 
x = (1110111101100111)2,  we  have  q = (0001000000001000)2-  One  might  expect 
intuitively  that  fl(logr)  bitwise  operations  would  be  needed.  But  in  fact  the 
following  20-step  computation  does  the  job  for  all  n > r > 0:  Let  s = |>/2], 
l = Sfc>o  mod  2",  h — (2 S-11)  mod  2",  and  a — (X]fc>o(~l)fc+122fcs)  mod  2n. 

y «—  h & x & ((2:  & h)  + l ); 
f -f-  (x  + j/)  & x & — 2r; 

u <r-  t & a,  v «—  t & a;  (98) 

m<-(«-(n>r))|(!)-(i)>  r)); 
q t & ((a;  & m)  + ((f  » r)  & ~(m  <C  1))). 

Exercise  111  explains  why  these  machinations  are  valid.  The  method  has  little 
or  no  practical  value;  there’s  an  easy  way  to  evaluate  (97)  in  2|~lgr]  + 2 steps, 
so  (98)  is  not  advantageous  until  r > 512.  But  (98)  is  another  indication  of  the 
unexpected  power  of  broadword  methods. 

*Lower  bounds.  Indeed,  the  existence  of  so  many  tricks  and  techniques  makes 
it  natural  to  wonder  whether  we’ve  only  been  scratching  the  surface.  Are  there 
many  more  incredibly  fast  methods,  still  waiting  to  be  discovered?  A few 
theoretical  results  are  known  by  which  we  can  derive  certain  limitations  on  what 
is  possible,  although  such  studies  are  still  in  their  infancy. 

Let’s  say  that  a 2-adic  chain  is  a sequence  (xo,  xi, . . . , xr ) of  2-adic  integers 
in  which  each  element  Xi  for  i > 0 is  obtained  from  its  predecessors  via  bitwise 
manipulation.  More  precisely,  we  want  the  steps  of  the  chain  to  be  defined  by 
binary  operations 


Xi  xk(i)  Ci  X^^jj  Or  ^j(i)  ^‘i * (99) 

where  each  o t is  one  of  the  operators  | ,©,=,  C,  D,  C,  D,  A,  V,  <,  »} 

and  each  is  a constant.  Furthermore,  when  the  operator  is  a left  shift  or 
right  shift,  the  amount  of  shift  must  be  a positive  integer  constant;  operations 
such  as  Xj(i)<Cxfc(j)  or  Cj3>x^(q  are  not  permitted.  (Without  the  latter  restriction 
we  couldn’t  derive  meaningful  lower  bounds,  because  every  0-1  valued  function 
of  a nonnegative  integer  x would  be  computable  in  two  steps  as  “(c  S>  x)  & 1” 
for  some  constant  c.) 

Similarly,  a broadword  chain  of  width  n,  also  called  an  n-bit  broadword 
chain,  is  a sequence  (xo,  xi, . . . , xr)  of  n-bit  numbers  subject  to  essentially  the 
same  restrictions,  where  n is  a parameter  and  all  operations  are  performed 
modulo  2".  Broadword  chains  behave  like  2-adic  chains  in  many  ways,  but 
subtle  differences  can  arise  because  of  the  information  loss  that  occurs  at  the  left 
of  n-bit  computations  (see  exercise  113). 

Both  types  of  chains  compute  a function  /(x)  = xr  when  we  start  them 
out  with  a given  value  x = xo-  Exercise  114  shows  that  an  mn-bit  broadword 
chain  is  able  to  do  m essentially  simultaneous  evaluations  of  any  function  that 
is  computable  with  an  n-bit  chain.  Our  goal  is  to  study  the  shortest  chains  that 
are  able  to  evaluate  a given  function  /. 
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, „ ^ny  2_ad: 10  or  broadword  chain  {x0,  Xl , . . . , xr)  has  a sequence  of  “shift  sets” 

1°  r A;"  , } and  “bounds”  (B0,B1,...,Bt),  defined  as  follows:  Start  with 

*->o  — and  B0  = 1;  then  for  i > 1,  let 


Sk 

Sj(i), 

Sj(i)  + Cj, 


and  Bi 


MiBj(i)Bk(i 

^iBk(i), 

MiBj(i), 


if  Xi  — Xj^  oi 

if  Xi  = a o i xk[i), 

if  Xi  'Ej(i)  °i  Ci , 
if  Xi  Xj( j)  3>  a , 
if  Xi  Xj(i)  ^ Ci, 


(ioo) 


i,  l Bj(i), 

where  Mt  — 2 if  °z  G {+,  — } and  M,  = 1 otherwise,  and  these  formulas  assume 
that  Oj  f {<,>}.  For  example,  consider  the  following  7-step  chain: 


Xi 

5, 

Bt 

Xq  — X 

{0} 

1 

xi  = x0  & -2 

{0} 

1 

x2  = X!  + 2 

{0} 

2 

X3  — x2  1 

{1} 

2 (101) 

X4  = X2  + X3 

{0,1} 

8 

x$  — X4  4 

{4,5} 

8 

xe  = X4  + x$ 

{0, 1,4,5} 

128 

x7  = x6  » 4 

{4,  5, 8, 9} 

128 

chain  in  exercise  4.4-9,  which  proved  that  these  operations 

I/I  7 mat  LJL1COC  uptflcllioi 

will  yield  x7  = [z/lOj  for  0 < x < 160  when  performed  with  8-bit  arithmetic.) 

To  begin  a theory  of  lower  bounds,  let’s  notice  first  that  the  high-order  bits 
°f  x ~ x°  cannot  influence  any  low-order  bits  unless  we  shift  them  to  the  right. 

Lemma  A.  Given  a 2-adic  or  broadword  chain,  let  the  binary  representation  of 
Xi  be  ( . . . xl2XiiXio)2.  Then  bit  xip  can  depend  on  bit  x0q  only  if  q < p + max  S, 

Proof.  By  induction  on  i we  can  in  fact  show  that,  if  Bt  = 1,  bit  xlp  can  depend 
on  bit  x0q  only  if  q - p G St.  Addition  and  subtraction,  which  force  B,  > 1, 
allow  any  particular  bit  of  their  operands  to  affect  all  bits  that  lie  to  the  left  in 
the  sum  or  difference,  but  not  those  that  lie  to  the  right.  | 

Corollary  I.  The  function  x - 1 cannot  be  computed  by  a 2-adic  chain,  nor 
can  any  function  for  which  at  least  one  bit  of  f{x)  depends  on  an  unbounded 
number  of  bits  of  x.  | 

Corollary  W.  An  n-bit  function  f(x)  can  be  computed  by  an  n-bit  broadword 
chain  without  shifts  if  and  only  if  x = y (modulo  2P)  implies  fix ) = f(y) 
(modulo  2p)  for  0 < p < n.  W 


Proof.  If  there  are  no  shifts  we  have  5,  = {0}  for  all  i.  Thus  bit  xrp  cannot 
depend  on  bit  x0q  unless  q < p.  In  other  words  we  must  have  xr  = yr  (modulo  2P ) 
whenever  x0  = y0  (modulo  2P). 

Conversely,  all  such  functions  are  achievable  by  a sufficiently  long  chain 
Fxercise  119  gives  shift-free  n-bit  chains  for  the  functions 

fpy(x)  = 2p[x  mod  2P+1  = y],  when  0 < p < n and  0 < y < 2P+1,  (102) 
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from  which  all  the  relevant  functions  arise  by  addition.  [H.  S.  Warren,  Jr.,  gener- 
alized this  result  to  functions  of  m variables  in  CACM  20  (1977),  439-441.]  | 

Shift  sets  Si  and  bounds  D,  are  important  chiefly  because  of  a fundamental 
lemma  that  is  our  principal  tool  for  proving  lower  bounds: 

Lemma  B.  Let  Xpqr  — {xr&r[2p  — 2q  \ | £0  G Vpqr}  in  an  n-bit  broadword  chain, 
where 

Vpqr  = {a;  | a;  & |_2P+S  — 2q+s\  — 0 for  all  sgSr]  (103) 

and  p > q.  Then  |XpgT.|  < Br.  (Here  p and  q are  integers,  possibly  negative.) 
This  lemma  states  that  at  most  Br  different  bit  patterns  a:r(p-i)  • • • xrq  can  occur 
within  /(a;),  when  certain  intervals  of  bits  in  x are  constrained  to  be  zero. 

Proof.  The  result  certainly  holds  when  r — 0.  Otherwise  if,  for  example,  xr  = 
Xj  + Xk,  we  know  by  induction  that  \Xpqj\  < Bj  and  \Xpqk\  < Bk . Furthermore 
Vpqr  = Vpqj  ft  Vpqk,  since  Sr  = Sj  U 5fc.  Thus  at  most  B^Bk  possibilities  for 
(xj  + Xk)  & [2P  — 2q\  arise  when  there’s  no  carry  into  position  q,  and  at  most 
BjBk  when  there  is  a carry,  making  a grand  total  of  at  most  Br  = 2B:jBp 
possibilities  altogether.  Exercise  122  considers  the  other  cases.  | 

We  now  can  prove  that  the  ruler  function  needs  fl(loglogn)  steps. 

Theorem  R.  If  n = d ■ 2d,  every  n-bit  broadword  chain  that  computes  px  for 
0 < x < 2”  has  more  than  lg  d steps  that  are  not  shifts. 

Proof.  If  there  are  l nonshift  steps,  we  have  |Sr|  < 2l  and  Br  < 22'-1.  Apply 
Lemma  B with  p = d and  q = 0,  and  suppose  |Xdor|  — 2d  — t.  Then  there  are  t 
values  of  k < 2d  such  that 

{2k,  2k+*d , 2k+22d , ...,2k+^2“}  £ Vd0r. 

But  Vdor  excludes  at  most  2 ld  of  the  n possible  powers  of  2;  so  t < 2l . 

If  l < lgd,  Lemma  B tells  us  that  2d  — t < Br  < 2d~1\  hence  2d~l  < t < 
2 1 < d.  But  this  is  impossible  unless  d < 2,  when  the  theorem  clearly  holds.  | 

The  same  proof  works  also  for  the  binary  logarithm  function: 

Corollary  L.  If  n = d ■ 2d  > 2,  every  n-bit  broadword  chain  that  computes  Ax 
for  0 < x < 2n  has  more  than  lg  d steps  that  are  not  shifts.  | 

By  using  Lemma  B with  q > 0 we  can  derive  the  stronger  lower  bound 
fi(logn)  for  bit  reversal,  and  hence  for  bit  permutation  in  general. 

Theorem  P.  If  2 < g < n,  every  n-bit  broadword  chain  that  computes  the 
g-bit  reversal  xR  for  0 < x < 29  has  at  least  lg g\  steps  that  are  not  shifts. 

Proof.  Assume  as  above  that  there  are  l nonshifts.  Let  h = [\/g\  and  suppose 
that  l < [lg(h  + 1)J.  Then  Sr  is  a set  of  at  most  2l  < |(h  + 1)  shift  amounts  s. 
We  shall  apply  Lemma  B with  p = q + h,  where  p < g and  q > 0,  thus  in  g — h + 1 
cases  altogether.  The  key  observation  is  that  xR  & [2P  — 29J  is  independent  of 
x & L2P+S  — 2?+sJ  whenever  there  are  no  indices  j and  k such  that  0 < j,k  < h, 
and  g — 1 — q — j — q + s + k.  The  number  of  “bad”  choices  of  q for  which  such 
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indices  exist  is  at  most  \{h+  l)h2  < g — h;  therefore  at  least  one  “good”  choice 
of  q yields  \Xpqr  \ — 2h . But  then  Lemma  B leads  to  a contradiction,  because  we 
obviously  cannot  have  2 h < Br  < | 

Corollary  M.  Multiplication  by  certain  constants,  modulo  2”,  requires  fl(logn) 
steps  in  an  n-bit  broadword  chain. 

Proof.  In  Hack  167  of  the  classic  memorandum  HAKMEM  (M.I.T.  A. I.  Lab- 
oratory, 1972),  Richard  Schroeppel  observed  that  the  operations 

((az)mod2n)&6,  y «-  ((cf)  mod  2n)  > (n  - g)  (104) 

compute  y = xR  whenever  n = g2  and  0 < x < 2»,  using  the  constants  a = 
(2 n+9  - l)/(2»+1  - 1),  b = 2g~1(2n  - l)/(2g  - 1),  and  c = (2n~»  - l)/{2g~1  - 1). 
(See  exercise  123.)  | 

At  this  point  the  reader  might  well  be  thinking,  “Okay,  I agree  that  broad- 
word chains  sometimes  have  to  be  asymptotically  long.  But  programmers  needn’t 
be  shackled  by  such  chains;  we  can  use  other  techniques,  like  conditional  branches 
or  references  to  precomputed  tables,  which  go  beyond  those  restrictions.” 

Right.  And  we’re  in  luck,  because  broadword  theory  can  also  be  extended 
to  more  general  models  of  computation.  Consider,  for  example,  the  follow- 
ing idealization  of  an  abstract  reduced-instruction-set  computer,  called  a basic 
RAM.  The  machine  has  n-bit  registers  r±,  ...,  n,  and  n-bit  memory  words 
{M[0], . . . , M[2m  - 1]}.  It  can  perform  the  instructions 

rt  4-  r}  ±rk,  r,  y-  Tj  o rk,  rt  y-  rj  > rk,  n y-  c, 

Vi  y-  M[rj  mod  2m],  M[rj  mod  2m]  y-  n,  (105) 

where  o is  any  bitwise  Boolean  operator,  and  where  rk  in  the  shift  instruction  is 
treated  as  a signed  integer  in  two’s  complement  notation.  The  machine  is  also 
able  to  branch  if  r,  < rj,  treating  r,  and  rj  as  unsigned  integers.  Its  state  is  the 
entire  contents  of  all  registers  and  memory,  together  with  a “program  counter” 
that  points  to  the  current  instruction.  Its  program  begins  in  a designated  state, 
which  may  include  precomputed  tables  in  memory,  and  with  an  n-bit  input 
value  x in  register  r1;  This  initial  state  is  called  Q(x,  0),  and  Q(x,  t ) denotes  the 
state  after  t instructions  have  been  performed.  When  the  machine  stops,  n will 
contain  some  n-bit  value  /(x).  Given  a function  f(x),  we  want  to  find  a lower 
bound  on  the  least  t such  that  rx  is  equal  to  f{x)  in  state  Q(x,  t),  for  0 < x < 2". 

Theorem  R . Let  e = 2 e.  A basic  n-bit  RAM  with  memory  parameter  m < 
n requires  at  least  lg  lg  n—e  steps  to  evaluate  the  ruler  function  px,  as  n — > oo. 

Proof.  Let  n = 22  , so  that  m < 22  f ~2’ . Exercise  124  explains  how  an 

omniscient  observer  can  construct  a broadword  chain  from  a certain  class  of 
inputs  x,  in  such  a way  that  each  x causes  the  RAM  to  take  the  same  branches, 
use  the  same  shift  amounts,  and  refer  to  the  same  memory  locations.  Our  earlier 
methods  can  then  be  used  to  show  that  this  chain  has  length  > /.  | 

A skeptical  reader  may  still  object  that  Theorem  R'  has  no  practical  value, 
because  lglgn  never  exceeds  6 in  the  real  world.  To  this  argument  there  is  no 
rebuttal.  But  the  following  result  is  slightly  more  relevant: 
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Theorem  P'.  A basic  n-bit  RAM  requires  at  least  | lgg  steps  to  compute  the 
g-bit  reversal  xR  for  0 < x < 29 , if  g < n and 

max(m,l  + lgn)  < ^(h^Y)\  - 2’  /l=Lv/ffJ-  (lo6) 

Proof.  An  argument  like  the  proof  of  Theorem  R'  appears  in  exercise  125.  | 

Lemma  B and  Theorems  R,  P,  R',  P'  and  their  corollaries  are  due  to 
A.  Brodnik,  P.  B.  Miltersen,  and  J.  I.  Munro,  Lecture  Notes  in  Comp.  Sci. 
1272  (1997),  426-439,  based  on  earlier  work  of  Miltersen  in  Lecture  Notes  in 
Comp.  Sci.  1099  (1996),  442-453. 

Many  unsolved  questions  remain  (see  exercises  126-130).  For  example,  does 
sideways  addition  require  Q(logn)  steps  in  an  n-bit  broadword  chain?  Can  the 
parity  function  (na:)mod2,  or  the  majority  function  [vx>n/ 2],  be  computed 
substantially  faster  than  vx  itself,  broadwordwise? 

An  application  to  directed  graphs.  Now  let’s  use  some  of  what  we’ve  learned, 
by  implementing  a simple  algorithm.  Given  a digraph  on  a set  of  vertices  V,  we 
write  u — > v when  there’s  an  arc  from  u to  v.  The  reachability  problem  is  to 
find  all  vertices  that  lie  on  oriented  paths  beginning  in  a specified  set  Q C V-, 
in  other  words,  we  seek  the  set 

R — {v  | u — >*  v for  some  u G Q},  (107) 

where  u — >*  v means  that  there  is  a sequence  of  t arcs 

u = Uq — >ui — >••• — >ut=v,  for  some  t > 0.  (108) 

This  problem  arises  frequently  in  practice.  For  example,  we  encountered  it  in 
Section  2.3.5  when  marking  all  elements  of  Lists  that  are  not  “garbage.” 

If  the  number  of  vertices  is  small,  say  |Vj  < 64,  we  may  want  to  approach 
the  reachability  problem  in  quite  a different  way  than  we  did  before,  by  working 
directly  with  subsets  of  vertices.  Let 

S[u]  = {i>  | u — tv}  (109) 

be  the  set  of  successors  of  vertex  m,  for  all  u G V.  Then  the  following  algorithm 
is  almost  completely  different  from  Algorithm  2.3.5E,  yet  it  solves  the  same 
abstract  problem: 

Algorithm  R ( Reachability ).  Given  a simple  directed  graph,  represented  by 
the  successor  sets  S[u]  in  (109),  this  algorithm  computes  the  elements  R that 
are  reachable  from  a given  set  Q. 

Rl.  [Initialize.]  Set  R 4—  Q and  X «—  0.  (I11  the  following  steps,  X is  the  subset 
of  vertices  u G R for  which  we’ve  looked  at  S[w].) 

R2.  [Done?]  If  X — R,  the  algorithm  terminates. 

R3.  [Examine  another  vertex.]  Let  u be  an  element  of  R \ X.  Set  X GlU  u, 
R R U S[u],  and  return  to  step  R2.  | 

The  algorithm  is  correct  because  (i)  every  element  placed  into  R is  reachable; 

(ii)  every  reachable  element  Uj  in  (108)  is  present  in  R , by  induction  on  j;  and 

(iii)  termination  eventually  occurs,  because  step  R3  always  increases  |X|. 
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1°  implement  Algorithm  R we  will  assume  that  V = {0, 1 n - 1}.  with 

n < 64.  The  set  X is  conveniently  represented  by  the  integer  a(X)  — ]r{2“  | 
M G X},  and  the  same  convention  works  nicely  for  the  other  sets  Q , R,  and 
S^m].  Notice  that  the  bits  of  5[0],  5[1],  . . . , S[n-  1]  are  essentially  the  adjacency 
matrix  of  the  given  digraph,  as  explained  in  Section  7,  but  in  little-endian  order: 
The  “diagonal”  elements,  which  tell  us  whether  or  not  u G S[u],  go  from  right  to 
left.  For  example,  if  n = 3 and  the  arcs  are  {0->0,0-)-l,  l->0,2^0},  we  have 
S[0]  = (011)2  and  S[  1]  = 5[2]  = (001)2,  while  the  adjacency  matrix  is  (US). 

Step  R3  allows  us  to  choose  any  element  of  R\X,  so  we  use  the  ruler  function 
u <-  p{a{R)  - <t(X))  to  choose  the  smallest.  The  bitwise  operations  require  no 
further  trickery  when  we  adapt  the  algorithm  to  MMIX: 


Program  R ( Reachability ).  The  input  set  Q is  given  in  register  q,  and  each 
successor  set  S[u]  appears  in  octabyte  M8[suc  + 8u].  The  output  set  R will 
appear  in  register  r;  other  registers  s,  t,  tt,  u,  and  x hold  intermediate  results. 


01 

1H 

SET 

r.q 

1 

Rl.  Initialize,  r <r( 

02 

SET 

x,0 

1 

x <—  cr(0). 

03 

JMP 

2F 

1 

To  R2. 

04 

3H 

SUBU 

tt  ,t , 1 

|R| 

R3.  Examine  another 

05 

SADD 

u,tt,t 

1*1 

u «-  p(t)  [see  (46)]. 

06 

SLU 

s,u,3 

1*1 

s *—  8m. 

07 

LD0U 

s , sue  , s 

1*1 

s 4-  <t(5[m]). 

08 

ANDN 

tt,t,tt 

1*1 

tt  -f-  t &~tt  = 2“. 

09 

OR 

x,x,tt 

1*1 

X <—  X U m;  that  is,  x 

10 

OR 

r,r,s 

1*1 

R 4-  R U S[m[;  that  is, 

11 

2H 

SUBU 

t.r.x 

1*1  + 1 

R2.  Done?  t e-  r — x 

12 

PBNZ 

t,3B 

1*1  + 1 

To  R3  if  R ^ X.  | 

WJ. 


1. 


2“,  since  x = a(X). 
r | s,  since  r = a(R). 


The  total  running  time  is  (p  + 9n)|R|  + 7v.  By  contrast,  exercise  131  imple- 
ments Algorithm  R with  linked  lists;  the  overall  execution  time  then  grows  to 
(3S  + 4|R| -2\Q\  + l)M  + (55  + 12|R|  - 5|Q|  + 4)v,  where  5 = J2ueR  |S[u]|.  (But 
of  course  that  program  is  also  able  to  handle  graphs  with  millions  of  vertices.) 

Exercise  132  presents  another  instructive  algorithm  where  bitwise  operations 
work  nicely  on  not-too-large  graphs. 


Application  to  data  representation.  Computers  are  binary,  but  (alas?) 
the  world  isn’t.  We  often  must  find  a way  to  encode  nonbinary  data  into  Os 
and  Is.  One  of  the  most  common  problems  of  this  sort  is  to  choose  an  efficient 
representation  for  items  that  can  be  in  exactly  three  different  states. 

Suppose  we  know  that  x G {a,  b,  c},  and  we  want  to  represent  x by  two 
bits  xixr.  We  could,  for  example,  map  a 00,  b 01,  and  c .->■  10.  But  there 
are  many  other  possibilities  — in  fact,  4 choices  for  a,  then  3 choices  for  b,  and 
2 for  c,  making  24  altogether.  Some  of  these  mappings  might  be  much  easier  to 
deal  with  than  others,  depending  on  what  we  want  to  do  with  x. 

Given  two  elements  x,y  e {a,  b,  c},  we  typically  want  to  compute  z = xoy 
for  some  binary  operation  o.  If  a:  = xtxr  and  y = ylVr  then  * = Zlzr,  where 

zl  = fl(xi,xr,yi,yr)  and  zr  = fr(xu  xr,  yu  t/r);  (no) 
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these  Boolean  functions  /;  and  fr  of  four  variables  depend  on  o and  the  chosen 
representation.  We  seek  a representation  that  makes  ft  and  fr  easy  to  compute. 
Suppose,  for  example,  that  {a,b,c}  — { — 1,0, +1}  and  that  o is  multiplica- 


tion. If  we  decide  to  use  the  natural  mapping  x 1-4  x mod  3,  namely 

Oh^OO,  +1ha01,  -1h->10,  (111) 

so  that  x = xT  — xi,  then  the  truth  tables  for  /;  and  fr  are  respectively 

ft  44  000*001*010*****  and  fr  44  000*010*001*****.  (112) 

(There  are  seven  “don’t-cares,”  for  cases  where  xixr  = 11  and/or  yiyr  = 11.) 
The  methods  of  Section  7.1.2  tell  us  how  to  compute  z;  and  zr  optimally,  namely 

zi  = (xi®yt)  A(xr®yr),  zr  = (xt  © yr)  A (xr  0 yi)]  (113) 

unfortunately  the  functions  ft  and  fr  in  (112)  are  independent,  in  the  sense  that 
they  cannot  both  be  evaluated  in  fewer  than  C(fi)  + C(fr)  — 6 steps. 

On  the  other  hand  the  somewhat  less  natural  mapping  scheme 

+ I1-4OO,  OhaOI,  -1  h->  10  (114) 

leads  to  the  transformation  functions 

ft  44  001*000*100*****  and  fr  44  010*111*010*****,  (115) 

and  three  operations  now  suffice  to  do  the  desired  evaluation: 

zr  — xrV  yr,  zi  = (xi  ®yi)  Azr.  (116) 


Is  there  an  easy  way  to  discover  such  improvements?  Fortunately  we  don’t 
need  to  try  all  24  possibilities,  because  many  of  them  are  basically  alike.  For 
example,  the  mapping  x 1-4  xrxi  is  equivalent  to  1 4 xixr,  because  the  new 
representation  x\x'T  = xrxl  obtained  by  swapping  coordinates  makes 

fl(xh  Xr ' Vl  ■ y r ) — zl  zr  f r(x  1 • Xr  ,yi,yr  ) , 

the  new  transformation  functions  //  and  f'r  defined  by 
fi(xi,xr,yi,yr)  = fr(xr,xi,yr,yt),  f'r(xi,xr,yuyr)  = fi(xr,xuyr,yi)  (1x7) 

have  the  same  complexity  as  ft  and  fr.  Similarly  we  can  complement  a coordi- 
nate, letting  x\x'T  = xlxr ; then  the  transformation  functions  turn  out  to  be 

f l (xi,  Xr , yi , yr)  — fl(xi,xr,  yi,yr),  fri.'^li'^r^yhyr')  fr  (xi , xr , y/ , yr ) , (118) 

and  again  the  complexity  is  essentially  unchanged. 

Repeated  use  of  swapping  and/or  complementation  leads  to  eight  mappings 
that  are  equivalent  to  any  given  one.  So  the  24  possibilities  reduce  to  only  three, 
which  we  shall  call  classes  I,  II,  and  III: 


Class  I Class  II  Class  III 

, s ^ ~ ^ / * ^ 

a K4  00  01  10  11  00  10  01  11  00  01  10  11  00  10  01  11  00  01  10  11  00  10  01  11; 

b H4  01  00  11  10  10  00  11  01  01  00  11  10  10  00  11  01  11  10  01  00  11  01  10  00;  (119) 

c 1-4  10  11  00  01  01  11  00  10  11  10  01  00  11  01  10  00  01  00  11  10  10  00  11  01. 
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To  choose  a representation  we  need  consider  only  one  representative  of  each 
class.  For  example,  if  a = +1,  b = 0,  and  c = -1,  representation  (m)  belongs 
to  class  II,  and  (114)  belongs  to  class  I.  Class  III  turns  out  to  have  cost  3,  like 
class  I.  So  it  appears  that  representation  (114)  is  as  good  as  any,  with  z computed 
by  (116),  for  the  3-element  multiplication  problem  we’ve  been  studying. 

Appearances  can,  however,  be  deceiving,  because  we  need  not  map  {a,b,c} 
into  unique  two-bit  codes.  Consider  the  one-to-many  mapping 


+ 1 1 t 00,  0 i->  01  or  11,  —1  i-a  10, 


(120) 


where  both  01  and  11  are  allowed  as  representations  of  zero.  The  truth  tables 
for  /,  and  fr  are  now  quite  different  from  (112)  and  (115),  because  all  inputs  are 
legal  but  some  outputs  can  be  arbitrary: 

ft  <A  0*1*****1*0*****  and  fr  4r¥  0101111101011111.  (121) 

And  in  fact,  this  approach  needs  just  two  operations,  instead  of  the  three  in  (116): 


zl  — X(  © yi , zr  = xr\J  yr. 


(122) 


A moment’s  thought  shows  that  indeed,  these  operations  obviously  yield  the 
product  z = x-y  when  the  three  elements  { + 1,  0,  -1}  are  represented  as  in  (120). 

Such  nonunique  mappings  add  36  more  possibilities  to  the  24  that  we  had 
before.  But  again,  they  reduce  under  “2-cube  equivalence”  to  a small  number  of 
equivalence  classes.  First  there  are  three  classes  that  we  call  IVa,  IVb,  and  IVr. 
depending  on  which  element  has  an  ambiguous  representation: 


Class  IV0 


Class  IVi, 


Class  IV,, 


a 1— 1 0*  0*  1*  1*  *0  *0  *1  *1  11  10  01  00  11  01  10  00  10  11  00  01  01  11  00  10; 

^ 1 ^ 10  11  00  01  01  11  00  10  0*  0*  1*  1*  *0  *0  *1  *1  11  10  01  00  11  01  10  00;  (121) 

c ha  11  10  01  00  1101  10  00  10  1100  0101  1100  10  0*0*  1*  1*  *0*0*1  *L 

(Representation  (120)  belongs  to  class  IVb.  Classes  IVa  and  IVC  don’t  work  well 

for  z — x y .)  Then  there  are  three  further  classes  with  only  four  mappings  each: 


Class  V0 


Class  V(, 


Class  Vr 


a ha  tt  tt 

b ha  01  00 

c ha  10  11 


tt 

11 

00 


tt 

10 

01 


10 

tt 

01 


11 

tt 

00 


00 

tt 

11 


01  01  00 

tt  10  11 

10  tt  ti 


11 

00 

tt 


10; 

01; 

tt. 


(124) 


These  classes  are  a bit  of  a nuisance,  because  the  indeterminacy  in  their  truth 
tables  cannot  be  expressed  simply  in  terms  of  don’t-cares  as  we  did  in  (121).  For 
example,  if  we  try 

+1  ha  00  or  11,  0 ha  01,  -1  ha  10,  (125) 

which  is  the  first  mapping  in  class  Va,  there  are  binary  variables  pqrst  such  that 
/;  -fA  p01(/000010rls01<  and  fr  <a  plOgllllOlrOslOt.  (126) 
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Furthermore,  mappings  of  classes  Va,  Vj,,  and  Vc  almost  never  turn  out  to 
be  better  than  the  mappings  of  the  other  six  classes  (see  exercise  138).  Still, 
representatives  of  all  nine  classes  must  be  examined  before  we  can  be  sure  that 
an  optimal  mapping  has  been  found. 

In  practice  we  often  want  to  perform  several  different  operations  on  ternary- 
valued variables,  not  just  a single  operation  like  multiplication.  For  example,  we 
might  want  to  compute  max(x,y)  as  well  as  x-y.  With  representation  (120),  the 
best  we  can  do  is  zi  — x/  Ayt,  zr  — (xi  A yr)  V (xr  A (yi  V yr))\  but  the  “natural” 
mapping  (111)  now  shines,  with  zi  — xi  Ayi,  zr  = xr  V yr.  Class  III  turns  out 
to  have  cost  4;  other  classes  are  inferior.  To  choose  between  classes  II,  III,  and 
IV;,  in  this  case,  we  need  to  know  the  relative  frequencies  of  x-y  and  max(x,j/). 
And  if  we  add  min(x,  y)  to  the  mix,  classes  II,  III,  and  IVf,  compute  it  with  the 
respective  costs  2,  5,  5;  hence  (111)  looks  better  yet. 

The  ternary  max  and  min  operations  arise  also  in  other  contexts,  such  as  the 
three- valued  logic  developed  by  Jan  Lukasiewicz  in  1917.  [See  his  Selected  Works, 
edited  by  L.  Borkowski  (1970),  84-88,  153-178.]  Consider  the  logical  values 
“true,”  “false,”  and  “maybe,”  denoted  respectively  by  1,  0,  and  *.  Lukasiewicz 
defined  the  three  basic  operations  of  conjunction,  disjunction,  and  implication 
on  these  values  by  specifying  the  tables 


y 


0" 

* 

T 

f° 

0 

0 

0 

1 * 

0 

* 

* 

ll 

0 

* 

1 

X 

A 
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* 
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1 

li 
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1 

1 

X 
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* 

7 

f° 

1 

1 

1 

X < * 

* 

1 

1 

ll 

0 

* 

1 

X 

y 

(127) 


For  these  operations  the  methods  above  show  that  the  binary  representation 


Oh- 00,  * h 01,  1 H 11  (128) 

works  well,  because  we  can  compute  the  logical  operations  thus: 

xixrA  yiyr  = (xiAyi)(xrAyr),  xixr  V yiyr  - (xi\/yi){xrWyr),  ^ 

xixr=>yiyr  — ((x;  V yi)  A (xr  V yr))  (x;  V yT). 

Of  course  x need  not  be  an  isolated  ternary  value  in  this  discussion;  we  often 
want  to  deal  with  ternary  vectors  x = X1X2  . . .xn,  where  each  Xj  is  either  a,  b, 
or  c.  Such  ternary  vectors  are  conveniently  represented  by  two  binary  vectors 


X[  X\iX“2 1 . . . Xfii  and  Xr  — X\^X2r  • • • -^nri  (130) 

where  Xj  H XjiXjr  as  above.  We  could  also  pack  the  ternary  values  into  two-bit 
fields  of  a single  vector, 

X X\iX\rX2lX2r  • • • Xn[Xnr , (131) 

that  would  work  fine  if,  say,  we’re  doing  Lukasiewicz  logic  with  the  operations  A 
and  V but  not  =>.  Usually,  however,  the  two-vector  approach  of  (130)  is  better, 
because  it  lets  us  do  bitwise  calculations  without  shifting  and  masking. 
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Applications  to  data  structures.  Bitwise  operations  offer  many  efficient  ways 
to  represent  elements  of  data  and  the  relationships  between  them.  For  example 
chess-playing  programs  often  use  a “bit  board”  to  represent  the  positions  of 
pieces  (see  exercise  143). 

In  Chapter  8 we  shall  discuss  an  important  data  structure  developed  by 
Peter  van  Emde  Boas  for  representing  a dynamically  changing  subset  of  integers 
between  0 and  N.  Insertions,  deletions,  and  other  operations  such  as  “find  the 
largest  element  less  than  x”  can  be  done  in  0(log  log  N)  steps  with  his  methods; 
the  general  idea  is  to  organize  the  full  structure  recursively  as  y/N  substructures 
for  subsets  of  intervals  of  size  y/N , together  with  an  auxiliary  structure  that 
tells  which  of  those  intervals  are  occupied.  [See  Information  Processing  Letters 
6 (1977),  80-82;  also  P.  van  Emde  Boas,  R.  Kaas,  and  E.  Zijlstra,  Math.  Systems 
Theory  10  (1977),  99-127.]  Bitwise  operations  make  those  computations  fast. 

Hierarchical  data  can  sometimes  be  arranged  so  that  the  links  between 
elements  are  implicit  rather  than  explicit.  For  example,  we  studied  “heaps” 
m Section  5.2.3,  where  n elements  of  a sequential  array  implicitly  have  a binary 
tree  structure  like 


(0101) 

(Iooo)(iooIXIoIo)~ 


(!32) 


when,  say,  n = 10.  (Node  numbers  are  shown  here  both  in  decimal  and  binary 
notation.)  There  is  no  need  to  store  pointers  in  memory  to  relate  node  j of  a 
heap  to  its  parent  (which  is  node  j > 1 if  j j,  1),  0r  to  its  sibling  (which  is  node 
j © 1 if  j f 1),  or  to  its  children  (which  are  nodes  j - Cl  and  (j  <C  1)  + 1 if  those 
numbers  don’t  exceed  n),  because  a simple  calculation  leads  directly  from  j to 
any  desired  neighbor. 

Similarly,  a sideways  heap  provides  implicit  links  for  another  useful  family 
of  n-node  binary  tree  structures,  typified  by 


(!33) 


(OOlp)  (Olio) 

(00©@Il)(5l^^IT)(To^ 


(IqIqT 


when  n - 10.  (We  sometimes  need  to  go  beyond  n when  moving  from  a node  to 
its  parent,  as  in  the  path  from  10  to  12  to  8 shown  here.)  Heaps  and  sideways 
heaps  can  both  be  regarded  as  nodes  1 to  n of  infinite  binary  tree  structures: 
The  heap  with  n — oc  is  rooted  at  node  1 and  has  no  leaves;  by  contrast,  the 
sideways  heap  with  n = oo  has  infinitely  many  leaves  1,  3,  5,  ... , but  no  root(!). 

The  leaves  of  a sideways  heap  are  the  odd  numbers,  and  their  parents  are  the 
odd  multiples  of  2.  The  grandparents  of  leaves,  similarly,  are  the  odd  multiples 
of  4;  and  so  on.  Thus  the  ruler  function  pj  tells  how  high  node  j is  above  leaf  level. 

The  parent  of  node  j in  the  infinite  sideways  heap  is  easily  seen  to  be  node 


( j ~ k)  | (k  <C  1),  where  k = j & — j- 


(i34) 
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this  formula  rounds  j to  the  nearest  odd  multiple  of  2l+pP  And  the  children  are 

1)  and  j + (fc>  1)  (135) 

when  j is  even.  In  general  the  descendants  of  node  j form  a closed  interval 

\j-2”  + l ..j  + 2pi -1],  (136) 

arranged  as  a complete  binary  tree  of  21+pi  — 1 nodes.  (These  are  the  “inclusive” 
descendants,  including  j itself.)  The  ancestor  of  node  j at  height  h is  node 

O'  I (K  h))  & -(1  < h)  = ((j  > h)  | 1)  « h (137) 

when  h > pj.  Notice  that  the  symmetric  order  of  the  nodes,  also  called  inorder, 
is  just  the  natural  order  1,  2,  3,  ...  . 

Dov  Harel  noted  these  properties  in  his  Ph.D.  thesis  (U.  of  California,  Irvine, 
1980),  and  observed  that  the  nearest  common  ancestor  of  any  two  nodes  of  a 
sideways  heap  can  also  be  easily  calculated.  Indeed,  if  node  l is  the  nearest 
common  ancestor  of  nodes  i and  j,  where  i < j,  there  is  a remarkable  identity 

pi  = ma  x{px  | i < x < j}  = A (j  & -i),  (138) 

which  relates  the  p and  A functions.  (See  exercise  146.)  We  can  therefore  use 
formula  (137)  with  h = A (j  & —i)  to  calculate  l. 

Subtle  extensions  of  this  approach  lead  to  an  asymptotically  efficient  algo- 
rithm that  finds  nearest  common  ancestors  in  any  oriented  forest  whose  arcs 
grow  dynamically  [D.  Harel  and  R.  E.  Tarjan,  SICOMP  13  (1984),  338-355]. 
Baruch  Schieber  and  Uzi  Vishkin  [SICOMP  17  (1988),  1253-1262]  subsequently 
discovered  a much  simpler  way  to  compute  nearest  common  ancestors  in  an 
arbitrary  (but  fixed)  oriented  forest,  using  an  attractive  and  instructive  blend  of 
bitwise  and  algorithmic  techniques  that  we  shall  consider  next. 

Recall  that  an  oriented  forest  with  m trees  and  n vertices  is  an  acyclic 
digraph  with  n — m arcs.  There  is  at  most  one  arc  from  each  vertex;  the  vertices 
with  out-degree  zero  are  the  roots  of  the  trees.  We  say  that  v is  the  parent  of  u 
when  u — >v,  and  v is  an  (inclusive)  ancestor  of  u when  u — >*  v.  Two  vertices 
have  a common  ancestor  if  and  only  if  they  belong  to  the  same  tree.  Vertex  w 
is  called  the  nearest  common  ancestor  of  u and  v when  we  have 

u — >*  z and  v — >*  z if  and  only  if  w — f z.  (139) 

Schieber  and  Vishkin  preprocess  the  given  forest,  mapping  its  vertices  into 
a sideways  heap  5 of  size  n by  computing  three  quantities  for  each  vertex  v: 

ttv,  the  rank  of  v in  preorder  (1  < irv  < n); 

/3v,  a node  of  the  sideways  heap  S (1  < /3v  < n ); 
av , a (1  + An)-bit  routing  code  (1  < av  < 21+Xn). 

If  u — » v we  have  nu  > ttv  by  the  definition  of  preorder.  Node  /3v  is  defined  to 
be  the  nearest  common  ancestor  of  all  sideways-heap  nodes  nu  such  that  v is  an 
ancestor  of  vertex  u (always  meaning  an  inclusive  ancestor).  And  we  define 

= E{2P/J“I  v-+*w}.  (140) 


av 
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For  example,  here  s an  oriented  forest  with  ten  vertices  and  two  trees: 


Each  node  has  been  labeled  with  its  preorder  rank,  from  which  we  can  compute 
the  P and  a codes: 


v=ABCDEFGH  I J 
TTV  = 0001  1000  0010  0100  1001  0011  0101  0111  1010  0110 

/3v  = 0100  1000  0010  0100  1010  0011  0110  0111  1010  0110 

av  = 0100  1000  0110  0100  1010  0111  0110  0101  1010  0110 

Notice  that,  for  instance,  (3A  = 4 = 0100  because  the  preorder  ranks  of  the 

descendants  of  A are  {1, 2,  3, 4,  5, 6,  7}.  And  aH  = 0101  because  the  ancestors 
of  H have  p codes  {pH,pD,pA}  = {0111,0100}.  One  can  prove  without 
difficulty  that  the  mapping  v H > f3v  satisfies  the  following  key  properties: 
i)  If  u — t v in  the  forest,  then  f3u  is  a descendant  of  Pv  in  S. 
n)  If  several  vertices  have  the  same  value  of  Pv,  they  form  a path  in  the  forest. 
Property  (ii)  holds  because  exactly  one  child  u of  v has  /3u  = Pv  when  pv  / 7 tv. 

Now  let’s  imagine  placing  every  vertex  v of  the  forest  into  node  Pv  of  5: 


If  k vertices  map  into  node  j,  we  can  arrange  them  into  a path 


u0— t >vk_i— >vk,  where  Pv0  = pVl  = •■•  = pvk^x  = j.  (143) 

These  paths  are  illustrated  in  (142);  for  example,  J — >G — >D  is  a path  in  (141), 
and  0— appears  with  node  0110  = pj  = pG. 

The  preprocessing  algorithm  also  computes  a table  tj  for  all  nodes  j of  S, 
containing  pointers  to  the  vertices  vk  at  the  tail  ends  of  (143): 

j = 0001  0010  0011  0100  0101  0110  0111  1000  1001  1010 
Tj—  AACAADDAAB 

Exercise  149  shows  that  all  four  tables  ttv,  Pv,  av,  and  rj  can  be  prepared  in 
0(n)  steps.  And  once  those  tables  are  ready,  they  contain  just  enough  informa- 
tion to  identify  the  nearest  common  ancestor  of  any  two  given  vertices  quickly: 

Algorithm  V ( Nearest  common  ancestors).  Suppose  ttv,  Pv,  av,  and  rj  are 
known  lor  all  n vertices  v of  an  oriented  forest,  and  for  1 < j < n.  A dummy 
vertex  A is  also  assumed  to  be  present,  with  tt  A = /3A  = a A - 0.  This  algorithm 
computes  the  nearest  common  ancestor  z of  any  given  vertices  x and  y,  returning 
z = A if  x and  y belong  to  different  trees.  We  assume  that  the  values  Xj  = [lgjj 
have  been  precomputed  for  1 < j < n,  and  that  A0  = An. 
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VI.  [Find  common  height.]  If  (3x  < f3y,  set  h 4-  A (J5y  k —fix);  otherwise  set 
h 4—  \(/3x  k —/3y).  (See  (138).) 

V2.  [Find  true  height.]  Set  k 4—  ax  k ay  k —(1  <C  h),  then  h 4—  A (k  k —k). 

V3.  [Find  f3z.]  Set  j 4—  ((13 x » h)  | 1)  <C  h.  (Now  j — f3z,  if  z ^ A.) 

V4.  [Find  x and  y.]  (We  now  seek  the  lowest  ancestors  of  x and  y in  node  j.) 
If  j = (3x,  set  x 4—  X]  otherwise  set  l 4—  \(ax  k ((1  -C  h)  — 1))  and  a:  4— 
t(((/3x  l)  | 1)  <C  /)•  Similarly,  if  j — j3y,  set  y 4—  y;  otherwise  set  l 4— 
A (ay  k ((1  <C  h)  - 1))  and  y 4-  r((((3y  > /)  | 1)  < l). 

V5.  [Find  z.]  Set  z <—  x if  ttx  < ny,  otherwise  z 4—  y.  | 

These  artful  dodges  obviously  exploit  (137);  exercise  152  explains  why  they  work. 


Sideways  heaps  can  also  be  used  to  implement  an  interesting  type  of  priority 
queue  that  J.  Katajainen  and  F.  Vitale  call  a “navigation  pile,”  illustrated  here 
for  n — 10: 


1 3 5 7 9 11  13  15  17  19 


(i44) 


Data  elements  go  into  the  leaf  positions  1,  3,  . . . , 2n  — 1 of  the  sideways  heap; 
they  can  be  many  bits  wide,  and  they  can  appear  in  any  order.  By  contrast,  each 
branch  position  2,  4,  6,  . . . contains  a pointer  to  its  largest  descendant.  And  the 
novel  point  is  that  these  pointers  take  up  almost  no  extra  space  — fewer  than  two 
bits  per  item  of  data,  on  average  — because  only  one  bit  is  needed  for  pointers  2, 
6,  10,  . . . , only  two  bits  for  pointers  4,  12,  20,  . . . , and  only  pj  for  pointer  j in 
general.  (See  exercise  153.)  Thus  the  navigation  pile  requires  very  little  memory, 
and  it  behaves  nicely  with  respect  to  cache  performance  on  a typical  computer. 
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Fig.  13.  Two  views  of  five  lines 
in  the  hyperbolic  plane. 


*Cells  in  the  hyperbolic  plane.  Hyperbolic  geometry  suggests  an  instructive 
implicit  data  structure  that  has  a rather  different  flavor.  The  hyperbolic  plane  is 
a fascinating  example  of  non-Euclidean  geometry  that  is  conveniently  viewed  by 
projecting  its  points  into  the  interior  of  a circle.  Its  straight  lines  then  become 
circular  arcs,  which  meet  the  rim  at  right  angles.  For  example,  the  lines  PP', 
QQ',  and  RR'  in  Fig.  13  intersect  at  points  O,  A,  B , and  those  points  form  a 
triangle.  Lines  SQ1  and  QQ1  are  parallel:  They  never  touch,  but  their  points 
get  closer  and  closer  together.  Line  QT  is  also  parallel  to  QQ' . 
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We  get  different  views  by  focusing  on  different  center  points.  For  example, 
the  second  view  in  Fig.  13  puts  O smack  in  the  center.  Notice  that  if  a line  passes 
through  the  very  center,  it  remains  straight  after  being  projected;  such  diameter- 
spanning  chords  are  the  special  case  of  a “circular  arc”  whose  radius  is  infinite. 

Most  of  Euclid’s  axioms  for  plane  geometry  remain  valid  in  the  hyperbolic 
plane.  For  example,  exactly  one  line  passes  through  any  two  distinct  points;  and 
if  point  A lies  on  line  PP'  there’s  exactly  one  line  QQ'  such  that  angle  PAQ  has 
any  given  value  9,  for  0 < 9 < 180°.  But  Euclid’s  famous  fifth  postulate  does  not 
hold:  If  point  C is  not  on  line  QQ' , there  always  are  exactly  two  lines  through  C 
that  are  parallel  to  QQ' . Furthermore  there  are  many  pairs  of  lines,  like  RR ’ 
and  SQ'  in  Fig.  13,  that  are  totally  disjoint  or  ultraparallel , in  the  sense  that 
their  points  never  become  arbitrarily  close.  [These  properties  of  the  hyperbolic 
plane  were  discovered  by  G.  Saccheri  in  the  early  1700s,  and  made  rigorous  by 
N.  I.  Lobachevsky,  J.  Bolyai,  and  C.  F.  Gauss  a century  later.] 

Quantitatively  speaking,  when  points  are  projected  onto  the  unit  disk  |z|  < 1, 
the  arc  that  meets  the  circle  at  eie  and  e~iB  has  center  at  sec  9 and  radius 
tan  9.  The  actual  distance  between  two  points  whose  projections  are  z and  z'  is 
d(z,z')  = ln( 1 1 — zz'\  + \z  — z'\)  — ln(|l  — zz'\  — \z  — z'\).  Thus  objects  far  from 
the  center  appear  dramatically  shrunken  when  we  see  them  near  the  circle’s  rim. 

The  sum  of  the  angles  of  a hyperbolic  triangle  is  always  less  than  180°.  For 
example,  the  angles  at  O,  A,  and  B in  Fig.  13  are  respectively  90°,  45°,  and  36°. 
Ten  such  36°-45°-90°  triangles  can  be  placed  together  to  make  a regular  pentagon 
with  90°  angles  at  each  corner.  And  four  such  pentagons  fit  snugly  together  at 
their  corners,  allowing  us  to  tile  the  entire  hyperbolic  plane  with  right  regular 
pentagons  (see  Fig.  14).  The  edges  of  these  pentagons  form  an  interesting  family 
of  lines,  every  two  of  which  are  either  ultraparallel  or  perpendicular;  so  we  have 
a grid  structure  analogous  to  the  unit  squares  of  the  ordinary  plane.  We  call  it 
the  pentagrid,  because  each  cell  now  has  five  neighbors  instead  of  four. 

There’s  a nice  way  to  navigate  in  the  pentagrid  using  Fibonacci  numbers, 
based  on  ideas  of  Maurice  Margenstern  [see  F.  Herrmann  and  M.  Margenstern, 
Theoretical  Comp.  Sci.  296  (2003),  345-351],  Instead  of  the  ordinary  Fibonacci 
sequence  (Fn),  however,  we  shall  use  the  negaFibonacci  numbers  (i7_n),  namely 

F-1  = !.  F—2  = -1,  3 = 2,  F_ 4 = -3,  . . . , F_n  = {-l)n~lFn.  (145) 

Exercise  1.2.8-34  introduced  the  Fibonacci  number  system,  in  which  every  non- 
negative integer  x can  be  written  uniquely  in  the  form 

x = Fkx  + Fk2  H b Fkr,  where  kx  k2  ■ ■ ■ »-  kr  yy  0;  (146) 

here  j »-  k means  lj  > fc  + 2’.  But  there’s  also  a negaFibonacci  number  system, 
which  suits  our  purposes  better:  Every  integer  x,  whether  positive,  negative,  or 
zero,  can  be  written  uniquely  in  the  form 

x = Fkl  + Fk2  H b Fkr,  where  Aq  k2  kr  1.  (147) 

For  example,  4 = 5 - 1 = F_5  + F_2  and  -2  = -3  + 1 = F_4  + F_  1.  This 
representation  can  conveniently  be  expressed  as  a binary  code  a = . . . a3a2ai, 
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Fig.  14.  The  pentagrid, 
in  which  identical  pentagons 
tile  the  hyperbolic  plane. 


A circular  regular  tiling,  confined  on  all  sides 
by  infinitely  small  shapes,  is  really  wonderful. 


— M.  C.  ESCHER,  letter  to  George  Escher  (9  November  1958) 


standing  for  N(a)  — ^2k  akF-k,  with  no  two  Is  in  a row.  For  example,  here  are 
the  negaFibonacci  representation  codes  of  all  integers  between  —14  and  +15: 


-14  = 10010100 

-8  = 100000 

-2  = 1001 

4 = 10010 

10  = 1001000 

-13  = 10010101 

-7  = 100001 

-1  = 10 

5 = 10000 

11  = 1001001 

-12  = 101010 

-6  = 100100 

0 = 0 

6 = 10001 

12  = 1000010 

-11  = 101000 

-5  = 100101 

1 = 1 

7 = 10100 

13  = 1000000 

-10  = 101001 

-4  = 1010 

2 = 100 

8 = 10101 

14  = 1000001 

-9  = 100010 

-3  = 1000 

3 = 101 

9 = 1001010 

15  = 1000100 

As  in  the  negadecimal  system  (see 

4.1-(6)  and  (7)),  we  can 

tell  whether  x is 

negative  or  not  by  seeing  if  its  representation  has 

an  even  or  odd  number  of  digits. 

The  predecessor  a—  and  successor  a+  of  any  negaFibonacci  binary  code  a 
can  be  computed  recursively  by  using  the  rules 

(c*01)-  = a00,  (aOOO)—  = aOlO,  (alOO)-  = aOOl,  (alO)- = (a-)01, 
(alO)+  = aOO,  (a00)+  = aOl,  (al)+  = (a— )0.  (148) 

(See  exercise  157.)  But  ten  elegant  2-adic  steps  do  the  calculation  directly: 

y x ® pQ,  zf-y®(y±  1),  where  x — (a)2; 
z 4-  2 | (x&  (2  « 1));  (149) 

w f—  x © 2 © ((2  + 1)  2);  then  w = (a±)2- 

We  just  use  y—  1 in  the  top  line  to  get  the  predecessor,  y + 1 to  get  the  successor. 
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And  now  here’s  the  point:  A negaFibonacci  code  can  be  assigned  to  each 
cell  of  the  pentagrid  in  such  a way  that  the  codes  of  its  five  neighbors  are  easy  to 
compute.  Let’s  call  the  neighbors  n,  s,  e,  w,  and  o,  for  “north,”  “south,”  “east,” 
“west,”  and  “other.”  If  a is  the  code  assigned  to  a given  cell,  we  define 

an  = a^>  2,  a,  = (t<2,  ae  — as+,  aw  = as  — ; (150) 

thus  asn  = a,  and  also  aen  = (a01)n  = a.  The  “other”  direction  is  trickier: 

_ J <*„+,  if  a & 1 = 1; 

if  a & 1 — 0.  (,5l) 

For  example,  1000o  = 101001  and  101001o  = 1000.  This  mysterious  interloper 
lies  between  north  and  east  when  ce  ends  with  1,  but  between  north  and  west 
when  a ends  with  0. 

If  we  choose  any  cell  and  label  it  with  code  0,  and  if  we  also  choose  an 
orientation  so  that  its  neighbors  are  n,  e,  s,  w , and  o in  clockwise  order,  rules 
(150)  and  (151)  will  assign  consistent  labels  to  every  cell  of  the  pentagrid.  (See 
exercise  160.)  For  example,  the  vicinity  of  a cell  labeled  1000  will  look  like  this: 


(152) 


The  code  labels  do  not,  however,  identify  cells  uniquely,  because  infinitely 
many  cells  receive  the  same  label.  (Indeed,  we  clearly  have  0n  = 0S  = 0 and 
= lo  = 1-)  To  get  a unique  identifier,  we  attach  a second  coordinate  so  that 
each  cell’s  full  name  has  the  form  (a,  y),  where  y is  an  integer.  When  y is  constant 
and  a ranges  over  all  negaFibonacci  codes,  the  cells  (a,  y)  form  a more-or-less 
hook-shaped  strip  whose  edges  take  a 90°  turn  next  to  cell  (0,  y).  In  general,  the 
five  neighbors  of  (a,y)  are  (a,y)n  = (an,  y + Sn(a)),  ( a,y)t  = (a„y  + 6„(a)), 
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(a,y)e  - ( ae,y  + 5e(a )),  (a,  y)w  = (aw,y  + 5w(a)),  and  ( a,y)0  = (a0,y  + S0(a)), 
where 

(5„(a)  = [a  = 0],  <5s(a)  = -[a  = 0],  <5e(a)  = 0,  Sw(a)  = -[a  = 1]; 

d-  in\  _ f sign(ao  - an)[a0  & an  = 0],  if  ex  & 1 = 1; 

\ sign(a0  — aw)[a0  & aw  — 0],  if  a & 1 = 0. 

(See  the  illustration  below.)  Bitwise  operations  now  allow  us  to  surf  the  entire 
hyperbolic  plane  with  ease.  On  the  other  hand,  we  could  also  ignore  the  y 
coordinates  as  we  move,  thereby  wrapping  around  a “hyperbolic  cylinder”  of 
pentagons;  the  a coordinates  define  an  interesting  multigraph  on  the  set  of  all 
negaFibonacci  codes,  in  which  every  vertex  has  degree  5. 


(100001,1) 

(100100,1) 

(100101,1) 

(1010,0) 

(1000,0) 

(1001,0) 

(10,-1) 


(154) 


Bitmap  graphics.  It’s  fun  to  write  programs  that  deal  with  pictures  and  shapes, 
because  they  involve  our  left  and  right  brains  simultaneously.  When  image  data 
is  involved,  the  results  can  be  engrossing  even  if  there  are  bugs  in  our  code. 

The  book  you  are  now  reading  was  typeset  by  software  that  treated  each 
page  as  a gigantic  matrix  of  0s  and  Is,  called  a “raster”  or  “bitmap,”  containing 
millions  of  square  picture  elements  called  “pixels.”  The  rasters  were  transmitted 
to  printing  machines,  causing  tiny  dots  of  ink  to  be  placed  wherever  a 1 appeared 
in  the  matrix.  Physical  properties  of  ink  and  paper  caused  those  small  clusters 
of  dots  to  look  like  smooth  curves;  but  each  pixel’s  basic  squareness  becomes 
evident  if  we  enlarge  the  images  tenfold,  as  in  the  letter  ‘A’  shown  in  Fig.  15(a). 

With  bitwise  operations  we  can  achieve  special  effects  like  “custering,”  in 
which  black  pixels  disappear  when  they’re  surrounded  on  all  sides  (Fig.  15(b)): 


Fig.  15.  The  letter  A, 
before  and  after  custering. 
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This  operation,  introduced  by  R.  A.  Kirsch,  L.  Cahn,  C.  Ray,  and  G.  H.  Urban 
[Proc.  Eastern  Joint  Computer  Conf.  12  (1957),  221-229],  can  be  expressed  as 

custer(X)  = I&~((AV1)&(I»1)&(J«1)&(I^1)),  (155) 

where  ‘XVI’  and  ‘X^l’  stand  respectively  for  the  result  of  shifting  the  bitmap  A' 
down  or  up  by  one  row.  Let  us  write 


XN  = X VL  Xw  = X > 1,  XE  = X « 1,  Xs=Xf\l  (i56) 

for  the  1-pixel  shifts  of  a bitmap  A' . Then,  for  example,  the  symbolic  expression 
An  & (Xs  | XE)  evaluates  to  1 in  those  pixel  positions  whose  northern  neighbor 
is  black,  and  which  also  have  either  a black  neighbor  on  the  south  side  or  a white 
neighbor  to  the  east.  With  these  abbreviations,  (155)  takes  the  form 


custer(A)  = A & ~(XN  & Xw  & XE  & Xs),  (157) 

which  can  also  be  expressed  as  A & (XN  | Aw  | XE  | As). 

Every  pixel  has  four  “rook-neighbors,”  with  which  it  shares  an  edge  at  the 
top,  left,  right,  or  bottom.  It  also  has  eight  “king-neighbors,”  with  which  it 
shares  at  least  one  corner  point.  For  example,  the  king-neighbors  that  lie  to  the 
northeast  of  all  pixels  in  a bitmap  A can  be  denoted  by  XNE,  which  is  equivalent 
to  (XN)E  in  pixel  algebra.  Notice  that  we  also  have  XNE  = (XE)N. 

A 3 x 3 cellular  automaton  is  an  array  of  pixels  that  changes  dynamically 
via  a sequence  of  local  transformations,  all  performed  simultaneously:  The  state 
of  each  pixel  at  time  t + 1 depends  entirely  on  its  state  at  time  t and  the  states 
of  its  king-neighbors  at  that  time.  Thus  the  automaton  defines  a sequence  of 
bitmaps  A(°),  X(1>,  X(2\  . . . that  lead  from  any  given  initial  state  X(0\  where 


v-^)  y(t) 

aNE  5 W 5 


it) 


X {t+1}  = /(414i),I”',A-,A"\A”',AS,Asw,A^J)  (158) 

and  / is  any  bitwise  Boolean  function  of  nine  variables.  Fascinating  patterns 
often  emerge  in  this  way.  For  example,  after  Martin  Gardner  introduced  John 
Conway’s  game  of  Life  to  the  world  in  1970,  more  computer  time  was  probably 
devoted  to  studying  its  implications  than  to  any  other  computational  task  during 
the  next  several  years  although  the  people  paying  the  computer  bills  were 
rarely  told!  (See  exercise  167.) 

There  are  2512  Boolean  functions  of  nine  variables,  so  there  are  2512  different 
3x3  cellular  automata.  Many  of  them  are  trivial,  but  most  of  them  probably 
have  such  complicated  behavior  that  they  are  humanly  impossible  to  understand. 
Fortunately  there  also  are  many  cases  that  do  turn  out  to  be  useful  in  practice  — 
and  much  easier  to  justify  on  economic  grounds  than  the  simulation  of  a game. 

For  example,  algorithms  for  recognizing  alphabetic  characters,  fingerprints, 
or  similar  patterns  often  make  use  of  a “thinning”  process,  which  removes  excess 
black  pixels  and  reduces  each  component  of  the  image  to  an  underlying  skeleton 
that  is  comparatively  simple  to  analyze.  Several  authors  have  proposed  cellular 
automata  for  this  problem,  beginning  with  D.  Rutovitz  [J.  Royal  Stat.  Society 
A129  (1966),  512  -513]  who  suggested  a 4 x 4 scheme.  But  parallel  algorithms 
are  notoriously  subtle,  and  flaws  tended  to  turn  up  after  various  methods  had 
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■ Fig.  16.  Example 

\ results  of  Guo  and 

\ Hall’s  3x3  autom- 

^ aton  for  thinning 

the  components  of  a 
^ bitmap.  (“Hollow” 

\ pixels  were  origi- 

nally  black.) 


been  published.  For  example,  at  least  two  of  the  black  pixels  in  a component  like 
HI  should  be  removed,  yet  a symmetrical  scheme  will  erroneously  erase  all  four. 

A satisfactory  solution  to  the  thinning  problem  was  finally  found  by  Z.  Guo 
and  R.  W.  Hall  [CACM  32  (1989),  359-373,  759],  using  a 3 x 3 automaton  that 
invokes  alternate  rules  on  odd  and  even  steps.  Consider  the  function 

/ (%w > i *^ne ■ xw,x,  xB,xsw , xs , xSE)  = x A—ig(xjiw i ■ xw,xE, . . . , xSE),  (159) 

where  g — 1 only  in  the  following  37  configurations  surrounding  a black  pixel: 

Then  we  use  (158),  but  with  f(xNW,xN,xNB,xw,x,xE,xsw,xs,xSE)  replaced  by 
its  180°  rotation  f(xSE,  xs,  xsw,  xE,  x,  xw,  xNE,  xN,  xNW)  on  even-numbered  steps. 
The  process  stops  when  two  consecutive  cycles  make  no  change. 

With  this  rule  Guo  and  Hall  proved  that  the  3x3  automaton  will  preserve 
the  connectivity  structure  of  the  image,  in  a strong  sense  that  we  will  discuss 
below.  Furthermore  their  algorithm  obviously  leaves  an  image  intact  if  it  is 
already  so  thin  that  it  contains  no  three  pixels  that  are  king-neighbors  of  each 
other.  On  the  other  hand  it  usually  succeeds  in  “removing  the  meat  off  the 
bones”  of  each  black  component,  as  shown  in  Fig.  16.  Slightly  thinner  thinning 
is  obtained  in  certain  cases  if  we  add  four  additional  configurations 

lift  (160) 

to  the  37  listed  above.  In  either  case  the  function  g can  be  evaluated  with  a 
Boolean  chain  of  length  25.  (See  exercises  170-172.) 

In  general,  the  black  pixels  of  an  image  can  be  grouped  into  segments  or 
components  that  are  kingwise  connected , in  the  sense  that  any  black  pixel  can 
be  reached  from  any  other  pixel  of  its  component  by  a sequence  of  king  moves 
through  black  pixels.  The  white  pixels  also  form  components,  which  are  rookwise 
connected:  Any  two  white  cells  of  a component  are  mutually  reachable  via  rook 
moves  that  touch  nothing  black.  It’s  best  to  use  different  kinds  of  connectedness 
for  white  and  black,  in  order  to  preserve  the  topological  concepts  of  “inside”  and 
“outside”  that  are  familiar  from  continuous  geometry  [see  A.  Rosenfeld,  JACM 
17  (1970),  146-160].  If  we  imagine  that  the  corner  points  of  a raster  are  black, 
an  infinitely  thin  black  curve  can  cross  between  pixels  at  a corner,  but  a white 
curve  cannot.  (We  could  also  imagine  white  corner  points,  which  would  lead  to 
rookwise  connectivity  for  black  and  kingwise  connectivity  for  white.) 
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time  = 0 


time  = 1 


time  = 3 


(b)  (c) 

Fig.  17.  The  shrinking  of  a Cheshire  cat 

An  amusing  algorithm  for  shrinking  a picture  while  preserving  its  connec- 
tivity, except  that  isolated  black  or  white  pixels  disappear,  was  presented  by 
S.  Levialdi  in  CACM  15  (1972),  7-10;  an  equivalent  algorithm,  but  with  black 
and  white  reversed,  had  also  appeared  in  T.  Beyer’s  Ph.D.  thesis  (M.I.T.,  1969). 
The  idea  is  to  use  a cellular  automaton  with  the  simple  transition  function 

f{x nw,xn,xne,xw,x,xe,xsw,xs,xse)  = (iA  (xw\/xsw\/xs))  V ( xwAxs ) (161) 

at  each  step.  This  formula  is  actually  a 2 x 2 rule,  but  we  still  need  a 3 x 3 window 
if  we  want  to  keep  track  of  the  cases  when  a one-pixel  component  goes  away. 

For  example,  the  25  x 30  picture  of  a Cheshire  cat  in  Fig.  17(a)  has  seven 
kingwise  black  components:  the  outline  of  its  head,  the  two  earholes,  the  two 
eyes,  the  nose,  and  the  smile.  The  result  after  one  application  of  (161)  is  shown 
m Fig-  17(b):  Seven  components  remain,  but  there’s  an  isolated  point  in  one  ear, 
and  the  other  earhole  will  become  isolated  after  the  next  step.  Hence  Fig.  17(c) 
has  only  five  components.  After  six  steps  the  cat  loses  its  nose,  and  even  the 
smile  will  be  gone  at  time  14.  Sadly,  the  last  bit  of  cat  will  vanish  during  step  46. 

At  most  M + N — 1 transitions  will  wipe  out  any  M x N picture,  because 
the  lowest  visible  northwest-to-southeast  diagonal  line  moves  relentlessly  upward 
each  time.  Exercises  176  and  177  prove  that  different  components  will  never 
merge  together  and  interfere  with  each  other. 

Of  course  this  cubic-time  cellular  method  isn’t  the  fastest  way  to  count  or 
identify  the  components  of  a picture.  We  can  actually  do  that  job  “online,” 
while  looking  at  a large  image  one  row  at  a time,  not  bothering  to  keep  all  of 
the  previously  seen  rows  in  memory  if  we  don’t  wish  to  look  at  them  again. 

While  we’re  analyzing  the  components  we  might  as  well  also  record  the 
relationships  between  them.  Let  s assume  that  only  finitely  many  black  pixels 
are  present.  Then  there’s  an  infinite  component  of  white  pixels  called  the 
background.  Black  components  adjacent  to  the  background  constitute  the  main 
objects  of  the  image.  And  these  objects  may  in  turn  have  holes,  which  may  serve 
as  a background  for  another  level  of  objects,  and  so  on.  Thus  the  connected 
components  of  any  finite  picture  form  a hierarchy  — an  oriented  tree,  rooted  at 
the  background.  Black  components  appear  at  the  odd-numbered  levels  of  this 
tree,  and  white  components  at  the  even-numbered  levels,  alternating  between 
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time  = 5 time  = 10  time  = 20 


by  repeated  application  of  Levialdi’s  transformation. 


kingwise  and  rookwise  connectedness.  Each  component  except  the  background  is 
surrounded  by  its  parent.  Childless  components  are  said  to  be  simply  connected. 

For  example,  here  are  the  Cheshire  cat's  components,  labeled  with  digits  for 
white  pixels  and  letters  for  the  black  ones,  and  the  corresponding  oriented  tree: 
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(162) 


During  the  shrinking  process  of  Fig.  17,  components  disappear  in  the  order 
©■  {©,©,©}  (all  at  time  3),  © ©.  ©.  ©.  (f),  ©. 

Suppose  we  want  to  analyze  the  components  of  such  a picture  by  reading 
one  row  at  a time.  After  we’ve  seen  four  rows  the  result-so-far  will  be 


0000000000000000000000  oho ooooo 
0 0 0 oraaao 0 0 0 0 0 0 0 0 0 0 0 0 0 ohHo 0 0 0 0 0 
0 0 0 0010110  0 0 0 0000000  0 002200  0 0 0 0 
0 0 001  1 llano  0 Q[IM«MIWII0  002  2H0  oooo 


U63) 


and  we’ll  be  ready  to  scan  row  five.  A comparison  of  rows  four  and  five  will 
then  show  that  © and  © should  merge  into  ©,  but  that  new  components  © 
and  (3)  should  also  be  launched.  Exercise  179  contains  full  details  about  an 
instructive  algorithm  that  properly  updates  the  current  tree  as  new  rows  are 
input.  Additional  information  can  also  be  computed  on  the  fly:  For  example,  we 
could  determine  the  area  of  each  component,  the  locations  of  its  first  and  last 
pixels,  the  smallest  enclosing  rectangle,  and/or  its  center  of  gravity. 
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Filling.  Let  s complete  our  quick  tour  of  raster  graphics  by  considering  how 
to  fill  regions  that  are  bounded  by  straight  lines  and/or  simple  curves.  Particu- 
larly efficient  algorithms  are  available  when  the  curves  are  built  up  from  “conic 
sections  circles,  ellipses,  parabolas,  or  hyperbolas,  as  in  classical  geometry. 

In  keeping  with  geometric  tradition,  we  shall  adopt  Cartesian  coordinates 
{x,y)  in  the  following  discussion,  instead  of  speaking  about  rows  or  columns 
of  pixels.  An  increase  of  x will  signify  a move  to  the  right,  while  an  increase 
of  y will  move  upward.  More  significantly,  we  will  focus  on  the  edges  between 
square  pixels,  instead  of  on  the  pixels  themselves.  Edges  run  between  integer 
points  (x,y)  and  (x',y')  of  the  plane  when  \x  - x'\  + | y - y'\  — Each  pixel 

is  bounded  by  the  four  edges  (x,y)  — (x-l,y)  — (x-l,y-l)  — (x,y-l) 

(x,  y).  Experience  has  shown  that  algorithms  for  filling  contours  become  simpler 
and  faster  when  we  concentrate  on  the  edge  transitions  between  white  and  black, 
instead  of  on  the  black  pixels  of  a custerized  boundary.  (See,  for  example,  the 
discussion  by  B.  D.  Ackland  and  N.  Weste  in  IEEE  Trans.  C-30  (1981),  41-48.) 

Consider  a continuous  curve  z(t ) = (x(t),y(t))  that  is  traced  out  as  t varies 
from  0 to  1.  We  assume  that  the  curve  doesn’t  intersect  itself  for  0 < t < 1,  and 
that  ^(O)  = 2(1).  The  famous  Jordan  curve  theorem  [C.  Jordan,  Corns  d analyse 
3 (1887),  587-594;  O.  Veblen,  Trans.  Amer.  Math.  Soc.  6 (1905),  83-98]  states 
that  every  such  curve  divides  the  plane  into  two  regions,  called  the  inside  and 
the  outside.  We  can  “digitize”  z(t)  by  forcing  it  to  travel  along  edges  between 
pixels;  then  we  obtain  an  approximation  in  which  the  inside  pixels  are  black  and 
the  outside  pixels  are  white.  This  digitization  process  essentially  replaces  the 
original  curve  by  the  sequence  of  integer  points 

round(z(t))  = (|®(t)  + |J,  Ll/(t)  + |J),  for  0 < t < 1.  (164) 

The  curve  can  be  perturbed  slightly,  if  necessary,  so  that  z(t)  never  passes  exactly 
through  the  center  of  a pixel.  Then  the  digitized  curve  takes  discrete  steps  along 
pixel  edges  as  t grows;  and  a pixel  lies  inside  the  digitization  if  and  only  if  its 
center  lies  inside  the  original  continuous  curve  { z(t ) | 0 < t < 1}. 

For  example,  the  equations  x(t)  = 20cos2nt  and  y(t)  = 10sin27rt  define  an 
ellipse.  Its  digitization,  round(z(t)),  starts  at  (20,0)  when  t = 0,  then  jumps  to 

(20. 1)  when  t k,  .008  and  10  sin  2nt  = 0.5.  Then  it  proceeds  to  the  points  (20,  2), 

(19.2) ,  (19,3),  (19,4),  (18,4),  ...,  (20,-1),  (20,0),  as  t increases  through  the 
values  .024,  .036,  .040,  .057,  .062,  . . . , .976,  .992: 


(165) 


The  horizontal  edges  of  such  a boundary  are  conveniently  represented  by  bit 
vectors  H(y)  for  each?/;  for  example,  H(  10)  = . . . 000000111111111111000000 
and  H{ 9)  = ...011111000000000000111110...  in  (165).  If  the  ellipse  is  filled 
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with  black  to  obtain  a bitmap  B,  the  H vectors  mark  transitions  between  black 
and  white,  so  we  have  the  symbolic  relation 

H — B (&  (B 1).  (1-66) 

Conversely,  it’s  easy  to  obtain  B when  the  H vectors  are  given: 

B(y)  = H(y  max  ) © ^(j/max-l)  © • ' • © H(y  + 1) 

= H(ymin)  © H(ymin+i)  ® ■ • • ®H(y).  (167) 

Notice  that  H (ymm)  © H (j/min+i)  © • • - ®H(ymax)  is  the  zero  vector,  because  each 
bitmap  is  white  at  both  top  and  bottom.  Notice  further  that  the  analogous  verti- 
cal edge  vectors  V(x)  are  redundant:  They  satisfy  the  formulas  V — B®{B< Cl) 
and  B — 17®  (see  exercise  36),  but  we  need  not  bother  to  keep  track  of  them. 

Conic  sections  are  easier  to  deal  with  than  most  other  curves,  because  we 
can  readily  eliminate  the  parameter  t.  For  example,  the  ellipse  that  led  to  (165) 
can  be  defined  by  the  equation  (x/20)2  + {y/ 10)2  = 1,  instead  of  using  sines 
and  cosines.  Therefore  pixel  (x,  y)  should  be  black  if  and  only  if  its  center  point 
(a '—\,y—\)  lies  inside  the  ellipse,  if  and  only  if  (x  — i)2/400-F(y— 1)2/100—  1 < 0. 

In  general,  every  conic  section  is  the  set  of  points  for  which  F(x,y)  = 0, 
when  F is  an  appropriate  quadratic  form.  Therefore  there’s  a quadratic  form 

Q{x,y)  = F(x-\,y-\ ) = ax2  + bxy  + cy2  + dx  + ey  + f (168) 

that  is  negative  at  the  integer  point  (x,y)  if  and  only  if  pixel  (x,y)  lies  on  a 
given  side  of  the  digitized  curve. 

For  practical  purposes  we  may  assume  that  the  coefficients  (a,  b, . . . , f)  of  Q 
are  not-too-large  integers.  Then  we’re  in  luck,  because  the  exact  value  of  Q{x,  y) 
is  easy  to  compute.  In  fact,  as  pointed  out  by  M.  L.  V.  Pitteway  [Comp.  J. 
10  (1967),  282-289],  there’s  a nice  “three-register  algorithm”  by  which  we  can 
quickly  track  the  boundary  points:  Let  x and  y be  integers,  and  suppose  we’ve  got 
the  values  of  Q(x,  y),  Qx(x,  y),  and  Qy(x,  y)  in  three  registers  (Q,  Qx,  Qy ),  where 

Qx{x,  y)  — 2ax  + by  + d and  Qy(x,y)  = bx  + 2cy  + e (!6g) 

are  -j^Q  and  -§^Q-  We  can  then  move  to  any  adjacent  integer  point,  because 

Q(x±l,y)  = Q(x,y)±Qx(x,y)+a,  Q(x,y±l)  = Q(x,y)±Qv(x,y)+c, 
Qx(x±l,y)  = Qx(x,y)±2a:  Qx(x,y±l)  = Qx{x,y)±b, 

Qy(x±l,y)  = Qy(x,y)±b;  Qy(x,  y±l)  = Qy(x,  y)±2c.  (170) 

Furthermore  we  can  divide  the  contour  into  separate  pieces,  in  each  of  which  x(t) 
and  y(t)  are  both  monotonic.  For  example,  when  the  ellipse  (165)  travels  from 
(20,0)  to  (0, 10),  the  value  of  x decreases  while  y increases;  thus  we  need  only 
move  from  (x,y)  to  (x— 1 ,y)  or  to  (x,y+l).  If  registers  ( Q,R,S ) respectively 
hold  ( Q , Qx—a,Qy+c),  a move  to  (x— 1,  y)  simply  sets  Q Q — R,  R R — 2a, 
and  S i S 6;  a move  to  {x,y+ 1)  is  just  as  quick.  With  care,  this  idea  leads 
to  a blindingly  fast  way  to  discover  the  correctly  digitized  edges  of  almost  any 


conic  curve. 
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For  example,  the  quadratic  form  Q(x,y)  for  ellipse  (165)  is  4a:2  + 16 y2  — 
(4x  + 16V  + 1595),  when  we  integerize  its  coefficients.  We  have  Q(20,0)  = 
F(19.5,  -0.5)  = -75  and  <2(21,0)  = +85;  therefore  pixel  (20,0),  whose  center  is 
(19.5,  -0.5),  is  inside  the  ellipse,  but  pixel  (21,0)  isn’t.  Let’s  zoom  in  closer: 


\ 

245 

405 

-179 

\ 

-35 

117 

277 

-275 

-131 

\ 

\ 

21 

181 

\ 

\ 

-339 

-195 

-43 

117 

-371 

-227 

~75j 

85 

-371 

-227 

-75j 

85 

(21,0) 


(l7l) 


The  boundary  can  be  deduced  without  examining  Q at  very  many  points.  In 
fact,  we  don  t need  to  look  at  <2(21,0),  because  we  know  that  all  edges  between 
(20,0)  and  (0, 10)  must  go  either  upwards  or  to  the  left.  First  we  test  Q(20. 1) 
and  find  it  negative  (-75);  so  we  move  up.  Also  <2(20,  2)  is  negative  (-43)  ’so 
we  go  up  again.  Then  we  test  <2(20, 3),  and  find  it  positive  (21);  so  we  move  left. 
And  so  on.  Only  the  Q values  -75,  -43,  21,  -131,  -35,  93,  -51,  . . . actually 
need  to  be  examined,  if  we’ve  set  the  three-register  method  up  properly. 

Algorithm  T ( Three-register  algorithm  for  conics).  Given  two  integer  points 
(x,y)  and  (x',y'),  and  an  integer  quadratic  form  Q as  in  (168),  this  algorithm 
decides  how  to  digitize  a portion  of  the  conic  section  defined  by  F(x,y)  = 0. 
where  F(x,  y)  = Q{x  + §,  y + §).  It  creates  \x'  - x\  horizontal  edges  and  | y'  - y\ 
vertical  edges,  which  form  a path  from  (x,y)  to  (s'.y').  We  assume  that 

i)  Real-valued  points  (£,77)  and  (£',77')  exist  such  that  F(£,  77)  = +(£',77')  = 0. 

ii)  The  curve  travels  from  (£,77)  to  (£',77')  monotonically  in  both  coordinates, 

m)  x = L£  + , y = (77  + , x'  = [£'  + |J , and  y'  = (77'  + |J . 

iv)  If  we  traverse  the  curve  from  (£,  77)  to  (£',  77'),  we  see  F < 0 on  our  left. 

v)  No  edge  of  the  integer  grid  contains  two  roots  of  Q (see  exercise  183). 

Tl.  [Initialize.]  If  m = x',  go  to  Til;  if  y = y\  g0  to  T10.  li  x < x'  and  y < y' 
set  Q 4-  Q(x+l,y+l),  R 4-  Qx(x+l,y+l)+a,  5 4-  Qy(x+l,y+l)+c,  and 
go  to  T2  If  x<  x'  and  y > y\  set  Q +-  Q(x+ 1,  y),  R 4-  Qx(x+ 1,  y)  + a, 
6 ^ Qvix+hy)  - c,  and  go  to  T3.  If  x > x'  and  y < y' , set  Q 4- 
Q(x ’ Qx(x , 2/+1)  — a,  S 4—  Qy(x,  7/+1)  + c,  and  go  to  T4.  If 

x>x  and  y > y' , set  Q 4-  Q(x,y),  R 4-  Qx(x,y)  - a,  S 4-  Qy(x,y)  - c, 


T2.  [Right  or  up.]  If  Q < 0,  do  T9;  otherwise  do  T6.  Repeat  until  interrupted. 
T3.  [Down  or  right.]  If  Q<  0,  do  T7;  otherwise  do  T9.  Repeat  until  interrupted. 
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T4.  [Up  or  left.]  If  Q < 0.  do  T6;  otherwise  do  T8.  Repeat  until  interrupted. 
T5.  [Left  or  down.]  If  Q < 0,  do  T8;  otherwise  do  T7.  Repeat  until  interrupted. 
T6.  [Move  up.]  Create  the  edge  (x,  y)  — (x,  j/+l),  then  set  y 4-  y+ 1.  Interrupt 
to  T10  if  y = y'\  otherwise  set  Q 4-  Q + S,  R 4-  R + 6,  S 4-  S + 2c. 

T7.  [Move  down.]  Create  the  edge  (x,y)  — (x,y—  1),  then  set  y 4-  y - 1. 

Interrupt  to  T10  if  y = y otherwise  set  Q 4-  Q — S,  R 4-  R-b,  S 4-  S— 2c. 
T8.  [Move  left.]  Create  the  edge  (x,y)  — (x-1  ,y),  then  set  x 4-  x - 1. 

Interrupt  to  Til  if  x = x';  otherwise  set  Q 4-  Q—R,  R 4-  R— 2a,  S 4-  S—b. 
T9.  [Move  right.]  Create  the  edge  (x,y)  — (x+1  ,y),  then  set  x 4-  x + 1. 
Interrupt  to  Til  if  x = x'\  otherwise  set  Q 4-  Q+R,  R 4—  R+ 2a,  S 4-  S+b. 
T10.  [Finish  horizontally.]  While  x < x',  create  the  edge  (x,y)  — (x+l,y)  and 
set  x 4—  x + 1.  While  x > x',  create  the  edge  (x,y)  — (x— 1 ,y)  and  set 
x 4—  x — 1.  Terminate  the  algorithm. 

Til.  [Finish  vertically.]  While  y < y' , create  the  edge  (x,y)  — (x,y+ 1)  and 
set  y 4-  y + 1.  While  y > y' , create  the  edge  (x,y)  — (x,y- 1)  and  set 
y 4—  y — 1.  Terminate  the  algorithm.  | 

For  example,  when  this  algorithm  is  invoked  with  (x,y)  = (20,0),  (x',y')  — 
(0, 10),  and  Q(x,y)  = 4x2  + 16y2  — 4x  — 16 y — 1595,  it  will  create  the  edges 
(20,0)  — (20,1)  — (20,2)  — (19,2)  — (19,3)  — (19,4)  — (18,4)  — 
(18,5)  — (17,5)  — (17,6)  — •••  — (6,9)  — (6,10),  then  make  a beeline 
for  (0, 10).  (See  (165)  and  (171).)  Exercise  182  explains  why  it  works. 

Movement  to  the  left  in  step  T8  is  conveniently  implemented  by  setting 
H(y)  4-  H{y)  ® (1  < (rmax  - x)),  using  the  H vectors  of  (166)  and  (167). 
Movement  to  the  right  is  similar,  but  we  set  x 4-  x + 1 first.  Step  T10  could  set 

H{y)  4-  H(y)  © ((1  < (xmax  - min(x,x')))  - (1<  (xmax  - max(x, x'))));  (172) 

but  one  move  at  a time  might  be  just  as  good,  because  |x'  — x|  is  often  small. 
Movement  up  or  down  needs  no  action,  because  vertical  edges  are  redundant. 

Notice  that  the  algorithm  runs  somewhat  faster  in  the  special  case  when 
6 = 0;  circles  always  belong  to  this  case.  The  even  more  special  case  of  straight 
lines,  when  a = b = c = 0,  is  of  course  faster  yet;  then  we  have  a simple  one- 
register  algorithm  (see  exercise  185). 


Fig.  18.  Pixels  change  from 
white  to  black  and  back  again, 
at  the  edges  of  digitized  circles. 

When  many  contours  are  filled  in  the  same  image,  using  H vectors,  the 
pixel  values  change  between  black  and  white  whenever  we  cross  an  odd  number 
of  edges.  Figure  18  illustrates  a tiling  of  the  hyperbolic  plane  by  equilateral 
45°-45°-45°  triangles,  obtained  by  superimposing  the  results  of  several  hundred 
applications  of  Algorithm  T. 
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Fig.  19.  Squines  that  define 
the  outline  contour  of  an  ‘S’. 


Algorithm  T applies  only  to  conic  curves.  But  that’s  not  really  a limitation 
in  practice,  because  just  about  every  shape  we  ever  need  to  draw  can  be  well  ap- 
proximated by  “piecewise  conics”  called  quadratic  Bezier  splines  or  squines.  For 
example,  Fig.  19  shows  a typical  squine  curve  with  40  points  (z0,  zu...,  z39,  z40). 
where  z40  = z0.  The  even-numbered  points  (z0,  z2, . . . , z4 0)  lie  on  the  curve,’ 
the  others,  (z1}  z3, . . . , z39),  are  called  “control  points,”  because  they  regulate 
local  bending  and  flexing.  Each  section  S(z2j,  z2j+i,  Z2j+2)  begins  at  point  z2j, 
traveling  in  direction  z2j+i  — z2j.  It  ends  at  point  z2j+ 2,  traveling  in  direction 
Z2j+2-z2j+i.  Thus  if  z2j  lies  on  the  straight  line  from  z2j-\  to  z2j+ 1,  the  squine 
passes  smoothly  through  point  z2j  without  changing  direction. 

Exercise  186  defines  S(z2j,  z2j+i,  z2j+2)  precisely,  and  exercise  187  explains 
how  to  digitize  any  squine  curve  using  Algorithm  T.  The  region  inside  the 
digitized  edges  can  then  be  filled  with  black  pixels. 

Incidentally,  the  task  of  drawing  lines  and  curves  on  a bitmap  turns  out 
to  be  much  more  difficult  than  the  task  of  filling  a digitized  contour,  because 
we  want  diagonal  strokes  to  have  the  same  apparent  thickness  as  vertical  and 
horizontal  strokes  do.  An  excellent  solution  to  the  line-drawing  problem  was 
found  by  John  D.  Hobby,  JACM  36  (1989),  209-229. 

*Branchless  computation.  Modern  computers  tend  to  slow  down  when  a 
program  contains  conditional  branch  instructions,  because  an  uncertain  flow 
of  control  can  interfere  with  predictive  lookahead  circuitry.  Therefore  we’ve 
used  MMIX’s  conditional-set  instructions  like  CSNZ  in  programs  like  (56).  Indeed, 
four  instructions  such  as  ‘ADD  z,y,l;  SRt,u,2;  CSNZ  x.q.z;  CSNZ  v,q,t’  are 
probably  faster  than  their  three-instruction  counterpart 

BZ  q,@+12;  ADDx,y,l;  SR  v,u,2  (173) 

when  the  actual  running  time  is  measured  on  a highly  pipelined  machine,  even 
though  the  rule-of-thumb  cost  of  (173)  is  only  3v  according  to  Table  1.3.1 -1. 
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Bitwise  operations  can  help  diminish  the  need  for  costly  branching.  For 
example,  if  MMIX  didn’t  have  a CSNZ  instruction  we  could  write 
NEGU  m,q;  OR  m,m,q;  SR  m,m,63; 

ADD  t ,y , 1 ; XOR  AND  XOR  x,x,t;  (174) 

SR  t ,u,2;  XOR  t,t,v;  AND  XOR  v,v,t; 

here  the  first  line  creates  the  mask  m = — [g  ^ Oj . On  some  computers  these  eleven 
branchless  instructions  would  still  run  faster  than  the  three  instructions  in  (173). 

The  inner  loop  of  a merge  sort  algorithm  provides  an  instructive  example. 
Suppose  we  want  to  do  the  following  operations  repeatedly: 

If  Xi  < yj,  set  Zk  <r-  Xi,  i 4—  * + 1,  and  go  to  x.done  if  i = imax. 
Otherwise  set  Zk  4-  t/y , j 4—  j + 1,  and  go  to  y.done  if  j = jmax. 

Then  set  k 4—  k + 1 and  go  to  z.done  if  k = fcmax. 

If  we  implement  them  in  the  “obvious”  way,  four  conditional  branches  are  in- 
volved, three  of  which  are  active  on  each  path  through  the  loop: 


1H  CMP 

t ,xi ,yj ; BNN  t,2F 

Branch  if  x,  > yj. 

STO 

xi ,zbase ,kk 

Zk  Xj. 

ADD 

ii , ii  ,8 

i 4—  i + 1. 

BZ 

ii , XJDone 

To  X-done  if  i — imax. 

LDO 

xi ,xbase , ii 

Load  Xi  into  register  xi. 

JMP 

3F 

Join  the  other  branch. 

2H  STO 

yj , zbase ,kk 

zk  «-  yj. 

ADD 

jj  .jj  .8 

3 *~j  + 1- 

BZ 

j j , Y_Done 

To  y„done  if  j = jmax. 

LDO 

yj , ybase, j j 

Load  yj  into  register  yj. 

3H  ADD 

kk , kk , 8 

k <—  k + 1 . 

PBNZ 

kk,  IB 

Repeat  if  k / kmax. 

JMP 

Z_Done 

To  z.done.  | 

(Here  ii  = 

— * max  ) ? j j = 8 (j  - 

- jmax),  and  kk  = 8 (k  — femax);  the  factor 

8 is  needed  because  Xj,  yj,  and  Zk 

are  octabytes.)  Those  four  branches  can 

reduced  to  just  one: 

1H  CMP 

t.xi.yj 

t <-  sign(xi  - yj). 

CSN 

yj  »t,xi 

yj  <—  mm{xi,yj). 

STO 

yj , zbase, kk 

Zk<~  yj. 

AND 

t ,t  ,8 

t <-  8[*i  <yj]. 

ADD 

ii,ii,t 

i <-  i + [xi  < yj}. 

LDO 

xi ,xbase , ii 

Load  X,  into  register  xi. 

XOR 

t , t , 8 

t «-  t © 8. 

ADD 

jj.jj.t 

3 *~3  + [*«  >Vj]- 

LDO 

yj  , ybase, jj 

Load  yj  into  register  yj. 

ADD 

kk , kk , 8 

k t—  k + 1. 

AND 

u,ii,  jj  ; AND  u,u,kk 

u <—  ii  & j j & kk. 

PBN 

u,  IB 

Repeat  if  i < imax , j jmax , and  k <C  kmax . 

When  the  loop  stops  in  this  version, 

we  can  readily  decide  whether  to  continue 

x-done,  y.done,  or  zAone.  These  instructions  load  both  x,  and  y:l  from  memory 
each  time,  but  the  redundant  value  will  already  be  present  in  the  cache. 
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*More  applications  of  MOR  and  MXOR.  Let’s  finish  off  our  study  of  bitwise 
manipulation  by  taking  a look  at  two  operations  that  are  specifically  designed  for 
64-bit  work.  MMIX’s  instructions  MOR  and  MXOR,  which  essentially  carry  out  matrix 
multiplication  on  8 x 8 Boolean  matrices,  turn  out  to  be  extremely  flexible  and 
powerful,  both  by  themselves  and  in  combination  with  other  bitwise  operations. 

If  x = (x7  . . . £1X0)256  is  an  octabyte  and  a = (a7  . . . aia0)2  is  a single  byte, 
the  instruction  MOR  t,x,a  sets  t 4-  a7x7  | • • • | arx  1 | a0x0,  while  MXOR  t,x,a  sets 
t 4-  <27X7  © • • • ©aiXi  ©a0x0.  For  example,  MOR  t,x,2  and  MXOR  t,x,2  both  set 
t 4-  £1;  MOR  t ,x,3  sets  t 4-  xx  \ x0;  and  MXOR  t,x,3  sets  t 4-  xx  © x0. 

In  general,  of  course,  MOR  and  MXOR  are  functions  of  octabytes.  When  y = 
(2/7  • • - 2/i 2/0)256  is  a general  octabyte,  the  instruction  MOR  t,x,y  produces  the 
octabyte  t whose  jth  byte  tj  is  the  result  of  MOR  applied  to  x and  yr 

Suppose  x = -1  = #ffffffffffffffff . Then  MOR  t,x,y  computes  the 
mask  t in  which  byte  tj  is  #f  f whenever  yj  ^ 0,  while  tj  is  zero  when  y3  = 0.  This 
simple  special  case  is  quite  useful,  because  it  accomplishes  in  just  one  instruction 
what  we  previously  needed  seven  operations  to  achieve  in  situations  like  (92). 

We  observed  in  (66)  that  two  MORs  will  suffice  to  reverse  the  bits  of  any  64-bit 
word,  and  many  other  important  bit  permutations  also  become  easy  when  MOR 
is  in  a computer’s  repertoire.  Suppose  7r  is  a permutation  of  {0, 1, ... , 7}  that 
takes  0 1— 4 07r,  1 1-4  l7r,  . . . , 7 7n.  Then  the  octabyte  p — (27,r . . . 2l7r207r)256 

corresponds  to  a permutation  matrix  that  makes  MOR  do  nice  tricks:  MOR  t,x,p 
will  permute  the  bytes  of  x,  setting  tj  4-  xjn.  Furthermore,  MOR  u,p,y  will 
permute  the  bits  of  each  byte  of  y,  according  to  the  inverse  permutation;  it  sets 
Uj  4-  (o7  . . . aia0)2  when  y3  = ( a7n  . . . al7ra07r)2. 

With  a little  more  skullduggery  we  can  also  expedite  further  permutations 
such  as  the  perfect  shuffle  (76),  which  transforms  a given  octabyte  z = 2 32x  + y = 
(x3i . . . £1X02/31  • • • 2/12/0)2  into  the  “zippered”  octabyte 

w = xty  = (£312/31 ...  £12/1x02/0)2-  (175) 

With  appropriate  permutation  matrices  p , q , and  r,  the  intermediate  results 
t — (£31X27X30X26 X29X25 £28X242/31 2/27 2/302/26 2/29 2/25  2/28  2/24  • • • 

£7x3x6x2x5x1x4x02/72/32/62/22/52/12/42/0)2,  (176) 

u = (2/272/312/262/302/252/292/242/28X27X31X26X30X25X29X24X28  • ■ • 

^32/72/22/62/12/52/02/4x3x7x2x6x1x5x0x4)2  (177) 

can  be  computed  quickly  via  the  four  instructions 

MOR  t , z ,p ; MOR  t ,q,t ; M0Ru,t,r;  M0Ru,r,u;  (178) 

see  exercise  204.  So  there’s  a mask  m for  which  ‘ PUT  rM , m ; MUX  w , t , u’  completes 
the  perfect  shuffle  in  just  six  cycles  altogether.  By  contrast,  the  traditional 
method  in  exercise  53  requires  30  cycles  (five  (5-swaps). 

The  analogous  instruction  MXOR  is  especially  useful  when  binary  linear  alge- 
bra is  involved.  For  example,  exercise  1.3.1-37  shows  that  X0R  and  MXOR  directly 
implement  addition  and  multiplication  in  a finite  field  of  2k  elements,  for  k < 8. 
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The  problem  of  cyclic  redundancy  checking  provides  an  instructive  example 
of  another  case  where  MXOR  shines.  Streams  of  data  are  often  accompanied  by 
“CRC  bytes"  in  order  to  detect  common  types  of  transmission  errors  [see  W.  W. 
Peterson  and  D.  T.  Brown,  Proc.  IRE  49  (1961),  228-235],  One  popular  method, 
used  for  example  in  MP3  audio  files,  is  to  regard  each  byte  a = (07  . . . 0100)2 
as  if  it  were  the  polynomial 

a(x)  — (a7...a1ao)x  = a^x7  + • • • + aix  + ao-  (479) 

When  transmitting  n bytes  an-i . . . aiot 0,  we  then  compute  the  remainder 

/3  = (an-^i)!8*"-1)  + • • ■ + ai(x)x8  + ao(x))x16  mod p(x),  (180) 

where  p(x)  = x16  + x15  + x2  + 1,  using  polynomial  arithmetic  mod  2,  and  append 
the  coefficients  of  /3  as  a 16-bit  redundancy  check. 

The  usual  way  to  compute  /?  is  to  process  one  byte  at  a time,  according  to 
classical  methods  like  Algorithm  4.6. ID.  The  basic  idea  is  to  define  the  partial 
result  /3m  = (an_ i(x)x8("~1_m)  + • • ■ + am+1(x)x8  + am(x))x16  modp(x)  so 
that  /3n  = 0,  and  then  to  use  the  recursion 

/?m  = ((/3m+i  «8)&  #ff00)  © crcJable[(/3m+1  > 8)  0 am]  (181) 

to  decrease  m by  1 until  m — 0.  Here  crc.table[a]  is  a 16-bit  table  entry  that 
holds  the  remainder  of  a(x)x16,  modulo  p(x)  and  mod  2,  for  0 < a < 256. 
[See  A.  Perez,  IEEE  Micro  3,3  (June  1983),  40-50.] 

But  of  course  we’d  prefer  to  process  64  bits  at  once  instead  of  8.  The  solution 
is  to  find  8x8  matrices  A and  B such  that 

a(x)x64  = (aA)(x)  + (aB)(x)x~8  (modulo  p(x)  and  2),  (182) 

for  arbitrary  bytes  a,  considering  a to  be  a 1 x 8 vector  of  bits.  Then  we  can 
pad  the  given  data  bytes  a„_i . . . aq a0  with  leading  zeros  so  that  n is  a multiple 
of  8,  and  use  the  following  efficient  reduction  method: 

Begin  with  c «-  0,  n «-  n - 8,  and  t <-  (an+7  . . . a„)256- 
While  n > 0,  set  u <r-  t ■ A,  v <-  t ■ B,  n <-  n — 8,  (183) 

t <—  (c*n+ 7 ■ • ■ Qn) 256  ® ti®  (»  > 8)  0 (c<  56),  and  C «—  V & #ff . 

Here  t ■ A and  t ■ B denote  matrix  multiplication  via  MXOR.  The  desired  CRC 
bytes,  (tx16  + cx8)  modp(x),  are  then  readily  obtained  from  the  64-bit  quantity  t 
and  the  8-bit  quantity  c.  Exercise  213  contains  full  details;  the  total  running 
time  for  n bytes  comes  to  only  (p,  + 10w)n/8  + 0(1). 

The  exercises  below  contain  many  more  instances  where  M0R  and  MXOR  lead 
to  substantial  economies.  New  tricks  undoubtedly  remain  to  be  discovered. 

For  further  reading.  The  book  Hacker’s  Delight  by  Henry  S.  Warren,  Jr. 
(Addison- Wesley,  2002)  discusses  bitwise  operations  in  depth,  emphasizing  the 
great  variety  of  options  that  are  available  on  real-world  computers  that  are  not 
as  ideal  as  MMIX. 
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EXERCISES 

► 1.  [15]  What  is  the  net  effect  of  setting  x <-  x ® y,  y «-  y © (x  & m),  x <-  x ® y? 

2.  [16]  (H.  S.  Warren,  Jr.)  Are  any  of  the  following  relations  valid  for  all  integers  x 
and  y?  (i)  * © y < x | y;  (ii)  x & y < x [ y-  (iii)  \x  - y\  < x © y. 

u3'  &-1  ' ■■XlX°h  With  let  xM  = (*»-i  • *1*0)2.  Thns  we 


have  0 


— 1,  0,  1,  0,  3,  2,  1,  0,  7,  6,  ... , if  we  let  0M  = —1.  Prove 


that  (x  ® y)M  < \x  — y\  < x ® y for  all  x,  y > 0. 

► 4.  [Ml 6]  Let  x = x,  xN  = -x,  xs  = x+ 1,  and  xp  = x - 1 denote  the  complement, 
the  negative,  the  successor,  and  the  predecessor  of  an  infinite-precision  integer  x Then 
we  have  a;CC  = xN ” = xsp  = xps  _ x_  What 

are  xCN  and  xNC ? 


5.  [M21]  Prove  or  disprove  the  following  conjectured  laws  concerning  binary  shifts: 

a)  (x  < j)  < k = x « (j  + k); 

b)  ( x > j)  & (y  <C  k)  = ((x  (j  + k))  & y)  <g  k = (x  & (y  <g;  ( j + fc)))  3>  j. 

6.  [M22]  Find  all  integers  x and  y such  that  (a)  x » y = y » X;  (b)  x « y = y « x. 

7^  [M22]  (R.  Schroeppel,  1972.)  Find  a fast  way  to  convert  the  binary  number 
x = {...x2xix0)2  to  its  negabinary  counterpart  x = . .x'2x'lX'o)-2,  and  vice  versa. 

Hint:  Only  two  bitwise  operations  are  needed! 


► 8.  [M2 2]  Given  a finite  set  S of  nonnegative  integers,  the  “minimal  excludant”  of  5 
is  defined  to  be 


mex(S)  = min{  k \ k > 0 and  k £ S }. 

Let  x®  S denote  the  set  {x  ® y \ y e S),  and  let  S ® y denote  {rffii/|i6S).  Prove 
that  if  x = mex(5)  and  y = mex(T)  then  x ® y — mex((S’  ® y)  U (x  ® T )). 

9.  [M26].  ( Nim .)  Two  people  play  a game  with  k piles  of  sticks,  where  there  are  a, 
sttcks  in  pile  j.  If  m = • • • = ak  = 0 when  it  is  a player’s  turn  to  move,  that  player 
loses;  otherwise  the  player  reduces  one  of  the  piles  by  any  desired  amount,  throwing 
away  the  removed  sticks,  and  it  is  the  other  player’s  turn.  Prove  that  the  player  to 
move  can  force  a victory  if  and  only  if  m ® • • ■ ® ak  / 0.  [Hint:  Use  exercise  8.]' 

10.  [HM40]  ( Nimbers , also  known  as  Conway’s  field.)  Continuing  exercise  8,  define 
the  operation  x <g>  y of  “nim  multiplication”  recursively  by  the  formula 


x®  y = mex{(x®i)®  (j  ® y)  ® (i  0 j)  [ 0 < i < x,  0 < j < y} . 

Prove  that  ® and  ® define  a field  over  the  set  of  all  nonnegative  integers.  Prove  also 
that  if  0 < x,y  < 22"  then  * ® y < 22",  and  22"  ® y = 2^y.  (In  particular,  this  field 
contains  subfields  of  size  22"  for  all  n > 0.)  Explain  how  to  compute  * <g>  y efficiently. 

► 11.  [M26]  (H.  W.  Lenstra,  1978.)  Find  a simple  way  to  characterize  all  pairs  of 
positive  integers  (m,  n)  for  which  m®n  = mn  in  Conway’s  field. 

12.  [M26]  Devise  an  algorithm  for  division  of  nimbers.  Hint:  If  x < 22"+1  then  we 

have  2n))  < 22". 

13.  [M32]  ( Second-order  nim.)  Extend  the  game  of  exercise  9 by  allowing  two  kinds 
of  moves:  Either  a,  is  reduced  for  some  j,  as  before;  or  a3  is  reduced  and  a,  is  replaced 
by  an  arbitrary  nonnegative  integer,  for  some  i < j.  Prove  that  the  player  to  move 
can  now  force  a victory  if  and  only  if  the  pile  sizes  satisfy  either  a2  7^  a3  ® • • • ® ak  or 
ai  ^ <23  ® (2®a4)  ® • • • ® ((fc  — 2)  ®afc).  For  example,  when  k = 4 and  (ai , a2,  a3,  a4)  = 
(7,  5,  0, 5),  the  only  winning  move  is  to  (7,  5,  6, 3). 
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14.  [M30]  Suppose  each  node  of  a complete,  infinite  binary  tree  has  been  labeled  with 
0 or  1.  Such  a labeling  is  conveniently  represented  as  a sequence  T = (t,  to,  ti,  too,  toi> 
tw,  tn,  tooo,  ■ ■ ■ ),  with  one  bit  ta  for  every  binary  string  a;  the  root  is  labeled  t,  the 
left  subtree  labels  are  To  = (to,  ioo,  toi,  tooo,  ■ • ■ ),  and  the  right  subtree  labels  are  Ti  = 
(ti,  tio,  tn,  tioo,  ■ ■ • )•  Any  such  labeling  can  be  used  to  transform  a 2-adic  integer 
x = ( . . .X2XiXo)2  into  the  2-adic  integer  y — ( . . .3/22/12/0)2  = T(x)  by  setting  2/0  = t, 
2/i  = txo,  2/2  = tXQX1,  etc.,  so  that  T(x)  = 2TX0  ( |_x/2j ) + t.  (In  other  words,  x defines 
an  infinite  path  in  the  binary  tree,  and  y corresponds  to  the  labels  on  that  path,  from 
right  to  left  in  the  bit  strings  as  we  proceed  from  top  to  bottom  of  the  tree.) 

A branching  function  is  the  mapping  xT  — x © T(x)  defined  by  such  a labeling. 
For  example,  if  toi  = 1 and  all  of  the  other  ta  are  0,  we  have  xT  = x ® 4[x  mod  4 = 2]. 

a)  Prove  that  every  branching  function  is  a permutation  of  the  2-adic  integers. 

b)  For  which  integers  k is  x © (a;  k)  a branching  function? 

c)  Let  x xT  be  a mapping  from  2-adic  integers  into  2-adic  integers.  Prove  that  xT 
is  a branching  function  if  and  only  if  p(x  © y)  = p(xTffi  yT)  for  all  2-adic  x and  y. 

d)  Prove  that  compositions  and  inverses  of  branching  functions  are  branching  func- 
tions. (Thus  the  set  B of  all  branching  functions  is  a permutation  group.) 

e)  A branching  function  is  balanced  if  the  labels  satisfy  ta  = fQoffifc*i  for  all  a.  Show 
that  the  set  of  all  balanced  branching  functions  is  a subgroup  of  B. 

► 15.  [ M26 ] J.  H.  Quick  noticed  that  ((a;  + 2)  © 3)  — 2 = ((x  — 2)  © 3)  + 2 for  all  x.  Find 
all  constants  a and  b such  that  ((x  + a)  © 6)  — a = ((x  — o)  © b)  + a is  an  identity. 

16.  [ M31 } A function  of  x is  called  animating  if  it  can  be  written  in  the  form 

((...((( (x  + ai  ) © 61)  + 612)©  62  ) + •••)+  Urn)  ffi  bm 

for  some  integer  constants  ay  61,  02,  f>2,  • • • , am,  bm,  with  m > 0. 

a)  Prove  that  every  animating  function  is  a branching  function  (see  exercise  14). 

b)  Furthermore,  prove  that  it  is  balanced  if  and  only  if  61  ffi  62  © • • • © bm  = 0.  Hint: 
What  binary  tree  labeling  corresponds  to  the  animating  function  ((xffic)  — 1)  ©c? 

c)  Let  [x]  = xffi(x—  1)  = 2p<-1)+1  — 1.  Show  that  every  balanced  animating  function 
can  be  written  in  the  form 

x ffi  [x  ffi  pi]  ffi  [x  ffi  P2I  ffi  • • • © L®  ® pA > Pi  < P2  < ■ • • < Pi , 

for  some  integers  {pi,P2,  • . . ,pi},  where  l > 0,  and  this  representation  is  unique. 

d)  Conversely,  show  that  every  such  expression  defines  a balanced  animating  function. 

17.  [HM36]  The  results  of  exercise  16  make  it  pos- 
sible to  decide  whether  or  not  any  two  given  ani- 
mating functions  are  equal.  Is  there  an  algorithm 
that  decides  whether  any  given  expression  is  iden- 
tically zero,  when  that  expression  is  constructed 
from  a finite  number  of  integer  variables  and  con- 
stants using  only  the  binary  operations  + and  ffi? 

What  if  we  also  allow  &? 

18.  [M25]  The  curious  pixel  pattern  shown  here 
has  ( x2y  2>  11)  & 1 in  row  x and  column  y , for 
1 < x,  2/  < 256.  Is  there  any  simple  way  to  explain 
some  of  its  major  characteristics  mathematically? 


186  COMBINATORIAL  SEARCHING 


7.1.3 


► 19.  [M37\  ( Paley’s  rearrangement  theorem.)  Given  three  vectors  A = (o0, ....  a2n-i), 
B = (&o,  ■ . . , 62" -i),  and  C = (co, . . . , c2"-i)  of  nonnegative  numbers,  let 


f(A,B,C)  = ajbkCi. 

j©fcffi(= 0 


For  example,  if  n — 2 we  have  f(A,B,C)  — a0b0c0  + a06ici  + ao62c2  + a0b3c3  + ai60ci  + 
a1b1c0  + 0162C3  -I  I-  a363c0;  in  general  there  are  22n  terms,  one  for  each  choice  of 
j and  k.  Our  goal  is  to  prove  that  f(A,B,C)  < f(A*,B*,C*),  where  A*  denotes  the 
vector  A sorted  into  nonincreasing  order:  aj  > aj  > • • • > a2n_1. 

a)  Prove  the  result  when  all  elements  of  A,  B,  and  C are  Os  and  Is. 

b)  Show  that  it  is  therefore  true  in  general. 

c)  Similarly,  f(A,B,C,D)  = £iffifc0iffim=o  a3bkc,dm  < f(A*,B*,C*,D*). 

► 20.  [21]  ( Gosper’s  hack.)  The  following  seven  operations  produce  a useful  function  y 
of  x,  when  a:  is  a positive  integer.  Explain  what  this  function  is  and  why  it  is  useful. 


u- we-z  + u;  y •f-u  + (((w©x)/u)>2). 

21.  [22]  Construct  the  reverse  of  Gosper’s  hack:  Show  how  to  compute  x from  y. 

22.  [21]  Implement  Gosper’s  hack  efficiently  with  MMIX  code,  assuming  that  x < 264, 
without  using  division. 


► 23.  [27]  A sequence  of  nested  parentheses  can  be  represented  as  a binary  number  by 
putting  aim  the  position  of  each  right  parenthesis.  For  example,  ‘ (())()  ’ corresponds 
in  this  way  to  (001101)2,  the  number  13.  Call  such  a number  a parenthesis  trace. 

a)  What  are  the  smallest  and  largest  parenthesis  traces  that  have  exactly  m is? 

b)  Suppose  a:  is  a parenthesis  trace  and  y is  the  next  larger  parenthesis  trace  with 
the  same  number  of  Is.  Show  that  y can  be  computed  from  x with  a short  chain 
of  operations  analogous  to  Gosper’s  hack. 

c)  Implement  your  method  on  MMIX,  assuming  that  ux  < 32. 

► 24.  [M30]  Program  1.3.2'P  instructed  MMIX  to  produce  a table  of  the  first  five  hundred 
prime  numbers,  using  trial  division  to  establish  primality.  Write  an  MMIX  program  that 
uses  the  “sieve  of  Eratosthenes”  (exercise  4.5.4-8)  to  build  a table  of  all  odd  primes  that 
are  less^than  N,  packed  into  octabytes  Q0 , Q 1,  . . . , Qjv/i28-i  as  in  (27).  Assume  that 
N < 2 , and  that  it’s  a multiple  of  128.  What  is  the  running  time  when  N = 3584? 


► 25.  [15]  Four  volumes  sit  side  by  side  on  a bookshelf.  Each  of  them  contains  exactly 
500  pages,  printed  on  250  sheets  of  paper  0.1  mm  thick;  each  book  also  has  a front  and 
back  cover  whose  thicknesses  are  1 mm  each.  A bookworm  gnaws  its  way  from  page  1 
of  Volume  1 to  page  500  of  Volume  4.  How  far  does  it  travel  while  doing  so? 


26.  [22]  Suppose  we  want  random  access  to  a table  of  12  million  items  of  5-bit  data. 
We  could  pack  12  such  items  into  one  64-bit  word,  thereby  fitting  the  table  into  8 
megabytes  of  memory.  But  random  access  then  seems  to  require  division  by  12,  which 
is  rather  slow;  we  might  therefore  prefer  to  let  each  item  occupy  a full  byte,  thus  using 
12  megabytes  altogether. 

Show,  however,  that  there’s  a memory-efficient  approach  that  avoids  division. 


27.  [21]  In  the  notation  of  Eqs.  (32)-(43),  how  would  you  compute  (a)  (al0a016)2? 

(b)  (al0all  )2?  (c)  (a00a01i’)2?  (d)  (0oolla006)2?  (e)  (0°°01o006)2?  (f)  (0°°llall6)2? 

28.  [16]  What  does  the  operation  (z+l)  & x produce? 


29.  [20]  (V.  R.  Pratt.)  Express  the  magic  mask  yk  of  (47)  in  terms  of  nk+1 
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30.  [20]  If  x — 0,  the  MMIX  instructions  (46)  will  set  p «-  64  (which  is  a close  enough 
approximation  to  00).  What  changes  to  (50)  and  (51)  will  produce  the  same  result? 

► 31.  [20]  A mathematician  named  Dr.  L.  I.  Presume  decided  to  calculate  the  ruler 
function  with  a simple  loop  as  follows:  “Set  p <-  0;  then  while  x & 1 = 0,  set  p «-  p + 1 
and  x x 1.”  He  reasoned  that,  when  a:  is  a random  integer,  the  average  number 
of  right  shifts  is  the  average  value  of  p,  which  is  1;  and  the  standard  deviation  is  only 
V2,  so  the  loop  almost  always  terminates  quickly.  Criticize  his  decision. 

32.  [20]  What  is  the  execution  time  for  px  when  (52)  is  programmed  for  MMIX? 

► 33.  [26]  (Leiserson,  Prokop,  and  Randall,  1998.)  Show  that  if  ‘58’  is  replaced  by  ‘49’ 
in  (52),  we  can  use  that  method  to  identify  both  bits  of  the  number  y = 2j  +2k  quickly, 
when  64  > j > k > 0.  (Altogether  (b24)  = 2016  cases  need  to  be  distinguished.) 

34.  [M23]  Let  x and  y be  2-adic  integers.  True  or  false:  (a)  p{x  & y)  = rna x(px,py); 
(b)  p(x  | y)  = min  (px,  py);  (c)  px  = py  if  and  only  if  x ® y = (x  - 1)  © (y  - 1). 

► 35.  [M26]  According  to  Reitwiesner’s  theorem,  exercise  4.1-34,  every  integer  n has  a 
unique  representation  n = n+  -n~  such  that  n+fcn~  = (n+  | n“)&((n+  |n“)»l)  = 0. 
Show  that  n+  and  n can  be  calculated  quickly  with  bitwise  operations.  Hint:  Prove 
the  identity  (x  0 3a;)  & ((x  0 3x)  » 1)  = 0. 

36.  [20]  Given  x = (x63  . . . xix0)2,  suggest  efficient  ways  to  calculate  the  quantities 

i)  x®  = (x®,  . . . x® x® )2 , where  xf  = xk  0 • ■ • 0 Xi  0 x0  for  0 < k < 64; 

ii)  x&  = (x|  . . . xf  x^)2,  where  xjf  = Xk  A • • • A xi  A x0  for  0 < k < 64. 

37.  [16]  What  changes  to  (55)  and  (56)  will  make  A0  come  out  -1? 

38.  [17]  How  long  does  the  leftmost-bit-extraction  procedure  (57)  take  when  imple- 
mented on  MMIX? 

► 39.  [20]  Formula  (43)  shows  how  to  remove  the  rightmost  run  of  1 bits  from  a given 
number  x.  How  would  you  remove  the  leftmost  run  of  1 bits? 

► 40.  [21]  Prove  (58),  and  find  a simple  way  to  decide  if  Ax  < A y,  given  x and  y > 0. 

41.  [M22]  What  are  the  generating  functions  of  the  integer  sequences  (a)  pn,  (b)  An, 
and  (c)  nn? 

42.  [M21]  If  n = 2ei  + ■ ■ ■ + 2er , with  ei  > ■ • ■ > er  > 0,  express  the  sum  0 
in  terms  of  the  exponents  ei,  . . . , er. 

► 43.  [20]  How  sparse  should  x be,  to  make  (63)  faster  than  (62)  on  MMIX? 

► 44.  [23]  (E.  Freed,  1983.)  What’s  a fast  way  to  evaluate  the  weighted  bit  sum  Yhixj'? 

► 45.  [20]  (T.  Rokicki,  1999.)  Explain  how  to  test  if  xR<  yR,  without  reversing  x and  y. 

46.  [22]  Method  (68)  uses  six  operations  to  interchange  two  bits  x,  Xj  of  a register. 
Show  that  this  interchange  can  actually  be  done  with  only  three  MMIX  instructions. 

47.  [10]  Can  the  general  5-swap  (69)  also  be  done  with  a method  like  (67)? 

48.  [M21  ] How  many  different  5-swaps  are  possible  in  an  n-bit,  register?  (When  n = 4, 
a 5-swap  can  transform  1234  into  1234,  1243,  1324,  1432,  2134,  2143,  3214,  3412,  4231.) 

► 49.  [M30]  Let  s(n)  denote  the  fewest  5-swaps  that  suffice  to  reverse  an  n-bit  number. 

a)  Prove  that  s(n)  > [log3  n]  when  n is  odd,  s(n)  > [log3  3n/2]  when  n is  even. 

b)  Evaluate  s(n)  when  n = 3m,  2 ■ 3m,  (3m  + l)/2,  and  (3m  - l)/2. 

c)  What  are  s(32)  and  s(64)?  Hint:  Show  that  s(5n  + 2)  < s(n)  + 2. 

50.  [M3 7]  Continuing  exercise  49,  prove  that  s(n)  = log3  n + O(loglogn). 
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51.  [23]  Let  c be  a constant,  0 < c < 2d.  Find  all  sequences  of  masks  (0O,  61, ...,  6d_x, 
Od-2,  ■ ■ ■ ,&i,  d0)  such  that  the  general  permutation  scheme  (71)  takes  x i-»  X* , where 
the  bit  permutation  n is  defined  by  either  (a)  jn  = j ® c;  or  (b)  jn  = (j  + c)  mod  2d. 
[The  masks  should  satisfy  6 C pd,k  and  §k  C pd,k,  so  that  (71)  corresponds  to  Fig.  12; 
see  (48).  Notice  that  reversal,  x*  = xR,  is  the  special  case  c = 2d  - 1 of  part  (a),  while 
part  (b)  corresponds  to  the  cyclic  right  shift  xn  = (x  » c)  + (x  « ( 2d  - c)).] 

52.  [22]  Find  hexadecimal  constants  (00, 6>i,  02, 03, 04, 6&J4,  03, 02, 61,  0O)  that  cause 
(71)  to  produce  the  following  important  64-bit  permutations,  based  on  the  binary 
representation  j = (jsjijzj^jijo^-  (a)  jn  = (ioj5j4j3j2ji)2;  (b)  jn  = (j2j\jojbjijz)2\ 
(c)  jn  = 0’ijoj5j4j3j2)2;  (d)  jn  = (ioji J2J3 74^5)2.  [Case  (a)  is  the  “perfect  shuffle” 
(175)  that  takes  (*63  ...  £331323:31  ...  21*0)2  into  (x63x3i  . . . x33x1X32Xo)2;  case  (b) 
transposes  an  8 x 8 matrix  of  bits;  case  (c),  similarly,  transposes  a 4 x 16  matrix; 
and  case  (d)  arises  in  connection  with  “fast  Fourier  transforms,”  see  exercise  4.6.4-14.] 

► 53.  [M25]  The  permutations  in  exercise  52  are  said  to  be  “induced  by  a permutation 
of  index  digits,”  because  we  obtain  jn  by  permuting  the  binary  digits  of  j.  Suppose 
3 71  = 0(d— i)v>  • • -1iv>jW)2,  where  ip  is  a permutation  of  {0, 1, . . . , d - 1}.  Prove  that  if 
ip  has  t cycles,  the  2d-bit  permutation  x >->■  x1'  can  be  obtained  with  only  d - t swaps. 
In  particular,  show  that  this  observation  speeds  up  all  four  cases  of  exercise  52. 

54.  [22]  (R.  W.  Gosper,  1985.)  If  an  m x m bit  matrix  is  stored  in  the  rightmost 
m2  bits  of  a register,  show  that  it  can  be  transposed  by  doing  (2 k(m  — l))-swaps  for 
0 < k < [ lg  m] . Write  out  the  method  in  detail  when  m = 7. 

► 55.  [26]  Suppose  annxn  bit  matrix  is  stored  in  the  rightmost  n2  bits  of  an  n3-bit  reg- 
ister. Prove  that  18d  + 2 bitwise  operations  suffice  to  multiply  two  such  matrices,  when 
n = 2 ; the  matrix  multiplication  can  be  either  Boolean  (like  MOR)  or  mod  2 (like  MXOR). 

56.  [24]  Suggest  a way  to  transpose  a 7 x 9 bit  matrix  in  a 64-bit  register. 

57.  [22]  The  network  P(2d)  of  Fig.  12  has  a total  of  (2d  - l)2d~ 1 crossbars.  Prove 
that  any  permutation  of  2d  elements  can  be  realized  by  some  setting  in  which  at  most 
d2d~1  of  them  are  active. 

► 58.  [M32]  The  first  d columns  of  crossbar  modules  in  the  permutation  network  P(2d) 
perform  a 1-swap,  then  a 2-swap,  . . . , and  finally  a 2d  Lswap,  when  the  network’s  wires 
are  stretched  into  horizontal  lines  as  shown  here  for  d = 3.  Let 
N = 2d.  These  N lines,  together  with  the  Nd/2  crossbars,  form  a 
so-called  “Omega  router”  or  “inverse  butterfly.”  The  purpose  of 
this  exercise  is  to  study  the  set  Q of  all  permutations  ip  such  that 
we  can  obtain  (<V,  lip, . . . , (IV—  l)p)  as  outputs  on  the  right  of  an 
Omega  router  when  the  inputs  at  the  left  are  (0, 1, . . . , IV  - 1). 

a)  Prove  that  |fi|  = 2Jvd/2 . (Thus  lg  |fi|  = Nd/2  ~ | lg  IV!.) 

b)  Prove  that  a permutation  p of  {0, 1, . . . , N - 1}  belongs  to  U if  and  only  if 

i mod  2 = j mod  2k  and  ip^$>k  = jp^>k  implies  iip  = jip  (*) 
for  all  0 < i,j  < N and  all  0 < k < d. 

c)  Simplify  condition  (*)  to  the  following,  for  all  0 < i,j  < N: 

Mip  ® jp)  < p(i  ® j)  implies  i = j . 

d)  Let  T be  the  set  of  all  permutations  r of  {0, 1, . . . , N - 1}  such  that  p(i  © j)  = 
p(ir(BjT ) for  all  i and  j.  (This  is  the  set  of  branching  functions  considered  in  exer- 
cise 14,  modulo  2 ; so  it  has  2 members,  2N^2Jrd  1 of  which  are  the  animating 
functions  modulo  2d.)  Prove  that  p E ft  if  and  only  if  rp  £ Q for  all  r € T. 
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e)  Suppose  and  ip  are  permutations  of  Q that  operate  on  different  elements;  that 
is,  jV  / j implies  jip  = j,  for  0 < j < N.  Prove  that  <pip  e fb 

f)  Prove  that  the  permutation  0 ip . . . (N  — \)<p  is  Omega-routable  if  and  only  if  it  is 
sorted  by  Batcher’s  bitonic  sorting  network  of  order  N . (See  Section  5.3.4.) 

59.  [M30]  Given  0 < a < b < N = 2d , how  many  Omega-routable  permutations 
operate  only  on  the  interval  [a . . b]l  (Thus  we  want  to  count  the  number  of  ip  € Q such 
that  jip  / j implies  a < j < b.  Exercise  58(a)  is  the  special  case  a = 0,  b = N — 1 .) 

60.  [HM28]  Given  a random  permutation  of  (0, 1, . . . , 2n— 1},  let  pnk  be  the  proba- 
bility that  there  are  2k  ways  to  set  the  crossbars  in  the  first  and  last  columns  of  the 
permutation  network  P(2n)  when  realizing  this  permutation.  In  other  words,  pnk  is  the 
probability  that  the  associated  graph  has  k cycles  (see  (75)).  What  is  the  generating 
function  ^2k>0  pnkZk?  What  are  the  mean  and  variance  of  2k? 

61.  [46]  Is  it  NP-hard  to  decide  whether  a given  permutation  is  realizable  with  at 
least  one  mask  0j  = 0,  using  the  recursive  method  of  Fig.  12  as  implemented  in  (71)? 

► 62.  [22]  Let  N = 2d . We  can  obviously  represent  a permutation  7r  of  (0.1,...,  N — I } 
by  storing  a table  of  N numbers,  d bits  each.  With  this  representation  we  have  instant 
access  to  y = xn,  given  x\  but  it  takes  Q(N)  steps  to  find  x = yn~  when  y is  given. 

Show  that,  with  the  same  amount  of  memory,  we  can  represent  an  arbitrary 
permutation  in  such  a way  that  xn  and  yn  ~ are  both  computable  in  0(d)  steps. 

63.  [19]  For  what  integers  w,  x,  y,  and  2 does  the  zipper  function  satisfy  (i)  x\y  = 
V t*?  (ii)  (xty)>z  = (a:»f2/2l)t(j/»[«/2j)?  (iii)  {w\x)&(y\z)  = {wk.y)\{x&z)l 

64.  [22]  Find  a “simple”  expression  for  the  zipper-of-sums  (x  + *')  J (y  + y1),  as  a 
function  oi  z = x\y  and  z = x'  \y' . 

65.  [ M16 ] The  binary  polynomial  u(x)  = u0  + uix  H b (mod  2)  can  be 

represented  by  the  integer  u = (un-i  ■ ■ ■ uiUo)?-  If  u(x ) and  v(x)  correspond  to  integers 
u and  v in  this  way,  what  polynomial  corresponds  to  u ( v ? 

► 66.  [M26]  Suppose  the  polynomial  u(x)  has  been  represented  as  an  n-bit  integer  u as 
in  exercise  65,  and  let  v = u © (u  <5)  0 (u  25)  ® (u  35)  ® • • • for  some  integer  5. 

a)  What’s  a simple  way  to  describe  the  polynomial  v(x)'? 

b)  Suppose  n is  large,  and  the  bits  of  u have  been  packed  into  64-bit  words.  How 
would  you  compute  v when  5=1,  using  bitwise  operations  in  64-bit  registers? 

c)  Consider  the  same  question  as  (b),  but  when  5 = 64. 

d)  Consider  the  same  question  as  (b),  but  when  5 = 3. 

e)  Consider  the  same  question  as  (b),  but  when  5 = 67. 

67.  [M31]  If  u(x)  is  a polynomial  of  degree  < n,  represented  as  in  exercise  65,  discuss 

the  computation  of  v(x)  = u(x)2  mod  ( xn  + xm  + 1),  when  0 < m < n and  both  m 
and  n are  odd.  Hint:  This  problem  has  an  interesting  connection  with  perfect  shuffling. 

68.  [20]  What  three  MMIX  instructions  implement  the  5-shift  operation,  (79)? 

69.  [25]  Prove  that  method  (80)  always  extracts  the  proper  bits  when  the  masks  0k 
have  been  set  up  properly:  We  never  clobber  any  of  the  crucial  bits  yj. 

► 70.  [31]  (Guy  L.  Steele  Jr.,  1994.)  What’s  a good  way  to  compute  the  masks  0o,  61, 
■ ■ ■ , 0 d—  1 that  are  needed  in  the  general  compression  procedure  (80),  given  \ / 0? 

71.  [17]  Explain  how  to  reverse  the  procedure  of  (80),  going  from  the  compact  value 
y = {Vr- 1 • • • 2112/0)2  to  a number  z = (z63  ■ ■ ■ 21-2:0)2  that  has  = yt  for  0 < i < r. 

72.  [25]  (Y.  Hilewitz  and  R.  B.  Lee.)  Prove  that  the  gather-flip  operation  (81/)  is 
Omega-routable  in  the  sense  of  exercise  58. 
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73.  [22]  Prove  that  d well-chosen  steps  of  (a)  the  sheep-and-goats  operation  (8i)  or 
(b)  the  gather-flip  operation  (8i')  will  implement  any  desired  2d-bit  permutation. 

74.  [22]  Given  counts  (co,Ci, . . . ,c2d_1)  for  the  Chung  Wong  procedure,  explain  why 
an  appropriate  cyclic  1-shift  can  always  produce  new  counts  (c'0 , c[  . . . c'2d  ) for  which 

c2i+ii  thus  allowing  the  recursion  to  proceed. 

► 75.  [32]  The  method  of  Chung  and  Wong  replicates  bit  l of  a register  exactly  c; 
times,  but  it  produces  results  in  scrambled  order.  For  example,  the  case  (c0, . . . ,c7)  = 
(1,2,0, 2, 0,2, 0,1)  illustrated  in  the  text  produces  (x7X3Xix5X5*3XiXo)2.  In  some 
applications  this  can  be  a disadvantage;  we  might  prefer  to  have  the  bits  retain  their 
original  order,  namely  {x7x5x5x3x3xxx1xo)2  in  that  example. 

Prove  that  the  permutation  network  P(2d)  of  Fig.  12  can  be  modified  to  achieve 
this  goal,  given  any  sequence  of  counts  (c0,  ci, . . . , c^.j),  if  we  replace  the  d ■ 2d~1 
crossbar  modules  m the  right-hand  half  by  general  2x2  mapping  modules.  (A  crossbar 
module  with  inputs  (a,  b)  produces  either  (a,  b)  or  (b,  a)  as  output;  a mapping  module 
can  also  produce  (a, a)  or  (6,6).) 

76.  [4  7]  A mapping  network  is  analogous  to  a sorting  network  or  a permutation 
network,  but  it  uses  2x2  mapping  modules  instead  of  comparators  or  crossbars,  and  it 
is  supposed  to  be  able  to  output  all  nn  possible  mappings  of  its  n inputs.  Exercise  75, 
m conjunction  with  Fig.  12,  shows  that  a mapping  network  for  n = 2d  exists  with  only 
4d-2  levels  of  delay,  and  with  n/2  modules  on  each  level;  furthermore,  this  construction 

needs  general  2x2  mapping  modules  (instead  of  simple  crossbars)  in  only  d of  those 
levels. 

To  within  O(n),  what  is  the  smallest  number  G(n)  of  modules  that  are  sufficient 
to  implement  a general  n-element  mapping  network? 

77.  [26]  (R.  W.  Floyd  and  V.  R.  Pratt.)  Design  an  algorithm  that  tests  whether 
or  not  a given  standard  n-network  is  a sorting  network,  as  defined  in  the  exercises 
of  Section  5.3.4.  When  the  given  network  has  r comparator  modules,  your  algorithm 
should  use  O(r)  bitwise  operations  on  words  of  length  2n. 

78.  [M27]  ( Testing  disjointness.)  Suppose  the  binary  numbers  xi,  x2,  . . . , xm  each 
represent  sets  in  a universe  of  n - k elements,  so  that  each  Xj  is  less  than  2n~k.  J.  H. 
Quick  (a  student)  decided  to  test  whether  the  sets  are  disjoint  by  testing  the  condition 

Xl  I X2  | ’ • ■ | Xm  = (xi  + X2  + • • ■ + Xm)  mod  2". 


Prove  or  disprove:  Quick’s  test  is  valid  if  and  only  if  k > lg(m  - 1). 

► 79.  [20]  If  x ^ 0 and  x C x,  what  is  an  easy  way  to  determine  the  largest  integer 
x,<x  such  that  x,  C X?  (Thus  (x,)'  = (a/),  = *,  in  connection  with  (84).) 

80.  [20]  Suggest  a fast  way  to  find  all  maximal  proper  subsets  of  a set.  More  precisely 
given  x with  vX  = m,  we  want  to  find  all  x C x such  that  ox  = rn  - I. 

81.  [21]  Find  a formula  for  “scattered  difference,”  to  go  with  the  “scattered  sum”  (86). 

82.  [21]  Is  it  easy  to  shift  a scattered  accumulator  to  the  left  by  1,  for  example  to 
change  (2/2*4x32/1X22/0X1X0)2  to  (2/1*43:32/02:20x12:0)2? 

► 83.  [33]  Continuing  exercise  82,  find  a way  to  shift  a scattered  2d-bit  accumulator  to 
the  right  by  1,  given  a and  x,  in  O(d)  steps. 

84.  [25]  Given  n-bit  numbers  * = (*._  1 . . . ziz0)2  and  x = (Xn-i  . • • XiXoh,  explain 
how  to  calculate  the  stretched”  quantities  z v-  x = (z(n-i)^x  ■ . . z1^xz0^.x)2  and 
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^ -»■  X = (z(n- i)-rx  ■ • • Zi^xz0-^x)2,  where 

3 X = maxjfc  | k <j  and  Xk  = 1},  j ->■  X = min{fc  | k > j and  \k  = 1}; 

we  let  Zj^x  = 0 if  \k  = 0 for  0 < k < j,  and  Zj-,x  = 0 if  \k  = 0 for  n > k > j . For 
example,  if  n — 11  and  x = (01101110010)2,  then  2 = {zgzgzaz6z6zbz4ziz1zi0)2 

and  2 —rX  = (0ZgZ8Z8Z6Z5Z4Z4Z4ZlZi)2. 

85.  [22]  (K.  D.  Tocher,  1954.)  Imagine  that  you  have  a vintage  1950s  computer 

with  a drum  memory  for  storing  data,  and  that  you  need  to  do  some  computations 
with  a 32  x 32  x 32  array  a[i,j,k],  whose  subscripts  are  5-bit  integers  in  the  range 
0 < i,  j,  k < 32.  Unfortunately  your  machine  has  only  a very  small  high-speed  memory: 
You  can  access  only  128  consecutive  elements  of  the  array  in  fast  memory  at  any  time. 
Since  your  application  usually  moves  from  a[i,  j,  k]  to  a neighboring  position  a[i',j',k'], 
where  \i  — i'\  + \ j — j'\  + \k  — k'\  = 1,  you  have  decided  to  allocate  the  array  so  that,  if 
* = (24*3*2*1*0)2,  j = (J4J3 j2iUo)2,  and  k = (^4/23^2^1^0)2,  the  array  entry  a[i,j,k]  is 
stored  in  drum  location  ^1*1/20^0*0)2-  By  interleaving  the  bits  in 

this  way,  a small  change  to  *,  j,  or  k will  cause  only  a small  change  in  the  address. 

Discuss  the  implementation  of  this  addressing  function:  (a)  How  does  it  change 
when  *,  j,  or  k changes  by  ±1?  (b)  How  would  you  handle  a random  access  to  a[i,  j,  k], 
given  i,  j,  and  k?  (c)  How  would  you  detect  a “page  fault”  (namely,  the  condition  that 
a new  segment  of  128  elements  must  be  swapped  into  fast  memory  from  the  drum)? 

86.  [M27]  An  array  of  2P  x 2q  x 2r  elements  is  to  be  allocated  by  putting  a[i,j,k\ 
into  a location  whose  bits  are  the  p + q + r bits  of  (*,  j,  k),  permuted  in  some  fashion. 
Furthermore,  this  array  is  to  be  stored  in  an  external  memory  using  pages  of  size  2s. 
(Exercise  85  considers  the  case  p = q = r = 5 and  s = 7.)  What  allocation  strategy 
of  this  kind  minimizes  the  number  of  times  that  a[i,j,k]  is  on  a different  page  from 
a[i' ,j' , k'],  summed  over  all  i,  j,  k,  *',  j',  and  k!  such  that  \i  — i'\  + \j  — j'\  + \k  — k'\  = 1? 

► 87.  [20]  Suppose  each  byte  of  a 64-bit  word  x contains  an  ASCII  code  that  represents 
either  a letter,  a digit,  or  a space.  What  three  bitwise  operations  will  convert  all  the 
lowercase  letters  to  uppercase? 

88.  [20]  Given  x = (xr  . . . *0)256  and  y = (y7  ...  2/0)256,  compute  2 = (27  ...  20)256, 
where  Zj  = (xj  — yj ) mod  256  for  0 < j < 8.  (See  the  addition  operation  in  (87).) 

89.  [23]  Given  x = (*31  . . . xix0)4  and  y = (2/31  . . . 2/11/0)4,  compute  2 = (231  . . . 2120)4, 
where  Zj  = [xj/yj]  for  0 < j < 32,  assuming  that  no  y3  is  zero. 

90.  [20]  The  bytewise  averaging  rule  (88)  always  rounds  downward  when  Xj  + yj  is 
odd.  Make  it  less  biased  by  rounding  to  the  nearest  odd  integer  in  such  cases. 

► 91.  [26]  ( Alpha  channels.)  Recipe  (88)  is  a good  way  to  compute  bytewise  averages, 
but  applications  to  computer  graphics  often  require  a more  general  blending  of  8-bit 
values.  Given  three  octabytes  x = (x7  . . . 10)256,  y = (y7  . . . 2/0)256,  a = (a7  . . . 2*0)256, 
show  that  bitwise  operations  allow  us  to  compute  2 = (27  . . . 20)256,  where  each  byte  Zj 
is  a good  approximation  to  ((255  — aj)xj  +ajyj)/255,  without  doing  any  multiplication. 
Implement  your  method  with  MMIX  instructions. 

► 92.  [21]  What  happens  if  the  second  line  of  (88)  is  changed  to  ‘2  «—  (x  \ y)  — 2’? 

93.  [18]  What  basic  formula  for  subtraction  is  analogous  to  formula  (89)  for  addition? 

94.  [21]  Let  x = (x7  ...  £12:0)256  and  t = (t7  . . . Ufo)256  in  (90).  Can  tj  be  nonzero 
when  Xj  is  nonzero?  Can  tj  be  zero  when  Xj  is  zero? 

95.  [22]  What’s  a bitwise  way  to  tell  if  all  bytes  of  x = (x7  . . . xix0)2S6  are  distinct? 
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96.  [21]  Explain  (93),  and  find  a similar  formula  that  sets  test  flags  tj  <-  128 [xj  < y3}. 

97.  [23]  Leslie  Lamport’s  paper  in  1975  presented  the  following  “problem  taken  from 
an  actual  compiler  optimization  algorithm”:  Given  octabytes  x = (x7  . . . x0)2S6  and  y = 
(y?  ■ ■ • 2/0)266,  compute  f = [t7  ...  f0)256  and  z = (z7  . . . 20)256  so  that  tj  # 0 if  and  only 
ifxj  ^ 0,  xj  / ’*’,  and  Xj  / and  Zj  = (xj  = 0?  Vj:  (Xj  / ’*’  AXj  / y{>.  Xj)). 

98.  [20]  Given  x = (x7  ...  2:0)256  and  y = (2/7  •••  2/0)256,  compute  2 = (27...  20)256 
and  w = (wT  ...  w0)256,  where  Zj  = ma x(xj,yj)  and  Wj  = min  (xj,yj)  for  0 < j < 8. 

► 99.  [28]  Find  hexadecimal  constants  a,  b,  c,  d,  e such  that  the  six  bitwise  operations 
2/  t—  x © a,  t <-  ((((//&  6) + c)  [ 2/)©d)&e 
will  compute  the  flags  t = (f7  . . . f\f0)256<^7  from  any  bytes  x = (x7  . . . xix0)256,  where 

f0  = [Xo=  ’!’],  /j  = [xr#  f2  = [x2  < ’A’],  f3  = [x3>  >z>],  fA  = [x4>  >a>], 

/s  = [xs  € { ’O’,  ’ 1 ’,  . . . , ’9’}],  f 6 = [x6  < 168],  f7  = [X7  € {’<’ , ’ = ’ , ’>  ’ , ’?’}]. 

100.  [25]  Suppose  x = (x15  . . . x^ojie  and  y = (y15  . . . yiy0)16  are  binary-coded  dec- 
imal numbers,  where  0 < Xj,y3  < 10  for  each  j.  Explain  how  to  compute  their  sum 
u - (uis  ...Wiu0)i6  and  difference  v = [v13  . . . Vlv0)i6,  where  0 < Uj,Vj  < 10  and 

(«15  • . . Mi«o)io  = ((X15  . . . XiXo)io  + (2/15  • ■ • 2/12/0)10)  mod  1016, 

0i5  . . . ui?;o)io  = ( (xis  . . . X!Xo)io  - (2/15  • • • 2/12/0)10)  mod  1016, 
without  bothering  to  do  any  radix  conversion. 

► 101.  [22]  Two  octabytes  x and  y contain  amounts  of  time,  represented  in  five  fields 
that  respectively  signify  days  (3  bytes),  hours  (1  byte),  minutes  (1  byte),  seconds 
(1  byte),  and  milliseconds  (2  bytes).  Can  you  add  and  subtract  them  quickly,  without 
converting  from  this  mixed-radix  representation  to  binary  and  back  again? 

102.  [25]  Discuss  routines  for  the  addition  and  subtraction  of  polynomials  modulo  5, 
when  (a)  16  4-bit  coefficients  or  (b)  21  3-bit  coefficients  are  packed  into  a 64-bit  word' 

► 103.  [22]  Sometimes  it’s  convenient  to  represent  small  numbers  in  unary  notation,  so 
that  0,  1,  2,  3,  . . . , k appear  respectively  as  (0)2,  (1)2,  (11)2,  (111)2,  . . . , 2k  - 1 inside 
the  computer.  Then  max  and  min  are  easily  implemented  as  | and  &. 

Suppose  the  bytes  of  x = (x7  . . . x0)256  are  such  unary  numbers,  while  the  bytes 
°f  V = (2/7  • • ■ 2/0)256  are  all  either  0 or  1.  Explain  how  to  “add”  y to  x or  “subtract”  y 
from  x,  giving  u = (u7  . . . u0)256  and  v = (v7  . . . v0)25fi  where 

Uj  = 2min(8'1s(*i+1)+!6)  _ 1 and  v.  _ 2max(o,lg(^+i)-K3-)  _ ! 

104.  [22]  Use  bitwise  operations  to  check  the  validity  of  a date  represented  in  “year- 
month-day  fields  (2/,  m,  d)  as  in  (22).  You  should  compute  a value  t that  is  zero  if  and 
only  if  1900  < y < 2100,  1 < m < 12,  and  1 < d < max-day(m),  where  month  m has 
at  most  max-day(m)  days.  Can  it  be  done  in  fewer  than  20  operations? 

105.  [30]  Given  x — ( x7  . . . xo)256  and  y = (y7  ...  2/0)256,  discuss  bitwise  operations 
that  will  sort  the  bytes  into  order,  so  that  x0  < y0  < ■ ■ ■ < x7  < y7  afterwards. 

106.  [ST].  Explain  the  Fredman-Willard  procedure  (95).  Also  show  that  a simple 
modification  of  their  method  will  compute  2Xx  without  doing  any  left  shifts. 

► 107.  [22]  Implement  Algorithm  B on  MMIX  when  d = 4,  and  compare  it  with  (56). 

108.  [26]  Adapt  Algorithm  B to  cases  where  n does  not  have  the  form  d ■ 2d . 
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109.  [20]  Evaluate  px  for  ?r-bit  numbers  x in  O(loglogn)  broadword  steps. 

► 110.  [30]  Suppose  n = 22  and  0 < x < n.  Show  how  to  compute  1<i  in  O(e) 
broadword  steps,  using  only  shift  commands  that  shift  by  a constant  amount.  (Together 
with  Algorithm  B we  can  therefore  extract  the  most  significant  bit  of  an  n-bit  number 
in  O(loglogn)  such  steps.) 

111.  [23]  Explain  the  01r  pattern  recognizer,  (98). 

112.  [46]  Can  all  occurrences  of  the  pattern  lr0  be  identified  in  0(1)  broadword  steps? 

113.  [23]  A strong  broadword  chain  is  a broadword  chain  of  a specified  width  n that 
is  also  a 2-adic  chain,  for  all  n-bit  choices  of  xo . For  example,  the  2-bit  broadword 
chain  (x0,xi)  with  xi  = x0  + 1 is  not  strong  because  xo  = (11)2  makes  xi  = (00)2. 
But  (£0,2:1, . . . ,£4)  is  a strong  broadword  chain  that  computes  (xo  + 1)  mod  4 for  all 
0 < £0  < 4 if  we  set  £1  = £0  ® 1,  £2  = £0  & 1,  £3  = £2  <S  1,  and  £4  = £1  0 £3. 

Given  a broadword  chain  (£0,  £1, . . . , xr)  of  width  n,  construct  a strong  broadword 
chain  {x'0,x'i, . . . ,x'r,)  of  the  same  width,  such  that  r'  = 0(r)  and  (£o,£i, . . . ,xr)  is  a 
subsequence  of  (x'0,  x\ , , x'r, ) . 

114.  [16]  Suppose  (£o,£i, . . . ,£r)  is  a strong  broadword  chain  of  width  n that  com- 

putes the  value  f(x)  = xr  whenever  an  n-bit  number  x = £0  is  given.  Construct  a 
broadword  chain  (Xo,  Xi, . . . , Xr)  of  width  mn  that  computes  Xr  = (/(£i) . . . /(£m))  2" 
for  any  given  mn-bit  value  Xo  = (£1  where  0 < Cl € m < 2n. 

► 115.  [24]  Given  a 2-adic  integer  x = ( . . .£2£i£o)2,  we  might  want  to  compute  y = 
(...1/22/12/0)2  = /(x)  from  £ by  zeroing  out  all  blocks  of  consecutive  Is  that  (a)  are 
not  immediately  followed  by  two  Os;  or  (b)  are  followed  by  an  odd  number  of  Os 
before  the  next  block  of  Is  begins;  or  (c)  contain  an  odd  number  of  Is.  For  exam- 
ple, if  £ is  (...01110111001101000110)2  then  y is  (a)  ( . . . 00000111000001000110)2; 
(b)  (...00000111000000000110)2;  (c)  (...00000000001100000110)2.  (Infinitely  many 
Os  are  assumed  to  appear  at  the  right  of  xo-  Thus,  in  case  (a)  we  have 

Vj  = Xj  A ((xj-iAXj-2)  V (xj-iAxj-2Axjs)  V (xj-\Axj-2Axj-3AXj-4)  V • • • ) 

for  all  j,  where  x*,  = 0 for  k < 0.)  Find  2-adic  chains  for  y in  each  case. 

116.  [HM30]  Suppose  x = (,..£2£i£o)2  and  y = (..-2/22/12/0)2  = /(x),  where  y is 
computable  by  a 2-adic  chain  having  no  shift  operations.  Let  L be  the  set  of  all  binary 
strings  such  that  yj  = [xj  . . . x\Xo  € L],  and  assume  that  all  constants  used  in  the  chain 
are  rational  2-adic  numbers.  Prove  that  L is  a regular  language.  What  languages  L 
correspond  to  the  functions  in  exercise  115(a)  and  115(b)? 

117.  [HM46]  Continuing  exercise  116,  is  there  any  simple  way  to  characterize  the  reg- 
ular languages  L that  arise  in  shift-free  2-adic  chains?  (The  language  L = 0*(10*10*)* 
does  not  seem  to  correspond  to  any  such  chain.) 

118.  [30]  According  to  Lemma  A,  we  cannot  compute  the  function  x 1 for  all  n- 
bit  numbers  x by  using  only  additions,  subtractions,  and  bitwise  Boolean  operations 
(no  shifts  or  branches).  Show,  however,  that  0(n)  such  operations  are  necessary  and 
sufficient  if  we  include  also  the  “monus”  operator  y — z in  our  repertoire. 

119.  [20]  Evaluate  the  function  fpy(x)  in  (102)  with  four  broadword  steps. 

► 120.  [ M25 ] There  are  2n2  functions  that  take  n-bit  numbers  (xi,...,xm)  into  an 
n-bit  number  f(x i,...,xm).  How  many  of  them  can  be  implemented  with  addition, 
subtraction,  multiplication,  and  nonshift  bitwise  Boolean  operations  (modulo  2n)? 
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► 121.  [M25]  By  exercise  3.1-6,  a function  from  [0  . . 2n)  into  itself  is  eventually  periodic. 

a)  Prove  that  if  / is  any  n-bit  broadword  function  that  can  be  implemented  without 
shift  instructions,  the  lengths  of  its  periods  are  always  powers  of  2. 

b)  However,  for  every  p between  1 and  n,  there’s  an  n-bit  broadword  chain  of  length  3 
that  has  a period  of  length  p. 

122.  [M22]  Complete  the  proof  of  Lemma  B. 

123.  [M23]  Let  aq  be  the  constant  1 + 2q  + 22?  + • ■ • + 2(9~x)9  = ( 2q 2 — l)/(29  — 1) 
Using  (104),  showthat  there  are  infinitely  many  q such  that  the  operation  of  multiplying 
by  aq,  modulo  2q  , requires  Q(log<j)  steps  in  any  n-bit  broadword  chain  with  n > q2 . 

124.  [M38]  Complete  the  proof  of  Theorem  R'  by  defining  an  n-bit  broadword  chain 

(xo,  xi,  and  sets  (U0,  Ui,...,  Uf)  such  that,  for  0 < t < /,  all  inputs  x € Ut  lead 

to  an  essentially  similar  state  Q(x,  t),  in  the  following  sense:  (i)  The  current  instruction 
m Q( x,  t ) does  not  depend  on  x.  (ii)  If  register  r,  has  a known  value  in  Q(x,  t),  it  holds 
xr  for  some  definite  index  j'  < t.  (iii)  If  memory  location  M[z\  has  been  changed  it 
holds  xz„  for  some  definite  index  z"  < t.  (The  values  of  j'  and  z"  depend  on  j,  z , 
and  t,  but  not  on  x.)  Furthermore  \Ut  \ > n/2*~\  and  the  program  cannot  guarantee 
that  n - px  when  t < f.  Hint:  Lemma  B implies  that  a limited  number  of  shift 
amounts  and  memory  addresses  need  to  be  considered  when  t is  small. 

125.  [ MSS ] Prove  Theorem  P'.  Hint:  Lemma  B remains  true  if  we  replace  ‘=  0’  by 
i=  as’  in  (103),  for  any  values  as. 

126.  [M46]  Does  the  operation  of  extracting  the  most  significant  bit,  2Ax,  require 
f l (log  log  n)  steps  in  an  n-bit  basic  RAM?  (See  exercise  110.) 

127.  [HM40]  Prove  that  at  least  H(logn/log  log  n)  broadword  steps  are  needed  to 
compute  the  parity  function,  (ux)  mod  2,  using  the  theory  of  circuit  complexity.  [Hint: 
Every  broadword  operation  is  in  complexity  class  AC0.] 

128.  [M46]  Can  (ux)  mod  2 be  computed  in  0(logn/loglogn)  broadword  steps? 

129.  [M46]  Does  sideways  addition  require  fl(logn)  broadword  steps? 

130.  [M46]  Is  there  an  n-bit  constant  a such  that  the  function  (a<z)  mod  2"  requires 
fl(logn)  n-bit  broadword  steps? 

► 131.  [23]  Write  an  MMIX  program  for  Algorithm  R when  the  graph  is  represented  by 
arc  lists.  Vertex  nodes  have  at  least  two  fields,  called  LINK  and  ARCS,  and  arc  nodes  have 
TIP  and  NEXT  fields,  as  explained  in  Section  7.  Initially  all  LINK  fields  are  zero,  except 
m the  given  set  of  vertices  Q,  which  is  represented  as  a circular  list.  Your  program 
should  change  that  circular  list  so  that  it  represents  the  set  R of  all  reachable  vertices. 

► 132.  [M27]'  A clique  in  a graph  is  a set  of  mutually  adjacent  vertices;  a clique  is 
maximal  if  it’s  not  contained  in  any  other.  The  purpose  of  this  exercise  is  to  discuss 
an  algorithm  due  to  J.  K.  M.  Moody  and  J.  Hollis,  which  provides  a convenient  way 
to  find  every  maximal  clique  of  a not-too-large  graph,  using  bitwise  operations. 

Suppose  G is  a graph  with  n vertices  V = {0, 1, ,,  . , n — 1).  Let  pv  = ^2{2U  \ 
u v or  u = u}  be  row  v of  G’s  reflexive  adjacency  matrix,  and  let  5V  = )T{2U  | 
u ^ v)  = 2 — 1 — 2l.  Every  subset  U C V is  representable  as  an  n-bit  integer 
a(U)  = Eueu  2“;  for  example,  Sv  = a(V  \ v).  We  also  define  the  bitwise  intersection 

t(U)  = & (u  € U7  Pu:  5U). 

0 <u<n 

For  example,  if  n = 5 we  have  r({0, 2})  = p0  & Ai  & p2  & 63  & 64. 
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a)  Prove  that  U is  a clique  if  and  only  if  t(U)  = <j(U). 

b)  Show  that  if  t(U)  = er(T)  then  T is  a clique. 

c)  For  1 < k < n,  consider  the  2k  bitwise  intersections 

ck  = { & (u  c m pu-  $u)  u c {o,  i, . . . , fc  — i}\, 

L o<w<fc  J 

and  let  be  the  maximal  elements  of  Ck.  Prove  that  U is  a maximal  clique  if 
and  only  if  a(U ) € C, J". 

d)  Explain  how  to  compute  Cjf  from  starting  with  Cq  = { 2n  — 1}. 

► 133.  [20]  Given  a graph  G,  how  can  the  algorithm  of  exercise  132  be  used  to  find 
(a)  all  maximal  independent  sets  of  vertices?  (b)  all  minimal  vertex  covers  (sets  that 
hit  every  edge)? 

134.  [15]  Nine  classes  of  mappings  for  ternary  values  appear  in  (119),  (123),  and  (124). 
To  which  class  does  the  representation  (128)  belong,  if  a = 0,  b = *,  c = 1? 

135.  [22]  Lukasiewicz  included  a few  operations  besides  (127)  in  his  three-valued  logic: 
->x  (negation)  interchanges  0 with  1 but  leaves  * unchanged;  ox  (possibility)  is  defined 
as  ~^x  =>  x;  ox  (necessity)  is  defined  as  -lO-ia;;  and  x <4-  y (equivalence)  is  defined  as 
(x  =*>  y)  A (y  =►  x).  Explain  how  to  perform  these  operations  using  representation  (128). 

136.  [29]  Suggest  two-bit  encodings  for  binary  operations  on  the  set  (a,  b,  c}  that  are 
defined  by  the  following  “multiplication  tables” : 

(a  b c\  ( a c b \ ( a b a\ 

b c c J ; (b)  ( c b a ) ; (c)  ( a a c J ■ 

ccc/  V b a c / V a b c J 

137.  [21]  Show  that  the  operation  in  exercise  136(c)  is  simpler  with  packed  vectors 
like  (131)  than  with  the  unpacked  form  (130). 

138.  [24]  Find  an  example  of  three-state-to-two-bit  encoding  where  class  VQ  is  best. 

139.  [25]  If  x and  y are  signed  bits  0,  +1,  or  —1,  what  2-bit  encoding  is  good  for 
calculating  their  sum  (2122)3  = x + y,  where  21  and  22  are  also  required  to  be  signed 
bits?  (This  is  a “half  adder”  for  balanced  ternary  numbers.) 

140.  [27]  Design  an  economical  full  adder  for  balanced  ternary  numbers:  Show  how 
to  compute  signed  bits  u and  v such  that  3u  + v = x + y + z when  x,y,z  € {0,  +1,  — 1}. 

► 141.  [30]  The  Ulam  numbers  (Ui,  C/2 , . . . ) = (1,2,3,4,6,8,11,13,16,18,26,...)  are 
defined  for  n > 3 by  letting  Un  be  the  smallest  integer  > Un-\  that  has  a unique 
representation  U„  = Uj  + Uk  for  0 < j < k < n.  Show  that  a million  Ulam  numbers 
can  be  computed  rapidly  with  the  help  of  bitwise  techniques. 

► 142.  [33]  A subcube  such  as  *10*1*01  can  be  represented  by  asterisk  codes  10010100 
and  bit  codes  01001001,  as  in  (85);  but  many  other  encodings  are  also  possible.  What 
representation  scheme  for  subcubes  works  best,  for  finding  prime  implicants  by  the 
consensus-based  algorithm  of  exercise  7.1.1-31? 

143.  [20]  Let  x be  a 64-bit  number  that  represents  an  8 x 8 chessboard,  with  a 1 bit 
in  every  position  where  a knight  is  present.  Find  a formula  for  the  64-bit  number  f(x) 
that  has  a 1 in  every  position  reachable  in  one  move  by  a knight  of  x.  For  example, 
the  white  knights  at  the  start  of  a game  correspond  to  1 = #42;  then  f(x)  = #a51800. 

144.  [16]  What  node  is  the  sibling  of  node  j in  a sideways  heap?  (See  (134).) 

145.  [17]  Interpret  (137)  when  h is  less  than  the  height  of  j. 
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► 146.  [M20]  Prove  Eq.  (138),  which  relates  the  p and  A functions. 

► 147  [M20]  What  values  of  7 tv,  /3v,  av,  and  rj  occur  in  Algorithm  V when  the  forest  is 

a)  the  empty  digraph  with  vertices  {m, . . and  no  arcs? 

b)  the  oriented  path  vn  — >- >v2  — >v3? 

148.  [M21]  When  preprocessing  for  Algorithm  V,  is  it  possible  to  have  f3x3  — >* 

f,f2  r4*  Pyi  13x1  in  S when  X3  * x?  * xi  — * A and  2/2  — > yi  — > A in 

tne  torest ; (If  so,  two  different  trees  are  “entangled”  in  S.) 

► 149.  [23]  Design  a preprocessing  procedure  for  Algorithm  V. 

► 150.  [25]  Given  an  array  of  elements  A1,...,An,  the  range  minimum  query  problem 

is  to  determine  k(i,j)  such  that  Ak(iJ)  = min(Ai, . . ,,Aj)  for  any  given  indices  i and  j 
with  1 < * < 3 < n.  Prove  that  Algorithm  V will  solve  this  problem,  after  O(n)  steps  of 
preprocessing  on  the  array  A have  prepared  the  necessary  tables  (n,  /?,  a,  r).  Hint:  Con- 
sider the  binary  search  tree  constructed  from  the  sequence  of  keys  (p(l),  p(2)  v(n)) 


where  p is  a permutation  of  {1, 2, . . . , n}  such  that  Ap(1)  < Ap(2)  < ■ ■ ■ < Ap(n) . 

151.  [22]  Conversely,  show  that  any  algorithm  for  range  minimum  queries  can  be  used 
to  find  nearest  common  ancestors,  with  essentially  the  same  efficiency. 

152.  [ M21 ] Prove  that  Algorithm  V is  correct. 


► 153.  [M20]  The  pointers  in  a navigation  pile  like  (144)  can  be  packed  into  a binary 
string  such  as 


0 

1 °i 

4 

1 0 0 

0 

0 0 

1 

0100 

0 0 

0 0 0 

0 

4 1 

0 

8 

10 

12  1 

0 

16 

0 

20 

22| 

24 

At  what  bit  position  (from  the  left)  does  the  pointer  for  node  j end? 

154.  [20]  The  gray  lines  in  Fig.  14  show  how  each  pentagon  is  composed  of  ten 
triangles.  What  decomposition  of  the  hyperbolic  plane  is  defined  by  those  gray  lines 
alone,  without  the  black  pentagon  edges? 

► 155.  [. M21 ] Prove  that  {xej})  mod  1 = (aO)1/0  when  a is  the  negaFibonacci  code  for  x. 

156.  [21]  Design  algorithms  (a)  to  convert  a given  integer  x to  its  negaFibonacci 
code  a,  and  (b)  to  convert  a given  negaFibonacci  code  atoi  = N(a). 

157.  [M21]  Explain  the  recursion  (148)  for  negaFibonacci  predecessor  and  successor. 

158.  [M26]  Let  a = a„...a1  be  the  binary  code  for  F(a0)  = anFn+1  + (-  aiF2 

in  the  standard  Fibonacci  number  system  (146).  Develop  methods  analogous  to  (148) 
and  ( 149)  for  incrementing  and  decrementing  such  codewords. 

159.  [M34]  Exercise  7 shows  that  it’s  easy  to  convert  between  the  negabinary  and 
binary  number  systems.  Discuss  conversion  between  negaFibonacci  codewords  and  the 
ordinary  Fibonacci  codes  in  exercise  158. 


160.  [M29]  Prove  that  (150)  and  (151)  yield  consistent  code  labels  for  the  pentagrid. 

161.  [20]  The  cells  of  a chessboard  can  be  colored  black  and  white,  so  that  neighboring 
cells  have  different  colors.  Does  the  pentagrid  also  have  this  property? 

► 162.  [HM3 7]  Explain  how  to  draw  the  pentagrid,  Fig.  14.  What  circles  are  present? 

163.  [HM41]  Devise  a way  to  navigate  through  the  triangles  in  the  tiling  of  Fig.  18. 

164.  [23]  The  original  definition  of  custerization  in  1957  was  not  (157)  but 

custer'(.Y)  =Xk  ~(XNW  & XN  & ANE  & Xw  k XE  k Xsw  kXsk  XSE). 

Why  is  (157)  preferable? 
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165.  [21 } (R.  A.  Kirsch.)  Discuss  the  computation  of  the  3x3  cellular  automaton  with 

X(f+1)  = custer(X(t))  = ~X(t)  & (XjS,0  | X§  | X'()  | X{st}). 

166.  [M23]  Let  f(M,N ) be  the  maximum  number  of  black  pixels  in  an  M x N 
bitmap  X for  which  X = custer(A').  Prove  that  f(M,  N)  = | MN  + 0(M  + N). 

167.  [24]  (Life.)  If  the  bitmap  X represents  an  array  of  cells  that  are  either  dead  (0) 
or  alive  (1),  the  Boolean  function 

/ (inw,  ...  ,x,  ...  ,Xse)  = [2  < Xnw  +Sn  + Ine  +£w  + ^x-\-Xe  +*sw  + Xs  +£se  < 4] 
can  lead  to  astonishing  life  histories  when  it  governs  a cellular  automaton  as  in  (158). 

a)  Find  a way  to  evaluate  / with  a Boolean  chain  of  26  steps  or  less. 

b)  Let  X*f)  denote  row  j of  X at  time  t.  Show  that  X|,+1)  can  be  evaluated  in 

J • ^ (t)  (t'\ 

at  most  23  broadword  steps,  as  a function  of  the  three  rows  X)_. , X ) , and 
y(t)  3 3 

Xj+ 1- 

► 168.  [23]  To  keep  an  image  finite,  we  might  insist  that  a 3 x 3 cellular  automaton 
treats  a M xN  bitmap  as  a torus,  wrapping  around  seamlessly  between  top  and  bottom 
and  between  left  and  right.  The  task  of  simulating  its  actions  efficiently  with  bitwise 
operations  is  somewhat  tricky:  We  want  to  minimize  references  to  memory,  yet  each 
new  pixel  value  depends  on  old  values  that  lie  on  all  sides.  Furthermore  the  shifting  of 
bits  between  neighboring  words  tends  to  be  awkward,  taxing  the  capacity  of  a register. 

Show  that  such  difficulties  can  be  surmounted  by  maintaining  an  array  of  n-bit 
words  Ajk  for  0 < j < M and  0 < k < N1  = \N/(n  — 2)].  If  j / M and  k / 0,  word  Ajk 
should  contain  the  pixels  of  row  j and  columns  (k  — l)(n  — 2)  through  k(n  — 2)  + 1, 
inclusive;  the  other  words  Amic  and  Aj 0 provide  auxiliary  buffer  space.  (Notice  that 
some  bits  of  the  raster  appear  twice.) 

169.  [22]  Continuing  the  previous  two  exercises,  what  happens  to  the  Cheshire  cat  of 
Fig.  17(a)  when  it  is  subjected  to  the  vicissitudes  of  Life,  in  a 26  x 31  torus? 

► 170.  [21]  What  result  does  the  Guo-Hall  thinning  automaton  produce  when  given  a 
solid  black  rectangle  of  M rows  and  N columns?  How  long  does  it  take? 

171.  [24]  Find  a Boolean  chain  of  length  < 25  to  evaluate  the  local  thinning  function 
g(xN w,xn,xNe,Xw,Xe,xSw,Xs,Xsb)  of  (159),  with  or  without  the  extra  cases  in  (160). 

172.  [M29]  Prove  or  disprove:  If  a pattern  contains  three  black  pixels  that  are  king- 
neighbors  of  each  other,  the  Guo-Hall  procedure  extended  by  (160)  will  reduce  it, 
unless  none  of  those  pixels  can  be  removed  without  destroying  the  connectivity. 

► 173.  [M30]  Raster  images  often  need  to  be  cleaned  up  if  they  contain  noisy  data.  For 
example,  accidental  specks  of  black  or  white  may  well  spoil  the  results  when  a thinning 
algorithm  is  used  for  optical  character  recognition. 

Say  that  a bitmap  A'  is  closed  if  every  white  pixel  is  part  of  a 2 x 2 square  of 
white  pixels,  and  open  if  every  black  pixel  is  part  of  a 2 x 2 square  of  black  pixels.  Let 

XD  = & {Y  | y D A'  and  Y is  closed};  XL  = \ {Y  \ Y C X and  Y is  open}. 

A bitmap  is  called  clean  if  it  equals  XDL  for  some  A'.  We  might,  for  example,  have 

* = XD  = J-.  : XDL  = J|  . 

In  general  XD  is  “darker”  than  A,  while  XL  is  “lighter”:  XD  DID  XL . 

a)  Prove  that  (XDL)DL  = XDL . Hint:  AC  Y implies  XD  C Y°  and  XL  C YL. 

b)  Show  that  XD  can  be  computed  with  one  step  of  a 3 x 3 cellular  automaton. 
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m.  [M46]  (M.  Minsky  and  S.  Papert.)  Is  there  a three-dimensional  shrinking  algo- 
rithm that  preserves  connectivity,  analogous  to  (161)? 

175.  [15]  How  many  rookwise  connected  black  components  does  the  Cheshire  cat  have? 
1 76  [M24 } Let  G be  the  graph  whose  vertices  are  the  black  pixels  of  a given  bitmap  Y 

T “ " W!'en  U and/  are  a kinS  move  aPart.  Let  G'  be  the  corresponding  graph 

after  the  shrinking  transformation  (161)  has  been  applied.  The  purpose  of  this  exercise 
is  to  show  that  the  number  of  connected  components  of  G'  is  the  number  of  components 
ot  G minus  the  number  of  isolated  vertices  of  G. 

north  T {(!’{)’(*_1’/)’(i-1>l  + 1)>(bi+1)}  be  pixel  (i,j)  together  with  its 

north  and/or  east  neighbors.  For  each  v E G let  S(v)  = {v'  E G'  I v'  E Nv ) 

a)  Prove  that  S(v)  is  empty  if  and  only  if  v is  isolated  in  G 
b K u~  v in  G,  u'  E S(u),  and  v'  E S(v),  prove  that  v!  — * v'  in  G' 
c For  each  v'  EG'  let  S'(v')  = {v  E G | 1/  E Nv}.  Is  S'(v')  always  nonempty? 
d If  a — n in  G , u £ S'(u'),  and  v E S'(v'),  prove  that  «_■ • „ in  G. 
e)  Hence  there’s  a one-to-one  correspondence  between  the  nontrivial  components 
01  G and  the  components  of  G' . 

177.  [M22]  Continuing  exercise  176,  prove  an  analogous  result  for  the  white  pixels. 

178.  [20]  If  X is  an  M x IV  bitmap, 
let  X*  be  the  M x (2N  + 1)  bitmap 
X t (-X"  | (-Y  -C  1)).  Show  that  the 
kingwise  connected  components  of 
X are  also  rookwise  connected,  and 
that  bitmap  X * has  the  same  “sur- 
roundedness  tree”  (162)  as  X. 

* 1J9’  M Design  an  algorithm  that  constructs  the  surroundedness  tree  of  a given 
(162)  and  S)’  )SCannmg  the  lmage  °ne  row  at  a time  as  discussed  in  the  text,  (See 

► 180.  [M24]  Digitize  the  hyperbola  y2  = x2  + 13  by  hand,  for  0 < y < 7. 

181.  [HM20]  Explain  how  to  subdivide  a general  conic  (168)  with  rational  coefficients 
into  monotonic  parts  so  that  Algorithm  T applies. 

182.  [M31]  Why  does  the  three-register  method  (Algorithm  T)  digitize  correctly? 

► 183.  [M29]  (G.  Rote.)  Explain  why  Algorithm  T might  fail  if  condition  (v)  is  false. 

* ;r8,4;,,[T]  iFin^  * quadrat!c  form  «'(*-»)  80  that,  when  Algorithm  T is  applied  to 

j ’ and  ^ ’ 14  produces  exactly  the  same  edges  as  it  does  from  (x,  y)  (x'  y') 

and  Q,  but  in  the  reverse  order.  ’ ' ' 

► 185 . [22]  Design  an  algorithm  that  properly  digitizes  a straight  line  from  (f)7?)  to 
K ,?7  j,  when  ?,  0,  f , and  y are  rational  numbers,  by  simplifying  Algorithm  T. 

186.  [HM22]  Given  three  complex  numbers  (z0,z1,z2),  consider  the  curve  traced  out  by 

B(t)  = (1  - t)2z0  + 2(1  - t)tzi  + t2z2,  for  0 < t < 1. 

a)  What  is  the  approximate  behavior  of  B(t)  when  t is  near  0 or  1? 

) Let  S(zo,zuZ2 ) = {B(t)  I 0 < t < 1}.  Prove  that  all  points  of  S(20,zi,z2)  lie 
on  or  inside  the  triangle  whose  vertices  are  z0,  zu  and  22- 

c)  True  or  false?  S(w  + <4z0,  w + {Zl,  w + (z2)  = w + (S(z0,  zu  z2). 

d)  Prove  that  S{z0,Zl,Z2)  is  part  of  a straight  line  if  and  only  if  z0,  zu  and  22  are 
colhnear;  otherwise  it  is  part  of  a parabola. 


7.1.3 


BITWISE  TRICKS  AND  TECHNIQUES  199 


e)  Prove  that  if  0 < 6 < 1,  we  have  the  recurrence 


5(20,21,^2)  = 5(2o,  (1— (?)z0  + Ozi,  B(9))  U S(B(9),  (\—Q)z\  + 0z2,  z2). 


187.  [M29]  Continuing  exercise  186,  show  how  to  digitize  S(z0,  zi,z2)  using  the  three- 
register  method  (Algorithm  T).  For  best  results,  the  digitizations  of  5(22,2i,20)  and 
5(20,21,22)  should  produce  the  same  edges,  but  in  reverse  order. 

► 188.  [25]  Bitmap  images  can  often  be  viewed  conveniently  using  pixels  that  are  shades 
of  gray  instead  of  just  black  or  white.  Such  gray  levels  typically  are  8-bit  values  that 
range  from  0 (black)  to  255  (white);  notice  that  the  black/white  convention  is  tradition- 
ally reversed  with  respect  to  the  1-bit  case.  An  m x n bitmap  whose  resolution  is  600 
dots  per  inch  corresponds  nicely  to  the  (m/8)  x (n/8)  grayscale  image  with  75  pixels 
per  inch  that  is  obtained  by  mapping  each  8x8  subarray  of  1-bit  pixels  into  the  gray 
level  [255(1  — k/QA)1^1  + |J,  where  7 = 1.3  and  k is  the  number  of  Is  in  the  subarray. 

Write  an  MMIX  routine  that  converts  a given  mx  n array  BITMAP  into  the  corre- 
sponding (m/8)  x (n/8)  image  GRAYMAP,  assuming  that  m = 8m'  and  n = 64n'. 

189.  [25]  Given  a 64  x 64  bitmap,  what’s  a good  way  (a)  to  transpose  it,  or  (b)  to 
rotate  it  by  90°,  using  operations  on  64-bit  numbers? 


190.  [23]  A parity  pattern  of  length  m and  width  n is  an  m x n matrix  of  0s  and 


Is  with  the 
example, 


property  that  each  element  is 


1 1 

00, 

1 1 


0011 

0100 

1101’ 

0101 


01010 

11011, 

01010 


the  sum  of  its  rook- neighbors,  mod  2. 


100 

0 1 1 1 0 

1 10 

10101 

101, 

and 

11011 

Oil 

10101 

001 

01  1 10 

For 


are  parity  patterns  of  sizes  3 x 2,  4 x 4,  3 x 5,  5 x 3,  and  5x5. 

a)  If  the  binary  vectors  ou,  a2,  . . . , am  are  the  rows  of  a parity  pattern,  show  that 
q2,  • . . , Qm  can  all  be  computed  from  the  top  row  ai  by  using  bitwise  operations. 
Thus  at  most  one  m x n parity  pattern  can  begin  with  any  given  bit  vector. 

b)  True  or  false:  The  sum  (mod  2)  of  two  m X n parity  patterns  is  a parity  pattern. 

c)  A parity  pattern  is  called  perfect  if  it  contains  no  all-zero  row  or  column.  For 
example,  three  of  the  matrices  above  are  perfect,  but  the  3x2  and  3x5  examples 
are  not.  Show  that  every  m x n parity  pattern  contains  a perfect  parity  pattern 
as  a submatrix.  Furthermore,  all  such  submatrices  have  the  same  size,  m'  x n , 
where  m'  + 1 is  a divisor  of  m + 1 and  n1  + 1 is  a divisor  n + 1 . 

d)  There’s  a perfect  parity  pattern  whose  first  row  is  0011,  but  there  is  no  such 

pattern  beginning  with  01010.  Is  there  a simple  way  to  decide  whether  a given 
binary  vector  is  the  top  row  of  a perfect  parity  pattern?  ^ n~1  ^ 

e)  Prove  that  there’s  a unique  perfect  parity  pattern  that  begins  with  1 0 ...  0. 


191.  [M30]  A wraparound  parity  pattern  is  analogous  to  the  parity  patterns  of  exer- 
cise 190,  except  that  the  leftmost  and  rightmost  elements  of  each  row  are  also  neighbors. 

a)  Find  a simple  relation  between  the  parity  pattern  of  width  n that  begins  with  a 
and  the  wraparound  parity  pattern  of  width  2n  + 2 that  begins  with  0a0aR. 

b)  The  Fibonacci  polynomials  FJ( x)  are  defined  by  the  recurrence 


F0(x)  = 0,  Fi(x)  = 1,  and  Fj+i(x)  = xFj(x)  + Fj-i(x)  for  j > 1. 

Show  that  there’s  a simple  relation  between  the  wraparound  parity  patterns  that 
begin  with  10 . . . 0 (N—  1 zeros)  and  the  Fibonacci  polynomials  modulo  xN  + 1. 
Hint:  Consider  Fj(x~1  + 1 + x),  and  do  arithmetic  mod  2 as  well  as  mod  xN+  1. 
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c)  If  a is  the  binary  string  ai  . . . an,  let  fa(x)  = alX  + ■ ■ ■ + anxn . Show  that 

/(a^.o a?)(x)  = (fa{x)  + fa(x  ))Fj(x  1 + l+i)  mod(xJV+l)  and  mod  2, 

when  N = 2n  + 2 and  a,  is  row  j of  a width-n  parity  pattern  that  begins  with  a. 

d)  Consequently  we  can  compute  a3  from  a in  only  0(n2  log  j)  steps.  Hints:  See  ex- 
ercise 4.6.3-26;  and  use  the  identity  Fm+n(x)  = Fm(x)  Fn+1(x)  + Fm-i(x)Fn(x), 
which  generalizes  Eq.  1.2.8-(6). 

192.  [ HM38 ] The  shortest  parity  pattern  that  begins  with  a given  string  can  be  quite 
long;  for  example,  it  turns  out  that  the  perfect  pattern  of  width  120  whose  first  row  is 
10  ...  0 has  length  36,028,797,018,963,966(1).  The  purpose  of  this  exercise  is  to  consider 
how  to  calculate  the  interesting  function 


C(q)  = l + max{  m | there  exists  a perfect  parity  pattern  of  length  m and  width  q- 1}, 


whose  initial  values  (1, 3, 4, 6,  5, 24, 9, 12,  28)  for  1 < q < 9 are  easy  to  compute  by  hand. 

a)  Characterize  c(q)  algebraically,  using  the  Fibonacci  polynomials  of  exercise  191. 

b)  Explain  how  to  calculate  c(q)  if  we  know  a number  M such  that  c(q)  divides  M, 
and  if  we  also  know  the  prime  factors  of  M . 

c)  Prove  that  c(2  ) = 3 • 2e  1 when  e > 0.  Hint:  F^y)  has  a simple  form,  mod  2. 

d)  Prove  that  when  q is  odd  and  not  a multiple  of  3,  c(q)  is  a divisor  of  22e  - 1, 
where  e is  the  order  of  2 modulo  q.  Hint:  F2=_i(j/)  has  a simple  form,  mod  2. 

e)  What  happens  when  q is  an  odd  multiple  of  3? 

f)  Finally,  explain  how  to  handle  the  case  when  q is 

► 193.  [M21]  If  a perfect  m x n parity  pattern  exists, 
when  m and  n are  odd,  show  that  there’s  also  a perfect 
(2m+l)  x (2n+l)  parity  pattern.  (Intricate  fractals  arise 
when  this  observation  is  applied  repeatedly;  for  example, 
the  5x5  pattern  in  exercise  190  leads  to  Fig.  20.) 

194.  [M24]  Find  all  n < 383  for  which  there  exists  a 
perfect  n X n parity  pattern  with  8-fold  symmetry,  such 
as  the  example  in  Fig.  20.  Hint:  The  diagonal  elements 
of  all  such  patterns  must  be  zero. 


► 195.  [ HM25 } Let  A be  a binary  matrix  having  rows 

ai>  am  °f  length  n.  Explain  how  to  use  bitwise  operations  to  compute  the 
rank  m — r of  A over  the  binary  field  {0, 1},  and  to  find  linearly  independent  binary 
vectors  0X,  . . . , 6r  of  length  m such  that  9jA  = 0 ...  0 for  1 < j < r.  Hint:  See  the 
“triangularization”  algorithm  for  null  spaces,  Algorithm  4.6. 2N. 


196.  [21]  (K.  Thompson,  1992.)  Integers  in  the  range  0 < x < 231  can  be  encoded  as 
a string  of  up  to  six  bytes  a(x)  = on  ...  at  in  the  following  way:  If  x < 27,  set  l 1 and 
“i  Otherwise  let  x = [xh  . . . xlXo)M',  set  l <—  [(Ax)/5l,ai  t-  28 -28~'  +xi_u  and 
ai  = ^ +xi-j  for  2 < j <1.  Notice  that  a(x ) contains  a zero  byte  if  and  only  if  x = 0. 

a)  What  are  the  encodings  of  #a,  #3a3,  #7b97,  and  #ldl41? 

b)  If  x < x , prove  that  a(x)  < a(x')  in  lexicographic  order. 

c)  Suppose  a sequence  of  values  x(1)x(2). . . x(n)  has  been  encoded  as  a byte  string 
a(x  l)a(i*2,)...ci( x^),  and  let  a*,  be  the  fcth  byte  in  that  string.  Show  that 
it’s  easy  to  determine  the  value  x(l)  from  which  ak  came,  by  looking  at  a few  of 
the  neighboring  bytes  if  necessary. 
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197.  [22]  The  Universal  Character  Set  (UCS),  also  known  as  Unicode,  is  a standard 
mapping  of  characters  to  integer  codepoints  * in  the  range  0 < x < 220  + 216.  An 
encoding  called  UTF-16  represents  such  integers  as  one  or  two  wydes  /3(x)  = fli  or 
/3(x)  = flifli,  in  the  following  way:  If  x < 216  then  fl(x)  = *;  otherwise 

Pi  = #d800  + Lj//210J  and  fii  = #dc00  + ( y mod  210),  where  y = x — 216. 

Answer  questions  (a),  (b),  and  (c)  of  exercise  196  for  this  encoding. 

► 198.  [21]  Unicode  characters  are  often  represented  as  strings  of  bytes  using  a scheme 
called  UTF-8,  which  is  the  encoding  of  exercise  196  restricted  to  integers  in  the  range 
0 < x < 220  + 216.  Notice  that  UTF-8  efficiently  preserves  the  standard  ASCII  character 
set  (the  codepoints  with  x < 27),  and  that  it  is  quite  different  from  UTF-16. 

Let  a\  be  the  first  byte  of  a UTF-8  string  a(x).  Show  that  there  are  reasonably 
small  integer  constants  a,  b,  and  c such  that  only  four  bitwise  operations 

(a  3>  ((ai  3>  6)  & c))  & 3 

suffice  to  determine  the  number  l — 1 of  bytes  between  ai  and  the  end  of  a(x). 

► 199.  [23]  A person  might  try  to  encode  #a  as  #c08a  or  #e0808a  or  #f080808a  in 
UTF-8,  because  the  obvious  decoding  algorithm  produces  the  same  result  in  each  case. 
But  such  unnecessarily  long  forms  are  illegal,  because  they  could  lead  to  security  holes. 

Suppose  a.\  and  c*2  are  bytes  such  that  au  > #80  and  #80  < c*2  < #c0.  Find 
a branchless  way  to  decide  whether  ai  and  ai  are  the  first  two  bytes  of  at  least  one 
legitimate  UTF-8  string  a(x). 

200.  [20]  Interpret  the  contents  of  register  $3  after  the  following  three  MMIX  instruc- 
tions have  been  executed:  MOR  $1,$0,#94;  MXOR  $2,$0,#94;  SUBU  $3,$1,$2. 

201.  [20]  Suppose  x = (*15  . . . *i*o)i6  has  sixteen  hexadecimal  digits.  What  one 
MMIX  instruction  will  change  each  nonzero  digit  to  f , while  leaving  zeros  untouched? 

202.  [20]  What  two  instructions  will  change  an  octabyte’s  nonzero  wydes  to  #ffff ? 

203.  [22]  Suppose  we  want  to  convert  a tetrabyte  x = (2:7  . . . £1*0)16  to  the  octabyte 
y = (2/7  •••  1/13/0)256,  where  yj  is  the  ASCII  code  for  the  hexadecimal  digit  Xj.  For 
example,  if  * = #1234abcd,  y should  represent  the  8-character  string  "1234abcd". 
What  clever  choices  of  five  constants  a,  b,  c,  d,  and  e will  make  the  following  MMIX 
instructions  do  the  job? 

M0Rt,x,a;  SLUs,t,4;  X0Rt,s,t;  ANDt,t,b 
ADD  t , t , c ; MOR  s , d , t ; ADD  t , t , e ; ADD  y , t , s 

► 204.  [22]  What  are  the  amazing  constants  p,  q,  r,  m that  achieve  a perfect  shuffle 
with  just  six  MMIX  commands?  (See  (i75)-(i78).) 

► 205.  [22]  How  would  you  perfectly  unshuffle  on  MMIX,  going  from  w in  (175)  back  to  2? 

206.  [20]  The  perfect  shuffle  (175)  is  sometimes  called  an  “outshuffle,”  by  comparison 
with  the  “inshuffle”  that  takes  z nt  j(i  = (3/31*31  •••  2/1*13/0*0)2;  the  outshuffle 
preserves  the  leftmost  and  rightmost  bits  of  z,  but  the  inshuffle  has  no  fixed  points. 
Can  an  inshuffle  be  performed  as  efficiently  as  an  outshuffle? 

207.  [22]  Use  MOR  to  perform  a 3- way  perfect  shuffle  or  “triple  zip,”  taking  (*63  . . . *0)2 
to  (*21*42*63*20  • • • *2*23*44*1*22*43*0)2,  as  well  as  the  inverse  of  this  shuffle. 

► 208.  [23]  What’s  a fast  way  for  MMIX  to  transpose  an  8 X 8 Boolean  matrix? 

► 209.  [21]  Is  the  suffix  parity  operation  *®  of  exercise  36  easy  to  compute  with  MXOR? 
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210.  [22]  A puzzle:  Register  x contains  a number  8 j + k,  where  0 < j,  k < 8.  Registers 
a and  b contain  arbitrary  octabytes  (a7  . . . a1a0)256  and  (b7  . . . b,b0)2b&.  Find  a sequence 
ot  four  MMIX  instructions  that  will  put  clj  & b & into  register  x. 

► 211.  [M25]  The  truth  table  of  a Boolean  function  f(x i, . . . , x6)  is  essentially  a 64-bit 
number  / = (/( 0, 0, 0, 0,  0, 0) ...  /(1, 1, 1, 1, 1, 0)/(1, 1, 1, 1, 1, 1))2.  Show  that  two  M0R 
instructions  will  convert  / to  the  truth  table  of  the  least  monotone  Boolean  function, 
/ , that  is  greater  than  or  equal  to  / at  each  point. 

212.  [M32]  Suppose  a = (a63  ...  aia0)2  represents  the  polynomial 

a(x)  = (a63  • • • ai<io)x  = ae3X63  + • • ■ + a\x  + ao. 

Discuss  using  MX0R  to  compute  the  product  c(x)  = a(x)b(x),  modulo  x64  and  mod  2. 

► 213.  [HM26]  Implement  the  CRC  procedure  (183)  on  MMIX. 

► 214.  [HM28]  (R.  W.  Gosper.)  Find  a short,  branchless  MMIX  computation  that  com- 
putes the  inverse  of  any  given  8x8  matrix  X of  0s  and  Is,  modulo  2,  if  det  X is  odd. 

► 215.  [21]  What’s  a quick  way  for  MMIX  to  test  if  a 64-bit  number  is  a multiple  of  3? 

► 216.  [M26]  Given  n-bit  integers  xu  . . . , xm  > 0,  n > Am,  compute  in  O(m)  steps  the 
least  y > 0 such  that  y <£  {alXl  + - ■ ■+amXm  \ au...,am  € {0,1}},  if  Az  takes  unit  time. 
217.  [40]  Explore  the  processing  of  long  strings  of  text  by  packing  them  in  a “trans- 
posed” or  “sliced”  manner:  Represent  64  consecutive  characters  as  a sequence  of  eight 
octabytes  w0...w7,  where  wk  contains  all  64  of  their  kth  bits. 

► 218.  [M30]  (Hans  Petter  Selasky,  2009.)  For  fixed  d > 3,  design  an  algorithm  to 
compute  a ■ xy  mod  2 , given  integers  a,  x,  and  y,  where  x is  odd,  using  O(d)  additions 
and  bitwise  operations  together  with  a single  multiplication  by  y. 

In  popular  usage,  the  term  BDD  almost  always  refers  to 
Reduced  Ordered  Binary  Decision  Diagram  (ROBDD  in  the  literature 
used  when  the  ordering  and  reduction  aspects  need  to  be  emphasized). 

WIKIPEDIA,  The  Free  Encyclopedia  (7  July  2007) 

7.1.4.  Binary  Decision  Diagrams 

Let’s  turn  now  to  an  important  family  of  data  structures  that  have  rapidly  be- 
come the  method  of  choice  for  representing  and  manipulating  Boolean  functions 
inside  a computer.  The  basic  idea  is  a divide-and-conquer  scheme  somewhat  like 
the  binary  tries  of  Section  6.3,  but  with  several  new  twists. 

Figure  21  shows  the  binary  decision  diagram  for  a simple  Boolean  function 
of  three  variables,  the  median  function  ( x1x2x3 ) of  Eq.  7.1.1-(43).  We  can  un- 
derstand it  as  follows:  The  node  at  the  top  is  called  the  root.  Every  internal  node 
(j),  also  called  a branch  node , is  labeled  with  a name  or  index  j = that 

designates  a variable;  for  example,  the  root  node  @ in  Fig.  21  designates  Xl. 
Branch  nodes  have  two  successors,  indicated  by  descending  lines.  One  of  the 
successors  is  drawn  as  a dashed  line  and  called  LO;  the  other  is  drawn  as  a solid 
line  and  called  HI.  These  branch  nodes  define  a path  in  the  diagram  for  any  values 
of  the  Boolean  variables,  if  we  start  at  the  root  and  take  the  LO  branch  from 
node  (j)  when  Xj  = 0,  the  HI  branch  when  Xj  = 1.  Eventually  this  path  leads 
to  a sink  node,  which  is  either  \T\  (denoting  FALSE)  or  [fj  (denoting  TRUE). 
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Fig.  21.  The  binary  decision  diagram  (BDD) 
for  the  majority  or  median  function  (X1X2X3). 


In  Fig.  21  it's  easy  to  verify  that  this  process  yields  the  function  value  FALSE 
when  at  least  two  of  the  variables  {xi,x2,x3}  are  0,  otherwise  it  yields  TRUE. 

Many  authors  use  [5]  and  [7]  to  denote  the  sink  nodes.  We  use  [X]  and  [t] 
instead,  hoping  to  avoid  any  confusion  with  the  branch  nodes  (o)  and  (T). 

Inside  a computer,  Fig.  21  would  be  represented  as  a set  of  four  nodes  in 
arbitrary  memory  locations,  where  each  node  has  three  fields  | V | L0  | HI  | . 
The  V held  holds  the  index  of  a variable,  while  the  L0  and  HI  fields  each  point 
to  another  node  or  to  a sink: 


With  64-bit  words,  we  might  for  example  use  8 bits  for  V,  then  28  bits  for  L0 
and  the  other  28  bits  for  HI. 

Such  a structure  is  called  a “binary  decision  diagram,”  or  BDD  for  short. 
Small  BDDs  can  readily  be  drawn  as  actual  diagrams  on  a piece  of  paper 
or  a computer  screen.  But  in  essence  each  BDD  is  really  an  abstract  set  of 
linked  nodes,  which  might  more  properly  be  called  a “binary  decision  dag”  — a 
binary  tree  with  shared  subtrees,  a directed  acyclic  graph  in  which  exactly  two 
distinguished  arcs  emanate  from  every  nonsink  node. 

We  shall  assume  that  every  BDD  obeys  two  important  restrictions.  First,  it 
must  be  ordered : Whenever  a LO  or  HI  arc  goes  from  branch  node  (T)  to  branch 
node  (J),  we  must  have  i < j.  Thus,  in  particular,  no  variable  Xj  will  ever  be 
queried  twice  when  the  function  is  evaluated.  Second,  a BDD  must  be  reduced, 
in  the  sense  that  it  doesn’t  waste  space.  This  means  that  a branch  node’s  LO 
and  HI  pointers  must  never  be  equal,  and  that  no  two  nodes  are  allowed  to  have 
the  same  triple  of  values  (V,  L0,HI).  Every  node  should  also  be  accessible  from 
the  root.  For  example,  the  diagrams 


are  not  BDDs,  because  the  first  one  isn’t  ordered  and  the  other  one  isn’t  reduced. 

Many  other  flavors  of  decision  diagrams  have  been  invented,  and  the  liter- 
ature of  computer  science  now  contains  a rich  alphabet  soup  of  acronyms  like 
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EVBDD,  FBDD,  IBDD,  OBDD,  OFDD,  OKFDD,  PBDD,  . . . , ZDD.  In  this 
book  we  shall  always  use  the  unadorned  code  name  “BDD”  to  denote  a binary 
decision  diagram  that  is  ordered  and  reduced  as  described  above,  just  as  we 
generally  use  the  word  “tree”  to  denote  an  ordered  (plane)  tree,  because  such 
BDDs  and  such  trees  are  the  most  common  in  practice. 

Recall  from  Section  7.1.1  that  every  Boolean  function  f(x i,...,xn)  cor- 
responds to  a truth  table , which  is  the  2n-bit  binary  string  that  starts  with 
the  function  value  / (0, . . . , 0)  and  continues  with  /( 0, . . . , 0, 1),  /( 0, . . . , 0, 1,0). 
/(0, ...  ,0, 1, 1),  ... , /( 1, . . . , 1, 1, 1).  For  example,  the  truth  table  of  the  median 
function  (xlX2x3)  is  00010111.  Notice  that  this  truth  table  is  the  same  as  the  se- 
quence of  leaves  in  the  unreduced  decision  tree  of  (2),  with  0 i-a  [X]  and  1 [j] . 

In  fact,  there’s  an  important  relationship  between  truth  tables  and  BDDs,  which 
is  best  understood  in  terms  of  a class  of  binary  strings  called  “beads.” 

A truth  table  of  order  n is  a binary  string  of  length  2".  A bead  of  order  n is 
a truth  table  /3  of  order  n that  is  not  a square;  that  is,  (3  doesn’t  have  the  form 
aa  for  any  string  a of  length  2n~\  (Mathematicians  would  say  that  a bead  is  a 
“primitive  string  of  length  2n.”)  There  are  two  beads  of  order  0,  namely  0 and  1: 
and  there  are  two  of  order  1,  namely  01  and  10.  In  general  there  are  22"  - 22"-1 
beads  of  order  n when  n > 0,  because  there  are  22"  binary  strings  of  length  2n 
and  2 2"  of  them  are  squares.  The  16  - 4 = 12  beads  of  order  2 are 

0001,  0010,  0011,  0100,  0110,  0111,  1000,  1001,  1011,  1100,  1101,  1110;  (3) 

these  are  also  the  truth  tables  of  all  functions  f{xi,x2)  that  depend  on  x1}  in 
the  sense  that  f(0,x2)  is  not  the  same  function  as  /( l,x2). 

Every  truth  table  r is  a power  of  a unique  bead,  called  its  root.  For  if  r has 
length  2 and  isn  t already  a bead,  it’s  the  square  of  another  truth  table  r';  and 
by  induction  on  the  length  of  r,  we  must  have  t'  = (3k  for  some  root  /?.  Hence 
r = /32k,  and  /3  is  the  root  of  r as  well  as  t' . (Of  course  k is  a power  of  2.) 

A truth  table  r of  order  n > 0 always  has  the  form  tqTi  , where  To  and  T\  are 
truth  tables  of  order  n - 1.  Clearly  r represents  the  function  f(x ltx2,  ,xn) 
if  and  only  if  r0  represents  /(0,  x2, . . . , xn)  and  n represents  /(l,  x2,\  . . ’x"). 
These  functions  /( 0,  x2, . . . , xn)  and  /( 1,  x2, . . . , xn ) are  called  subfunctions  of  /; 
and  their  truth  tables,  To  and  ti,  are  called  subtables  of  r. 

Subtables  of  a subtable  are  also  considered  to  be  subtables,  and  a table  is 
considered  to  be  a subtable  of  itself.  Thus,  in  general,  a truth  table  of  order  n 
has  2 subtables  of  order  n — k,  for  0 < k < n,  corresponding  to  2k  possible 
settings  of  the  first  k variables  (aq, . . . , xk).  Many  of  these  subtables  often  turn 
out  to  be  identical;  in  such  cases  we’re  able  to  represent  r in  a compressed  form. 

The  beads  of  a Boolean  function  are  the  subtables  of  its  truth  table  that  hap- 
pen to  be  beads.  For  example,  let’s  consider  again  the  median  function  (. Xlx2x3 ), 
with  its  truth  table  00010111.  The  distinct  subtables  of  this  truth  table  are 
{00010111,0001,0111,00,01,11,0,1};  and  all  of  them  except  00  and  11  are 
beads.  Therefore  the  beads  of  ( xlX2x3 ) are 

{00010111,0001,0111,01,0,1}. 


(4) 
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And  now  we  get  to  the  point:  The  nodes  of  a Boolean  function’s  BDD  are  in 
one-to-one  correspondence  with  its  beads.  For  example,  we  can  redraw  Fig.  21 
by  placing  the  relevant  bead  inside  of  each  node: 

(000 10  111) 

(0001)  (5m).  (5) 

Tj® 

EJ'"  12 

In  general,  a function’s  truth  tables  of  order  n + 1 — k correspond  to  its  sub- 
functions f(ci, . . . , Cfc_ i,  Xk, . . . , xn)  of  that  order;  so  its  beads  of  order  n + 1 — k 
correspond  to  those  subfunctions  that  depend  on  their  first  variable,  Xk . There- 
fore every  such  bead  corresponds  to  a branch  node  (k)  in  the  BDD.  And  if  (X)  is 
a branch  node  corresponding  to  the  truth  table  r'  = t'0t[  , its  LO  and  HI  branches 
point  respectively  to  the  nodes  that  correspond  to  the  roots  of  r('  and  t[  . 

This  correspondence  between  beads  and  nodes  proves  that  every  Boolean 
function  has  one  and  only  one  representation  as  a BDD.  The  individual  nodes 
of  that  BDD  might,  of  course,  be  placed  in  different  locations  inside  a computer. 

If  / is  any  Boolean  function,  let  B(f ) denote  the  number  of  beads  that  it  has. 
This  is  the  size  of  its  BDD  -the  total  number  of  nodes,  including  the  sinks.  For 
example,  B(f)  — 6 when  / is  the  median-of-three  function,  because  (5)  has  size  6. 

To  fix  the  ideas,  let’s  work  out  another  example,  the  “more-or-less  random” 
function  of  7.1.1-(22)  and  7.1.2-(6).  Its  truth  table,  1100100100001111,  is  a 
bead,  and  so  are  the  two  subtables  11001001  and  00001111.  Thus  we  know  that 
the  root  of  its  BDD  will  be  a (T)  branch,  and  that  the  LO  and  HI  nodes  below  the 
root  will  both  be  @s.  The  subtables  of  length  4 are  {1100, 1001,0000, 1111}; 
here  the  first  two  are  beads,  but  the  others  are  squares.  To  get  to  the  next  level, 
we  break  the  beads  in  half  and  carry  over  the  square  roots  of  the  nonbeads, 
identifying  duplicates;  this  leaves  us  with  {11,00,10,01}.  Again  there  are  two 
beads,  and  a final  step  produces  the  desired  BDD: 


(In  this  diagram  and  others  below,  it’s  convenient  to  repeat  the  sink  nodes  [X] 
and  [t)  in  order  to  avoid  excessively  long  connecting  lines.  Only  one  [X|  node 
and  one  [t]  node  are  actually  present;  so  the  size  of  (6)  is  9,  not  13.) 

An  alert  reader  might  well  be  thinking  at  this  point,  “Very  nice,  but  what 
if  the  BDD  is  huge?”  Indeed,  functions  can  easily  be  constructed  whose  BDD  is 
impossibly  large;  we’ll  study  such  cases  later.  But  the  wonderful  thing  is  that  a 
great  many  of  the  Boolean  functions  that  are  of  practical  importance  turn  out 
to  have  reasonably  small  values  of  £?(/).  So  we  shall  concentrate  on  the  good 
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news  first,  postponing  the  bad  news  until  we’ve  seen  why  BDDs  have  proved  to 
be  so  popular. 

BDD  virtues.  If  f(x)  = f(xi,...,xn)  is  a Boolean  function  whose  BDD  is 
reasonably  small,  we  can  do  many  things  quickly  and  easily.  For  example: 

• We  can  evaluate  f(x)  in  at  most  n steps,  given  any  input  vector  x = x\  xni 
by  simply  starting  at  the  root  and  branching  until  we  get  to  a sink. 

• We  can  find  the  lexicographically  smallest  x such  that  /( x)  = 1,  by  start- 
ing at  the  root  and  repeatedly  taking  the  LO  branch  unless  it  goes  directly 
to  [X] . The  solution  has  xj  = 1 only  when  the  HI  branch  was  necessary  at  (Jj . 
For  example,  this  procedure  gives  x1x2x3  = Oil  in  the  BDD  of  Fig.  21.  and 
X1X2X3X4  = 0000  in  (6).  (It  locates  the  value  of  x that  corresponds  to  the 
leftmost  1 in  the  truth  table  for  /.)  Only  n steps  are  needed,  because  every 
branch  node  corresponds  to  a nonzero  bead;  we  can  always  find  a downward 
path  to  [t]  without  backing  up.  Of  course  this  method  fails  when  the  root  itself 
is  |T) . But  that  happens  only  when  / is  identically  zero. 

• We  can  count  the  number  of  solutions  to  the  equation  f(x)  = 1,  using 
Algorithm  C below.  That  algorithm  does  B(f)  operations  on  n-bit  numbers;  so 
its  running  time  is  0(nB(f))  in  the  worst  case. 

• After  Algorithm  C has  acted,  we  can  speedily  generate  random  solutions 
to  the  equation  f(x)  = 1,  in  such  a way  that  every  solution  is  equally  likely. 

• We  can  also  generate  all  solutions  x to  the  equation  f(x)  = 1.  The  algorithm 
in  exercise  16  does  this  in  0(nN)  steps  when  there  are  N solutions. 

• We  can  solve  the  linear  Boolean  programming  problem:  Find  x such  that 


wxxi  -\ (-  wnxn  is  maximum,  subject  to  /(aq, . . .,xn)  = 1,  (7) 

given  constants  (uq, . . . , wn).  Algorithm  B (below)  does  this  in  0(n+B(f))  steps. 

• We  can  compute  the  generating  function  a0  + axz  + b anzn,  where  there 

are  aj  solutions  to  /(aq,  ...,xn)  = l with  Xl  + • • • + xn  = j.  (See  exercise  25.) 

• We  can  calculate  the  reliability  polynomial  F(plf . . . ,pn),  which  is  the  prob- 
ability that  f(x i,...,x„)  = 1 when  each  xj  is  independently  set  to  1 with  a 
given  probability  pj.  Exercise  26  does  this  in  0(B(f))  steps. 

Moreover,  we  will  see  that  BDDs  can  be  combined  and  modified  efficiently.  For 
example,  it  is  not  difficult  to  form  the  BDDs  for  f(x  1,...,xn)  A g{x\, . ,xn) 
and  f{x±, . . . , Xj_i, g(xi, . . . ,xn),Xj+ 1, ...  ,xn)  from  the  BDDs  for  / and  g. 

Algorithms  for  solving  basic  problems  with  BDDs  are  often  described  most 
easily  if  we  assume  that  the  BDD  is  given  as  a sequential  list  of  branch  instruc- 
tions J,-i,  Is- 2,  • • • , h,  7).  where  each  X has  the  form  (cfc?  lk\ hk ) . For  example, 
(6)  might  be  represented  as  a list  of  s = 9 instructions 


h = (I?  7:  6), 
I7  = (2?  5: 4), 
76  = (2?0:1), 


h = (3?  1:0), 
h = (3?  3:  2), 
h = (4?  1:0), 


h = (4?  0:1), 
/i  = (5?l:l), 
Io  = (5?  0:0), 


(8) 


with  v8  = 1,  ls  = 7,  he  =6,  v7  = 2,  l7  = 5,  h7  = 4,  . . . , v0  = 5,  l0  = h0  = 0.  In 
general  the  instruction  hf  means,  “If  xv  = 0,  go  to  otherwise  go  to  Ih” 
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except  that  the  last  cases  I\  and  I0  are  special.  We  require  that  the  LO  and  HI 
branches  lk  and  hk  satisfy 

Ik  < k,  hk  < k,  vtk  > vk,  and  vhk  > vk,  for  s > k > 2;  (g) 

in  other  words,  all  branches  move  downward,  to  variables  of  greater  index.  But 
the  sink  nodes  [t]  and  [X]  are  represented  by  dummy  instructions  I\  and  /0,  in 
which  lk  = hk  = k and  the  “variable  index”  vk  has  the  impossible  value  n + 1. 

These  instructions  can  be  numbered  in  any  way  that  respects  the  topological 
ordering  of  the  BDD,  as  required  by  (9).  The  root  node  must  correspond  to  Is-i, 
and  the  sink  nodes  must  correspond  to  I1  and  J0,  but  the  other  index  numbers 
aren’t  so  rigidly  prescribed.  For  example,  (6)  might  also  be  expressed  as 

/'=(!?  7:  2),  4 = (4?  0: 1),  Z'  = (2?  0: 1), 

4 = (2?  4: 6),  4 = (3?  1:0),  4 = (5?  1:1),  (10) 

4 = (3?3:5),  4 = (4?  1:0),  4 = (5?  0:0), 

and  in  46  other  isomorphic  ways.  Inside  a computer,  the  BDD  need  not  actu- 
ally  appear  in  consecutive  locations;  we  can  readily  traverse  the  nodes  of  any 
acyclic  digraph  in  topological  order,  when  the  nodes  are  linked  as  in  (1).  But 
we  will  imagine  that  they’ve  been  arranged  sequentially  as  in  (8),  so  that  various 
algorithms  are  easier  to  understand. 

One  technicality  is  worth  noting:  If  f(x)  = 1 for  all  x,  so  that  the  BDD 
is  simply  the  sink  node  [t]  , we  let  s = 2 in  this  sequential  representation. 
Otherwise  s is  the  size  of  the  BDD.  Then  the  root  is  always  represented  by  Is—i- 

Algorithm  C ( Count  solutions).  Given  the  BDD  for  a Boolean  function  f(x)  = 
f(x  1, . . . ,xn),  represented  as  a sequence  /s_i,  ...,  Iq  as  described  above,  this 
algorithm  determines  |/|,  the  number  of  binary  vectors  x = x± . . ,xn  such  that 
f(x)  — 1.  It  also  computes  the  table  Co,  ci,  . . . , cs_i,  where  ck  is  the  number 
of  Is  in  the  bead  that  corresponds  to  Ik. 

Cl.  [Loop  over  k]  Set  c0  4-  0,  ci  4-  1,  and  do  step  C2  for  k = 2,  3,  . . . , s - 1. 
Then  return  the  answer  2 ”e“1-1c3_1. 

C2.  [Compute  ck]  Set  l 4-  lk,  h 4-  hk,  and  ck  4-  2v,~Vk~1ci  + 2Vh-Vk~1ch.  \ 
For  example,  when  presented  with  (8),  this  algorithm  computes 

c2  I,  C3  <—  1,  C4  e-  2,  C5  <r~  2,  Cq  i 4,  C7  4—  4,  eg  4-  8; 

the  total  number  of  solutions  to  f(xi,  x2,  x3,  x4)  = 1 is  8. 

The  integers  ck  in  Algorithm  C satisfy 

0 < ck  < 2n+1~Vk,  for  2 <k<s,  (11) 

and  this  upper  bound  is  the  best  possible.  Therefore  multiprecision  arithmetic 
may  be  needed  when  n is  large.  If  extra  storage  space  for  high  precision  is  prob- 
lematic, one  could  use  modular  arithmetic  instead,  running  the  algorithm  several 
times  and  computing  ck  mod  p for  various  single-precision  primes  p\  then  the  final 
answer  would  be  deducible  with  the  Chinese  remainder  algorithm,  Eq.  4.3.2-(24). 
On  the  other  hand,  floating  point  arithmetic  is  usually  sufficient  in  practice. 
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Let’s  look  at  some  examples  that  are  more  interesting  than  (6).  The  BDDs 


represent  functions  of  six  variables  that  correspond  to  subsets  of  vertices  in  the 
cycle  graph  C6.  In  this  setup  a vector  such  as  x1...x6  = 100110  stands  for  the 
subset  {1, 4,  5};  the  vector  000000  stands  for  the  empty  subset;  and  so  on.  On  the 
left  is  the  BDD  for  which  we  have  f(x)  = 1 when  z is  independent  in  C6;  on  the 
right  is  the  BDD  for  maximal  independent  subsets,  also  called  the  kernels  of  C6 
(see  exercise  12).  In  general,  the  independent  subsets  of  Cn  correspond  to  ar- 
rangements of  0s  and  Is  in  a circle  of  length  n,  with  no  two  Is  in  a row;  the  kernels 
correspond  to  such  arrangements  in  which  there  also  are  no  three  consecutive  0s. 

Algorithm  C decorates  a BDD  with  counts  e*: , working  from  bottom  to  top, 
where  Cfc  is  the  number  of  ways  to  go  from  node  k to  [t]  by  choosing  values  for 
xi . . . xn,  if  l is  the  label  of  node  k.  When  we  apply  that  algorithm  to  the  BDDs 
in  (12)  we  get 


hence  Ce  has  18  independent  sets  and  5 kernels. 

These  counts  make  it  easy  to  generate  uniformly  random  solutions.  For 
example,  to  get  a random  independent  set  vector  xx . . ,xe,  we  know  that  13  of 
the  solutions  in  the  left-hand  BDD  have  xx  = 0,  while  the  other  5 have  xx  = 1. 
So  we  set  xx  <-  0 with  probability  13/18,  and  take  the  LO  branch;  otherwise  we 
set  xx  <—  1 and  take  the  HI  branch.  In  the  latter  case,  xx  = 1 forces  x2  <-  0,  but 
then  X3  could  go  either  way. 

Suppose  we’ve  chosen  to  set  xx  «-  1,  x2  <-  0,  x3  <-  0,  and  x4  «-  0;  this  case 
occurs  with  probability  £ • f • § • § = Then  there’s  a branch  from  @ to  ©, 
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so  we  flip  a coin  and  set  x5  to  a completely  random  value.  In  general,  a branch 
from  (J)  to  (j)  means  that  the  j — i — 1 intermediate  bits  Xi+ 1 . . . Xj-\  should 
independently  become  0 or  1 with  equal  probability.  Similarly,  a branch  from 
(t)  to  [t]  should  assign  random  values  to  xl+i . . . xn. 

Of  course  there  are  simpler  ways  to  make  a random  choice  between  18 
solutions  to  a combinatorial  problem.  Moreover,  the  right-hand  BDD  in  (13) 
is  an  embarrassingly  complex  way  to  represent  the  five  kernels  of  C6:  We  could 
simply  have  listed  them,  001001,  010010,  010101,  100100,  101010!  But  the  point 
is  that  this  same  method  will  yield  the  independent  sets  and  kernels  of  Cn  when 
n is  much  larger.  For  example,  the  100-cycle  C100  has  1,630,580,875,002  kernels, 
yet  the  BDD  describing  them  has  only  855  nodes.  One  hundred  simple  steps  will 
therefore  generate  a fully  random  kernel  from  this  vast  collection. 

Boolean  programming  and  beyond.  A bottom-up  algorithm  analogous  to 
Algorithm  C is  also  able  to  find  optimum  weighted  solutions  (7)  to  the  Boolean 
equation  f(x)  = 1.  The  basic  idea  is  that  it’s  easy  to  deduce  an  optimum  solution 
for  any  bead  of  /,  once  we  know  optimum  solutions  for  the  LO  and  HI  beads 
that  lie  directly  below  it. 

Algorithm  B ( Solutions  of  maximum  weight).  Let  Is-i,  ■ ■ ■ , Iq  be  a sequence 
of  branch  instructions  that  represents  the  BDD  for  a Boolean  function  /,  as  in 
Algorithm  C,  and  let  (uq, . . . , wn)  be  an  arbitrary  sequence  of  integer  weights. 

This  algorithm  finds  a binary  vector  x — Xi  . . . xn  such  that  uqaq  + b wnxn 

is  maximum,  over  all  x with  f(x)  = 1.  We  assume  that  s > 1;  otherwise  /( x) 
is  identically  0.  Auxiliary  integer  vectors  mi . . . ms_i  and  W\  . . . Wn+i  are  used 
in  the  calculations,  as  well  as  an  auxiliary  bit  vector  £2  • • 

Bl.  [Initialize.]  Set  Wn+1  <—  0 and  Wj  4—  Wj+ 1 + max(wj,0)  for  n > j > 1. 

B2.  [Loop  011  A-.]  Set  mi  4—  0 and  do  step  B3  for  2 < k < s.  Then  do  step  B4. 
B3.  [Process  Ik.)  Set  v 4-  vk,  l 4-  lk,  h «-  hk,  tk  4-  0.  If  l / 0,  set  mk  f- 
mi  + Wv+i  - WVl.  Then  if  h ^ 0,  compute  m 4~  mh  + Wv+ 1 - WVh  + wv- 
and  if  l — 0 or  m > mk , set  mk  4-  m and  tk  «—  1. 

B4.  [Compute  the  x’s.]  Set  j t—  0,  k <—  s — 1,  and  do  the  following  operations 
until  j = n:  While  j < vk  - 1,  set  j «-  j + 1 and  Xj  t-  [wj  > 0];  if  k > 1, 
set  j 4—  j + 1 and  Xj  <—  tk  and  k (tk  = 0?  lk:  hk).  | 

A simple  case  of  this  algorithm  is  worked  out  in  exercise  18.  Step  B3  does  tech- 
nical maneuvers  that  may  look  a bit  scary,  but  their  net  effect  is  just  to  compute 

mk  4-  max(m;  + Wv+1  - WVl,  mh  + Wv+1  - WVh  + wv),  (14) 

and  to  record  in  tk  whether  l or  h is  better.  In  fact,  vi  and  vh  are  usually  both 
equal  to  v + 1;  then  the  calculation  simply  sets  mk  4-  ma x(mj,m/,  + wv),  cor- 
responding to  the  cases  xv  = 0 and  xv  = 1.  Technicalities  arise  only  because  we 
want  to  avoid  fetching  mo,  which  is  —00,  and  because  vi  or  vk  might  exceed  t>+ 1. 

With  this  algorithm  we  can,  for  example,  quickly  find  an  optimum  set  of  ker- 
nel vertices  in  an  n-cycle  Cn,  using  weights  based  on  the  “Thue-Morse”  sequence, 

Wj  = (— 1)"J;  (15) 
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here  vj  denotes  sideways  addition,  Eq.  7.1.3-(5g).  In  other  words,  wj  is  -1  or 
+1,  depending  on  whether  j has  odd  parity  or  even  parity  when  expressed  as 
a binary  number.  The  maximum  of  wiXi  + • • • + wnxn  occurs  when  the  even- 
parity  vertices  3,  5,  6,  9,  10,  12,  15,  . . . most  strongly  outnumber  the  odd-parity 
vertices  1,  2,  4,  7,  8,  11,  13,  . . . that  appear  in  a kernel.  It  turns  out  that 


{1, 3, 6, 9, 12, 15, 18, 20,  23, 25,  27,  30,  33,  36,  39, 41, 43,  46, 48, 

51,  54, 57, 60, 63, 66,  68,  71,  73,  75,  78, 80, 83,  86, 89,  92,  95,  97,  99}  (16) 

is  an  optimum  kernel  in  this  sense  when  n = 100;  only  five  vertices  of  odd  parity, 
namely  {1, 25, 41,  73,  97},  need  to  be  included  in  this  set  of  38  to  satisfy  the  kernel 
conditions,  hence  max(w1xx  + - • --l-wioo^ioo)  = 28.  Thanks  to  Algorithm  B,  a few 
thousand  computer  instructions  are  sufficient  to  select  (16)  from  more  than  a tril- 
lion possible  kernels,  because  the  BDD  for  all  those  kernels  happens  to  be  small. 

Mathematically  pristine  problems  related  to  combinatorial  objects  like  cycle 
kernels  could  also  be  resolved  efficiently  with  more  traditional  techniques,  which 
are  based  on  recurrences  and  induction.  But  the  beauty  of  BDD  methods  is  that 
they  apply  also  to  real-world  problems  that  don’t  have  any  elegant  structure.  For 
example,  let’s  consider  the  graph  of  49  “united  states”  that  appeared  in  7 (17) 
and  7— (61).  The  Boolean  function  that  represents  all  the  maximal  independent 
sets  of  that  graph  (all  the  kernels)  has  a BDD  of  size  780  that  begins  as  follows: 


Algorithm  B quickly  discovers  the  following  kernels  of  minimum  and  maximum 
weight,  when  each  state  vertex  is  simply  weighted  according  to  the  sum  of  letters 
in  its  postal  code  (wCA  = 3 + 1,  wDC  = 4 + 3,  . . , , WvY  = 23  + 25): 


This  graph  has  266,137  kernels;  but  with  Algorithm  B,  we  needn’t  generate  them 
all.  In  fact,  the  right-hand  example  in  (18)  could  also  be  obtained  with  a smaller 
BDD  of  size  428,  which  characterizes  the  independent  sets,  because  all  weights 
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are  positive.  (A  kernel  of  maximum  weight  is  the  same  thing  as  an  independent 
set  of  maximum  weight,  in  such  cases.)  There  are  211,954,906  independent  sets 
in  this  graph,  many  more  than  the  number  of  kernels;  yet  we  can 
find  an  independent  set  of  maximum  weight  more  quickly  than 
a kernel  of  maximum  weight,  because  the  BDD  is  smaller. 


Fig.  22.  The  grid  P3  □ P3 , and 
a BDD  for  its  connected  subgraphs. 

A quite  different  sort  of  graph-related  BDD  is  shown  in 
Fig.  22.  This  one  is  based  on  the  3x3  grid  P3aP3;  it  characterizes 
the  sets  of  edges  that  connect  all  vertices  of  the  grid  together.  Thus, 
it’s  a function  f(x  12,2:13, . . . ,x89)  of  the  twelve  edges  1 — 2,  1 — 3,  . . . , 

8 — 9 instead  of  the  nine  vertices  {1, . . . , 9}.  Exercise  55  describes  one  way  to 
construct  it.  When  Algorithm  C is  applied  to  this  BDD,  it  tells  us  that  exactly 
431  of  the  212  = 4096  spanning  subgraphs  of  P:>  aP3  are  connected. 

A straightforward  extension  of  Algorithm  C (see  exercise  25)  will  refine  this 
total  and  compute  the  generating  function  of  these  solutions,  namely 


G{z)  = £y*/(*)  = 1922s  + 164z9  + 62z10  + 12zu 


„12 


(19) 


Thus  P3  □ P>  has  192  spanning  trees,  plus  164  spanning  subgraphs  that  are 
connected  and  have  nine  edges,  and  so  on.  Exercise  7.2.1.6-106(a)  gives  a formula 
for  the  number  of  spanning  trees  in  Pm  □ Pn  for  general  m and  n;  but  the 
full  generating  function  G(z)  contains  considerably  more  information,  and  it 
probably  has  no  simple  formula  unless  min(m,  n)  is  small. 

Suppose  each  edge  u — v is  present  with  probability  puv,  independent  of 
all  other  edges  of  P3nP3.  What  is  the  probability  that  the  resulting  subgraph 
is  connected?  This  is  the  reliability  polynomial,  which  also  goes  by  a variety 
of  other  names  because  it  arises  in  many  different  applications.  In  general,  as 
discussed  in  exercise  7.1.1-12,  every  Boolean  function  f(xx,. . . , xn)  has  a unique 
representation  as  a polynomial  F(x\, . . . , xn ) with  the  properties  that 

i)  F(x  1, . . . ,xn)  = f(xi, . . . ,xn)  whenever  each  Xj  is  0 or  1; 

ii)  F(xi, . . . , xn)  is  multilinear:  Its  degree  in  Xj  is  < 1 for  all  j. 

This  polynomial  F has  integer  coefficients  and  satisfies  the  basic  recurrence 


F(x  1,. ..,£„)  = (1  - xi)F0(x2,  ■ ■ ■ ,xn)  + x1F1(x2,  ■ ■ ■ ,xn),  (20) 

where  F0  and  iq  are  the  integer  multilinear  representations  of  /( 0,x2, . . . ,xn) 
and  /(l,  X2,  ■ ■ * , xn).  Indeed,  (20)  is  George  Boole’s  “law  of  development.” 

Two  important  things  follow  from  recurrence  (20).  First,  F is  precisely 
the  reliability  polynomial  F(pi, . . . ,pn ) mentioned  earlier,  because  the  reliability 
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polynomial  obviously  satisfies  the  same  recurrence.  Second,  F is  easily  calculated 
from  the  BDD  for  /,  working  upward  from  the  bottom  and  using  (20)  to  compute 
the  reliability  of  each  bead.  (See  exercise  26.) 

The  connectedness  function  for  an  8 x 8 grid  P3oP3  is,  of  course,  much  more 
complicated  than  the  one  for  P3aP3;  it  is  a Boolean  function  of  112  variables  and 
its  BDD  has  43790  nodes,  compared  to  only  37  in  Fig.  22.  Still,  computations 
with  this  BDD  are  quite  feasible,  and  in  a second  or  two  we  can  compute 


G(z)  = 126231322912498539682594816z63 

+ 100661 1 14003541 1062600761344z64 

+ • ■ • + 6212z110  + 112zlu  + z112, 

as  well  as  the  probability  F(p)  of  connectedness 
and  its  derivative  F'(p ),  when  each  of  the  edges  is 
present  with  probability  p (see  exercise  29): 


F(P): 


F'(p): 


(21) 


0 P 1 0 p 1 

*A  sweeping  generalization.  Algorithms  B and  C and  the  algorithms  we’ve 
been  discussing  for  bottom-up  BDD  scanning  are  actually  special  cases  of  a much 
more  general  scheme  that  can  be  exploited  in  many  additional  ways.  Consider 
an  abstract  algebra  with  two  associative  binary  operators  o and  • , satisfying  the 
distributive  laws 


a*  (£07)  = (<*•/?)  o (0.7),  (/3  o 7)  • a = (/?  • a)  o (7  • a).  (22) 

Every  Boolean  function  /(aq,  . . . ,x„)  corresponds  to  a fully  elaborated  truth  table 
involving  the  symbols  o,  •,  _L,  and  T,  together  with  x3  and  x3  for  1 < j < n,  in 
a way  that’s  best  understood  by  considering  a small  example:  When  n = 2 and 
when  the  ordinary  truth  table  for  / is  0010,  the  fully  elaborated  truth  table  is 

(xx  • x2  • -L)  o (xj  • x2  • 1)  o ( xi  • x2  • T)  o (xj  • x2  • -L).  (23) 

The  meaning  of  such  an  expression  depends  on  the  meanings  that  we  attach  to 
the  symbols  o,  •,  _L,  T,  and  to  the  literals  x3  and  x}:  but  whatever  the  expression 
means,  we  can  compute  it  directly  from  the  BDD  for  /. 

For  example,  let’s  return  to  Fig.  21,  the  BDD  for  (x1x2x3).  The  elaborations 
of  nodes  [T]  and  [t]  are  a±  = _L  and  aT  = T,  respectively.  Then  the  elaboration 
of  (3)  is  a3  = ( x3  • ax)  o (x3  • ax);  the  elaborations  of  the  nodes  labeled  (2)  are 
«2  = (x2»{x3ox3)»a±)o(x2»a3)  on  the  left  and  ar2  = 0r2»(*3)o(x2*(x3ox3)«aT) 
on  the  right;  and  the  elaboration  of  node  (l)  is  oq  = (Tq  • al2)  o (x,  • a2). 
(Exercise  31  discusses  the  general  procedure.)  Expanding  these  formulas  via  the 
distributive  laws  (22)  leads  to  a full  elaboration  with  2"  = 8 “terms”: 

07  = 0ri«i2*S3*-L)  ° (x^x^x^L)  o (xx •x2*x3»_L)  o (x!*X2*X3*T) 

° (xi •x2*x3*_L)  o (x1»X2«X3«T)  o (xi»x2»x3»T)  o (xi*x2*x3*T).  (24) 
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Algorithm  C is  the  special  case  where  ‘o’  is  addition,  V is  multiplication, 
‘X’  is  0,  ‘T’  is  1,  ‘xf  is  1,  and  lXjJ  is  also  1.  Algorithm  B arises  when  ‘o’  is  the 
maximum  operator  and  V is  addition;  the  distributive  laws 

a + max(/3, 7)  = max(a+/3,  a+7),  max(/3, 7)  + a = max(/3+a,  7+a)  (25) 

are  easily  checked.  We  interpret  ‘X’  as  —00,  ‘T’  as  0,  ‘at,-’  as  0,  and  lXj ’ as  Wj. 
Then,  for  example,  (24)  becomes 

max(— 00,  -00,  -00,  w2  + w 3,  -00,  wi  + w3,  u>i  + w2,  Wi  + w2  + w3); 

and  in  general  the  full  elaboration  under  this  interpretation  is  equivalent  to  the 
expression  lnaxftiqxi  + • • • + wnxn  | }(x  1, . . . , xn)  = 1}. 

Friendly  functions.  Many  families  of  functions  are  known  to  have  BDDs  of 
modest  size.  If  / is,  for  example,  a symmetric  function  of  n variables,  it’s  easy 
to  see  that  B(f ) = 0(n2).  Indeed,  when  n = 5 we  can  start  with  the  triangular 
pattern 


(26) 


and  set  the  leaves  to  [X]  or  [t]  depending  on  the  respective  values  of  / when  the 
value  of  vx  = X\+-  ■ 5 equals  0,  1,  2,  3,  4,  or  5.  Then  we  can  remove  redundant 

or  equivalent  nodes,  always  obtaining  a BDD  whose  size  is  ("j1)  + 2 or  less. 

Suppose  we  take  any  function  f(x  1, . . . , xn)  and  make  two  adjacent  variables 
equal: 

§{xi , . . . , Xn)  f(.X\i . . . , Xk— 1 , X]$ , Xk , Xk-\-2i  • • • , Xn) • (27) 

Exercise  40  proves  that  B(g)  < B(f).  And  by  repeating  this  condensation 
process,  we  find  that  a function  such  as  /( Xi,Xi,x2,X3,X3,xq)  has  a small  BDD 
whenever  B(f)  is  small.  In  particular,  the  threshold  function  [2xi  + 3x3  + xq  > t] 
must  have  a small  BDD  for  any  value  of  t , because  it’s  a condensed  version  of 

the  symmetric  function  f(x  1, . . . , x6)  = [aq  + 1 - xe>t}.  This  argument  shows 

that  any  threshold  function  with  nonnegative  integer  weights, 

f(x  i,x2,...,xn)  = [uqxi  + w2x2  4 h wnxn>t],  (28) 

can  be  obtained  by  condensing  a symmetric  function  of  wi  + w2  + ■ ■ ■ + wn 
variables,  so  its  BDD  size  is  0(w  1 + w2  + • • • + wn)2. 

Threshold  functions  often  turn  out  to  be  easy  even  when  the  weights  grow 
exponentially.  For  example,  suppose  t = (tit2  . . .tn)2  and  consider 


ft(xi,x2,...,xn)  - [ 2n  1x1+2n  2x2  H 1-  xn  > t\. 


(29) 
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This  function  is  true  if  and  only  if  the  binary  string  X\X2  . ■ . xn  is  lexicographically 
greater  than  or  equal  to  tit2  . . . tn,  and  its  BDD  always  has  exactly  n + 2 nodes 
when  tn  = 1.  (See  exercise  170.) 

Another  kind  of  function  with  small  BDD  is  the  2m-way  multiplexer  of 
Eq.  7.1.2-(3i),  a function  of  n = m -f  2m  variables: 

Afm(xi, . . . , xm\ xm+i, . . . , xn)  = xm+1+(Xl ...Xm)2.  (30) 

Its  BDD  begins  with  2k-1  branch  nodes  @ for  1 < k < m.  But  below  that  com- 
plete binary  tree,  there’s  just  one  ® for  each  x k in  the  main  block  of  variables 
with  m < k < n.  Hence  B(Mm ) = 1 + 2 H 1-  2m_1  + 2m  + 2 = 2m+1  + 1 < 2n. 

A linear  network  model  of  computation,  illustrated  in  Fig.  23,  helps  to 
clarify  the  cases  where  a BDD  is  especially  efficient.  Consider  an  arrangement 
of  computational  modules  Mi,  M2 , . . . , Mn,  in  which  the  Boolean  variable  xk 
is  input  to  module  Mk;  there  also  are  wires  between  neighboring  modules,  each 
carrying  a Boolean  signal,  with  ak  wires  from  Mk  to  Mk+1  and  bk  wires  from 
Mk+ 1 to  Mk  for  1 < k < n.  A special  wire  out  of  Mn  contains  the  output  of 
the  function,  f(x  1, . . .,xn).  We  define  a0  = b0  - bn  - 0 and  an  = 1,  so  that 
module  Mk  has  exactly  ck  = 1 + ak-i+bk  input  ports  and  exactly  dk  = ak  + bk_1 
output  ports  for  each  k.  It  computes  dk  Boolean  functions  of  its  ck  inputs. 

The  individual  functions  computed  by  each  module  can  be  arbitrarily  com- 
plicated, but  they  must  be  well  defined  in  the  sense  that  their  joint  values  are 
completely  determined  by  the  ads:  Every  choice  of  (xi,...,x„)  must  lead  to 
exactly  one  way  to  set  the  signals  on  all  the  wires,  consistent  with  all  of  the 
given  functions. 

Theorem  M.  If  f can  be  computed  by  such  a network , then  B(f)  < )T”=0  2ak2>,k. 

Proof.  We  will  show  that  the  BDD  for  / has  at  most  2ak-l2bk~ 1 branch  nodes 
® > for  1 < k < n.  This  is  clear  if  bk_i  = 0,  because  at  most  2ak~1  subfunctions 
are  possible  when  x\  through  xk-i  have  any  given  values.  So  we  will  show  that 
any  network  that  has  ak_i  forward  wires  and  6fc_i  backward  wires  between  Mk_i 
and  Mk  can  be  replaced  by  an  equivalent  network  that  has  afc_i  2bk~1  forward 
wires  and  none  that  run  backward. 

For  convenience,  let’s  consider  the  case  k = 4 in  Fig.  23,  with  a3  = 4 and 
b3  = 2;  we  want  to  replace  those  6 wires  by  16  that  run  only  forward.  Suppose 
Alice  is  in  charge  of  M3  and  Bob  is  in  charge  of  M4.  Alice  sends  a 4-bit  signal,  a, 
to  Bob  while  he  sends  a 2-bit  signal,  b , to  her.  More  precisely,  for  any  fixed 
value  of  (xi, . . .,xn),  Alice  computes  a certain  function  A and  Bob  computes  a 
function  B,  where 

A(b)  = a and  B(a)  = b.  (31) 

Alice  s function  A depends  on  (x±,  x2,  x3),  so  Bob  doesn’t  know  what  it  is;  Bob’s 
function  B is,  similarly,  unknown  to  Alice,  since  it  depends  on  (x4, . . . , xn). 
But  those  unknown  functions  have  the  key  property  that,  for  every  choice  of 
(xi, . . . ,xn),  there’s  exactly  one  solution  (a,  b)  to  the  equations  (31). 
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Fig.  23.  A generic  network  of  Boolean  modules  for  which  Theorem  M is  valid. 


So  Alice  changes  the  behavior  of  module  M3:  She  sends  Bob  four  4- bit 
values,  A(00),  A(01),  A(10),  and  A(ll),  thereby  revealing  her  A function.  And 
Bob  changes  the  behavior  of  M4.  Instead  of  sending  any  feedback,  he  looks  at 
those  four  values,  together  with  his  other  inputs  (namely  X4  and  the  64  bits 
received  from  M5),  and  discovers  the  unique  a and  b that  solve  (31).  His  new 
module  uses  this  value  of  a to  compute  the  04  bits  that  he  outputs  to  M5.  | 

Theorem  M says  that  the  BDD  size  will  be  reasonably  small  if  we  can 
construct  such  a network  with  small  values  of  ak  and  bk-  Indeed,  B(f)  will  be 
0(n)  if  the  a' s and  b's  are  bounded,  although  the  constant  of  proportionality 
might  be  huge.  Let’s  work  an  example  by  considering  the  three-in- a-row  function, 

f(x  I,...,xn)  — X1X2X3  V X2X3X4  V ■ • ■ VX„_2Xn_iXn  Vln-rXnXi  VxnXiX2,  (32) 

which  is  true  if  and  only  if  a circular  necklace  labeled  with  bits  xx,  . . . , xn  has 
three  consecutive  Is.  One  way  to  implement  it  via  Boolean  modules  is  to  give  Mk 
three  inputs  (uk,vk,Wk)  from  Mk_x  and  two  inputs  (yk,zk)  from  Mk+ 1,  where 

Uk—Xk-  1;  Vk  = Xk-2Xk-l,  Wk  = Xn_iX„Xi  V • ■ • V Xfc_3Xfe_2Xfc_1; 

Vk  — •En—l'Eri' 

Here  subscripts  are  treated  modulo  n,  and  appropriate  changes  are  made  at  the 
left  or  right  when  k=l  ox  k>n  — 1.  Then  Mk  computes  the  functions 

uk+ i=xk,  vk+i  = ukXk,  Wfc+i  = wk  V vkxk,  yk-i=yk,  zk_x  = zk  (34) 

for  nearly  all  values  of  k\  exercise  45  has  the  details.  With  this  construction  we 
have  ak  < 3 and  bk  < 2 for  all  k,  hence  Theorem  M tells  us  that  B(f)  < 2 12n  = 
4096n.  In  fact,  the  truth  is  much  sweeter:  B(f)  is  actually  < 9n  (see  exercise  46). 

Shared  BDDs.  We  often  want  to  deal  with  several  Boolean  functions  at  once, 
and  related  functions  often  have  common  subfunctions.  In  such  cases  we  can 
work  with  the  “BDD  base”  for  {/i(xi, . . . , xn), . . . , /m(xi, . . . , xn)},  which  is 
a directed  acyclic  graph  that  contains  one  node  for  every  bead  that  occurs 
within  the  truth  tables  of  any  of  the  functions.  The  BDD  base  also  has  m 
“root  pointers,”  F3,  one  for  each  function  fy,  the  BDD  for  fj  is  then  the  set  of 
all  nodes  reachable  from  node  Fj.  Notice  that  node  F}  itself  is  reachable  from 
node  Fi  if  and  only  if  fj  is  a subfunction  of  /*. 

For  example,  consider  the  problem  of  computing  the  n + 1 bits  of  the  sum 
of  two  n-bit  numbers, 

(/n+l/n/n~l  • • • /l)2  = (xjX3  . . . X2„-l)2  + (x2X4  . . . X2n)2.  (35) 
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The  BDD  base  for  those  n + 1 bits  looks  like  this  when  n = 4: 

Fa 


xxx3x5x7 
+ X2X4X6XS 

/5/4/3/2/1 


The  way  we’ve  numbered  the  ads  in  (35)  is  important  here  (see  exercise  51).  In 
general  there  are  exactly  B(fi, . . . , /n+1)  = 9n  — 5 nodes,  when  n > 1.  The  node 
just  to  the  left  of  Fj,  for  1 < j < n,  represents  the  subfunction  for  a carry  Cj  out 
of  the  jth  bit  position  from  the  right;  the  node  just  to  the  right  of  Fj  represents 
the  complement  of  that  carry,  Cj;  and  node  Fn+1  represents  the  final  carry  cn. 

Operations  on  BDDs.  We’ve  been  talking  about  lots  of  things  to  do  when  a 
BDD  is  given.  But  how  do  we  get  a BDD  into  the  computer  in  the  first  place? 

One  way  is  to  start  with  an  ordered  binary  decision  diagram  such  as  (26)  or 
the  right-hand  example  in  (2),  and  to  reduce  it  so  that  it  becomes  a true  BDD. 
The  following  algorithm,  based  on  ideas  of  D.  Sieling  and  I.  Wegener  [Information 
Processing  Letters  48  (1993),  139-144],  shows  that  an  arbitrary  TV- node  binary 
decision  diagram  whose  branches  are  properly  ordered  can  be  reduced  to  a BDD 
in  0(N  + n)  steps  when  there  are  n variables. 

Of  course  we  need  some  extra  memory  space  in  order  to  decide  whether 
two  nodes  are  equivalent,  when  doing  such  a reduction.  Having  only  the  three 
fields  (V,  L0,HI)  in  each  node,  as  in  (1),  would  give  us  no  room  to  maneuver. 
Fortunately,  only  one  additional  pointer-size  field,  called  AUX,  is  needed,  together 
with  two  additional  state  bits.  We  will  assume  for  convenience  that  the  state  bits 
are  implicitly  present  in  the  signs  of  the  LO  and  AUX  fields,  so  that  the  algorithm 
needs  to  deal  with  only  four  fields:  (V, LO, HI,  AUX).  The  fact  that  the  sign  is 
preempted  does  mean  that  a 28-bit  LO  field  will  accommodate  only  227  nodes  at 
most  about  134  million  — instead  of  228.  (O11  a computer  like  MMIX,  we  might 
prefer  to  assume  that  all  node  addresses  are  even,  and  to  add  1 to  a field  instead 
of  complementing  it  as  done  here.) 

Algorithm  R ( Reduction  to  a BDD).  Given  a binary  decision  diagram  that 
is  ordered  but  not  necessarily  reduced,  this  algorithm  transforms  it  into  a valid 
BDD  by  removing  unnecessary  nodes  and  rerouting  all  pointers  appropriately. 
Each  node  is  assumed  to  have  four  fields  (V,  LO,  HI,  AUX)  as  described  above,  and 
ROOT  points  to  the  diagram’s  top  node.  The  AUX  fields  are  initially  irrelevant,  ex- 
cept that  they  must  be  nonnegative;  they  will  again  be  nonnegative  at  the  end  of 
the  process.  All  deleted  nodes  are  pushed  onto  a stack  addressed  by  AVAIL,  linked 
together  by  the  HI  fields  of  its  nodes.  (The  LO  fields  of  these  nodes  will  be  neg- 
ative; their  complements  point  to  equivalent  nodes  that  have  not  been  deleted.) 
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The  V fields  of  branch  nodes  are  assumed  to  run  from  V(ROOT)  up  to  umax, 
in  increasing  order  from  the  top  downwards  in  the  given  dag.  The  sink  nodes  [X] 
and  [t]  are  assumed  to  be  nodes  0 and  1,  respectively,  with  nonnegative  LO  and 
HI  fields.  They  are  never  deleted;  in  fact,  they  are  left  untouched  except  for  their 
AUX  fields.  An  auxiliary  array  of  pointers,  HEAD[z>]  for  V(ROOT)  < v < i>max,  is 
used  to  create  temporary  lists  of  all  nodes  that  have  a given  value  of  V. 

Rl.  [Initialize.]  Terminate  immediately  if  ROOT  < 1.  Otherwise,  set  AUX(O)  4— 

AUX  ( 1 ) 4-  AUX  (ROOT)  4 1,  and  HEAD  [n]  4 1 for  V(ROOT)  < v < vmax. 

(We  use  the  fact  that  — 1 = ~0  is  the  bitwise  complement  of  0.)  Then  set 
s 4 — ROOT  and  do  the  following  operations  while  s ^ 0: 

Set  p 4-  s,  s 4 AUX  (p)  , AUX  (p)  4-  HEAD  [V  (p)  ] , HEAD  [V  (p)  ] 4 p. 

If  AUX  (LO  (p)  ) > 0,  set  AUX  (LO  (p)  ) 4 sandsf-  LO  (p)  . 

If  AUX  (HI  (p)  ) > 0,  set  AUX  (HI  (p)  ) 4- ~s  and  s 4-  HI  (p)  . 

(We’ve  essentially  done  a depth-first  search  of  the  dag,  temporarily  marking 
all  nodes  reachable  from  ROOT  by  making  their  AUX  fields  negative.) 

R2.  [Loop  on  v.]  Set  AUX(O)  4—  AUX(l)  4—  0,  and  v 4—  nmax. 

R3.  [Bucket  sort.]  (At  this  point  all  remaining  nodes  whose  V field  exceeds  v 
have  been  properly  reduced,  and  their  AUX  fields  are  nonnegative.)  Set 
p 4—  ^HEAD  [v] , s 4—  0,  and  do  the  following  steps  while  p / 0: 

Set  p'  4—  ^AUX(p)  . 

Set  q 4—  HI  (p) ; if  L0  (g)  < 0,  set  HI  (p)  4—  ~L0  (g)  . 

Set  q 4—  L0  (p) ; if  L0  (g)  < 0,  set  L0  (p)  4—  ~L0  ( q ) and  q 4—  L0  (p)  . 

If  q = HI  (p) , set  L0(p)  4 q , HI(p)  4-  AVAIL,  AUX(p)  4-  0,  AVAIL  4 -p; 

otherwise  if  AUX(g)  > 0,  set  AUX(p)  4—  s,  s 4—  ~g,  and  AUX(g)  4—  ~p; 

otherwise  set  AUX(p)  4—  AUX(~AUX(g))  and  AUX(~AUX(g))  4— p. 

Then  set  p 4—  p'. 

R4.  [Clean  up.]  (Nodes  with  L0  = x / HI  have  now  been  linked  together  via 
their  AUX  fields,  beginning  with  ~AUX(a;).)  Set  r 4—  ~s,  s 4—  0,  and  do  the 
following  while  r > 0: 

Set  q 4—  ~AUX(r)  and  AUX(r)  4—  0. 

If  s — 0 set  s 4 — g;  otherwise  set  AUX(p)  4—  q. 

Set  p 4—  g;  then  while  AUX(p)  > 0,  set  p 4—  AUX(p) . 

Set  r 4—  ~AUX(p) . 

R5.  [Loop  on  p.]  Set  p 4—  s.  Go  to  step  R9  if  p = 0.  Otherwise  set  g 4—  p. 

R6.  [Examine  a bucket.]  Set  s 4—  L0(p).  (At  this  point  p = g.) 

R7.  [Remove  duplicates.]  Set  r 4—  HI(g).  If  AUX(r)  > 0,  set  AUX(r)  4—  ~g; 
otherwise  set  L0(g)  4—  AUX(r),  HI(g)  4—  AVAIL,  and  AVAIL  4—  g.  Then  set 
g 4—  AUX(g).  If  g / 0 and  L0(g)  = s,  repeat  step  R7. 

R8.  [Clean  up  again.]  If  L0(p)  > 0,  set  AUX  (HI  (p))  4—  0.  Then  set  p 4—  AUX(p), 
and  repeat  step  R8  until  p ~ q. 

R9.  [Done?]  If  p / 0,  return  to  R6.  Otherwise,  if  v > V(R00T),  set  v 4—  v — 1 
and  return  to  R3.  Otherwise,  if  L0(R00T)  < 0,  set  ROOT  4—  ~L0(R00T) . | 
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The  intricate  link  manipulations  of  Algorithm  R are  easier  to  program  than  to 
explain,  but  they  are  highly  instructive  and  not  really  difficult.  The  reader  is 
urged  to  work  through  the  example  in  exercise  53. 

Algorithm  R can  also  be  used  to  compute  the  111)1)  for  any  restriction  of  a 
given  function,  namely  for  any  function  obtained  by  “hardwiring”  one  or  more 
variables  to  a constant  value.  The  idea  is  to  do  a little  extra  work  between  steps 
R1  and  R2,  setting  HI(p)  <—  L0(p)  if  variable  V(p)  is  supposed  to  be  fixed  at  0, 
or  L0(p)  4-  HI  (p)  if  V(p)  is  to  be  fixed  at  1.  We  also  need  to  recycle  all  nodes 
that  become  inaccessible  after  restriction.  Exercise  57  fleshes  out  the  details. 

Synthesis  of  BDDs.  We’re  ready  now  for  the  most  important  algorithm  on 
binary  decision  diagrams,  which  takes  the  BDD  for  one  function,  /,  and  combines 
it  with  the  BDD  for  another  function,  g,  in  order  to  obtain  the  BDD  for  further 
functions  such  as  / A g or  / © g.  Synthesis  operations  of  this  kind  are  the 
principal  way  to  build  up  the  BDDs  for  complex  functions,  and  the  fact  that 
they  can  be  done  efficiently  is  the  main  reason  why  BDD  data  structures  have 
become  popular.  We  will  discuss  several  approaches  to  the  synthesis  problem, 
beginning  with  a simple  method  and  then  speeding  it  up  in  various  ways. 

The  basic  notion  that  underlies  synthesis  is  a product  operation  on  BDD 
structures  that  we  shall  call  melding.  Suppose  a = ( v,l,h ) and  a'  = (v',l',h') 
are  BDD  nodes,  each  containing  the  index  of  a variable  together  with  LO  'and 
HI  pointers.  The  meld”  of  a and  a',  written  aoa ',  is  defined  as  follows  when 
a and  a ’ are  not  both  sinks: 

( ( v , lol',  hoti),  if  v = v'; 
aoa’  = < (v,  loa',  ho  a'),  iiv<v'\  (37) 

[ ( v' , aol',  a oh'),  if  v > v' . 

For  example,  Fig.  24  shows  how  two  small  but  typical  BDDs  are  melded.  The 
one  on  the  left,  with  branch  nodes  (a,(3,-y,  S),  represents  f(xi , x2,  x3,  x4)  = 
(xx  V x2)  A (xz  V x4);  the  one  in  the  middle,  with  branch  nodes  (u,  ip,  <p,  v,  r), 
represents  g(x i,x2,  xz,x4)  = (x\  © x2)  V {xz  ®x4).  Nodes  6 and  r are  essentially 
the  same,  so  we  would  have  S = t if  / and  g were  part  of  a single  BDD  base;  but 
melding  can  be  applied  also  to  BDDs  that  do  not  have  common  nodes.  At  the 
right  of  Fig.  24,  a o oj  is  the  root  of  a decision  diagram  that  has  eleven  branch 
nodes,  and  it  essentially  represents  the  ordered  pair  ( f,g ). 
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Fig.  24.  Two  BDDs  can  be  melded  together  with  the  o operation  (37). 
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An  ordered  pair  of  two  Boolean  functions  can  be  visualized  by  placing  the 
truth  table  of  one  above  the  truth  table  of  the  other.  With  this  interpretation, 
aoui  stands  for  the  ordered  pair  o??oiliiinioiio  > and  I3  0 X stands  for 
etc.  The  melded  BDD  of  Fig.  24  corresponds  to  the  diagram 


which  is  analogous  to  (5)  except  that  each  node  denotes  an  ordered  pair  of 
functions  instead  of  a single  function.  Beads  and  subtables  are  defined  on  ordered 
pairs  just  as  before.  But  now  we  have  four  possible  sinks  instead  of  two,  namely 

±o_L,  loT,  Tol,  and  ToT,  (3g) 

corresponding  to  the  ordered  pairs  ° , ° , J , and  J . 

To  compute  the  conjunction  / A g,  we  AND  together  the  truth  tables  of  / 
and  g.  This  operation  corresponds  to  replacing  °,  J,  and  \ by  0,  0,  0,  and  1, 
respectively;  so  we  get  the  BDD  for  / A g from  fog  by  replacing  the  respective 
sink  nodes  of  (39)  by  [X] , [X] , [X] , and  [j] , then  reducing  the  result.  Similarly, 
the  BDD  for  / ® g is  obtained  if  we  replace  the  sinks  (39)  by  (X|,  [t],  pp] , 
and  \T\ . (In  this  particular  case  / © g turns  out  to  be  the  symmetric  function 
S ia(xi,x2,X3,x4:),  as  computed  in  Fig.  9 of  Section  7.1.2.)  The  melded  diagram 
fog  contains  all  the  information  needed  to  compute  any  Boolean  combination 
of  / and  g ; and  the  BDD  for  every  such  combination  has  at  most  B(fog)  nodes. 

Clearly  B(f  o g)  < B(f)B(g ),  because  each  node  of  fog  corresponds  to 
a node  of  / and  a node  of  g.  Therefore  the  meld  of  small  BDDs  cannot  be 
extremely  large.  Usually,  in  fact,  melding  produces  a result  that  is  considerably 
smaller  than  this  worst-case  upper  bound,  with  something  like  B(f)  + B(g) 
nodes  instead  of  B(f)B(g).  Exercise  60  discusses  a sharper  bound  that  sheds 
some  light  on  why  melds  often  turn  out  to  be  small.  But  exercises  59(b)  and  63 
present  interesting  examples  where  quadratic  growth  does  occur. 

Melding  suggests  a simple  algorithm  for  synthesis:  We  can  form  an  array  of 
B{f)B(.9)  nodes,  with  node  a o a'  in  row  a and  column  a!  for  every  a in  the 
BDD  for  / and  every  a'  in  the  BDD  for  g.  Then  we  can  convert  the  four  sink 
nodes  (39)  to  [X]  or  [t]  as  desired,  and  apply  Algorithm  R to  the  root  node 
fog.  Voila  — we’ve  got  the  BDD  for  f Ag  or  f(Bg  or  fVg  or  whatever. 

The  running  time  of  this  algorithm  is  clearly  of  order  B(f)B(g).  We  can 
reduce  it  to  order  B(f  o g),  because  there’s  no  need  to  fill  in  all  of  the  matrix 
entries  aoa';  only  the  nodes  that  are  reachable  from  fog  are  relevant,  and  we  can 
generate  them  on  the  fly  when  necessary.  But  even  with  this  improvement  in  the 
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running  time,  the  simple  algorithm  is  unsatisfactory  because  of  the  requirement 
for  B(f)B(g)  nodes  in  memory.  When  we  deal  with  BDDs,  time  is  cheap  but 
space  is  expensive:  Attempts  to  solve  large  problems  tend  to  fail  more  often 
because  of  spaceout  than  because  of  “timeout.”  That’s  why  Algorithm  R was 
careful  to  perform  its  machinations  with  only  one  auxiliary  link  field  per  node. 

The  following  algorithm  solves  the  synthesis  problem  with  working  space  of 
order  B(f  og);  in  fact,  it  needs  only  about  sixteen  bytes  per  element  of  the  BDD 
for  fog.  The  algorithm  is  designed  to  be  used  as  the  main  engine  of  a “Boolean 
function  calculator,”  which  represents  functions  as  BDDs  in  compressed  form  on 
a sequential  stack.  The  stack  is  maintained  at  the  lower  end  of  a large  array 
called  the  pool.  Each  BDD  on  the  stack  is  a sequence  of  nodes,  which  each  have 
three  fields  (V,  L0,  HI).  The  rest  of  the  pool  is  available  to  hold  temporary  results 
called  templates,  which  each  have  four  fields  (L,  H,  LEFT,  RIGHT).  A node  typically 
occupies  one  octabyte  of  memory,  while  a template  occupies  two. 

The  purpose  of  Algorithm  S is  to  examine  the  top  two  Boolean  functions 
on  the  stack,  / and  g,  and  to  replace  them  by  the  Boolean  combination  fog. 
where  o is  one  of  the  16  possible  binary  operators.  This  operator  is  identified  by 
its  4-bit  truth  table,  op.  For  example,  Algorithm  S will  form  the  BDD  for  f ®g 
when  op  is  (0110)2  = 6;  it  will  deliver  / A g when  op  = 1. 

When  the  algorithm  begins,  operand  / appears  in  locations  [f0-.go)  of 
the  pool,  and  operand  g appears  in  locations  [g0  . .NTOP).  All  higher  locations 
[NTOP  . .P00LSIZE)  are  available  for  storing  the  templates  that  the  algorithm 
needs.  Those  templates  will  appear  in  locations  [TBOT . .P00LSIZE)  at  the  high 
end  of  the  pool;  the  boundary  markers  NTOP  and  TBOT  will  change  dynamically 
as  the  algorithm  proceeds.  The  resulting  BDD  for  fog  will  eventually  be  placed 
in  locations  [/0  . . NTOP),  taking  over  the  space  formerly  occupied  by  / and  g.  We 
assume  that  a template  occupies  the  space  of  two  nodes.  Thus,  the  assignments 
“t  4-  TBOT  - 2,  TBOT  4-  t”  allocate  space  for  a new  template,  pointed  to  by  t;  the 
assignments  up  4-  NTOP,  NTOP  4-  p + 1”  allocate  a new  node  p.  For  simplicity  of 
exposition,  Algorithm  S does  not  check  that  the  condition  NTOP  < TBOT  remains 
valid  throughout  the  process;  but  of  course  such  tests  are  essential  in  practice. 
Exercise  69  remedies  this  oversight. 

The  input  functions  / and  g are  specified  to  Algorithm  S as  sequences  of 
instructions  (Is_1, . . . , Ix,  I0)  and  (I's,_1, /q),  as  in  Algorithms  B and  C 
above.  The  lengths  of  these  sequences  are  s — B+(f)  and  s'  = B+(g),  where 

B+(f)  = B(f)  + [/  is  identically  1]  (40) 


is  the  number  of  BDD  nodes  when  the  sink  [X]  is  forced  to  be  present.  For 
example,  the  two  BDDs  at  the  left  of  Fig.  24  could  be  specified  by  the  instructions 


h = (I?  4:  3), 
I4  = (2?  0:  3), 

as  usual,  I1,  I0,  I[ 
nodes,  so  that  if  I k 


h = (3?  2:  1), 
I2  = (4?  0:  1); 


It  = (1?  5:  6),  /'  = (3?  2:  3), 

I*  = (2?  1:  4),  I'3  = (4?  1:  0),  (41) 

/'  - (2?  4:  1),  7'  = (4?  0:  1); 


, and  I'q  are  the  sinks.  These  instructions  are  packed  into 
= («fc?  h-  hk)  we  have  V(/0  + k)  = vk,  L0(/0  + k)  = lk,  and 
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HI(/o  + fc)  = /ifc  for  2 < k < s when  Algorithm  S begins.  Similar  conventions 
apply  to  the  instructions  I'k  that  define  g.  Furthermore 

V(/o)  = V(/o  + 1)  = V(9o)  = V(<?0  + 1)  = Umax  + 1,  (42) 

where  we  assume  that  / and  g depend  only  on  the  variables  xv  for  1 < v < umax. 

Like  the  simple  but  space-hungry  algorithm  described  earlier,  Algorithm  S 
proceeds  in  two  phases:  First  it  builds  the  BDD  for  fog,  constructing  templates 
so  that  every  important  meld  a o a'  is  represented  as  a template  t for  which 

LEFT(f)  = a,  RIGHT(f)  = a',  L(t)  =LO(aoa'),  H(t)  =HI(aoa').  (43) 

(The  L and  H fields  point  to  templates,  not  nodes.)  Then  the  second  phase 
reduces  these  templates,  using  a procedure  similar  to  Algorithm  R;  it  changes 
template  t from  (43)  to 

LEFT(t)  = RIGHT(f)  = r(f), 

L(t)  =r(LO(a«a')),  H(t)  =r(HI(a«a')), 

where  r(t)  is  the  unique  template  to  which  t has  been  reduced,  and  where  n{t) 
is  the  “clone”  of  t if  r(t)  = t.  Every  reduced  template  t corresponds  to  an 
instruction  node  in  the  BDD  of  / o g,  and  rift)  is  the  index  of  this  node  relative 
to  position  f0  in  the  stack.  (Setting  LEFT(f)  to  ~«(t)  instead  of  n{t)  is  a sneaky 
trick  that  makes  steps  S7-S10  run  faster.)  Special  overlapping  templates  are 
permanently  reserved  for  sinks  at  the  bottom  of  the  pool,  so  that  we  always  have 

LEFT(O)  = ~0,  RIGHT  (0)  = 0,  LEFT(l)  = ~1,  RIGHT  (1)  = 1,  (45) 

in  accord  with  the  conventions  of  (42)  and  (44). 

We  needn’t  make  a template  for  a o a'  when  the  value  of  a o a'  is  obviously 
constant.  For  example,  if  we’re  computing  / A g,  we  know  that  a o a'  will 
eventually  reduce  to  [X]  if  a = 0 or  a'  = 0.  Such  simplifications  are  discovered 
by  a subroutine  called  findJevel  (/,  g),  which  returns  the  positive  integer  j if  the 
root  of  fog  begins  with  the  branch  (j) , unless  fog  clearly  has  a constant  value; 
in  the  latter  case,  findJevel(f,g)  returns  the  value  — (/  o g),  which  is  0 or  — 1. 
The  procedure  is  slightly  technical,  but  simple,  using  the  global  truth  table  op: 

Subroutine  findJevel  (f , g),  with  local  variable  t: 

If  / < 1 and  g < 1,  return  —{{op  (3  — 2/  — g))  & 1),  which  is  — (/  o g). 

If  / < 1 and  g > 1,  set  t t—  (/?  op  & 3:  op  2);  return  0 if  t = 0,  — 1 if  t = 3. 

If  / > 1 and  g < 1,  set  t,  <—  ( g ? op:  op  1)  & 5;  return  0 if  t = 0,  — 1 if  t — 5. 

Otherwise  return  min(V(/0  + /),V(g0  + <?))•  (46) 

The  main  difficulty  that  faces  us,  when  generating  a template  for  a descen- 
dant of  q 0 ct'  according  to  (37),  is  to  decide  whether  or  not  such  a template 
already  exists  — and  if  so,  to  link  to  it.  The  best  way  to  solve  such  problems  is 
usually  to  use  a hash  table;  but  then  we  must  decide  where  to  put  such  a table, 
and  how  much  extra  space  to  devote  to  it.  Alternatives  such  as  binary  search 
trees  would  be  much  easier  to  adapt  to  our  purposes,  but  they  would  add  an 
unwanted  factor  of  log  B(f  o g)  to  the  running  time.  The  synthesis  problem  can 
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actually  be  solved  in  worst-case  time  and  space  0(B(f  o g ))  by  using  a bucket 
sort  method  analogous  to  Algorithm  R (see  exercise  72);  but  that  solution  is 
complicated  and  somewhat  awkward. 

Fortunately  there’s  a nice  way  out  of  this  dilemma,  requiring  almost  no  extra 
memory  and  only  modestly  complex  code,  if  we  generate  the  templates  one  level 
at  a time.  Before  generating  the  templates  for  level  /,  we’ll  know  the  number 
Ni  of  templates  to  be  requested  on  that  level.  So  we  can  temporarily  allocate 
space  for  2b  templates  at  the  top  of  the  currently  free  area,  where  b = [lgN;], 
and  put  new  templates  there  while  hashing  into  the  same  area.  The  idea  is  to 
use  chaining  with  separate  lists,  as  in  Fig.  38  of  Section  6.4;  the  H and  L fields  of 
our  templates  and  potential  templates  play  the  roles  of  heads  and  links  in  that 
illustration,  while  the  keys  appear  in  (LEFT,  RIGHT).  Here’s  the  logic,  in  detail: 

Subroutine  make.template(f,g),  with  local  variable  t: 

Set  h 4-  HBASE  + 2(((314159257/  + 271828171g)  mod  2d)  > (d  - b)),  where  d 
is  a convenient  upper  bound  on  the  size  of  a pointer  (usually  d = 32).  Then 
set  t 4-  H(/i.) . While  1 / A and  either  LEFT(f)  / / or  RIGHT(f)  ^ g , set 
t 4-  L(t).  If  t = A,  set  t 4-  TBOT  - 2,  TBOT  4-  t,  LEFT  (t)  4-  /,  RIGHT  (t)  4-  g. 
L(t)  4—  H(/i) , and  H(/i)  4-  t.  Finally,  return  the  value  t.  (qy) 

The  calling  routine  in  steps  S4  and  S5  ensures  that  NTOP  < HBASE  < TBOT. 

This  breadth-first,  level-at-a-time  strategy  for  constructing  the  templates 
has  an  added  payoff,  because  it  promotes  “locality  of  reference”:  Memory  ac- 
cesses tend  to  be  confined  to  nearby  locations  that  have  recently  been  seen,  hence 
controlled  in  such  a way  that  cache  misses  and  page  faults  are  significantly 
reduced.  Furthermore,  the  eventual  BDD  nodes  placed  on  the  stack  will  also 
appear  in  order,  so  that  all  branches  on  the  same  variable  appear  consecutively. 

Algorithm  S (Breadth- first  synthesis  of  BDDs).  This  algorithm  computes  the 
BDD  for  fog  as  described  above,  using  subroutines  (46)  and  (47).  Auxiliary 
arrays  LSTART [1]  , LCOUNT [/]  . LLIST[/]  , and  HLIST[Z]  are  used  for  0 < l < umax. 

51.  [Initialize.]  Set  / 4-  g0  - 1 - f0,  g 4-  NTOP  - 1 - g0,  and  l 4-  firuLlevel{f,  g). 
See  exercise  66  if  l < 0.  Otherwise  set  LSTART  [1  - 1]  4-  P00LSIZE,  and 
LLIST  [A;]  4-  HLISTEfc]  «-  A,  LCOUNT  [fc]  0 for  l < k < vmaJ  Set 
TBOT  <-  P00LSIZE  - 2,  LEFT  (TBOT)  4-  /,  and  RIGHT  (TBOT)  4-  g. 

52.  [Scan  the  level-1  templates.]  Set  LSTART  [/]  4-  TBOT  and  1 4-  LSTART  [I  - 1]  . 
While  t > TBOT,  schedule  requests  for  future  levels  by  doing  the  following: 

Set  t 4-  t—2,  f 4—  LEFT(t),  g 4-  RIGHT (t),  vf  4-  V(/0+/),  vg  4-  V(g0+g), 
ll  4 find-level  ((vf  < vg?  LO(/0  + /):  f),(vf  > vg?  L0  (g0+g):  g )), 

Ih  <-  fimUevel ((vf  < vg?  HI(/0  + /):  f),(vf  > vg?  HI (g0  + g ):  g)). 

If  ll<  0,  set  L (t)  i ll;  otherwise  set  L«)  4-  LLIST [/l],  LLISTf/1]  4-  t, 

LC0UNT[//]  4—  LC0UNT[1I]  -|-  1.  If  lh,  < 0,  set  H(t)  4—  —Ih;  otherwise  set 
H(f)  4-  HLIST[Zft],  HLIST Uhl  4 - t,  LCOUNT [Ife]  4-  LC0UNT[//i]  + 1. 

53.  [Done  with  phase  one?]  Go  to  S6  if  l = umax.  Otherwise  set  l 4—  l + 1,  and 
return  to  S2  if  LCOUNT  [1]  = 0. 
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54.  [Initialize  for  hashing.]  Set  b 4—  [lgLCOUNT[Z]],  HBASE  4—  TBOT  — 26+1, 
and  H (HBASE  + 2k)  4-  A for  0 < fc  < 2b. 

55.  [Make  the  level-/  templates.]  Set  t,  4—  LLIST[/].  While  A / A,  set  s 4— 
L (A),  / 4-  LEFT  (A) , g <-  RIGHT(A),  vf  4-  V(/0  + /),  vg  4-  V(5o+fl), 
L (A)  4—  make.template ((vf  < vg ? LO(/o+/):  f),(vf  > vg ? LO(go+3):  ff)), 
A 4—  s.  (We’re  half  done.)  Then  set  A 4—  HLISTf/].  While  A / A,  set 
s 4—  H (A) , / <-  LEFT(A),  g 4-  RIGHT  (A),  vf  4-  V(/0  + /),  w/  4-  V(5o+5), 
H ( A)  4—  make-template  ((vf  < vg ? HI(/o+/):  f),(vf  > w;?  HlCgo+ff):  </))j 
A 4—  s.  (Now  the  other  half  is  done.)  Go  back  to  step  S2. 

56.  [Prepare  for  phase  two.]  (At  this  point  it’s  safe  to  obliterate  the  nodes  of  / 
and  g , because  we’ve  built  all  the  templates  (43).  Now  we’ll  convert  them 
to  form  (44).  Note  that  V(/0)  = V(/0  + 1)  = nmax  + 1.)  Set  NTOP  4—  f0  + 2. 

57.  [Bucket  sort.]  Set  t 4—  LSTART[Z  — 1]  . Do  the  following  while  t > LSTART[Z]  : 

Set  A -< — A — 2,  L (A)  4-  RIGHT (L  (A)  ) , and  H (A)  4-  RIGHT (H (A)  ) . 

If  L ( A)  = H(A),  set  RIGHT  (A)  4—  L(A).  (This  branch  is  redundant.) 

Otherwise  set  RIGHT  (A)  4 1,  LEFT  (A)  t-  LEFT  (L  (A)),  LEFT  (L  (A))  <-  A. 

58.  [Restore  clone  addresses.]  If  A = LSTART  [/  — 1]  , set  A «—  LSTART  [/]  — 2 
and  go  to  S9.  Otherwise,  if  LEFT  (A)  < 0,  set  LEFT  (L  (A) ) e-  LEFT  (A) . Set 
A «—  A + 2 and  repeat  step  S8. 

59.  [Done  with  level?]  Set  A «—  A + 2.  If  A = LSTART  [/  — 1]  , go  to  S12.  Otherwise, 
if  RIGHT  (A)  > 0 repeat  step  S9. 

510.  [Examine  a bucket.]  (Suppose  L(Ai)  = LO2)  = L ( A3 ) , where  Ai  > A2  > 
A3  = A and  no  other  templates  on  level  / have  this  L value.  Then  at  this  point 
we  have  LEFT(A3)  = A2,  LEFT(A2)  = A1;  LEFT  (Ax)  < 0,  and  RIGHT  (Ax)  = 
RIGHTO2)  — RIGHT(A3)  = —1.)  Set  s •<—  A.  While  s > 0,  do  the  following: 
Set  r H(s) , RIGHT (s)  <-  LEFT(r);  if  LEFT (r)  < 0,  set  LEFT(r)  4-  s;  and 
set  s 4—  LEFT(s).  Finally  set  s 4—  A again. 

511.  [Make  clones.]  If  s < 0,  go  back  to  step  S9.  Otherwise  if  RIGHT(s)  > 0, 
set  s 4—  LEFT(s).  Otherwise  set  r 4—  LEFT(s),  LEFT(H(s))  4—  RIGHT (s), 
RIGHT (s)  4-  s,  q 4-  NTOP,  NTOP  4-  q + 1,  LEFT(s)  4-  ~(g  - f0),  L0(g)  4- 
~LFFT(L(s)  ) , HI  (g)  4-  ~LEFT(H(s)),  V(q)  4-  /,  s 4-  r.  Repeat  step  Sll. 

512.  [Loop  on  /.]  Set  l 4-  l - 1.  Return  to  S7  if  LSTART  [/]  < P00LSIZE. 

Otherwise,  if  RIGHT(P00LSIZE  - 2)  = 0,  set  NTOP  4-  NTOP  - 1 (because 
/ o g is  identically  0).  | 

As  usual,  the  best  way  to  understand  an  algorithm  like  this  is  to  trace  through 
an  example.  Exercise  67  discusses  what  Algorithm  S does  when  it  is  asked  to 
compute  / A g,  given  the  BDDs  in  (41). 

Algorithm  S can  be  used,  for  example,  to  construct  the  BDDs  for  interesting 
functions  such  as  the  “monotone- function  function”  pn(x i,...,X2«),  which  is 
true  if  and  only  if  x\  . . . x2»  is  the  truth  table  of  a monotone  function: 

pn(xx,...,x^)=  [\  [xi+i<Xj+i\. 

0<iCj<2n 


(48) 
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Fig.  25.  ^2(xi,X3,X5,X7)  A ti2(x2,X4,X6,Xs)  /\Ga(x 
as  computed  by  Algorithm  S. 


Starting  with  no(xi)  — 1,  this  function  satisfies  the  recursion  relation 
Mn  (X1  , • • • > ) = 

x3, . . . , X2n_i')  A 2,  X4, . . . , x2n)  A G2™  (a^i, . . . , X2<» ),  (49) 

where  ^(xx,...,^)  = [an  < x2]  A [x3  < x4]  A • • • A [a^-i  < x2~].  So  its 
BDD  is  easy  to  obtain  with  a BDD  calculator  like  Algorithm  S:  The  BDDs  for 
Mn— 1 (Ti , X3, . . . ,X2«_i)  and  nn- 1(2:2, £4, . . . ,X2»)  are  simple  variants  of  the  one 
for  fin- i(xi,X2, . . . ,x2n-i),  and  G2n  has  an  extremely  simple  BDD  (see  Fig.  25). 

Repeating  this  process  six  times  will  produce  the  BDD  for  //6.  which  has 
103,924  nodes.  There  are  exactly  7,828,354  monotone  Boolean  functions  of  six 
variables  (see  exercise  5.3.4—31);  this  BDD  nicely  characterizes  them  all,  and  we 
need  only  about  4.8  million  memory  accesses  to  compute  it  with  Algorithm  S. 
Furthermore,  6.7  billion  mems  will  suffice  to  compute  the  BDD  for  //7,  which 
has  155,207,320  nodes  and  characterizes  2,414,682,040,998  monotone  functions. 

We  must  stop  there,  however;  the  size  of  the  next  case,  B(ns),  turns  out  to 
be  a whopping  69,258,301,585,604  (see  exercise  77). 

Synthesis  in  a BDD  base.  Another  approach  is  called  for  when  we’re  dealing 
with  many  functions  at  once  instead  of  computing  a single  BDD  on  the  fly. 
The  functions  of  a BDD  base  often  share  common  subfunctions,  as  in  (36). 
Algorithm  S is  designed  to  take  disjoint  BDDs  and  to  combine  them  efficiently, 
afterwards  destroying  the  originals;  but  in  many  cases  we  would  rather  form 
combinations  of  functions  whose  BDDs  overlap.  Furthermore,  after  forming  a 
new  function  / At/,  say,  we  might  want  to  keep  / and  g around  for  future  use; 
indeed,  the  new  function  might  well  share  nodes  with  / or  g or  both. 

Let  s therefore  consider  the  design  of  a general-purpose  toolkit  for  manip- 
ulating a collection  of  Boolean  functions.  BDDs  are  especially  attractive  for 
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this  purpose  because  most  of  the  necessary  operations  have  a simple  recursive 
formulation.  We  know  that  every  nonconstant  Boolean  function  can  be  written 

f{xi  , X2 , • . - , Xn)  — ( Xv ? fl  • fh),  (50) 

where  v — fv  indexes  the  first  variable  on  which  / depends,  and  where  we  have 

fl  — fi. 0, . . . , 0,  xv+i , ■ • • , xn) ; fh  /(l, . . . , 1,  xv+i, . . • , xn).  (51) 

This  rule  corresponds  to  branch  node  (w)  at  the  top  of  the  BDD  for  /;  and 
the  rest  of  the  BDD  follows  by  using  (50)  and  (51)  recursively,  until  we  reach 
constant  functions  that  correspond  to  [±]  or  [t)  • A similar  recursion  defines  any 
combination  of  two  functions,  fog-.  For  if  / and  g aren’t  both  constant,  we  have 

f(xi,...,xn)  = (x„?  fi\  fh)  and  g{xu . . . , xn)  = (x„?  gv.  gh),  (52) 

where  v = min (fv,gv)  and  where  /;,  fh,  gv  gh  are  given  by  (51).  Then,  presto, 

fog  = (x„?  fi  ogf.  fh  ogh).  (53) 


This  important  formula  is  another  way  of  stating  the  rule  by  which  we  defined 
melding,  Eq.  (37). 

Caution:  The  notations  above  need  to  be  understood  carefully,  because  the 
subfunctions  fi  and  fh  in  (50)  might  not  be  the  same  as  the  /;  and  fh  in  (52). 
Suppose,  for  example,  that  f — x 2 V13  while  g — x 1 © x3.  Then  Eq.  (50)  holds 
with  fv  = 2 and  / = (x2?  fv-  fh),  where  fi  — x3  and  fh  = 1-  We  also  have 
gv  = 1 and  g = (xi?  x3:  x3).  But  in  (52)  we  use  the  same  branch  variable  xv  for 
both  functions,  and  v — min (fv,gv)  = 1 in  our  example;  so  Eq.  (52)  holds  with 
/ = (xi?  fi : fh)  and  /;  = fh  = x2  V x3. 

Every  node  of  a BDD  base  represents  a Boolean  function.  Furthermore,  a 
BDD  base  is  reduced;  therefore  two  of  its  functions  or  subfunctions  are  equal 
if  and  only  if  they  correspond  to  exactly  the  same  node.  (This  convenient 
uniqueness  property  was  not  true  in  Algorithm  S.) 

Formulas  (5i)-(53)  immediately  suggest  a recursive  way  to  compute  / A g: 


AND  (f,g)=  { 


( If  / A g has  an  obvious  value,  return  it. 

Otherwise  represent  / and  g as  in  (52); 
compute  ri  4-  AND (fi,gi)  and  rh  4-  XND(fh,gh)', 
l return  the  function  (xv?  rf.  r^). 


(54) 


(Recursions  always  need  to  terminate  when  a sufficiently  simple  case  arises.  The 
“obvious”  values  in  the  first  line  correspond  to  the  terminal  cases  / A 1 = /, 
lAg  = </,/A0  = 0Ag  = 0,  and  / A g = / when  / = g.)  When  / and  g are 
the  functions  in  our  example  above,  (54)  reduces  / A g to  the  computation  of 
(x2Vx3)Ax3  and  (x2Vx3)Ax3.  Then  (x2Vx3)Ax3  reduces  to  x3Ax3  and  1Ax3;  etc. 

But  (54)  is  problematic  if  we  simply  implement  it  as  stated,  because  every 
nonterminal  step  launches  two  more  instances  of  the  recursion.  The  computation 
explodes,  with  2fc  instances  of  AND  when  we’re  k levels  deep! 

Fortunately  there’s  a good  way  to  avoid  that  blowup.  Since  f has  only  B(f) 
different  subfunctions,  at  most  B(f)B(g)  distinctly  different  calls  of  AND  can 
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arise.  To  keep  a lid  on  the  computations,  we  just  need  to  remember  what  we’ve 
done  before,  by  making  a memo  of  the  fact  that  / A g = r just  before  returning 
r as  the  computed  value.  Then  when  the  same  subproblem  occurs  later,  we 
can  retrieve  the  memo  and  say,  “Hey,  we’ve  already  been  there  and  done  that.” 
Previously  solved  cases  thereby  become  terminal;  only  distinct  subproblems  can 
generate  new  ones.  (Chapter  8 will  discuss  this  memoization  technique  in  detail.) 

The  algorithm  in  (54)  also  glosses  over  another  problem:  It’s  not  so  easy  to 
“return  the  function  (xv?  77:  rh),"  because  we  must  keep  the  BDD  base  reduced. 
If  r[  — r^,  we  should  return  the  node  77 ; and  if  77  yf  , we  need  to  decide 
whether  the  branch  node  (x„?  77:  rh)  already  exists,  before  creating  a new  one. 

Thus  we  need  to  maintain  additional  information,  besides  the  BDD  nodes 
themselves.  We  need  to  keep  memos  of  problems  already  solved;  we  also  need 
to  be  able  to  find  a node  by  its  content,  instead  of  by  its  address.  The  search 
algorithms  of  Chapter  6 now  come  to  our  rescue  by  telling  us  how  to  do  both  of 
these  things,  for  example  by  hashing.  To  record  a memo  that  / A g = r,  we  can 
hash  the  key  ‘(/,  A,  </)’  and  associate  it  with  the  value  r;  to  record  the  existence 
of  an  existing  node  (V,LO,HI),  we  can  hash  the  key  ‘(V,LO,HI)’  and  associate 
it  with  that  node’s  memory  address. 

The  dictionary  of  all  existing  nodes  (V,  LO,  HI)  in  a BDD  base  is  traditionally 
called  the  unique  table , because  we  use  it  to  enforce  the  all-important  uniqueness 
criterion  that  forbids  duplication.  Instead  of  putting  all  that  information  into 
one  giant  dictionary,  however,  it  turns  out  to  be  better  to  maintain  a collection 
of  smaller  unique  tables,  one  for  each  variable  V.  With  such  separate  tables  we 
can  efficiently  find  all  nodes  that  branch  on  a particular  variable. 

The  memos  are  handy,  but  they  aren’t  as  crucial  as  the  unique  table  entries. 
If  we  happen  to  forget  the  isolated  fact  that  f Ag  = r,  we  can  always  recompute 
it  again  later.  Exponential  blowup  won’t  be  worrisome,  if  the  answers  to  the 
subproblems  fi  A gi  and  fh  A gh  are  still  remembered  with  high  probability. 
Therefore  we  can  use  a less  expensive  method  to  store  memos,  designed  to  do 
a pretty-good-but-not-perfect  job  of  retrieval:  After  hashing  the  key  ‘(/,  A,  g)' 
to  a table  position  p , we  need  look  for  a memo  only  in  that  one  position,  not 
bothering  to  consider  collisions  with  other  keys.  If  several  keys  all  share  the  same 
hash  address,  position  p will  record  only  the  most  recent  relevant  memo.  This 
simplified  scheme  will  still  be  adequate  in  practice,  as  long  as  the  hash  table  is 
large  enough.  We  shall  call  such  a near-perfect  table  the  memo  cache , because 
it  is  analogous  to  the  hardware  caches  by  which  a computer  tries  to  remember 
significant  values  that  it  has  dealt  with  in  relatively  slow  storage  units. 

Okay,  let  s flesh  out  algorithm  (54)  by  explicitly  stating  how  it  interacts  with 
the  unique  tables  and  the  memo  cache: 


and(/,5) 


r If  / a g has  an  obvious  value,  return  it. 

Otherwise,  if  f Ag  = r is  in  the  memo  cache,  return  r. 
Otherwise  represent  / and  g as  in  (52); 
compute  77  <-  AND (fh9l)  and  rh  <-  AND (fh,gh); 
set  r <—  UNIQUE (v,  ri,  rh),  using  Algorithm  U; 

, put  f A g = r into  the  memo  cache,  and  return  r. 
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Algorithm  U ( Unique  table  lookup).  Given  ( v,p , q),  where  v is  an  integer  while 
p and  q point  to  nodes  of  a BDD  base  with  variable  rank  > v,  this  algorithm  re- 
turns a pointer  to  a node  UNIQUE(w,p,  q)  that  represents  the  function  (xv?  p:  q). 
A new  node  is  added  to  the  base  if  that  function  wasn’t  already  present. 

Ul.  [Easy  case?]  If  p = q,  return  p. 

U2.  [Check  the  table.]  Search  variable  xv’s  unique  table  using  the  key  ( p,q ).  If 
the  search  successfully  finds  the  value  r,  return  r. 

U3.  [Create  a node.]  Allocate  a new  node  r,  and  set  V(r)  <—  v,  L0(r)  <—  p, 
HI(r)  q.  Put  r into  a^’s  unique  table  using  the  key  ( p , q).  Return  r.  | 
Notice  that  we  needn’t  zero  out  the  memo  cache  after  finishing  a top-level 
computation  of  AND  (/,<?)•  Each  memo  that  we  have  made  states  a relationship 
between  nodes  of  the  structure;  those  facts  are  still  valid,  and  they  might  be 
useful  later  when  we  want  to  compute  AND  (f,g)  for  new  functions  / and  g. 

A refinement  of  (55)  will  enhance  that  method  further,  namely  to  swap 
/ «-)•  g if  we  discover  that  f > g when  / A g isn’t  obvious.  Then  we  won’t  have 
to  waste  time  computing  / A g when  we’ve  already  computed  g A /. 

With  simple  changes  to  (55),  the  other  binary  operators  OR(/,  g),  XOR (/,  g), 
BUTNOT(/,  g),  NOR(/,  g).  . . . can  also  be  computed  readily;  see  exercise  81. 

The  combination  of  (55)  and  Algorithm  U looks  considerably  simpler  than 
Algorithm  S.  Thus  one  might  well  ask,  why  should  anybody  bother  to  learn  the 
other  method?  Its  breadth-first  approach  seems  quite  complex  by  comparison 
with  the  “depth- first”  order  of  computation  in  the  recursive  structure  of  (55);  yet 
Algorithm  S is  able  to  deal  only  with  BDDs  that  are  disjoint,  while  Algorithm  U 
and  recursions  like  (55)  apply  to  any  BDD  base. 

Appearances  can,  however,  be  deceiving:  Algorithm  S has  been  described 
at  a low  level,  with  every  change  to  every  element  of  its  data  structures  spelled 
out  explicitly.  By  contrast,  the  high-level  descriptions  in  (55)  and  Algorithm  U 
assume  that  a substantial  infrastructure  exists  behind  the  scenes.  The  memo 
cache  and  the  unique  tables  need  to  be  set  up,  and  their  sizes  need  to  be  carefully 
adjusted  as  the  BDD  base  grows  or  contracts.  When  all  is  said  and  done,  the 
total  length  of  a program  that  implements  Algorithms  (55)  and  U properly  “from 
scratch”  is  roughly  ten  times  the  length  of  a similar  program  for  Algorithm  S. 

Indeed,  the  maintenance  of  a BDD  base  involves  interesting  questions  of 
dynamic  storage  allocation,  because  we  want  to  free  up  memory  space  when 
nodes  are  no  longer  accessible.  Algorithm  S solves  this  problem  in  a last-in-first- 
out  manner,  by  simply  keeping  its  nodes  and  templates  on  sequential  stacks,  and 
by  making  do  with  a single  small  hash  table  that  can  easily  be  integrated  with 
the  other  data.  A general  BDD  base,  however,  requires  a more  intricate  system. 

The  best  way  to  maintain  a dynamic  BDD  base  is  probably  to  use  reference 
counters,  as  discussed  in  Section  2.3.5,  because  BDDs  are  acyclic  by  definition. 
Therefore  let’s  assume  that  every  BDD  node  has  a REF  field,  in  addition  to  V,  LO, 
and  HI.  The  REF  field  tells  us  how  many  references  exist  to  this  node,  either 
from  LO  or  HI  pointers  in  other  nodes  or  from  external  root  pointers  Fj  as  in  (36). 
For  example,  the  REF  fields  for  the  nodes  labeled  (3)  in  (36)  are  respectively  4, 
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1 and  2;  and  all  of  the  nodes  labeled  © or  @ or  © in  that  example  have 
REF  - 1.  Exercise  82  discusses  the  somewhat  tricky  issue  of  how  to  increase 
and  decrease  REF  counts  properly  in  the  midst  of  a recursive  computation. 

A node  becomes  dead  when  its  reference  count  becomes  zero.  When  that 
happens,  we  should  decrease  the  REF  fields  of  the  two  nodes  below  it;  and  then 
they  too  might  die  in  the  same  manner,  recursively  spreading  the  plague. 

But  a dead  node  needn’t  be  removed  from  memory  immediately.  It  still 
represents  a potentially  useful  Boolean  function,  and  we  might  discover  that  we 
need  that  function  again  as  our  computation  proceeds.  For  example,  we  might 
find  a dead  node  in  step  U2,  because  pointers  from  the  unique  table  don’t  get 
counted  as  references.  Likewise,  in  (55),  we  might  accidentally  stumble  across  a 
cache  memo  telling  us  that  f Ag  = r,  when  r is  currently  dead.  In  such  cases, 
node  r comes  back  to  life.  (And  we  must  increase  the  REF  counts  of  its  LO  and 
HI  descendants,  possibly  resurrecting  them  recursively  in  the  same  fashion.) 

Periodically,  however,  we  will  want  to  reclaim  memory  space  by  removing 
the  deadbeats.  Then  we  must  do  two  things:  We  must  purge  all  memos  from 
the  cache  for  which  either  /,  g,  or  r is  dead;  and  we  must  remove  all  dead 
nodes  from  memory  and  from  their  unique  tables.  See  exercise  84  for  typical 
heuristic  strategies  by  which  an  automated  system  might  decide  when  to  invoke 
such  cleanups  and  when  to  resize  the  tables  dynamically. 

Because  of  the  extra  machinery  that  is  needed  to  support  a BDD  base, 
Algorithm  U and  top-down  recursions  like  (55)  cannot  be  expected  to  match  the 
efficiency  of  Algorithm  S on  one-shot  examples  such  as  the  monotone-function 
unction  nn  m (49).  The  running  time  is  approximately  quadrupled  when  the 
more  general  approach  is  applied  to  this  example,  and  the  memory  requirement 
grows  by  a factor  of  about  2.4. 

But  a BDD  base  really  begins  to  shine  in  numerous  other  applications, 
uppose,  for  example,  that  we  want  the  formulas  for  each  bit  of  the  product 
of  two  binary  numbers, 


(zl-.-zm+n)2  - (x1...xm)2  X(y1...yn)2.  (56) 

Clearly  z±  . . . zm  — 0 ...  0 when  n = 0,  and  the  simple  recurrence 

(*!"•  Xm^2  X (yi  ■ ■ ■ y^Vn+lh  = (zi  • • • zm+n 0)2  + (a*  . . . Xm)2yn+1  (57) 

allows  us  to  increase  n by  1.  This  recurrence  is  easy  to  code  for  a BDD  base. 
Here  s what  we  get  when  m = n = 3,  with  subscripts  chosen  to  match  the 
analogous  diagram  for  binary  addition  in  (36): 
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Clearly  multiplication  is  much  more  complicated  than  addition,  bitwise.  (Indeed, 
if  it  weren’t,  factorization  wouldn’t  be  so  hard.)  The  corresponding  BDD  base 
for  binary  multiplication  when  m = n = 16  is  huge,  with  jB(/i,  • ■ ■ , /32)  = 
136,398,751  nodes.  It  can  be  found  after  doing  about  56  gigamems  of  calculation 
with  Algorithm  U,  in  6.3  gigabytes  of  memory-  including  some  1.9  billion 
invocations  of  recursive  subroutines,  with  hundreds  of  dynamic  resizings  of  the 
unique  tables  and  the  memo  cache,  plus  dozens  of  timely  garbage  collections. 
A similar  calculation  with  Algorithm  S would  be  almost  unthinkable,  although 
the  individual  functions  in  this  particular  example  do  not  share  many  common 
subfunctions:  It  turns  out  that  B(fi)  + • ■ • + B(f3?)  = 168,640,131,  with  the 
maximum  occurring  at  the  “middle  bit,”  B(fie)  = 38,174,143. 

*Ternary  operations.  Given  three  Boolean  functions  / = /(xi, . . . ,xn),  g — 
g(xi, . . . , x„),  and  h = h(x i, . . . , xn),  not  all  constant,  we  can  generalize  (52)  to 

/ — (xv?  fi-  fh)  and  g = (xv?gi:gh)  and  h = (x„?  ht:  hh),  (59) 
by  taking  v = min (fv,gv,hv).  Then,  for  example,  (53)  generalizes  to 

( fgh ) = (xv?  ( figihi ):  ( fh9hhh))\  (60) 


and  similar  formulas  hold  for  any  ternary  operation  on  /,  g , and  h,  including 


(/?  9 ■ h)  = (x„?  (/;?  gi:  hi):  (//,?  gh:  hh)).  (61) 

(The  reader  of  these  formulas  will  please  forgive  the  two  meanings  of  lh'  in  "hh  ■) 
Now  it’s  easy  to  generalize  (55)  to  ternary  combinations  like  multiplexing: 


MU  X(f,g,h)  = { 


If  (/?  g:  h)  has  an  obvious  value,  return  it. 

Otherwise,  if  (/?  g:  h)  = r is  in  the  memo  cache,  return  r. 

Otherwise  represent  /,  g , and  h as  in  (59); 

compute  n MUX(/(,  <7/,  hi)  and  rh  <-  MU X(fh,gh,hh); 

set  r <—  UNIQUE (n,  r^r^),  using  Algorithm  U; 

put  ‘(/?  g:  h)  = r’  into  the  memo  cache,  and  return  r. 


(62) 


(See  exercises  86  and  87.)  The  running  time  is  0(B{f)B(g)B(h)).  The  memo 
cache  must  now  be  consulted  with  a more  complex  key  than  before,  including 
three  pointers  (/,  g , h)  instead  of  two,  together  with  a code  for  the  relevant 
operation.  But  each  memo  (op,  f,  g,  h,  r)  can  still  be  represented  conveniently  in, 
say,  two  octabytes,  if  the  number  of  distinct  pointer  addresses  is  at  most  231. 

The  ternary  operation  / A g A h is  an  interesting  special  case.  We  could 
compute  it  with  two  invocations  of  (55),  either  as  AND(/,  AND(g,  h))  or  as 
AND(g,  AND(/i, /))  or  as  AND(/i,  AND(/,  g));  or  we  could  use  a ternary  sub- 
routine, ANDAND (f,g,h),  analogous  to  (62).  This  ternary  routine  first  sorts 
the  operands  so  that  the  pointers  satisfy  / < g < h.  Then  if  / = 0,  it  returns  0; 
if  / = 1 or  / = g,  it  returns  AND((7,  /i);  if  g = h it  returns  AND(/,  <7);  otherwise 
1 < / < g < h and  the  operation  remains  ternary  at  the  current  level  of  recursion. 

Suppose,  for  example,  that  / = /x5(x i,X3, . . . ,x63),  g — 7ts(x2,X4, . . . ,X64), 
and  h,  — G64(xi,  . . . , x64),  as  in  Eq.  (49).  The  computation  AND(/,  AND(g,  h)) 
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costs  0.2  + 6.8  = 7.0  megamems  in  the  author’s  experimental  implementation; 
AND(g,  AND(ft,  /))  costs  0.1  + 7.0  = 7.1;  AND  (ft,  AND(/,  g))  costs  24.4  + 5.6  = 
30.0(1);  and  ANDAND(/,  g,  ft)  costs  7.5.  So  in  this  instance  the  all-binary 
approach  wins,  if  we  don’t  choose  a bad  order  of  computation.  But  sometimes 
ternary  ANDAND  beats  all  three  of  its  binary  competitors  (see  exercise  88). 

*Quantifiers.  If  / = f(x\, . . . ,xn)  is  a Boolean  function  and  1 < j < n,  logicians 
traditionally  define  existential  and  universal  quantification  by  the  formulas 


^Xj  f(xi, . . . ,xn)  — /0  V fi  and  \fxj  f(xx, . . . , xn)  = f0  A /x , (63) 

where  fc  — /(aq, . . . , xj-i,  c,  Xj+i, . . . , xn).  Thus  the  quantifier  ‘3xj\  pro- 
nounced there  exists  Xj,  changes  / to  the  function  of  the  remaining  variables 
(xi, . . . , Xj  — i,  Xj+ 1, . . . , xn)  that  is  true  if  and  only  if  at  least  one  value  of  xj 
satisfies  f(x x,  — , ac„);  the  quantifier  ‘Vx/,  pronounced  “for  all  a changes  / 
to  the  function  that  is  true  if  and  only  if  both  values  of  Xj  satisfy  /. 

Several  quantifiers  are  often  applied  simultaneously.  For  example,  the  for- 
mula 3x23x33x6f(x1,...,xn)  stands  for  the  OR  of  eight  terms,  representing 
the  eight  functions  of  (aq,  x4,  x5,  x7, . . . , xn)  that  are  obtained  when  we  plug  the 
values  0 or  1 into  the  variables  X2,  X3,  and  x<;  in  all  possible  ways.  Similarly, 
Vx3  Vx6  f(xi , . . . ,xn)  stands  for  the  AND  of  those  same  eight  terms. 

One  common  application  arises  when  the  function  f{ii,...,if,j1,...,jm) 
denotes  the  value  in  row  (n  . . . q)2  and  column  (jd  . . . jm)2  of  a 2l  x 2m  Boolean 
matrix  F.  Then  the  function  ft(*i, . . . , q;  ki, . . . , kn)  given  by 


3ji  • • • 3jm  (/(ii,  1, . . . ,jm)  A g{ju. . . , jm- ku  . . . , kn))  (64) 

represents  the  matrix  H that  is  the  Boolean  product  F G. 

A convenient  way  to  implement  multiple  quantification  in  a BDD  base  has 
been  suggested  by  R.  L.  Rudell:  Let  g = Xjl  A • • • A xjm  be  a conjunction  of 
positive  literals.  Then  we  can  regard  3xh  . . . 3xJm  f as  the  binary  operation 
/ E <7,  implemented  by  the  following  variant  of  (55): 


EXISTS  (f,g) 


' If  / E g has  an  obvious  value,  return  it. 

Otherwise  represent  / and  g as  in  (52); 
if  v / fvi  return  EXISTS)/, g^). 

Otherwise,  if  / E g = r is  in  the  memo  cache,  return  r. 
Otherwise,  r,  <-  EXISTS (/,,  gh)  and  rh  i-  EXISTS (fh,gh);  ^ 
if  r*  / 9v,  set  r <—  UNIQUE(u,  rt,  rh)  using  Algorithm  U, 
otherwise  compute  r <-  OR (ri,rh); 

■ Put  / E g — r'  into  the  memo  cache,  and  return  r. 


(See  exercise  94.)  The  E operation  is  undefined  when  g does  not  have  the  stated 
form.  Notice  how  the  memo  cache  nicely  remembers  existential  computations 
that  have  gone  before. 

The  running  time  of  (65)  is  highly  variable  — not  like  (55)  where  we  know 
that  0{B{f)B{g))  is  the  worst  possible  case-  because  m OR  operations  are 
invoked  when  g specifies  m-fold  quantification.  The  worst  case  now  can  be  as 
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bad  as  order  B(f)2m , if  all  of  the  quantification  occurs  near  the  root  of  the  BDD 
for  /;  this  is  only  0(B(f)2)  if  m = 1,  but  it  might  become  unbearably  large  as  m 
grows.  On  the  other  hand,  if  all  of  the  quantification  occurs  near  the  sinks,  the 
running  time  is  simply  0(B(f)),  regardless  of  the  size  of  m.  (See  exercise  97.) 

Several  other  quantifiers  are  worthy  of  note,  and  equally  easy,  although  they 
aren’t  as  famous  as  3 and  V.  The  Boolean  difference  and  the  yes/no  quantifiers 
are  defined  by  formulas  analogous  to  (63): 

Qxj  / = /()©  /1;  A:r,  / = /o  A /1;  NX,  / = /o  A /1.  (66) 

The  Boolean  difference,  Q,  is  the  most  important  of  these:  Q xj  f is  true  for 
all  values  of  {aq, . . . , Xj_i,  Xj+i, . . . , xn}  such  that  / depends  on  xj.  If  the 
multilinear  representation  of  / is  / = ( xjg  + h)  mod  2,  where  g and  h are 
multilinear  polynomials  in  { x\ , . . . , aq-i,  Xj+ 1, . . . , xn},  then  Qxj  } — g mod  2. 
(See  Eq.  7.1.1-(ig).)  Thus  0 acts  like  a derivative  in  calculus,  over  a finite  field. 

A Boolean  function  /(aq, . . . ,xn)  is  monotone  (nondecreasing)  if  and  only 
if  \/”=  1 Naq  / = 0,  which  is  the  same  as  saying  that  Naq  / = 0 for  all  j.  However, 
exercise  105  presents  a faster  way  to  test  a BDD  for  monotonicity. 

Let’s  consider  now  a detailed  example  of  existential  quantification  that  is 
particularly  instructive.  If  G is  any  graph,  we  can  form  Boolean  functions  IND(x) 
and  KER(x)  for  its  independent  sets  and  kernels  as  follows,  where  a:  is  a bit  vector 
with  one  entry  xv  for  each  vertex  v of  G: 

IND(x)  = ->  \J  (xuAx„); 

U V 

KER(x)  = IND(x)  A f\(xv  V \J  xu ) 

V u — V 

We  can  form  a new  graph  Q whose  vertices  are  the  kernels  of  G,  namely  the 
vectors  x such  that  KER(x)  = 1.  Let’s  say  that  two  kernels  x and  y are  adjacent 
in  Q if  they  differ  in  just  the  two  entries  for  u and  v,  where  (xu,  xv ) = (1,0)  and 
{Vu^Vv)  — (0, 1),  in  which  case  we’ll  also  have  u — v.  Kernels  can  be  considered 
as  certain  ways  to  place  markers  on  vertices  of  G;  moving  a marker  from  one 
vertex  to  a neighboring  vertex  produces  an  adjacent  kernel.  Formally  we  define 

ADJ  (x,  y ) = [i/(x  © y)  = 2]  A KER(x)  A KER(y).  (69) 

Then  x — y in  Q if  and  only  if  ADJ  (x,y)  = 1. 

Notice  that,  if  x = aq  . . .xn,  the  function  |V(x)  = 2]  is  the  symmetric  func- 
tion 5 2(24, . . . ,xn ).  Furthermore  f(x  © y)  has  at  most  3 times  as  many  nodes 
as  /(x),  if  we  interleave  the  variables  zipperwise  so  that  the  branching  order  is 
(xi,  yi, . . . , xn,yn).  So  B(ADJ)  won’t  be  extremely  large  unless  B(KER)  is  large. 

Quantification  now  makes  it  easy  to  express  the  condition  that  x is  an 
isolated  vertex  of  Q (a  vertex  of  degree  0,  a kernel  without  neighbors): 

ISO(x)  = KER(x)  A -<3y  ADJ  (x,  y).  (70) 

For  example,  suppose  G is  the  graph  of  contiguous  states  in  the  USA,  as 
in  (18).  Then  each  kernel  vector  x has  49  entries  xv  for  v G {ME,  NH, . . . , CA}. 


(67) 

(68) 
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The  graph  Q has  266,137  vertices,  and  we  have  observed  earlier  that  the  BDD 
sizes  for  IND(ic)  and  KER(x)  are  respectively  428  and  780  (see  (17)).  In  this  case 
ADJ(x,  V ) in  (69)  has  a BDD  of  only  7260  nodes,  even  though  it’s  a function  of 
98  Boolean  variables.  The  BDD  for  3y  ADi(x,y),  which  describes  all  kernels  z 
of  G that  have  at  least  one  neighbor,  turns  out  to  have  842  nodes;  and  the  one 
for  ISO  (a:)  has  only  77.  We  find  that  G has  exactly  three  isolated  kernels,  namely 


(71) 


and  another  that  is  a blend  of  these  two.  Using  the  algorithms  above,  this  entire 
calculation,  starting  from  a list  of  the  vertices  and  edges  of  G (not  £),  can  be 
carried  out  with  a total  cost  of  about  4 megamems,  in  about  1.6  megabytes  of 
memory;  that’s  only  about  15  memory  accesses  per  kernel  of  G. 

In  a similar  fashion  we  can  use  BDDs  to  work  with  other  “implicit  graphs,” 
which  have  more  vertices  than  could  possibly  be  represented  in  memory,  if  those 
vertices  can  be  characterized  as  the  solution  vectors  of  Boolean  functions.  When 
the  functions  aren’t  too  complicated,  we  can  answer  queries  about  those  graphs 
that  could  never  be  answered  by  representing  the  vertices  and  arcs  explicitly. 

* Functional  composition.  The  piece  de  resistance  of  recursive  BDD  algorithms 
is  a general  procedure  to  compute  f(g1 , g2, . . . , gn),  where  / is  a given  function  of 
{xi,X2,  . . . ,xn}  and  so  is  each  argument  gj.  Suppose  we  know  a number  m > 0 
such  that  gj  = Xj  for  m < j < n;  then  the  procedure  can  be  expressed  as  follows: 


COMPOSE  (f,9l,...,gn) 


' If  / = 0 or  / = 1,  return  /. 

Otherwise  suppose  / = (xv?  fr.  fh),  as  in  (50); 
if  v > m,  return  /;  otherwise,  if  f(g1, . . . ,gn)  = r 
is  in  the  memo  cache,  return  r. 

Compute  ri  «-  COMPOSE  ...,qn)  '72 

and  rh^  COMPOSE^, gu...,gn)- 
set  r «-  MVX(gv,ri,rh)  using  (62); 

■ put  l/(ffi;  • • • )5n)  = r'  into  the  cache,  and  return  r. 


The  representation  of  cache  memos  like  ‘/(^l , - ■■,gn)  = r'  in  this  algorithm  is  a 
bit  tricky;  we  will  discuss  it  momentarily. 

Although  the  computations  here  look  basically  the  same  as  those  we’ve  been 
seeing  in  previous  recursions,  there  is  in  fact  a huge  difference:  The  functions  rt 
and  rh  in  (72)  can  now  involve  all  variables  {xu . . .,xn},  not  just  the  x’s  near 
the  bottom  of  the  BDDs.  So  the  running  time  of  (72)  might  actually  be  huge. 
But  there  also  are  many  cases  when  everything  works  together  harmoniously  and 
efficiently.  For  example,  the  computation  of  [u(x  © y)  = 2]  in  (69)  is  no  problem. 
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The  key  of  a memo  like  ‘ f(gi,  ■ ■ ■ , gn)  = r’  should  not  be  a completely 
detailed  specification  of  (/, gq, . . . , <;„),  because  we  want  to  hash  it  efficiently. 
Therefore  we  store  only  ‘/[G]  = r’,  where  G is  an  identification  number  for  the 
sequence  of  functions  (gi, . . . , gn).  Whenever  that  sequence  changes,  we  can  use  a 
new  number  G;  and  we  can  remember  the  G’s  for  special  sequences  of  functions 
that  occur  repeatedly  in  a particular  computation,  as  long  as  the  individual 
functions  gj  don’t  die.  (See  also  the  alternative  scheme  in  exercise  102.) 

Let’s  return  to  the  graph  of  contiguous  states  for  one  more  example.  That 
graph  is  planar;  suppose  we  want  to  color  it  with  four  colors.  Since  the  colors 
can  be  given  2-bit  codes  {00,01, 10, 11},  it’s  easy  to  express  the  valid  colorings 
as  a Boolean  function  of  98  variables  that  is  true  if  and  only  if  the  color  codes 
ab  are  different  for  each  pair  of  adjacent  states: 

COLOR(aMEi  ^mei  • • • , «cai  ^ca)  — 

IND(aHE  A &me,  • • • j ttcA  A bck)  A IND(aME  A bK E, . . . , ac A A 6ca)  (73) 

A IND(aHE  A &mei  . . . , acA  A hCA)  A IND(aME  A bHE, . . . , aCA  A bCA). 

Each  of  the  four  INDs  has  a BDD  of  854  nodes,  which  can  be  computed  via  (72) 
with  a cost  of  about  70  kilomems.  The  COLOR  function  turns  out  to  have  only 
25579  BDD  nodes.  Algorithm  C now  quickly  establishes  that  the  total  number 
of  ways  to  4-color  this  graph  is  exactly  25,623,183,458,304  — or,  if  we  divide 
by  4!  to  remove  symmetries,  about  1.1  trillion.  The  total  time  needed  for  this 
computation,  starting  from  a description  of  the  graph,  is  less  than  3.5  megamems, 
in  2.2  megabytes  of  memory.  (We  can  also  find  random  4-colorings,  etc.) 

Nasty  functions.  Of  course  there  also  are  functions  of  98  variables  that  aren’t 
nearly  so  nice  as  COLOR.  Indeed,  the  total  number  of  98-variable  functions  is 
2298;  exercise  108  proves  that  at  most  2246  of  them  have  a BDD  size  less  than 
a trillion,  and  that  almost  all  Boolean  functions  of  98  variables  actually  have 
B(f)  fa  298/98  i=s  3.2  x 1027.  There’s  just  no  way  to  compress  298  bits  of  data 
into  a small  space,  unless  that  data  happens  to  be  highly  redundant. 

What’s  the  worst  case?  If  / is  a Boolean  function  of  n variables,  how  large 
can  B(f)  be?  The  answer  isn’t  hard  to  discover,  if  we  consider  the  profile  of 
a given  BDD,  which  is  the  sequence  (bo, . . . ,bn-i,bn)  when  there  are  bk  nodes 
that  branch  on  variable  Xk+ 1 and  bn  sinks.  Clearly 

B(f)  = bo  + • • • + 5n- 1 + bn . (74) 

We  also  have  bo  < 1,  fq  < 2,  i>2  < 4.  63  < 8,  and  in  general 

h < 2fc,  (75) 

because  each  node  has  only  two  branches.  Furthermore  bn  = 2 whenever  / isn’t 
constant;  and  bn- 1 < 2,  because  there  are  only  two  legal  choices  for  the  LO  and 
HI  branches  of  (n).  Indeed,  we  know  that  bk  is  the  number  of  beads  of  order 
n — k in  the  truth  table  for  /,  namely  the  number  of  distinct  subfunctions  of 
(xk+ 1,  ■ ■ ■ ,xn)  that  depend  on  Xk+ 1 after  the  values  of  (aq, . . . ,Xk)  have  been 
specified.  Only  22”‘  — 22”‘  beads  of  order  m are  possible,  so  we  must  have 

bk  < 22  — 22  , for  0 < k < n.  (76) 
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When  n = 11,  for  instance,  (75)  and  (76)  tell  us  that  (60, . . . , 6U)  is  at  most 

(1,  2,  4,  8,  16,  32,  64,  128,  240,  12,  2,  2).  (77) 

Thus  13(f)  < 1 + 2 + • • • + 128  + 240  + • • ■ + 2 = 255  + 256  = 511  when  n = 11. 
This  upper  bound  is  in  fact  obtained  with  the  truth  table 


00000000  00000001  00000010...  11111110  11111111,  (78) 

or  with  any  string  of  length  2U  that  is  a permutation  of  the  256  possible  8-bit 
bytes,  because  all  of  the  8-bit  beads  are  clearly  present,  and  because  all  of  the 
subtables  of  lengths  16,  32,  . . . , 211  are  clearly  beads.  Similar  examples  can  be 
constructed  for  all  n (see  exercise  110).  Therefore  the  worst  case  is  known: 

Theorem  U.  Every  Boolean  function  f(xu.  has  B(f)  < Un,  where 

n—  1 

Un  = 2+  ^min(2fc,22"'  - 22"-'1-1)  = 2n~A("-An)  + 22Mn~Xn)_  p (7g) 

k= 0 


Furthermore,  explicit  functions  fn  with  B(fn)  = Un  exist  for  all  n.  | 


If  we  replace  A by  lg,  the  right-hand  side  of  (79)  becomes  2 n/(n  - lgn)  + 
2"/n  - 1.  In  general,  Un  is  un  times  2 n/n,  where  the  factor  un  lies  between  1 
and  2 + 0(- 1^).  A BDD  with  about  2n+1/n  nodes  needs  about  n + 1 - lg  n bits 
for  each  of  two  pointers  in  every  node,  plus  lg  n bits  to  indicate  the  variable  for 
branching.  So  the  total  amount  of  memory  space  taken  up  by  the  BDD  for  any 
function  f(x  1,  ...,xn)  is  never  more  than  about  2"+2  bits,  which  is  four  times 
the  number  of  bits  in  its  truth  table,  even  if  / happens  to  be  one  of  the  worst 
possible  functions  from  the  standpoint  of  BDD  representation. 

The  average  case  turns  out  to  be  almost  the  same  as  the  worst  case,  if  we 
choose  the  truth  table  for  / at  random  from  among  all  22’*  possibilities.  Again  the 
calculations  are  straightforward:  The  average  number  of  (A+ l)  nodes  is  exactly 


h = (22"-fc—  22"-*-1)  (22"—  (22"-k—  l)2*1) /22 


(80) 


because  there  are  22"  - 22'‘  * 1 beads  of  order  n - k and  (22”  - l)2"  truth 
tables  in  which  any  particular  bead  does  not  occur.  Exercise  112  shows  that  this 
complicated-looking  quantity  bk  always  lies  extremely  close  to  the  worst-case 
estimate  min(2fc,  22"  - 22”“''~1),  except  for  two  values  of  A.  The  exceptional 
levels  occur  when  k « 2n~k  and  the  “min”  has  little  effect.  For  example,  the 
average  profile  (b0, . . . , bn_i,  bn)  when  n = 11  is  approximately 


(1.0,  2.0,  4.0,  8.0,  16.0,  32.0,  64.0,  127.4,  151.9,  12.0,  2.0,  2.0)  (81) 

when  rounded  to  one  decimal  place,  and  these  values  are  virtually  indistinguish- 
able from  the  worst  case  (77)  except  when  A:  = 7 or  8. 

A related  concept  called  a quasi-BDD , or  “QDD,”  is  also  important.  Every 
function  has  a unique  QDD,  which  is  similar  to  its  BDD  except  that  the  root 
node  is  always  (T),  and  every  (A)  node  for  A < n branches  to  two  (A+l)  nodes; 
thus  every  path  from  the  root  to  a sink  has  length  n.  To  make  this  possible, 
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we  allow  the  LO  and  HI  pointers  of  a QDD  node  to  be  identical.  But  the  QDD 
must  still  be  reduced,  in  the  sense  that  different  nodes  cannot  have  the  same  two 
pointers  (LO,  HI).  For  example,  the  QDD  for  (aqa^a^)  is 


it  has  two  more  nodes  than  the  corresponding  BDD  in  Fig.  21.  Notice  that  the 
V fields  are  redundant  in  a QDD,  so  they  needn’t  be  present  in  memory. 

The  quasi-profile  of  a function  is  (go,  • • • , gn-i,  g«),  where  qk-i  is  the  number 
of  (fc)  nodes  in  the  QDD.  It’s  easy  to  see  that  q k is  also  the  number  of  distinct 
subtables  of  order  n — k in  the  truth  table,  just  as  bk  is  the  number  of  distinct 
beads.  Every  bead  is  a subtable,  so  we  have 

qk  > bk,  for  0 < k < n.  (83) 

Furthermore,  exercise  115  proves  that 

qk  < 1 + b0  4 + bk- 1 and  qk  < bk  + b bni  for  0 <k<  n.  (84) 

Consequently  each  element  of  the  quasi-profile  is  a lower  bound  on  the  BDD  size: 

B(f)  > 2qk  - 1,  for  0 < k < n.  (85) 

Let  Q(f)  — qo  + • ■ • + qn-i  + g«  be  the  total  size  of  the  QDD  for  /.  We 
obviously  have  Q(f)  > B(f ),  by  (83).  On  the  other  hand  Q(f)  can’t  be  too 
much  bigger  than  £?(/),  because  (84)  implies  that 

ft  _1_  1 

Q(f)  < -±- (£(/)  + I)-  (86) 

Exercises  116  and  117  explore  other  basic  properties  of  quasi-profiles. 

The  worst-case  truth  table  (78)  actually  corresponds  to  a familiar  function 
that  we’ve  already  seen,  the  8-way  multiplexer 

M3(x9,  xio,xu-,xi,...,x8)  = xi+(X9xloXll)2-  (87) 

But  we’ve  renumbered  the  variables  perversely  so  that  the  multiplexing  now 
occurs  with  respect  to  the  last  three  variables  (*9,  aqo,  in),  instead  of  the  first 
three  as  in  Eq.  (30).  This  simple  change  to  the  ordering  of  the  variables  raises 
the  BDD  size  of  M3  from  17  to  511;  and  an  analogous  change  when  n — 2m  + m 
would  cause  B(Mm ) to  make  a colossal  leap  from  2 n — 2m  + 1 to  2n-m+1  — 1. 

R.  E.  Bryant  has  introduced  an  interesting  “navel-gazing”  multiplexer  called 
the  hidden  weighted  bit  function , defined  as  follows: 

hn[x  i,...,:rn)  — — f-  xn  Xisxi  (88) 

with  the  understanding  that  *0  = 0.  For  example,  X2,  £3,  £4)  has  the  truth 

table  0000  011110011011.  He  proved  [IEEE  Trans.  C-40  (1991),  208-210]  that 
hn  has  a large  BDD,  regardless  of  how  we  might  try  to  renumber  its  variables. 
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With  the  standard  ordering  of  variables,  the  profile  (b0,  ...,bn)  of  hn  is 

(1,2,4,8,15,27,46,40,18,7,2,2);  (89) 

hence  B(hn)  = 172.  The  first  half  of  this  profile  is  actually  the  Fibonacci  se- 
quence in  slight  disguise,  with  bk  = Fk+4  -k-  2.  In  general,  hn  always  has  this 
value  of  bk  for  k < n/2;  thus  its  initial  profile  counts  grow  with  order  <fik  instead  of 
the  worst-case  rate  of  2fc.  This  growth  rate  slackens  after  k surpasses  n/2,  so  that, 
for  example,  B(h32)  is  only  a modest  86,636.  But  exponential  growth  eventually 
takes  over,  and  B(h100)  is  out  of  sight:  17,530,618,296,680.  (When  n = 100,  the 
maximum  profile  element  is  b59  = 2,947,635,944,748,  which  dwarfs  60  + - • -+649  = 
139,583,861,115.)  Exercise  125  proves  that  B(hn)  is  asymptotically  cXn  + 0(n2), 
where  

_ \/27-  1/62l+  (/27  + V62l 

W 

= 1.32471 79572  44746  02596  09088  54478  09734  07344+  (90) 

is  the  so-called  “plastic  constant,”  the  positive  root  of  x3  = X + 1-  and  the 
coefficient  c is  7X  - 1 + 14/(3  + 2X)  ss  10.75115. 

On  the  other  hand  we  can  do  substantially  better  if  we  change  the  order 
in  which  the  variables  are  tested  in  the  BDD.  If  f{xu.  ..,*„)  is  any  Boolean 
function  and  if  n is  any  permutation  of  {1, . . . , n},  let  us  write 

/ (xi,...,xn)  — f(x4n,  . . . , Xnn).  (gx) 

For  example,  if  f(xi,x 2,x3,a:4)  = (x3  V (sq  A x4))  A (x2  V x4)  and  if  ( 1 7r,  27t, 
37t,47t)  = (3,2,4, 1),  then  fn(x4,  x2,  x3,  x4)  = (x4  V (x3  Aq))  A (x2  Vq);  and 
we  have  B(f)  = 10,  B(f x)  = 6 because  the  BDDs  are 


The  BDD  for  f*  corresponds  to  a BDD  for  / that  has  a nonstandard  ordering, 
m which  a branch  is  permitted  from  @ to  @ only  if  in  < jn: 


(93) 


The  root  is  @,  where  i = In  is  the  index  for  which  in  = 1.  When  the  branch 
variables  are  listed  from  the  top  down,  we  have  (An,  2n,  In,  3n)  = (1,2,  3, 4). 
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Applying  these  ideas  to  the  hidden  weighted  bit  function,  we  have 

• • • > %n)  f- -xn)m  ( 94) 

with  the  understanding  that  07r  = 0 and  xo  = 0.  For  example,  hf  (0, 0, 1)  = 1 if 
(l7r,  27r,  37t)  = (3, 1,2),  because  x^Xl+X2+X3')„  — x3  — 1.  (See  exercise  120.) 

Element  q *.  of  the  quasi-profile  counts  the  number  of  distinct  subfunctions 
that  arise  when  the  values  of  x\  through  x are  known.  Using  (94),  we  can 
represent  all  such  subfunctions  by  means  of  a slate  of  options  [ro, . . . , rn_fc], 
where  rj  is  the  result  of  the  subfunction  when  Xfc+ 1 + • • • + xn  — j.  Suppose 

Xi  = ci,  . . . , xk  = Cfe,  and  let  s = ci  H h Cfc.  Then  rj  = C(s+j)v  if  (s  + j)ir  < k; 

otherwise  rj  = X(s+j)„.  However,  we  set  ro  <—  0 if  sir  > k,  and  r„_fc  1 if 
(s  + n — k)rr  > k , so  that  the  first  and  last  options  of  every  slate  are  constant. 

For  example,  calculations  show  that  the  following  permutation  l7r  . . . 1007T 
reduces  the  BDD  size  of  /i10o  from  17.5  trillion  to  B(hf00)  = 1,124,432,105: 

2 4 6 8 10  12  14  16  18  20  97  57  77  37  87  47  67  27  92  52 

72  32  82  42  62  22  100  60  80  40  90  50  70  30  95  55  75  35  85  45 

65  25  98  58  78  38  88  48  68  28  93  53  73  33  83  43  63  23  99  59  (95) 

79  39  89  49  69  29  94  54  74  34  84  44  64  24  96  56  76  36  86  46 

66  26  91  51  71  31  81  41  61  21  19  17  15  13  11  9 7 5 3 1 

Such  calculations  can  be  based  on  an  enumeration  of  all  slates  that  can  arise,  for 
0 < s < k < n.  Suppose  we’ve  tested  x 1 , . . . , x83  and  found  that  Xj  = [ j < 42], 
say,  for  1 < j < 83.  Then  s = 42;  and  the  subfunction  of  the  remaining  17 
variables  (x84, . . . ,xi0o)  is  given  by  the  slate  [r0, . . . ,r17]  = [c25,  x98,  c58,  c78,  c38, 
Xss , c48 , c68 , c28 , X93 , C53 , c73 , C33 , c83 , C43 , c63 , C23 , X9g] , which  reduces  to 

[1,  x98 , 0,  0, 1,  x88, 0, 0, 1,  x93, 0, 0, 1, 0, 0, 0, 1, 1],  (96) 

This  is  one  of  the  214  subfunctions  counted  by  g83  when  s = 42.  Exercise  124 
explains  how  to  deal  similarly  with  the  other  values  of  k and  s. 

We’re  ready  now  to  prove  Bryant’s  theorem: 

Theorem  B.  The  BDD  size  of  exceeds  2 Lrl/5 1 ; for  nj}  permutations  n. 

Proof.  Observe  first,  that  two  subfunctions  of  h £ are  equal  if  and  only  if  they 
have  the  same  slate.  For  if  [r0, . . . , rn_k ] / [r’0, . . . , r'n_k\,  suppose  r • / r' . If 
both  Tj  and  r'  are  constant,  the  subfunctions  differ  when  x^+i  + • • • + x„  = j. 
If  rj  is  constant  but  r'  = xi)  we  have  0 < j < n — k;  the  subfunctions  differ 
because  Xfc+i  + ■ ■ ■ + xn  can  equal  j with  Xi  / rj.  And  if  rj  = Xj  but  r'  = xit 

with  i ^ i',  we  can  have  Xfc+1  H 1-  xn  — j with  Xj  / x^ . (The  latter  case  can 

arise  only  when  the  slates  correspond  to  different  offsets  s and  s'.) 

Therefore  qk  is  the  number  of  different  slates  [r0, . . . , rn Exercise  123 
proves  that  this  number,  for  any  given  fc,  n,  and  s as  described  above,  is  exactly 

<"> 

where  w is  the  number  of  indices  j such  that  s<j<s  + n — k and  j7r  < k. 

Now  consider  the  case  k = [3n/5j  + 1,  and  let  s = k — [n/2] , s'  = \n/2\  + 1. 
(Think  of  n = 100,  k = 61,  s = 11,  s'  — 51.  We  may  assume  that  n > 10.)  Then 
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w + w'  = k - w",  where  w"  counts  the  indices  with  jtt  < k and  either  j < s 
or  j > s'  + n — k.  Since  w"  < (s  — 1)  + (k  — s')  — 2k  — 2 — n,  we  must  have 
w + w'  > n + 2 - k = |"2n/5]  + 1.  Hence  either  w > \n/ 5j  or  w’  > [n/5j;  and 
in  both  cases  (97)  exceeds  2l”/5-l-i.  The  theorem  follows  from  (85).  | 

Conversely,  there’s  always  a permutation  tt  such  that  B(h £)  = o(20-2029")5 
although  the  constant  hidden  by  O-notation  is  quite  large.  This  result  was  proved 
by  B.  Bollig,  M.  Lobbing,  M.  Sauerhoff,  and  I.  Wegener,  Theoretical  Informatics 
and  Applications  33  (1999),  103-115,  using  a permutation  like  (95):  The  first 
indices,  with  j tt  < n/5,  come  alternately  from  j > 9n/10  and  j < n/ 10;  the 
others  are  ordered  by  reading  the  binary  representation  of  On/ 1 0 - j from  right 
to  left  ( colex  order). 

Let’s  also  look  briefly  at  a much  simpler  example,  the  permutation  function 
Pm{x  1, . . . ,xm2 ),  which  equals  1 if  and  only  if  the  binary  matrix  with  x^i_i')rn+J 
in  row  i and  column  j is  a permutation  matrix: 

m 

Pm(xi,  . . . ,xm2)  = ^l(^(*-l)m+l^(«-l)m+25  • • • ,#(i-i)m+m) 

2=1 

m 

A f\  Si  (xj,  xm+j , . . . , xm2_rn+j).  (98) 
3= 1 

In  spite  of  its  simplicity,  this  function  cannot  be  represented  with  a small  BDD, 
under  any  reordering  of  its  variables: 

Theorem  K.  The  BDD  size  of  exceeds  m2m~1,  for  all  permutations  tt. 

Proof.  [See  I.  Wegener,  Branching  Programs  and  Binary  Decision  Diagrams 
(SIAM,  2000),  Theorem  4.12.3.]  Given  the  BDD  for  Pfn,  notice  that  each  of  the 
m!  vectors  x such  that  P^x)  = 1 traces  a path  of  length  n = m2  from  the  root 
to  [jj;  every  variable  must  be  tested.  Let  vk(x)  be  the  node  from  which  the 
path  for  x takes  its  fct.h  HI  branch.  This  node  branches  on  the  value  in  row  i and 
column  j of  the  given  matrix,  for  some  pair  (j,  j)  = (ik(x),  jk(x)). 

Suppose  vk(x)  = vk/(x'),  where  x / x’ . Construct  x"  by  letting  it  agree 
with  x up  to  vk(x)  and  with  x'  thereafter.  Then  P/^^x")  = 1;  consequently  we 
must  have  k — k! . In  fact,  this  argument  shows  that  we  must  also  have 

{*!(*)>  *2(a;)>  • • • ,*fc-i(a;)}  = {h(x'),  t2(aj')>  • • • > *fc- l(ar')} 

and  {ji(a:),j2(as), . . . ,jk-i{x)}  - {ji(x'),  j2(x'), . . . ,jk-i{x')}.  (99) 

Imagine  m colors  of  tickets,  with  m!  tickets  of  each  color.  Place  a ticket  of 
color  k on  node  vk(x),  for  all  k and  all  x.  Then  no  node  gets  tickets  of  different 
colors;  and  no  node  of  color  k gets  more  than  (k  - 1)!  (m  - k)\  tickets  altogether, 
by  Eq.  (99).  Therefore  at  least  m!/((fc  - 1)!  (m  - k)!)  = k(™)  different  nodes 
must  receive  tickets  of  color  k.  Summing  over  k gives  m2m_1  non-sink  nodes.  | 

Exercise  184  shows  that  B{Pm)  is  less  than  m2m+1,  so  the  lower  bound  in 
Theorem  K is  nearly  optimum  except  for  a factor  of  4.  Although  the  size  grows 
exponentially,  the  behavior  isn  t hopelessly  bad,  because  m - \fn.  For  example, 
B(P2o)  is  only  38,797,317,  even  though  P20  is  a Boolean  function  of  400  variables. 


7.1.4 


BINARY  DECISION  DIAGRAMS  239 


^Optimizing  the  order.  Let  -Bm;n(/)  and  Bmax(f)  denote  the  smallest  and 
largest  values  of  B(fn),  taken  over  all  permutations  7r  that  can  prescribe  an 
ordering  of  the  variables.  We’ve  seen  several  cases  where  Bm ;n  and  Bmax  are 
dramatically  different;  for  example,  the  2m-way  multiplexer  has  Bm\n(Mm)  sa  2 n 
and  Bmax(Mm)  ss  2 n/n,  when  n — 2m  + m.  And  indeed,  simple  functions  for 
which  a good  ordering  is  crucial  are  not  at  all  unusual.  Consider,  for  instance, 

f(x  i,x2,...,xn)  = (xi  V x2)  A (x3  V x4)  A ■ • ■ A (x„_i  V xn),  n even;  (loo) 

this  is  the  important  subset  function  [xix3  . . . xn_i  C X2X4  . . . xn],  and  we  have 
B(f)  — Bmm(f)  = n + 2.  But  the  BDD  size  explodes  to  B(fn)  = -Bmax(/)  = 
2"/2+1  when  n is  “organ-pipe  order,”  namely  the  ordering  for  which 

r{x  i,x2,...,x„)  = (xr  V xn)  A (x2  Vxn_i)  A • • • A (x„/2  V xn/2+i).  (101) 

And  the  same  bad  behavior  occurs  for  the  ordering  [xi  . . . x„/2  C xn/2+i . . . xn\. 
In  these  orderings  the  BDD  must  “remember”  the  states  of  n/2  variables,  while 
the  original  formulation  (100)  needs  very  little  memory. 

Every  Boolean  function  / has  a master  profile  chart,  which  encapsulates  the 
set  of  all  its  possible  sizes  B(fn).  If  / has  n variables,  this  chart  has  2n  vertices, 
one  for  each  subset  of  the  variables;  and  it  has  n2n~1  edges,  one  for  each  pair  of 
subsets  that  differ  in  just  one  element.  For  example,  the  master  profile  chart  for 
the  function  in  (92)  and  (93)  is 


(102) 


Every  edge  has  a weight,  illustrated  here  by  the  number  of  lines;  for  example, 
the  weight  between  {1,  2}  and  {1, 2, 3}  is  3.  The  chart  has  the  following  interpre- 
tation: If  X is  a subset  of  k variables , and  if  x ^ X,  then  the  weight  between  X 
and  XU  x is  the  number  of  subfunctions  of  f that  depend  on  x when  the  variables 
of  X have  been  replaced  by  constants  in  all  2fc  possible  ways.  For  example,  if 
X = {1,2},  we  have  /(0,0,x3,x4)  = x3,  /( 0,  l,x3,x4)  = /( 1,  l,x3,x4)  = x3Ax4, 
and  /( l,0,x3,x4)  = x3  V x4;  all  three  of  these  subfunctions  depend  on  x3,  but 
only  two  of  them  depend  on  x4,  as  shown  in  the  weights  below  {1,2}. 

There  are  n!  paths  of  length  n from  0 to  { 1, . . . , n},  and  we  can  let  the  path 
0 — > {a4}  — >■  {ai,a2}  —»•••—>■  {a4,...,an}  correspond  to  the  permutation  n 
if  ai7r  = 1,  a2n  = 2,  . . . , an7r  = n.  Then  the  sum  of  the  weights  on  path  7r  is 
B(fn),  if  we  add  2 for  the  sink  nodes.  For  example,  the  path  0 -A  {4}  — > {2, 4}  -A 
{1,2,4}  -A  {1,2,  3, 4}  yields  the  only  way  to  achieve  B(fn)  — 6 as  in  (93). 
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Notice  that  the  master  profile  chart  is  a familiar  graph,  the  n-cube,  whose 
edges  have  been  decorated  so  that  they  count  the  number  of  beads  in  various  sets 
of  subfunctions.  The  graph  has  exponential  size,  n2n_1;  yet  it  is  much  smaller 
than  the  total  number  of  permutations,  n\.  When  n is,  say,  25  or  less,  exercise  138 
shows  that  the  entire  chart  can  be  computed  without  great  difficulty,  and  we  can 
find  an  optimum  permutation  for  any  given  function.  For  example,  the  hidden 
weighted  bit  function  turns  out  to  have  Blnin(h25)  = 20  90  and  Bmax(h25)  = 
35441;  the  minimum  is  achieved  with  (In, . . ,,25n)  = (3,  5,  7,  9,  11.  13,  15.  17. 
25,  24,  23,  22,  21,  20,  19,  18,  16,  14,  12,  10,  8,  6,  4,  2,  1),  while  the  maximum 
results  from  a strange  permutation  (22,  19,  17,  25,  15,  13,  11,  10,  9,  8,  7,  24,  6. 
5,  4,  3,  2,  12,  1,  14,  23,  16,  18,  20,  21)  that  tests  many  “middle”  variables  first. 

Instead  of  computing  the  entire  master  profile  chart,  we  can  sometimes  save 
time  by  learning  just  enough  about  it  to  determine  a path  of  least  weight.  (See 
exercise  140.)  But  when  n grows  and  functions  get  more  weird,  we  are  unlikely 
to  be  able  to  determine  Bmin(f)  exactly,  because  the  problem  of  finding  the  best 
ordering  is  NP-complete  (see  exercise  137). 

We’ve  defined  the  profile  and  quasi-profile  of  a single  Boolean  function  /,  but 
the  same  ideas  apply  also  to  an  arbitrary  BDD  base  that  contains  m functions 
{/i>  ■ • • j fm}-  Namely,  the  profile  is  (b0, when  there  are  bk  nodes  on 
level  k,  and  the  quasi-profile  is  (q0, . . . , qn)  when  there  are  qk  nodes  on  level  k of 
the  corresponding  QDD  base;  the  truth  tables  of  the  functions  have  bk  different 
beads  of  order  n-k,  and  qk  different  subtables.  For  example,  the  profile  of  the 
(4  + 4)-bit  addition  functions  {/1,  f2,  /3,  /4,  /5}  in  (36)  is  (2, 4,  3, 6,  3,  6, 3,  2,  2), 
and  the  quasi-profile  is  worked  out  in  exercise  144.  Similarly,  the  concept  of 
master  profile  chart  applies  to  m functions  whose  variables  are  reordered  simul- 
taneously; and  we  can  use  it  to  find  Bmin(fu  . . . , fm)  and  Bmax(/1, . . . , fm),  the 
minimum  and  maximum  of  b0  H b bn  taken  over  all  profiles. 

Local  reordering.  What  happens  to  a BDD  base  when  we  decide  to  branch 
on  x2  first,  then  on  x1;  x3,  xn?  Figure  26  shows  that  the  structure  of  the 
top  two  levels  can  change  dramatically,  but  all  other  levels  remain  the  same. 

A closer  analysis  reveals,  in  fact,  that  this  level-swapping  process  isn’t 
difficult  to  understand  or  to  implement.  The  (T)  nodes  before  swapping  can 
be  divided  into  two  kinds,  “tangled”  and  “solitary,”  depending  on  whether  they 
have  (2)  nodes  as  descendants;  for  example,  there  are  three  tangled  nodes  at 
the  left  of  Fig.  26,  pointed  to  by  si,  s2,  and  S3,  while  S4  points  to  a solitary 
node.  Similarly,  the  (2)  nodes  before  swapping  are  either  “visible”  or  “hidden,” 
depending  on  whether  they  are  independent  source  functions  or  accessible  only 
from  (T)  nodes;  all  four  of  the  (|)  nodes  at  the  left  of  Fig.  26  are  hidden. 

After  swapping,  the  solitary  (T)  nodes  simply  move  down  one  level;  but 
the  tangled  nodes  are  transmogrified  into  (2)s,  according  to  a process  that  we 
shall  explain  shortly.  The  hidden  (2)  nodes  disappear,  if  any,  and  the  visible  ones 
simply  move  up  to  the  top  level.  Additional  nodes  might  also  arise  during  the 
transmogrification  process;  such  nodes,  labeled  @,  are  called  “newbies.”  For  ex- 
ample, two  newbies  appear  above  t2  at  the  right  of  Fig.  26.  This  process  decreases 
the  total  number  of  nodes  if  and  only  if  the  hidden  nodes  outnumber  the  newbies. 
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S1  *2  *3  *4  *1  *2  *3  *4 


Fig.  26.  Interchanging  the  top  two  levels  of  a BDD  base.  Here  (si,  s2,  S3,  s4)  are  source 
functions;  (fi,  <2,  £3,  £4)  are  target  nodes,  representing  subfunctions  at  lower  levels. 

The  reverse  of  a swap  is,  of  course,  the  same  as  a swap,  but  with  the  roles  of 
(T)  and  (2)  interchanged.  If  we  begin  with  the  diagram  at  the  right  of  Fig.  26, 
we  see  that  it  has  three  tangled  nodes  (labeled  (2))  and  one  that’s  visible  (la- 
beled (T));  two  of  its  nodes  are  hidden,  none  are  solitary.  The  swapping  process 
in  general  sends  (tangled,  solitary,  visible,  hidden)  nodes  into  (tangled,  visible, 
solitary,  newbie)  nodes,  respectively — - after  which  newbies  would  become  hidden 
in  a reverse  swap,  and  the  originally  hidden  nodes  would  reappear  as  newbies. 

Transmogrification  is  easiest  to  understand  if  we  treat  all  nodes  below  the 
top  two  levels  as  if  they  were  sinks,  having  constant  values.  Then  every  source 
function  f(x  i,x2)  depends  only  on  x\  and  x2\  hence  it  takes  on  four  values 
a = /( 0,0),  b = /( 0,1),  c = /(1,0),  and  d = /( 1,1),  where  a,  b,  c,  and  d 
represent  sinks.  We  may  suppose  that  there  are  q sinks,  [T] , [2]  , . . . , [9] , and 
that  1 < a,b,c,d  < q.  Then  f(x i,x2)  is  fully  described  by  its  extended  truth 
table , /(0, 0)/(0, 1)/(1,  0)/(l,  1)  = abed.  And  after  swapping,  we’re  left  with 
f{x2,x  1),  which  has  the  extended  truth  table  aebd.  For  example,  Fig.  26  can  be 
redrawn  as  follows,  using  extended  truth  tables  to  label  its  nodes: 

1224  2324  1324  3344  1224  2234  1234  3434 


Fig.  27.  Another  way  to  represent  the  transformations  in  Fig.  26. 

In  these  terms,  the  source  function  abed  points  to  a solitary  node  when  a — b / 
c = d,  and  to  a visible  node  when  a = c ^ b — d\  otherwise  it  points  to  a tangled 
node  (unless  a — b — c = d,  when  it  points  directly  to  a sink).  The  tangled  node 
abed  usually  has  LO  = ab  and  HI  = cd.  unless  a = b or  c = d;  in  the  exceptional 
cases,  LO  or  HI  is  a sink.  After  transmogrification  it  will  have  LO  — ac  and 
HI  = bd  in  a similar  way,  where  latter  nodes  will  be  either  newbies  or  visibles 
or  sinks  (but  not  both  sinks).  One  interesting  case  is  1224,  whose  children  12 
and  24  on  the  left  are  hidden  nodes,  while  the  12  and  24  on  the  right  are  newbies. 

Exercise  147  discusses  an  efficient  implementation  of  this  transformation, 
which  was  introduced  by  Richard  Rudell  in  IEEE/ACM  International  Conf. 
Computer-Aided  Design  CAD-93  (1993),  42-47.  It  has  the  important  property 
that  no  pointers  need  to  change,  except  within  the  nodes  on  the  top  two  levels: 
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All  source  nodes  Sj  still  point  to  the  same  place  in  computer  memory,  and  all 
sinks  retain  their  previous  identity.  We  have  described  it  as  a swap  between  (T)s 
and  @s,  but  in  fact  the  same  transformation  will  swap  (J)s  and  (k) s whenever 
the  variables  Xj  and  xk  correspond  to  branching  on  adjacent  levels.  The  reason 
is  that  the  upper  levels  of  any  BDD  base  essentially  define  source  functions  for 
the  lower  levels,  which  constitute  a BDD  base  in  their  own  right. 

We  know  from  our  study  of  sorting  that  any  reordering  of  the  variables  of 
a BDD  base  can  be  produced  by  a sequence  of  swaps  between  adjacent  levels. 
In  particular,  we  can  use  adjacent  swaps  to  do  a “jump-up”  transformation, 
which  brings  a given  variable  xk  to  the  top  level  without  disturbing  the  relative 
order  of  the  other  variables.  It’s  easy,  for  instance,  to  jump  x4  up  to  the  top- 
We  simply  swap  @ 44  ©,  then  @ 44  then  @ 44  Q,  because  x4  will  be 
adjacent  to  x\  after  it  has  jumped  past  x2. 

Since  repeated  swaps  can  produce  any  ordering,  they  are  sometimes  able 
to  make  a BDD  base  grow  until  it  is  too  big  to  handle.  How  bad  can  a single 
swap  be?  If  exactly  (s,t,v,h,  v)  nodes  are  solitary,  tangled,  visible,  hidden,  and 
newbie,  the  top  two  levels  end  up  with  s + t + v + v nodes;  and  this  is  at  most 
m+is  < m+2t  when  there  are  m source  functions,  because  m > s+t+v.  Thus  the 
new  size  of  those  levels  can’t  exceed  twice  the  original,  plus  the  number  of  sources. 

If  a single  swap  can  double  the  size,  a jump-up  for  xk  threatens  to  increase 
the  size  exponentially,  because  it  does  k - 1 swaps.  Fortunately,  however,  jump- 
ups  are  no  worse  than  single  swaps  in  this  regard: 

Theorem  J+.  B(f[ , . . . , /£)  < m + 2B(fi, . . . , fm)  after  a jump-up  operation. 

Proof.  Let  a1a2  . ■ . a2k_1a2k  be  the  extended  truth  table  for  a source  function 
f(x i>  • • • i xk )>  with  lower-level  nodes  regarded  as  sinks.  After  the  jump-up  the 
extended  truth  table  for  /* (xu  . . . , xk)  = f(xln, . . . , xkn)  = f(x2, . . . , xk,  xx)  is 
aia3  . . . a2k_1a2a4  ...a2k.  Thus  we  can  see  that  each  bead  on  level  j of  fn  is 
derived  from  some  bead  on  level  j-  1 of  /,  for  1 < j < h.  but  every  bead  on 
level  j - 1 of  / spawns  at  most  two  beads,  of  half  the  size,  in  fn.  Therefore 
if  the  respective  profiles  of  {h, ... , fm}  and  {/f , are  (bo,  and 

("O’  • • • - *>„).  we  must  have  b'0  < m,  b[  < 2 b0,  ...,  b,k_1  < 2 bk_2,  b’k  = bk 

b’n  = bn.  The  total  is  therefore  < m + B(fu  . . . , fm)  + b0  + ■ ■ ■ + bk~2  - bk_4.  | 

The  opposite  of  a jump-up  is  a ‘jump-down,”  which  demotes  the  topmost 
variable  by  k - 1 levels.  As  before,  this  operation  can  be  implemented  with  k-  1 
swaps.  But  we  have  to  settle  for  a much  weaker  upper  bound  on  the  resulting  size: 

Theorem  J . B(ff , . . . , fft)  < B(f4, . . . , /m)2  after  a jump-down  operation. 

Proof.  Now  the  extended  truth  table  in  the  previous  proof  changes  from  a4 . . . a2k 
to  a4 . . . a2k~i  \ a2k-i+i . . . a2k  = a4a2k~i+1 . . .a2k-\a2k,  the  “zipper  function” 
7- 1 -3— (76) . In  this  case  we  can  identify  every  bead  after  the  jump  with  an 
ordered  pair  of  original  subfunctions,  as  in  the  melding  operation  (37)  and  (38). 
For  example,  when  k = 3 the  truth  table  12345678  becomes  15263748,  whose 
bead  1526  can  be  regarded  as  the  meld  12  o 56.  | 
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This  proof  indicates  why  quadratic  growth  might  occur.  If,  for  example, 

/(xi , . . . , xn)  Xi?  Afm  (x2, . . . , xm_)_i  5 X2m-j-2>  • • • , xn): 

1 ■ ■ ■ ^2m+2?  ■ ■ ■ * Xn),  (103) 

where  n = 1 + 2m  + 2m,  a jump-down  of  2m  levels  changes  B(f)  = 4 n — 8m  — 3 
to  B(fn)  = 2n2  — 8 m(n  — m)  — 2(n  — 2m)  + 1 sa  |.B(/)2. 

Since  jump-up  and  jump-down  are  inverse  operations,  we  can  also  use  Theo- 
rems J+  and  J~  in  reverse:  A jump-up  operation  might  conceivably  decrease  the 
BDD  size  to  something  like  its  square  root,  but  a jump-down  cannot  reduce  the 
size  to  less  than  about  half.  That’s  bad  news  for  fans  of  jump-down,  although 
they  can  take  comfort  from  the  knowledge  that  jump-downs  are  sometimes  the 
only  decent  way  to  get  from  a given  ordering  to  an  optimum  one. 

Theorems  J"*"  and  J-  are  due  to  B.  Bollig,  M.  Lobbing,  and  I.  Wegener,  Inf. 
Processing  Letters  59  (1996),  233-239.  (See  also  exercise  149.) 

* Dynamic  reordering.  In  practice,  a natural  way  to  order  the  variables  often 
suggests  itself,  based  on  the  modules-in-a-row  perspective  of  Fig.  23  and  Theo- 
rem M.  But  sometimes  no  suitable  ordering  is  apparent,  and  we  can  only  hope 
to  be  lucky;  perhaps  the  computer  will  come  to  our  rescue  and  find  one.  Fur- 
thermore, even  if  we  do  know  a good  way  to  begin  a computation,  the  ordering 
of  variables  that  works  best  in  the  first  stages  of  the  work  might  turn  out  to  be 
unsatisfactory  in  later  stages.  Therefore  we  can  get  better  results  if  we  don’t 
insist  on  a fixed  ordering.  Instead,  we  can  try  to  tune  up  the  current  order  of 
branching  whenever  a BDD  base  becomes  unwieldy. 

For  example,  we  might  try  to  swap  i o Xj  in  the  order,  for  1 < j < n, 
undoing  the  swap  if  it  increases  the  total  number  of  nodes  but  letting  it  ride 
otherwise;  we  could  keep  this  up  until  no  such  swap  makes  an  improvement. 
That  method  is  easy  to  implement,  but  unfortunately  it’s  too  weak;  it  doesn’t 
give  much  of  a reduction.  A much  better  reordering  technique  was  proposed  by 
Richard  Rudell  at  the  same  time  as  he  introduced  the  swap-in-place  algorithm  of 
exercise  147.  His  method,  called  “sifting,”  has  proved  to  be  quite  successful.  The 
idea  is  simply  to  take  a variable  Xk  and  to  try  jumping  it  up  or  down  to  all  other 
levels  — that  is,  essentially  to  remove  Xk  from  the  ordering  and  then  to  insert  it 
again,  choosing  a place  for  insertion  that  keeps  the  BDD  size  as  small  as  possible. 
All  of  the  necessary  work  can  be  done  with  a sequence  of  elementary  swaps: 

Algorithm  J ( Sifting  a variable).  This  algorithm  moves  variable  Xk  into  an 
optimum  position  with  respect  to  the  current  ordering  of  the  other  variables 
{xi, . . . , Xfc_ i,  Xfc+!, . . . , xn}  in  a given  BDD  base.  It  works  by  repeatedly  calling 
the  procedure  of  exercise  147  to  swap  adjacent  variables  Xj-i  «-»  Xj.  Throughout 
this  algorithm,  S denotes  the  current  size  of  the  BDD  base  (the  total  number  of 
nodes);  the  swapping  operation  usually  changes  S. 

Jl.  [Initialize.]  Set  p <-  0,  j «-  k,  and  s <-  S.  If  k > n/2,  go  to  J5. 

J2.  [Sift  up.]  While  j > 1,  swap  Xj- 1 4->  Xj  and  set  j j - 1,  s t—  min(S,  s). 
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J3.  [End  the  pass.]  If  p = 1,  go  to  J4.  Otherwise,  while  j / k,  set  j j + 1 and 
swap  i Xj ; then  set  p f-  1 and  go  to  J5. 

J4.  [Finish  downward.]  While  s^S,  set  j <- j + 1 and  swap  Xj-i  Xj.  Stop. 

J5.  [Sift  down.]  While  j<n,  set  j <-j  + 1,  swap  Xj-i++Xj,  and  set  s«-min(S,  s). 

J6.  [End  the  pass.]  If  p = 1,  go  to  J7.  Otherwise,  while  j / k , swap  x,j_1  Xj 
and  set  j t—  j — 1;  then  set  p f-  1 and  go  to  J2. 

J7.  [Finish  upward.]  While  s/S,  swap  xJ_1  o Xj  and  set  j <-  j - 1.  Stop.  | 

Whenever  Algorithm  J swaps  Xj_x  the  original  variable  xfc  is  currently 

called  either  Xj_x  or  Xj . The  total  number  of  swaps  varies  from  about  n to 
about  2.5ra,  depending  on  k and  the  optimum  final  position  of  xk.  But  we  can 
improve  the  running  time  substantially,  without  seriously  affecting  the  outcome, 
if  steps  J2  and  J5  are  modified  to  proceed  immediately  to  J3  and  J6,  respectively, 
whenever  S becomes  larger  than,  say,  1.2s  or  even  1.1s  or  even  1.05s.  In  such 
cases,  further  sifting  in  the  same  direction  is  unlikely  to  decrease  s. 

Rudell’s  sifting  procedure  consists  of  applying  Algorithm  J exactly  n times, 
once  for  each  variable  that  is  present;  see  exercise  151.  We  could  continue  sifting 
again  and  again  until  there  is  no  more  improvement;  but  the  additional  gain  is 
usually  not  worth  the  extra  effort. 

Let  s look  at  a detailed  example,  in  order  to  make  these  ideas  concrete. 
We’ve  observed  that  when  the  contiguous  United  States  are  arranged  in  the  order 

ME  NH  VT  MA  RI  CT  NY  NJ  PA  DE  MD  DC  VA  NC  SC  GA  FL  AL  TN  KY  WV  OH  MI  IN 
IL  WI  MN  IA  MO  AR  MS  LA  TX  OK  KS  NE  SD  ND  MT  WY  CO  NM  AZ  UT  ID  WA  OR  NV  CA  (104) 

as  in  (17),  they  lead  to  a BDD  of  size  428  for  the  independent-set  function 

~ '((^AL  A Xfl)  v (Xal  a Xqa)  V (x/u,  A Xfls)  V ■ • ■ V (X(JT  A Xyy)  V (xyA  A Xyy)).  (1O5) 

The  author  chose  the  ordering  (104)  by  hand,  starting  with  the  historical/geo- 
graphical  listing  of  states  that  he  had  been  taught  as  a child,  then  trying  to 
minimize  the  size  of  the  boundary  between  states-already-listed  and  states-to- 
corne,  so  that  the  BDD  for  (105)  would  not  need  to  “remember”  too  many  partial 
results  at  any  level.  The  resulting  size,  428,  is  pretty  good  for  a function  of  49 
variables;  but  sifting  is  able  to  make  it  even  better.  For  example,  consider  WV: 
Some  of  the  possibilities  for  altering  its  position,  with  varying  sizes  S,  are 

j RI | CT | NY | NJ | PA | DE | MD | DC | VA  j NC I SC  I GA I FL  j AL  j TN 1 KY I OH  1 MI  I IN  j IL 1 

424  422  417  415  414  412  411  410  412  412  415  420  421  426  425  427  428  428  436  442  453 

so  we  can  save  428  - 410  = 18  nodes  by  jumping  WV  up  to  a position  between  MD 
and  DC.  By  using  Algorithm  J to  sift  on  all  the  variables  — first  on  ME,  then  on 
NH,  then  . . . , then  on  CA  — we  end  up  with  the  ordering 

VT  MA  ME  NH  CT  RI  NY  NJ  DE  PA  MD  WV  VA  DC  KY  OH  NC  GA  SC  AL  FL  MS  TN  IN 
IL  MI  AR  TX  LA  OK  MO  IA  WI  MN  CO  NE  KS  MT  ND  WY  SD  UT  AZ  NM  ID  CA  OR  WA  NV  (106) 

and  the  BDD  size  has  been  reduced  to  345(1).  That  sifting  process  involves  a 
total  of  4663  swaps,  requiring  less  than  4 megamems  of  computation  altogether. 
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Instead  of  choosing  an  ordering  carefully,  let’s  consider  a lazier  alternative: 
We  might  begin  with  the  states  in  alphabetic  order 

AL  AR  AZ  CA  CO  CT  DC  DE  FL  GA  IA  ID  IL  IN  KS  KY  LA  MA  MD  ME  MI  MN  MO  MS 
MT  NC  ND  NE  NH  NJ  NM  NV  NY  OH  OK  OR  PA  RI  SC  SD  TN  TX  UT  VA  VT  WA  WI  WV  WY 

and  proceed  from  there.  Then  the  BDD  for  (105)  turns  out  to  have  306,214 
nodes;  it  can  be  computed  either  via  Algorithm  S (with  about  380  megamems  of 
machine  time)  or  via  (55)  and  Algorithm  U (with  about  565  megamems).  In  this 
case  sifting  makes  a dramatic  difference:  Those  306,214  nodes  become  only  2871, 
at  a cost  of  430  additional  megamems.  Furthermore,  the  sifting  cost  goes  down 
from  430  Mp  to  210  M/x  if  the  loops  of  Algorithm  J are  aborted  when  S > 1.1s. 
(The  more  radical  choice  of  aborting  when  S > 1.05s  would  reduce  the  cost  of 
sifting  to  155  M/r;  but  the  BDD  size  would  be  reduced  only  to  2946  in  that  case.) 

And  we  can  actually  do  much,  much  better,  if  we  sift  the  variables  while  eval- 
uating (105),  instead  of  waiting  until  that  whole  long  sequence  of  disjunctions  has 
been  entirely  computed.  For  example,  suppose  we  invoke  sifting  automatically 
whenever  the  BDD  size  surpasses  twice  the  number  of  nodes  that  were  present 
after  the  previous  sift.  Then  the  evaluation  of  (105),  starting  from  the  alphabetic 
ordering  (107),  runs  like  a breeze:  It  automatically  churns  out  a BDD  that  has 
only  419  nodes,  after  only  about  60  megamems  of  calculation!  Neither  human 
ingenuity  nor  “geometric  understanding”  are  needed  to  discover  the  ordering 

NV  OR  ID  WA  AZ  CA  UT  NM  WY  CO  MT  OK  TX  NE  M0  KS  LA  AR  MS  TN  IA  ND  MN  SD 
GA  FL  AL  NC  SC  KY  WI  MI  IL  OH  IN  WV  MD  VA  DC  PA  NJ  DE  NY  CT  RI  NH  ME  VT  MA  ''1°  ' 

which  beats  the  author’s  (104).  For  this  one,  the  computer  just  decided  to  invoke 
autosifting  39  times,  on  smaller  BDDs. 

What  is  the  best  ordering  of  states  for  the  function  (105)?  The  answer  to 
that  question  will  probably  never  be  known  for  sure,  but  we  can  make  a pretty 
good  guess.  First  of  all,  a few  more  sifts  of  (108)  will  yield  a still-better  ordering 

OR  ID  NV  WA  AZ  CA  UT  NM  WY  CO  MT  SD  MN  ND  IA  NE  OK  KS  TX  M0  LA  AR  MS  TN 
GA  FL  AL  NC  SC  KY  WI  MI  IL  OH  IN  WV  MD  DC  VA  PA  NJ  DE  NY  CT  RI  NH  ME  VT  MA  '109’ 

with  BDD  size  354.  Sifting  will  not  improve  (109)  further;  but  sifting  has  only 
limited  power,  because  it  explores  only  (n  - l)2  alternative  orderings,  out  of 
n!  possibilities.  (Indeed,  exercise  134  exhibits  a function  of  only  four  variables 
whose  BDD  cannot  be  improved  by  sifting,  even  though  the  ordering  of  its 
variables  is  not  optimum.)  There  is,  however,  another  arrow  in  our  quiver:  We 
can  use  master  profile  charts  to  optimize  every  window  of,  say,  16  consecutive 
levels  in  the  BDD.  There  are  34  such  windows;  and  the  algorithm  of  exercise  139 
optimizes  each  of  them  rather  quickly.  After  about  9.6  gigamems  of  computation, 
that  algorithm  discovers  a new  champion 

OR  ID  NV  WA  AZ  CA  UT  NM  WY  CO  MT  SD  MN  ND  IA  NE  OK  KS  TX  M0  LA  AR  MS  WI 
KY  MI  IN  IL  AL  TN  FL  NC  SC  GA  WV  OH  MD  DC  VA  PA  NJ  DE  NY  CT  RI  NH  ME  VT  MA 


(110) 


by  cleverly  rearranging  16  states  within  (109).  This  ordering,  for  which  the  BDD 
size  is  only  339,  might  well  be  optimum,  because  it  cannot  be  improved  either 
by  sifting  or  by  optimizing  any  window  of  width  25.  However,  such  a conjecture 


246  COMBINATORIAL  SEARCHING 


7.1.4 


rests  on  shaky  ground:  The  ordering 

AL  GA  FL  TN  NC  SC  VA  MS  AR  TX  LA  OK  KY  MO  NM  WV  MD  DC  PA  NJ  DE  OH  IL  MI 
IN  IA  NE  KS  WI  SD  WY  ND  MN  MT  UT  CO  ID  CA  AZ  OR  WA  NV  NY  CT  RI  NH  ME  VT  MA  t111) 

also  happens  to  be  unimprovable  by  sifting  and  by  width-25  window  optimiza- 
tion, yet  its  BDD  has  606  nodes  and  is  far  from  optimum. 

With  the  improved  ordering  (no),  the  98- variable  COLOR  function  of  (73) 
needs  only  22037  BDD  nodes,  instead  of  25579.  Sifting  reduces  it  to  16098. 

*Read-once  functions.  Boolean  functions  such  as  (xx  D x2)  © ((x3  = x4)  A £5), 
which  can  be  expressed  as  formulas  in  which  each  variable  occurs  exactly  once, 
form  an  important  class  for  which  optimum  orderings  of  variables  can  easily  be 
computed.  Formally,  let  us  say  that  /(x  1, . . . , xn)  is  a read- once  function  if  either 
(i)  n = 1 and  f(xx)  = xx;  or  (ii)  /(xx,  ...,xn)  = a(xx,  ...,xk)o  h(xk+1, ...,  xn), 
where  o is  one  of  the  binary  operators  {A,  V,  A,  V,  D,  C,  D,  C,  ©,  =}  and  where 
both  g and  h are  read-once  functions.  In  case  (i)  we  obviously  have  £?(/)  = 3. 
And  in  case  (ii),  exercise  163  proves  that 

B(f)  = iB(9)  + B(h)-2,  ifoe  {A,V,A,V,D,C,D,C}; 

l B{g)  + B(h,h)  - 2,  ifoe{©,s}.  (ll2) 

In  order  to  get  a recurrence,  we  also  need  the  similar  formulas 

( 4,  if  n = 1; 

BifJ)  = < 2B(g)  + B(h,h)  - 4,  if  o <E  {A,  V,  A,  V,  3,  C,  5,  C};  (113) 

{ B(g,g)  + B(h,h)  -2,  if  oe  {©,=}. 

A particularly  interesting  family  of  read-once  functions  arises  when  we  define 
U™+1  (*1  > • • • . X2™+1  )=vm(xi,...,  x2>» ) A vm  (x2m+1 , . . . , X2m  + 1 ) , 

%+l  (*1 , • . • 1 ) = (^1 , . . • I *2”  ) ffi  (l2™+l , . . . , l2m+l  ),  ^ 

and  uq (xi ) = v0(Xl)  = ari;  for  example,  u3(x1( . . . , a*)  = ((xx  Ai2)  0 (x3  Ax4))  A 
((x5Ax6)0(x7Ax8)).  Exercise  165  shows  that  the  BDD  sizes  for  these  functions, 
calculated  via  (112)  and  (113),  involve  Fibonacci  numbers: 

B{u2m)  - 2mF2m+2  + 2,  B(u2m+1)  = 2 m+1F2m+2  + 2; 

B(v2m)  = 2mF2m+2  + 2,  B(y2rn+x)  = 2mF2m+4  + 2.  (ll5) 

Thus  um  and  vm  are  functions  of  ti  = 2m  variables  whose  BDD  sizes  grow  as 

Q(2m/2(f>m)  — @(n0),  where /3  = 1/2  + lg0  w 1.19424.  (116) 

In  fact,  the  BDD  sizes  in  (115)  are  optimum  for  the  u and  v functions, 
under  all  permutations  of  the  variables,  because  of  a fundamental  result  due  to 
M.  Sauerhoff,  I.  Wegener,  and  R.  Werchner: 

Theorem  W.  If  f{xu ...  ,xn)  = g(xu  . . . ,xk)  o h(xk+l, . . . ,xn)  is  a read- 
once  function,  there  is  a permutation  n that  minimizes  B(f1T)  and  B(fn,fn ) 
simultaneously,  and  in  which  the  variables  { x 1 , . . . ,xk}  occur  either  first  or  last. 
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Proof.  Any  permutation  (l7r, . . . , rnr)  leads  naturally  to  an  “unshufiled”  per- 
mutation (lcr, . . . , no)  in  which  the  first  k elements  are  {1, . . . , k}  and  the  last 
n — k elements  are  {A:  + 1, . . . , n},  retaining  the  7r  order  within  each  group.  For 
example,  if  fc  = 7,  n = 9,  and  (l7r, . . . , 97r)  = (3, 1, 4, 5, 9, 2, 6,  8, 7),  we  have 
(lcr, . . . , 9cr ) = (3, 1, 4,  5,  2,  6,  7, 9,  8).  Exercise  166  proves  that,  in  appropriate 
circumstances,  we  have  B(fa)  < B(fn)  and  B(fa,fIT)  < i?(/7r,/7r).  | 

Using  this  theorem  together  with  (112)  and  (113),  we  can  readily  optimize 
the  ordering  of  variables  for  the  BDD  of  any  given  read-once  function.  Consider, 
for  example,  (xiVx2)©(x3Ax4Ax5)  = <?(xi,  X2)©h(x3,  X4,  X5).  We  have  B(g)  = 4 
and  B(g,  g)  = 6;  B(h)  — 5 and  B(h , h)  = 8.  For  the  overall  formula  f — g ® h, 
Theorem  W says  that  there  are  two  candidates  for  a best  ordering  (l7r, . . . , 57r), 
namely  (1,2,3,  4,5)  and  (4,  5,1,2,  3).  The  first  of  these  gives  B(fn)  — B(g)  + 
B{h , h)  — 2 = 10;  the  other  one  excels,  with  B(fn)  — B(h)  + B(g , g)  — 2 = 9. 

The  algorithm  in  exercise  167  finds  an  optimum  n for  any  read-once  function 
f(xi, . . . , x„)  in  O(n)  steps.  Moreover,  a careful  analysis  proves  that  B(f'K)  — 
0(nP)  in  the  best  ordering,  where  /?  is  the  constant  in  (116).  (See  exercise  168.) 

*Multiplication.  Some  of  the  most  interesting  Boolean  functions,  from  a math- 
ematical standpoint,  are  the  m + n bits  that  arise  when  an  m-bit  number  is 
multiplied  by  an  n-bit  number: 

(xm  ..  .X2Xi)2  X (yn  ..  ■ 2/22/1)2  = {Zm+n  ■ ■ ■ Z2Zi)2-  (117) 

In  particular,  the  “leading  bit”  zTO+n,  and  the  “middle  bit”  zn  when  m = n,  are 
especially  noteworthy.  To  remove  the  dependence  of  this  notation  on  m and  n, 
we  can  imagine  that  m — n = 00  by  letting  x*  = yj  — 0 for  all  i > m and  j > n; 
then  each  Zk  is  a function  of  2k  variables,  z*,  — Zk{x\, . . . , Xk\ yi,  ■ ■ ■ , 2/fc),  namely 
the  middle  bit  of  the  product  (x^  . . . xx)2  x ( yk  ■ ■ ■ Vi)?- 

The  middle  bit  turns  out  to  be  difficult,  BDDwise,  even  when  y is  constant. 
Let  Zn<a(x  1, . . . ,x„)  = Zn(xi,. . . ,xn;  ai, . . . ,a„),  where  a = (an  . . .ai)2. 

Theorem  X.  There  is  a constant  a such  that  Bmm(Znta ) > • 2 L”/2-!  — 2. 

Proof.  [P.  Woelfel,  J.  Computer  and  System  Sci.  71  (2005),  520-534.]  We  may 

assume  that  n — 2t  is  even,  since  i?2t+i,2a  = Z2t, a-  Let  x = (xn...xi)2  and 
m — (\mr  < f]  . . . [In  < t])2 - Then  x = p + q,  where  q — x & m represents  the 
“known”  bits  of  x after  t branches  have  been  taken  in  a BDD  for  Zn  a with  the 
ordering  n,  and  p = x & fn  represents  the  bits  yet  unknown.  Let 

P = {x  &;  m | 0 < x < 2"}  and  Q — {x  & m \ 0 < x < 2"}.  (118) 

For  any  fixed  a,  the  function  ZnA  has  2l  subfunctions 

fgip)  = {(pa  + qa)  » (n  - 1))  & 1,  q&Q.  (119) 

We  want  to  show  that  some  n-bit  number  a will  make  many  of  these  subfunctions 
differ;  in  other  words  we  want  to  find  a large  subset  Q*  C Q such  that 

q G Q*  and  q'  G Q*  and  q / q'  implies  fq(p)  / fq'{p)  for  some  p G P.  (120) 
Exercise  176  shows  in  detail  how  this  can  be  done.  | 
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Table  1 

BEST  AND  WORST  ORDERINGS  FOR  THE  MIDDLE  BIT  zn  OF  MULTIPLICATION 


®na;iocC9a;7X8a;6a;i3a;i5 
X Xi6Xi4Xl2XsX4X3X2Xi 
Bmin(Zs)  = 756 

X24X20XlsXl6XgXSX10XliX7Xi2Xl4X2l 

X X22X1gX17Xl5X6X5X4X3X2XiXl3X23 
■Bmin  (Z12)  = 21931 


Xi0XliXgX8X7Xi6X6Xl5 
X XsX4X3Xl2X13X2XlX14 
^max  (Z8)  = 6791 

Xl6Xl7XlSXl4X24X13X12Xi1X2oX1oXgX23 

x X8a:7a:6a;5a;i8a:4*22a:3a:2Xi9a:ia;2i 
^max  (Z  i2)  = 866283 


Table  2 

BEST  AND  WORST  ORDERINGS  FOR  ALL  BITS  {zu . . . , zm+n}  OF  MULTIPLICATION 


xn  £16X15X14X13X12X10X9 
X £8a:7£6a:5£4a;3a;2a:i 
Bmin(Zg  g , . . . , Zg  g ^ ) = 9700 

Xl5XnX24X23X22X21X20Xl9XisX1eXl4Xl3 
X X1X2X3X4X5XeX7X8XgXl0X11X12 

Bmin(Z12^12,  • ■ • , ^12,12)  = 648957 

£i7£i6£io£9£ii£i2  • ■ ■ Xi5£i8£i9£24£23  • • . £20 
X X\X2X3X4X5X8X7X8 
7knin(Zj6^g,  • • • , Z^g  g ) = 157061 


xioX8£9Xi3£2£i£n£7 

x xiex5xi5xex4xi4x3xi2 
Bmax(Zg*g, . . . , zffi)  = 28678 

Xl7X22X14X13XieX10X20X3X2XiXl9Xi2 
x X24Xi&X9X8X2iX7XeXiiX23X5X4Xl8 
Bmax(Z i2^i2>  • ■ • , ^12^12)  = 4224195 

x 13X14X12X15X13X17X22X10X3X7X13X9X2X1X19X6 
x X24£ii£2i£5X4£23£3£20 
BmaxiZ^g  g, . . . , z[g4g)  = 1236251 


A good  upper  bound  for  the  BDD  size  of  the  middle  bit  function  when 
neither  operand  is  constant  has  been  found  by  K.  Amano  and  A.  Maruoka, 
Discrete  Applied  Math.  155  (2007),  1224-1232: 

Theorem  A.  Let  f(xu  . . . , x2n)  = Zn(Xl,x3, . . . , x2n_i;  x2,  x4, . . . , x2n).  Then 

B(f)  < Q(f)  < ^2rw5i.  (l2l) 

Proof.  Consider  two  n-bit  numbers  x = 2kxh  + xt  and  y = 2 kyh  + yu  with  n-k 
unknown  bits  in  each  of  their  high  parts  ( xh,yh ),  while  their  fc-bit  low  parts 
( xUVi ) are  both  known.  Then  the  middle  bit  of  xy  is  determined  by  adding 
together  three  (n  — A;)-bit  quantities  when  k > n/2,  namely  xkyi  mod2n~k , 
xtyh  mod  2n~k,  and  ( xm  > k)  mod  2n~k . Hence  level  2k  of  the  QDD  needs  to 
remember  only  the  least  significant  n — k bits  of  each  of  the  prior  quantities 
XU  Vu  and  xiyi  3>  k,  a total  of  3n  — 3k  bits,  and  we  have  q2k  < 23"-3fc  in  /’s 
quasi-profile.  Exercise  177  completes  the  proof.  | 

^ Amano  and  Maruoka  also  discovered  another  important  upper  bound.  Let 
Zm,n{xi i • • ■ i xm  \ Vi > • • • > 2/n)  denote  the  pth  bit  zp  of  the  product  (117). 

Theorem  Y.  For  all  constants  (am  . . . ai)2  and  for  all  p,  the  BDD  and  QDD 
for  the  function  Zm]n(ai, . . . , am;  xx, . . . , xn)  have  fewer  than  3 • 2n/2  nodes. 

Proof  Exercise  180  proves  that  qk  < 2n+1~k  for  this  function.  The  theorem 
follows  when  we  combine  that  result  with  the  obvious  upper  bound  qk  < 2k . | 
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Theorem  Y shows  that  the  lower  bound  of  Theorem  X is  best  possible,  except 
for  a constant  factor.  It  also  shows  that  the  BDD  base  for  all  m + n product 
functions  Zm]n{x i, . . . , xm;  xm+i, . . . , xm+n)  is  not  nearly  as  large  as  ©(2m+n), 
which  we  get  for  almost  all  instances  of  m + n functions  ofm+n  variables: 

Corollary  Y.  If  m < n,  B(Z{^n, Z^n))  < 3 (m  + n)2m+(”+1)/2.  | 

The  best  orderings  of  variables  for  the  middle-bit  function  Zn  and  for  the 
complete  BDD  base  remain  mysterious,  but  empirical  results  for  small  m and  n 
give  reason  to  conjecture  that  the  upper  bounds  of  Theorem  A and  Corollary  Y 
are  not  far  from  the  truth;  see  Tables  1 and  2.  Here,  for  example,  are  the 
optimum  results  of  Zn  when  n < 12: 

n = 1 2 3 4 5 6 7 8 9 10  11  12 

Bmin(Zn)  = 4 8 14  31  63  136  315  756  1717  4026  9654  21931 

26"/5  « 2 5 12  28  64  147  338  776  1783  4096  9410  21619 

The  ratios  Bmax/Bmin  with  respect  to  the  full  BDD  base  {■Z’m.n, . . . , Z^nU'> } 
are  surprisingly  small  in  Table  2.  Therefore  all  orderings  for  that  problem  might 
turn  out  to  be  roughly  equivalent. 


Zero-suppressed  BDDs:  A combinatorial  alternative.  When  BDDs  are 
applied  to  combinatorial  problems,  a glance  at  the  data  in  memory  often  reveals 
that  most  of  the  HI  fields  simply  point  to  (X).  In  such  cases,  we’re  better 
off  using  a variant  data  structure  called  a zero-suppressed,  binary  decision  dia- 
gram, or  “ZDD”  for  short,  introduced  by  Shin-ichi  Minato  [ACM/IEEE  Design 
Automation  Conf.  30  (1993),  272-277].  A ZDD  has  nodes  like  a BDD,  but  its 
nodes  are  interpreted  differently:  When  an  (T)  node  branches  to  a (J)  node  for 
j > j+1,  it  means  that  the  Boolean  function  is  false  unless  aq+i  = • • • = Xj_ i = 0. 

For  example,  the  BDDs  for  independent  sets  and  kernels  in  (12)  have  many 
nodes  with  HI  = [X] . Those  nodes  go  away  in  the  corresponding  ZDDs,  although 
a few  new  nodes  must  also  be  added: 


Notice  that  we  might  have  LO  = HI  in  a ZDD,  because  of  the  new  conventions. 
Furthermore,  the  example  on  the  left  shows  that  a ZDD  need  not  contain  [X]  at 
all!  About  40%  of  the  nodes  in  (12)  have  been  eliminated  from  each  diagram. 
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One  good  way  to  understand  a ZDD  is  to  regard  it  as  a condensed  repre- 
sentation of  a family  of  sets.  Indeed,  the  ZDDs  in  (122)  represent  respectively 
the  families  of  all  independent  sets  and  all  kernels  of  C6.  The  root  node  of  a 
ZDD  names  the  smallest  element  that  appears  in  at  least  one  of  the  sets;  its  HI 
and  LO  branches  represent  the  residual  subfamilies  that  do  and  don’t  contain  that 
element;  and  so  on.  At  the  bottom,  [X]  represents  the  empty  family  ‘0’,  and  Ff] 
represents  ‘{0}’.  For  example,  the  rightmost  ZDD  in  (122)  represents  the  fam- 
ily {{1,3,5},  {1,4},  {2,4,6},  {2,5},  {3,6}},  because  the  HI  branch  of  the  root 
represents  {{3,5},  {4}}  and  the  LO  branch  represents  {{2,4,6},  {2,5},  {3,6}}. 

Every  Boolean  function  f(x is,  of  course,  equivalent  to  a fam- 
ily of  subsets  of  {1, . . . , n},  and  vice  versa.  But  the  family  concept  gives  us 
a different  perspective  from  the  function  concept.  For  example,  the  family 
{ { 1 ’ 31 1 {2},  {2,  5}}  has  the  same  ZDD  for  all  n > 5;  but  if,  say,  n = 7.  the 
BDD  for  the  function  f{xu...,x7)  that  defines  this  family  needs  additional 
nodes  to  ensure  that  xA  = x6  = x7  = 0 when  f(x)  = 1. 

Almost  every  notion  that  we’ve  discussed  for  BDDs  has  a counterpart  in  the 
theory  of  ZDDs,  although  the  actual  data  structures  are  often  strikingly  different. 
We  can,  for  example,  take  the  truth  table  for  any  given  function  /(aq, . . . , xn)  and 
construct  its  unique  ZDD  in  a straightforward  way,  analogous  to  the  construction 
of  its  BDD  as  illustrated  in  (5).  We  know  that  the  BDD  nodes  for  / correspond 
to  the  beads  of  /’ s truth  table;  the  ZDD  nodes,  similarly,  correspond  to  zeads , 
which  are  binary  strings  of  the  form  a/3  with  |a|  = |/3|  and  /?  / 0 . . . 0,  or  with 
M = 1^1  ~ Any  binary  string  corresponds  to  a unique  zead,  obtained  by 
lopping  off  the  right  half  repeatedly,  if  necessary,  until  the  string  either  has  odd 
length  or  its  right  half  is  nonzero. 

Dear  reader,  please  take  a moment  now  to  work  exercise  187.  (Really.) 

The  z-profile  of  /(aq, . . . , xn)  is  (z0, . . . , zn),  where  Zk  is  the  number  of  zeads 
of  order  n-k  in  /’s  truth  table,  for  0 < k < n,  namely  the  number  of  (A+l)  nodes 

m the  ZDD;  also  zn  is  the  number  of  sinks.  We  write  Z(f)  = z0 -\ \-  zn  for 

the  total  number  of  nodes.  For  example,  the  functions  in  (122)  have  z-profiles 
(1,1, 2, 2,  2,1, 1)  and  (1, 1,2,2, 1, 1,2),  respectively,  so  Z(f)  = 10  in  each  case. 

The  basic  relations  (83)-(8s)  between  profiles  and  quasi-profiles  hold  true 
also  for  z-profiles,  but  with  qk  counting  only  nonzero  subtables  of  order  n — k: 

Qk  — zk,  for  0 < A < n;  (123) 

q'k  < 1 + *o + • ••+**_!  and  q'k  < z*  + • • • + z„,  for  0 < A-  < n\  (124) 
ZU)  > 2q'k  - 1,  for  0 < k < n.  (125) 

Consequently  the  BDD  size  and  the  ZDD  size  can  never  be  wildly  different: 

zif)  < (£(/)  + 1)  + 1 and  B(f)  < ~(Z(f)  + l)  +2.  (126) 

On  the  other  hand,  a factor  of  50  when  n = 100  is  nothing  to  sneeze  at. 

When  ZDDs  are  used  to  find  independent  sets  and  kernels  of  the  contiguous 
USA,  using  the  original  order  of  (17),  the  BDD  sizes  of  428  and  780  go  down  to 
177  and  385,  respectively.  Sifting  reduces  these  ZDD  sizes  to  160  and  335.  Is  any- 
body sneezing?  That’s  amazingly  good,  for  complicated  functions  of  49  variables. 
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When  we  know  the  ZDDs  for  / and  g , we  can  synthesize  them  to  obtain 
the  ZDDs  for  / A g,  f V 9,  f 0 <?,  etc.,  using  algorithms  that  are  very  much  like 
the  methods  we’ve  used  for  BDDs.  Furthermore  we  can  count  and/or  optimize 
the  solutions  of  /,  with  analogs  of  Algorithms  C and  B;  in  fact,  ZDD-based 
techniques  for  counting  and  optimization  turn  out  to  be  a bit  easier  than  the 
corresponding  BDD-based  algorithms  are.  With  slight  modifications  of  BDD 
methods,  we  can  also  do  dynamic  variable  reordering  via  sifting.  Exercises  197- 
209  discuss  the  nuts  and  bolts  of  all  the  basic  ZDD  procedures. 

In  general,  a ZDD  tends  to  be  better  than  a BDD  when  we’re  dealing  with 
functions  whose  solutions  are  sparse,  in  the  sense  that  vx  tends  to  be  small 
when  f(x)  — 1.  And  if  f(x)  itself  happens  to  be  sparse,  in  the  sense  that  it  has 
comparatively  few  solutions,  so  much  the  better. 

For  example,  ZDDs  are  well  suited  to  exact  cover  problems,  defined  by  an 
m x n matrix  of  Os  and  Is:  We  want  to  find  all  ways  to  choose  rows  that  sum  to 
(1,1,...,!).  Our  goal  might  be,  say,  to  cover  a chessboard  with  32  dominoes,  like 


This  is  an  exact  cover  problem  whose  matrix  has  8 x 8 = 64  columns,  one  for 
each  cell;  there  are  2x7x8=112  rows,  one  for  each  pair  of  adjacent  cells: 

/II  0 0 00000000. .. 0000000000  0\ 

1 0000000  1 000. ..00000000000 
01  1000000000. ..00000000000 
0 1 0000000  1 00. ..00000000000 


000000  000000...  0000000  1 1 00 
000000000000. ..000000001  10 
VOOOOOOOOOOOO... 000000000  1 1/ 


Let  variable  Xj  represent  the  choice  (or  not)  of  row  j.  Thus  the  three  so- 
lutions in  (127)  have  (aq,  x2,  x3,  a:4, . . . , aq10,  aqn,  aq12)  = (1,0, 0,0, . . . , 1,0, 1), 
(1, 0, 0, 0, . . . , 1, 0, 1),  and  (0, 1, 0, 1, . . . , 1, 0, 0),  respectively.  In  general,  the  so- 
lutions to  an  exact  cover  problem  are  represented  by  the  function 

n n 

f(x1,...,xm)  = /\  Si(X,)  = y\  [vX]  = 1],  (129) 

j=i  i=i 

where  Xj  — {xi  | = 1}  and  (a^)  is  the  given  matrix. 

The  dominoes-on-a-chessboard  ZDD  turns  out  to  have  only  Z(f)  = 2300 
nodes,  even  though  / has  m — 112  variables  in  this  case.  We  can  use  it  to  prove 
that  there  are  exactly  12,988,816  coverings  such  as  (127). 
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Similarly,  we  can  investigate  more  exotic  kinds  of  covering.  In 


(130) 


for  instance,  a chessboard  has  been  covered  with  monominoes,  dominoes,  and/or 
trommoes  — that  is,  with  rookwise-connected  pieces  that  each  have  either  one, 
two,  or  three  cells.  There  are  exactly  92,109,458,286,284,989,468,604  ways  to 
do  this( ! ) ; and  we  can  compute  that  number  almost  instantly,  doing  only  about 
75  megamems  of  calculation,  by  forming  a ZDD  of  size  512,227  on  468  variables. 

A special  algorithm  could  be  devised  to  find  the  ZDD  for  any  given  exact 
cover  problem;  or  we  can  synthesize  the  result  using  (129).  See  exercise  212. 

Incidentally,  the  problem  of  domino  covering  as  in  (127)  is  equivalent  to 
finding  the  perfect  matchings  of  the  grid  graph  P8  a Ps,  which  is  bipartite.  We 
will  see  m Section  7.5.1  that  efficient  algorithms  are  available  by  which  perfect 
matchings  can  be  studied  on  graphs  that  are  far  too  large  to  be  treated  with 
BDD/ZDD  techniques.  In  fact,  there’s  even  an  explicit  formula  for  the  number 
of  domino  coverings  of  an  m x n grid.  By  contrast,  general  coverings  such  as 
(130)  fall  into  a wider  category  of  hypergraph  problems  for  which  polynomial- 
time methods  are  unlikely  to  exist  as  m,  n ->  00. 

An  amusing  variant  of  domino  covering  called  the  “mutilated 
chessboard”  was  considered  by  Max  Black  in  his  book  Critical 
Thinking  (1946),  pages  142  and  394:  Suppose  we  remove  opposite 
corners  of  the  chessboard,  and  try  to  cover  the  remaining  cells 
with  31  dominoes.  It’s  easy  to  place  30  of  them,  for  example 
as  shown  here;  but  then  we’re  stuck.  Indeed,  if  we  consider  the 
corresponding  108  x 62  exact  cover  problem,  but  leave  out  the  last 
two  constraints  of  (129),  we  obtain  a ZDD  with  1224  nodes  from  which  we  can 

deduce  that  there  are  324,480  ways  to  choose  rows  that  sum  to  (1, 1 1, 1,  *,  *) 

But  each  of  those  solutions  has  at  least  two  Is  in  column  61;  therefore  the  ZDD 
reduces  to  [T]  after  we  AND  in  the  constraint  [vX61  = lj.  (“Critical  thinking” 
explains  why;  see  exercise  213.)  This  example  reminds  us  that  (i)  the  size  of  the 
final  ZDD  or  BDD  in  a calculation  can  be  much  smaller  than  the  time  needed 
to  compute  it;  and  (ii)  using  our  brains  can  save  oodles  of  computer  cycles. 

ZDDs  as  dictionaries.  Let’s  switch  gears  now,  to  note  that  ZDDs  are  advanta- 
geous also  in  applications  that  have  an  entirely  different  flavor.  We  can  use  them, 
for  instance,  to  represent  the  five-letter  words  of  English , the  set  W0RDS(5757) 
bom  the  Stanford  GraphBase  that  is  discussed  near  the  beginning  of  this  chapter. 
One  way  to  do  this  is  to  consider  the  function  f(x  1, . . .,x25)  that  is  defined  to 
be  1 if  and  only  if  the  five  numbers  (an  . . .x5)2,  (x6  . ..x10)2,  . . . , {x21 . . .x25)2 
encode  the  letters  of  an  English  word,  where  a = (00001)2,  . . . , z = (11010)2. 
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For  example,  /(0, 0, 1, 1, 1, 0, 1, 1, 1, 1, 0, 1, 1, 1, 1, 0, 0, 1, 1, 0, 1, 1, 0,0,x25)  = x25. 
This  function  of  25  variables  has  Z(f)  = 6233  nodes  — which  isn’t  bad,  since  it 
represents  5757  words. 

Of  course  we’ve  studied  many  other  ways  to  represent  5757  words,  in  Chap- 
ter 6.  The  ZDD  approach  is  no  match  for  binary  trees  or  tries  or  hash  tables, 
when  we  merely  want  to  do  simple  searches.  But  with  ZDDs  we  can  also  retrieve 
data  that  is  only  partially  specified,  or  data  that  is  only  supposed  to  match  a 
key  approximately;  many  complex  queries  can  be  handled  with  ease. 

Furthermore,  we  don’t  need  to  worry  very  much  about  having  lots  of  vari- 
ables when  ZDDs  are  being  used.  Instead  of  working  with  the  25  variables  xj 
considered  above,  we  can  also  represent  those  five-letter  words  as  a sparse  func- 
tion F(ai,. . . , z\,  a2, . . • , z2, . . . , a5, . . . , z5)  that  has  26x5  = 130  variables,  where 
variable  a 2 (for  example)  controls  whether  the  second  letter  is  ‘a’.  To  indicate 
that  crazy  is  a word,  we  make  F true  when  Ci  — r2  = 03  = Z4  = y$  — 1 and 
all  other  variables  are  0.  Equivalently,  we  consider  F to  be  a family  consisting 
of  the  5757  subsets  {w1;h2,  3.3,  c4,h5},  {ti, h2,  e3,  r4,  e5},  etc.  With  these  130 
variables  the  ZDD  size  Z(F)  turns  out  to  be  only  5020  instead  of  6233. 

Incidentally,  B(F)  is  46,189  — more  than  nine  times  as  large  as  Z(F).  But 
B(f)/Z(f)  is  only  8870/6233  « 1.4  in  the  25-variable  case.  The  ZDD  world  is 
different  from  the  BDD  world  in  many  ways,  in  spite  of  having  similar  algorithms 
and  a similar  theory. 

One  consequence  of  this  difference  is  a need  for  new  primitive  operations  by 
which  complex  families  of  subsets  can  readily  be  constructed  from  elementary 
families.  Notice  that  the  simple  subset  { f 1 , u2 , n3 , n4 , ys  } is  actually  an  extremely 
long-winded  Boolean  function: 

5i  A • • ■ A ei  A /1  A A ■ ■ ■ A f2  A u2  A »2  A • ■ • A A 3/5  A z5,  (131) 

a minterm  of  130  Boolean  variables.  Exercise  203  discusses  an  important  family 
algebra , by  which  that  subset  is  expressed  more  naturally  as  ‘f  iUu2Un3Un4Uy5’. 
With  family  algebra  we  can  readily  describe  and  compute  many  interesting 
collections  of  words  and  word  fragments  (see  exercise  222). 

ZDDs  to  represent  simple  paths.  An  important  connection  between  arbi- 
trary directed,  acyclic  graphs  (dags)  and  a special  class  of  ZDDs  is  illustrated  in 
Fig.  28.  When  every  source  vertex  of  the  dag  has  out-degree  1 and  every  sink 
vertex  has  in-degree  1,  the  ZDD  for  all  oriented  paths  from  a source  to  a sink 
has  essentially  the  same  “shape”  as  the  original  dag.  The  variables  in  this  ZDD 
are  the  arcs  of  the  dag,  in  a suitable  topological  order.  (See  exercise  224.) 


Fig.  28.  A dag,  and  the  ZDD  for  its 
source-to-sink  paths.  Arcs  of  the  dag 
correspond  to  vertices  of  the  ZDD.  All 
branches  to  [X]  have  been  omitted  from 
the  ZDD  in  order  to  show  the  structural 
similarities  more  clearly. 
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We  can  also  use  ZDDs  to  represent  simple  paths  in  an  undirected  graph. 
For  example,  there  are  12  ways  to  go  from  the  upper  left  corner  of  a 3 x 3 
grid  to  the  lower  right  corner,  without  visiting  any  point  twice: 


These  paths  can  be  represented  by  the  ZDD  shown  at  the  right,  which  charac- 
terizes all  sets  of  suitable  edges.  For  example,  we  get  the  first  path  by  taking 
the  HI  branches  at  @,  (36),  (68),  and  (89)  of  the  ZDD.  (As  in  Fig.  28, 
this  diagram  has  been  simplified  by  omitting  all  of  the  uninteresting 
LO  branches  that  merely  go  to  [X].)  Of  course  this  ZDD  isn’t  a truly 
great  way  to  represent  (132),  because  that  family  of  paths  has  only  12 
members.  But  on  the  larger  grid  P8  aP8,  the  number  of  simple  paths 
from  corner  to  corner  turns  out  to  be  789,360,053,252;  and  they  can  all 
be  represented  by  a ZDD  that  has  at  most  33580  nodes.  Exercise  225 
explains  how  to  construct  such  a ZDD  quickly. 

A similar  algorithm,  discussed  in  exercise  226,  constructs  a ZDD 
that  represents  all  cycles  of  a given  graph.  With  a ZDD  of  size  22275, 
we  can  deduce  that  P8aP8  has  exactly  603,841,648,931  simple  cycles. 

This  ZDD  may  well  provide  the  best  way  to  represent  all  of  those  cycles  within 
a computer,  and  the  best  way  to  generate  them  systematically  if  desired. 

The  same  ideas  work  well  with  graphs  from  the  “real  world”  that  don’t 
have  a neat  mathematical  structure.  For  example,  we  can  use  them  to  answer 
a question  posed  to  the  author  in  2008  by  Randal  Bryant:  “Suppose  I wanted 
to  take  a driving  tour  of  the  Continental  U.S.,  visiting  all  of  the  state  capitols, 
and  passing  through  each  state  only  once.  What  route  should  I take  to  minimize 
the  total  distance?”  The  following  diagram  shows  the  shortest  distances  between 
neighboring  capital  cities,  when  restricted  to  local  itineraries  that  each  cross  only 
one  state  boundary: 


The  problem  is  to  choose  a subset  of  these  edges  that  form  a Hamiltonian  path 
of  smallest  total  length. 
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Every  Hamiltonian  path  in  this  graph  must  clearly  either  start  or  end 
at  Augusta,  Maine  (ME).  Suppose  we  start  in  Sacramento,  California  (CA). 
Proceeding  as  above,  we  can  find  a ZDD  that  characterizes  all  paths  from  CA 
to  ME;  this  ZDD  turns  out  to  have  only  7850  nodes,  and  it  quickly  tells  us  that 
exactly  437,525,772,584  simple  paths  from  CA  to  ME  are  possible.  In  fact,  the 
generating  function  by  number  of  edges  turns  out  to  be 

4 z11  + 124z12  + 1539z13  + ■ • • + 33385461z46  + 2707075.Z47;  (134) 


so  the  longest  such  paths  are  Hamiltonian,  and  there  are  exactly  2,707,075  of 
them.  Furthermore,  exercise  227  shows  how  to  construct  a smaller  ZDD,  of  size 
4726,  which  describes  just  the  Hamiltonian  paths  from  CA  to  ME. 

We  could  repeat  this  experiment  for  each  of  the  states  in  place  of  California. 
(Well,  the  starting  point  had  better  be  outside  of  New  England,  if  we  are  going 
to  get  past  New  York,  which  is  an  articulation  point  of  this  graph.)  For  example, 
there  are  483,194  Hamiltonian  paths  from  NJ  to  ME.  But  exercise  228  shows  how 
to  construct  a single  ZDD  of  size  28808  for  the  family  of  all  Hamiltonian  paths 
from  ME  to  any  other  final  state  — of  which  there  are  68,656,026.  The  answer  to 
Bryant’s  problem  now  pops  out  immediately,  via  Algorithm  B.  (The  reader  may 
like  to  try  finding  a minimum  route  by  hand,  before  turning  to  exercise  230  and 
discovering  the  absolutely  optimum  answer.) 

*ZDDs  and  prime  implicants.  Finally,  let’s  look  at  an  instructive  application 
in  which  BDDs  and  ZDDs  are  both  used  simultaneously. 

According  to  Theorem  7.1.1Q,  every  monotone  Boolean  function  / has  a 
unique  shortest  two-level  representation  as  an  OR  of  ANDs,  called  its  “disjunctive 
prime  form”  — the  disjunction  of  all  of  its  prime  implicants.  The  prime  impli- 
cants correspond  to  the  minimal  points  where  f(x)  = 1,  namely  the  binary 
vectors  x for  which  we  have  f(x')  = 1 and  x'  C x if  and  only  if  x'  = x.  If 

f(xi,x2,x3)  = xx  V (x2  Ax3),  (135) 


for  example,  the  prime  implicants  of  / are  aq  and  X2  A £3,  while  the  minimal 
solutions  are  xlx2x3  = 100  and  Oil.  These  minimal  solutions  can  also  be 
expressed  conveniently  as  e\  and  e2  Ue3,  using  family  algebra  (see  exercise  203). 

In  general,  Xi1  A • • • A aqs  is  a prime  implicant  of  a monotone  function  / if 
and  only  if  eq  U • ■ • U els  is  a minimal  solution  of  /.  Thus  we  can  consider  /’ s 
prime  implicants  PI(/)  to  be  its  family  of  minimal  solutions.  Notice,  however, 
that  Xi1  A • • • A Xis  C Xj1  A • ■ • A Xjt  if  and  only  if  eq  U • • • LI  e,s  D ej1  U • • • U ejt ; so 
it’s  confusing  to  say  that  one  prime  implicant  “contains”  another.  Instead,  we 
say  that  the  shorter  one  “absorbs”  the  longer  one. 

A curious  phenomenon  shows  up  in  example  (135):  The  diagram  m dj'  ti 
is  not  only  the  BDD  for  /,  it’s  also  the  ZDD  for  PI(/)!  Similarly,  Fig.  21  at  the 
beginning  of  this  section  illustrates  not  only  the  BDD  for  ( xix2x3 ) but  also  the 
ZDD  for  Pl^aqaqrrs)).  On  the  other  hand,  let  g = (aq  Ax3)\/x2.  Then  the  BDD 


for  g is 


but  the  ZDD  for  PI(</)  is 


What’s  going  on  here? 


IB 
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The  key  to  resolving  this  mystery  lies  in  the  recursive  structure  on  which 
BDDs  and  ZDDs  are  based.  Every  Boolean  function  can  be  represented  as 

f(xi>  ■ • • ’ xn)  = (Si?  fo-  f\)  = (Si  A /0)  V (xi  A fi),  (136) 

where  fc  is  the  value  of  / when  x±  is  replaced  by  c.  When  / is  monotone  we  also 
have  f = f0V(x!  A fi),  because  f0  C fc.  If  f0  / fu  the  BDD  for  / is  obtained 
by  creating  a node  (T)  whose  LO  and  HI  branches  point  to  the  BDDs  for  /0 
and  / 1 • Similarly,  it  s not  difficult  to  see  that  the  prime  implicants  of  / are 

PI(/)  = PI(/o)u(e1U(PI(/1)\Pl(/0))).  (137) 

(See  exercise  253.)  This  is  the  recursion  that  defines  the  ZDD  for  PI(/),  when 
we  add  the  termination  conditions  for  constant  functions:  The  ZDDs  for  PI(0) 
and  PI(1)  are  [X]  and  pf]. 

Let’s  say  that  a Boolean  function  / is  sweet  if  it  is  monotone  and  if  the  ZDD 
for  PI(/)  is  exactly  the  same  as  the  BDD  for  /.  Constant  functions  are  clearly 
sweet.  And  nonconstant  sweetness  is  easily  characterized: 

Theorem  S.  A Boolean  function  that  depends  on  x,  is  sweet  if  and  only  if  its 
piime  implicants  are  P U (ar  U Q),  where  P and  Q are  sweet  and  independent 
of  xi,  and  every  member  of  P is  absorbed  by  some  member  ofQ. 

Proof.  See  exercise  246.  (To  say  that  “P  and  Q are  sweet”  means  that  they 
each  are  families  of  prime  implicants  that  define  a sweet  Boolean  function.)  | 

Corollary  S.  The  connectedness  function  of  any  graph  is  sweet. 

Proof.  The  prime  implicants  of  the  connectedness  function  / are  the  spanning 
trees  of  the  graph.  Every  spanning  tree  that  does  not  include  arc  xx  has  at  least 
one  subtree  that  will  be  spanning  when  arc  xx  is  added  to  it.  Furthermore,  all 
subfunctions  of  / are  the  connectedness  functions  of  smaller  graphs.  | 

Thus,  for  example,  the  BDD  in  Fig.  22,  which  defines  all  431  of  the  connected 
subgraphs  of  P3  [HP3,  also  is  the  ZDD  that  defines  all  192  of  its  spanning  trees. 

Whether  / is  sweet  or  not,  we  can  use  (137)  to  compute  the  ZDD  for  PI(/) 
whenever  / is  monotone.  When  we  do  this  we  can  actually  let  the  BDD  nodes  and 
the  ZDD  nodes  coexist  in  the  same  big  base  of  data:  Two  nodes  with  identical 
(V,  LO,  HI)  fields  might  as  well  appear  only  once  in  memory,  even  though  they 
might  have  complete  different  meanings  in  different  contexts.  We  use  one  routine 
to  synthesize  / A g when  / and  g point  to  BDDs,  and  another  routine  to  form 
f \ g when  / and  g point  to  ZDDs;  no  trouble  will  arise  if  these  routines  happen 
to  share  nodes,  as  long  as  the  variables  aren’t  being  reordered.  (Of  course  the 
cache  memos  must  distinguish  BDD  facts  from  ZDD  facts  when  we  do  this.) 

For  example,  exercise  7.1.1-67  defines  an  interesting  class  of  self-dual  func- 
tions called  the  Y functions,  and  the  BDD  for  Y12  (which  is  a function  of  91 
variables)  has  748,416  nodes.  This  function  has  2,178,889,774  prime  implicants; 
yet  Z(PI(F12))  is  only  217,388.  (We  can  find  this  ZDD  with  a computational 
cost  of  about  13  gigamems  and  660  megabytes.) 


7.1.4 


BINARY  DECISION  DIAGRAMS  257 


A brief  history.  The  seeds  of  binary  decision  diagrams  were  implicitly  planted 
by  Claude  Shannon  [Trans.  Amer.  Inst.  Electrical  Engineers  57  (1938),  713—723] , 
in  his  illustrations  of  relay-contact  networks.  Section  4 of  that  paper  showed  that 
any  symmetric  Boolean  function  of  n variables  has  a BDD  with  at  most  ("t1) 
branch  nodes.  Shannon  preferred  to  work  with  Boolean  algebra;  but  C.  Y.  Lee,  in 
Bell  System  Tech.  J.  38  (1959),  985-999,  pointed  out  several  advantages  of  what 
he  called  “binary-decision  programs,”  because  any  n-variable  function  could  be 
evaluated  by  executing  at  most  n branch  instructions  in  such  a program. 

S.  Akers  coined  the  name  “binary  decision  diagrams”  and  pursued  the  ideas 
further  in  IEEE  Trans.  C-27  (1978),  509-516.  He  showed  how  to  obtain  a 
BDD  from  a truth  table  by  working  bottom-up,  or  from  algebraic  subfunctions 
by  working  top-down.  He  explained  how  to  count  the  paths  from  a root  to  [t] 
or  [X] , and  observed  that  these  paths  partition  the  n-cube  into  disjoint  subcubes. 

Meanwhile  a very  similar  model  of  Boolean  computation  arose  in  theoret- 
ical studies  of  automata.  For  example,  A.  Cobham  [FOCS  7 (1966),  78-87] 
related  the  minimum  sizes  of  branching  programs  for  a sequence  of  functions 
fn(x i,...,xn)  to  the  space  complexity  of  nonuniform  Turing  machines  that 
compute  this  sequence.  More  significantly,  S.  Fortune,  J.  Hopcroft,  and  E.  M. 
Schmidt  [ Lecture  Notes  in  Comp.  Sci.  62  (1978),  227-240]  considered  “free  13- 
schemes,”  now  known  as  FBDDs,  in  which  no  Boolean  variable  is  tested  twice 
on  any  path  (see  exercise  35).  Among  other  results,  they  gave  a polynomial-time 
algorithm  to  test  whether  / = g,  given  FBDDs  for  / and  g,  provided  that  at 
least  one  of  those  FBDDs  is  ordered  consistently  as  in  a BDD.  The  theory  of 
finite-state  automata,  which  has  intimate  connections  to  BDD  structure,  was  also 
being  developed;  thus  several  researchers  worked  on  problems  that  are  equivalent 
to  analyzing  the  size,  B(f),  for  various  functions  /.  (See  exercise  261.) 

All  of  this  work  was  conceptual,  not  implemented  in  computer  programs, 
although  programmers  had  found  good  uses  for  binary  tries  and  Patrician  trees  — 
which  are  similar  to  BDDs  except  that  they  are  trees  instead  of  dags  (see  Sec- 
tion 6.3).  But  then  Randal  E.  Bryant  discovered  that  binary  decision  diagrams 
are  significantly  important  in  practice  when  they  are  required  to  be  both  reduced 
and  ordered.  His  introduction  to  the  subject  [IEEE  Trans.  C-35  (1986),  677-691] 
became  for  many  years  the  most  cited  paper  in  all  of  computer  science,  because 
it  revolutionized  the  data  structures  used  to  represent  Boolean  functions. 

In  his  paper,  Bryant  pointed  out  that  the  BDD  for  any  function  is  essentially 
unique  under  his  conventions,  and  that  most  of  the  functions  encountered  in 
practice  had  BDDs  of  reasonable  size.  He  presented  efficient  algorithms  to 
synthesize  the  BDDs  for  f Ag  and  /©<?,  etc.,  from  the  BDDs  for  / and  g.  He  also 
showed  how  to  compute  the  lexicographically  least  x such  that  /(*)  = 1,  etc. 

Lee,  Akers,  and  Bryant  all  noted  that  many  functions  can  profitably  co- 
exist in  a BDD  base,  sharing  their  common  subfunctions.  A high-performance 
“package”  for  BDD  base  operations,  developed  by  K.  S.  Brace,  R.  L.  Rudell, 
and  R.  E.  Bryant  [ACM/IEEE  Design  Automation  Conf.  27  (1990),  40-45],  has 
strongly  influenced  all  subsequent  implementations  of  BDD  toolkits.  Bryant 
summarized  the  early  uses  of  BDDs  in  Computing  Surveys  24  (1992),  293-318. 
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Shin-ichi  Minato  introduced  ZDDs  in  1993,  as  noted  above,  to  improve 
performance  in  combinatorial  work.  He  gave  a retrospective  account  of  early 
ZDD  applications  in  Software  Tools  for  Technology  Transfer  3 (2001),  156-170 
The  use  of  Boolean  methods  in  graph  theory  was  pioneered  by  K.  Maghout 
[Comptes  Rendus  Acad.  Sci.  248  (Paris,  1959),  3522-3523],  who  showed  how 
to  express  the  maximal  independent  sets  and  the  minimal  dominating  sets  of 
any  graph  or  digraph  as  the  prime  implicants  of  a monotone  function.  Then 
R.  Fortet  [Cahiers  du  Centre  d' Etudes  Recherche  Operationelle  1,  4 (1959),  5-36] 
considered  Boolean  approaches  to  a variety  of  other  problems;  for  example,  he 
introduced  the  idea  of  4-coloring  a graph  by  assigning  two  Boolean  variables  to 
each  vertex,  as  we  have  done  in  (73).  P.  Camion,  in  that  same  journal  [2  (1960), 
234-289],  transformed  integer  programming  problems  into  equivalent  problems 
m Boolean  algebra,  hoping  to  resolve  them  via  techniques  of  symbolic  logic.  This 
work  was  extended  by  others,  notably  P.  L.  Hammer  and  S.  Rudeanu,  whose  book 
Boolean  Methods  in  Operations  Research  (Springer,  1968)  summarized  the  ideas. 
Unfortunately,  however,  their  approach  foundered,  because  no  good  techniques 
for  Boolean  calculation  were  available  at  the  time.  The  proponents  of  Boolean 
methods  had  to  wait  until  the  advent  of  BDDs  before  the  general  Boolean 
programming  problem  (7)  could  be  resolved,  thanks  to  Algorithm  B.  The  special 
case  of  Algorithm  B in  which  all  weights  are  nonnegative  was  introduced  by 
B.  Lin  and  F.  Somenzi  [ International  Conf.  Computer-Aided  Design  CAD-90 
(IEEE,  1990),  88-91],  S.  Minato  [Formal  Methods  in  System  Design  10  (1997), 
221-242]  developed  software  that  automatically  converts  linear  inequalities  be- 
tween integer  variables  into  BDDs  that  can  be  manipulated  conveniently,  some- 
what as  the  researchers  of  the  1960s  had  hoped  would  be  possible. 

The  classic  problem  of  finding  a minimum  size  DNF  for  a given  function  also 
became  spectacularly  simpler  when  BDD  methods  became  understood.  The 
latest  techniques  for  that  problem  are  beyond  the  scope  of  this  book,  but  Olivier 
Coudert  has  given  an  excellent  overview  in  Integration  17  (1994),  97-140. 

A fine  book  by  Ingo  Wegener,  Branching  Programs  and  Binary  Decision 
Diagrams  (SIAM,  2000),  surveys  the  vast  literature  of  the  subject,  develops  the 
mathematical  foundations  carefully,  and  discusses  many  ways  in  which  the  basic 
ideas  have  been  generalized  and  extended. 

Caveat.  We’ve  seen  dozens  of  examples  in  which  the  use  of  BDDs  and/or 
ZDDs  has  made  it  possible  to  solve  a wide  variety  of  combinatorial  problems 
with  amazing  efficiency,  and  the  exercises  below  contain  dozens  of  additional 
examples  where  such  methods  shine.  But  BDD  and  ZDD  structures  are  by  no 
means  a panacea;  they’re  only  two  of  the  weapons  in  our  arsenal.  They  apply 
chiefly  to  problems  that  have  more  solutions  than  can  readily  be  examined  one  by 
one,  problems  whose  solutions  have  a local  structure  that  allows  our  algorithms 
to  deal  with  only  relatively  few  subproblems  at  a time.  In  later  sections  of  The 
Art  of  Computer  Programming  we  shall  be  studying  additional  techniques  by 
which  other  kinds  of  combinatorial  problems  can  be  tamed. 
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EXERCISES 

► 1.  [20]  Draw  the  BDDs  for  all  16  Boolean  functions  f(x i,X2).  What  are  their  sizes? 

► 2.  [21]  Draw  a planar  dag  with  sixteen  vertices,  each  of  which  is  the  root  of  one  of 

the  16  BDDs  in  exercise  1. 

3.  [16]  How  many  Boolean  functions  f(xi, . . . , x„)  have  BDD  size  3 or  less? 

4.  [21  ] Suppose  three  fields  | V | LO  | HI  ] have  been  packed  into  a 64-bit  word  x, 
where  V occupies  8 bits  and  the  other  two  fields  occupy  28  bits  each.  Show  that  five 
bitwise  instructions  will  transform  x H > x' , where  x'  is  equal  to  x except  that  a LO  or 
HI  value  of  0 is  changed  to  1 and  vice  versa.  (Repeating  this  operation  on  every  branch 
node  a;  of  a BDD  for  / will  produce  the  BDD  for  the  complementary  function,  /.) 

5.  [20]  If  you  take  the  BDD  for  f(xi, . . . , x.n)  and  interchange  the  LO  and  HI  pointers 
of  every  node,  and  if  you  also  swap  the  two  sinks  [X]  o [T] , what  do  you  get? 

6.  [10]  Let  g(xi,X2,X3,X4 ) = f(xi,X3,x 2,2:1),  where  / has  the  BDD  in  (6).  What 

is  the  truth  table  of  g,  and  what  are  its  beads? 

7.  [21]  Given  a Boolean  function  f(xi, ...,  xn),  let 

gic(x0,xi, . . . ,xn)  = f(x0, . . . ,Xk-2,Xk-i  V Xfc,Xfc+i, . ..,x„)  for  1 < k < n. 

Find  a simple  relation  between  (a)  the  truth  tables  and  (b)  the  BDDs  of  / and 

8.  [22]  Solve  exercise  7 with  Xk-i  © xjt  in  place  of  Xk- 1 V Xk- 

9.  [16]  Given  the  BDD  for  a function  f(x)  = f(x  1, . . . , xn),  represented  sequentially 
as  in  (8),  explain  how  to  determine  the  lexicographically  largest  x such  that  f(x)  = 0. 

► 10.  [21]  Given  two  BDDs  that  define  Boolean  functions  / and  /',  represented  sequen- 
tially as  in  (8)  and  (10),  design  an  algorithm  that  tests  / = /'. 

11.  [20]  Does  Algorithm  C give  the  correct  answer  if  it  is  applied  to  a binary  decision 
diagram  that  is  (a)  ordered  but  not  reduced?  (b)  reduced  but  not  ordered? 

► 12.  [ M21 ] A kernel  of  a digraph  is  a set  of  vertices  K such  that 

v € K implies  v -/-»  u for  all  u £ K\ 
v ^ K implies  v — >u  for  some  »6A'. 

a)  Show  that  when  the  digraph  is  an  ordinary  graph  (that  is,  when  u — >v  if  and  only 
if  v — >u),  a kernel  is  the  same  as  a maximal  independent  set. 

b)  Describe  the  kernels  of  the  oriented  cycle  Cn. 

c)  Prove  that  an  acyclic  digraph  has  a unique  kernel. 

13.  [M15]  How  is  the  concept  of  a graph  kernel  related  to  the  concept  of  (a)  a maximal 
clique?  (b)  a minimal  vertex  cover? 

14.  [M24]  How  big,  exactly,  are  the  BDDs  for  (a)  all  independent  sets  of  the  cycle 
graph  Cn,  and  (b)  all  kernels  of  C„,  when  n > 3?  (Number  the  vertices  as  in  (12).) 

15.  [M23]  How  many  (a)  independent  sets  and  (b)  kernels  does  Cn  have,  when  n > 3? 

► 16.  [22]  Design  an  algorithm  that  successively  generates  all  vectors  x\  . . . xn  for  which 
f(x  1, . . . , xn)  = 1,  when  a BDD  for  / is  given. 

17.  [32]  If  possible,  improve  the  algorithm  of  exercise  16  so  that  its  running  time  is 
0(B(f))  + O(N)  when  there  are  N solutions. 

18.  [13]  Play  through  Algorithm  B with  the  BDD  (8)  and  (»i, . . . , W4)  = (1,-2,— 3, 4). 
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19.  [20]  What  are  the  largest  and  smallest  possible  values  of  variable  mk  in  Algo- 
rithm B,  based  only  on  the  weights  (wi, . . . , wn),  not  on  any  details  of  the  function  /? 

20.  [15]  Devise  a fast  way  to  compute  the  Time  -Morse  weights  (15)  for  1 < j < n. 

21.  [05]  Can  Algorithm  B minimize  wlXl  + ■ ■ ■ + wnxn,  instead  of  maximizing  it? 

►22.  [M21]  Suppose  step  B3  has  been  simplified  so  that  lWv+1  — WVl  ’ and  ‘IF„+1  — Wv  ’ 
are  eliminated  from  the  formulas.  Prove  that  the  algorithm  will  still  work,  when  applied 
to  BDDs  that  represent  kernels  of  graphs. 

► 23  [M20]  All  paths  from  the  root  of  the  BDD  in  Fig.  22  to  \T\  have  exactly  eight 

solid  arcs.  Why  is  this  not  a coincidence? 

24.  [M2  2]  Suppose  twelve  weights  (w12,w13, w89)  have  been  assigned  to  the  edges 
o the  grid  in  Fig.  22.  Explain  how  to  find  a minimum  spanning  tree  in  that  graph 
(namely,  a spanning  tree  whose  edges  have  minimum  total  weight),  by  applying  Algo- 
rithm B to  the  BDD  shown  there. 


25.  [M20]  Modify  Algorithm  C so  that  it  computes  the  generating  function  for  the  so- 
lutions to  f(xu.  = 1,  namely  G(z)  = ^1=0  • • ■ £*b=0  *Xl+-+“" /(*,,. . . ,x„). 

26.  [M20]  Modify  Algorithm  C so  that  it  computes  the  reliability  polynomial  for  given 
probabilities,  namely 


F(Pi,‘--,Pn)  = Y2  (1 

xi=0 


p^~XipV 


£n  =0 


.(1  -Pn)1  X"Pnnf(x  l,...,Xn). 


►27.  [M26]  Suppose  F(Pl,...,Pn)  and  G(Pl,...,Pn)  are  the  reliability  polynomials 
for  Boolean  functions /(xr,  and  g(Xl,. . where  / jt  g.  Let  q be  a prime 

number,  and  choose  independent  random  integers  qi,  ...,  qn.  uniformly  distributed 
in  the  range  0 < qk  < q.  Prove  that  F(qu.  ,.,qn)  mod  q # G(qi,.  ..,qn)  mod  q with 
probability  > (1  - \/q)n.  (In  particular,  if  n = 1000  and  q = 231  - 1,  different  functions 
lead  to  different  “hash  values”  under  this  scheme  with  probability  at  least  0.9999995.) 

28.  [Ml  6]  Let  F(p)  be  the  value  of  the  reliability  polynomial  F(pu  ...,Pn)  when  Pl  = 
■■■  =Pn=p.  Show  that  it’s  easy  to  compute  F(P)  from  the  generating  function  G(z). 

29.  [HM20]  Modify  Algorithm  C so  that  it  computes  the  reliability  polynomial  F(p) 
of  exercise  28  and  also  its  derivative  F'(p ),  given  p and  the  BDD  for  /. 

►30.  [M21]  The  reliability  polynomial  is  the  sum,  over  all  solutions  to /(xi  x ) = l 

of  contributions  from  all  “minterms”  (1  -pif-ip?  . . . (1  Explain  how 

to  find  a solution  . . . x„  whose  contribution  to  the  total  reliability  is  maximum,  given 
a BDD  for  / and  a sequence  of  probabilities  (pi,...  ,pn). 

31.  [M21]  Modify  Algorithm  C so  that  it  computes  the  fully  elaborated  truth  table 
of  /,  formalizing  the  procedure  by  which  (24)  was  obtained  from  Fig.  21. 

► 32.  [M20]  What  interpretations  of  ‘o’,  V,  ‘_L’,  ‘T\  ‘x/,  and  ‘x/  will  make  the  general 
algorithm  of  exercise  31  specialize  to  the  algorithms  of  exercises  25,  26,  29,  and  30? 

► 33.  [M22]  Specialize  exercise  31  so  that  we  can  efficiently  compute 


(uiixi  + • 

f(x)=l 


and 


T UJnXn) 

from  the  BDD  of  a Boolean  function  f(x)  = f(x  1, . ,.,x„) 


(WlXl  + • • ■ + i»„x„)! 

/(*)  = ! 
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34.  [M25]  Specialize  exercise  31  so  that  we  can  efficiently  compute 

max{  max  (wxXi  H wk_1xk_1  + w'kxk  + wk+1xk+1  \-wnxn+wk)  \ f(x)  = l} 


from  the  BDD  of  /,  given  3 n arbitrary  weights  (wj, . . . , wn,w[, . . . , w'n,w", . . . , w '„). 

► 35.  [22]  A free  binary  decision  diagram  (FBDD)  is  a binary  decision  diagram  such  as 


where  the  branch  variables  needn’t  appear  in  any  particular  order,  but  no  variable  is 
allowed  to  occur  more  than  once  on  any  downward  path  from  the  root.  (An  FBDD  is 
“free”  in  the  sense  that  every  path  in  the  dag  is  possible:  No  branch  constrains  another.) 

a)  Design  an  algorithm  to  verify  that  a supposed  FBDD  is  really  free. 

b)  Show  that  it’s  easy  to  compute  the  reliability  polynomial  F(pi, . . . ,pn)  of  a Bool- 
ean function  f(x i, . . . ,xn),  given  (pi, . . . ,pn)  and  an  FBDD  that  defines  /,  and 
to  compute  the  number  of  solutions  to  f(x i, . . . , xn ) = 1. 

36.  [25]  By  extending  exercise  31,  explain  how  to  compute  the  elaborated  truth  table 
for  any  given  FBDD,  if  the  abstract  operators  o and  • are  commutative  as  well  as 
distributive  and  associative.  (Thus  we  can  find  optimum  solutions  as  in  Algorithm  B,  or 
solve  problems  such  as  those  in  exercises  30  and  33,  with  FBDDs  as  well  as  with  BDDs.) 

37.  [M20]  (R.  L.  Rivest  and  ,1.  Vuillemin,  1976.)  A Boolean  function  f(x i, . . . ,x„)  is 
called  evasive  if  every  FBDD  for  / contains  a downward  path  of  length  n.  Let  G(z)  be 
the  generating  function  for  /,  as  in  exercise  25.  Prove  that  / is  evasive  if  G(  — 1)  ^ 0. 

► 38.  [27]  Let  Is-i,  . ..,  Ia  be  branch  instructions  that  define  a nonconstant  Boolean 
function  f(x i, . . . , xn)  as  in  (8)  and  (10).  Design  an  algorithm  that  computes  the  status 
variables  t\  . . . tn,  where 


{+1,  if  /( xi, . . . ,xn)  = 1 whenever  Xj  = 1; 
— 1,  if  f(x i, . . . ,xn)  = 1 whenever  Xj  = 0; 
0,  otherwise. 


(If  ti  . . . tn  / 0 ...  0,  the  function  / is  therefore  canalizing  as  defined  in  Section  7.1.1.) 
The  running  time  of  your  algorithm  should  be  0(n  + s). 


39.  [M20]  What  is  the  size  of  the  BDD  for  the  threshold  function  [x\  + • ■ ■ + x„  > fc]? 

► 40.  [22]  Let  g be  the  “condensation”  of  / obtained  by  setting  xk+i  <—  xk  as  in  (27). 

a)  Prove  that  B(g)  < B(f).  [Hint:  Consider  subtables  and  beads.] 

b)  Suppose  h is  obtained  from  / by  setting  xk+2  «—  xk.  Is  B(h)  < B(f)7 


41.  [M25] 
tions  (a)  (x 


Assuming  that  n > 4,  find  the  BDD  size  of  the  Fibonacci  threshold  func- 


F1  tF2 
1 **"} 


2xn-ilxnn  2 > and  (b)  (x„1x^2_1  . . . x["  2x: 


2>- 


42.  [22]  Draw  the  BDD  base  for  all  symmetric  Boolean  functions  of  3 variables. 


► 43.  [22]  What  is  B(f)  when  (a)  x2n)  = [x\  4 hi„  = i„+H ^ x2n]^ 

(b)  f(x  1,  . . . ,X2n)  = [xi  + x3  + h X2n-1  —X2  + X4  -t 1-  x2n\? 


► 44.  [M32]  Determine  the  maximum  possible  size,  ZJn,  of  B(f)  when  / is  a symmetric 
Boolean  function  of  n variables. 
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45.  [22]  Give  precise  specifications  for  the  Boolean  modules  that  compute  the  three- 
m-a-row  function  as  in  (33)  and  (34),  and  show  that  the  network  is  well  defined. 

46.  [M23]  What  is  the  true  BDD  size  of  the  three-in-a-row  function? 


47.  [M21]  Devise  and  prove  a converse  of  Theorem  M:  Every  Boolean  function  / with 
a small  BDD  can  be  implemented  by  an  efficient  network  of  modules. 

48.  [M22]  Implement  the  hidden  weighted  bit  function  with  a network  of  modules 
like  Fig.  23,  using  a*,  = 2 + Xk  and  bk  = 1 + A (n  — k)  connecting  wires  for  1 < k < n. 
Conclude  from  Theorem  B that  the  upper  bound  in  Theorem  M cannot  be  improved 
to  ELo  2p(“*"6*)  for  any  polynomial  p. 

49.  [20]  Draw  the  BDD  base  for  the  following  sets  of  symmetric  Boolean  functions: 
(a)  {S>k(x  i,x2,x3,x4)  | 1 < k < 4};  (b)  {S*,(a;x,  x2,  X3,  x4)  | 0 < k < 4}. 

50.  [22]  Draw  the  BDD  base  for  the  functions  of  the  ^-segment  display  (7.1.2-(42)). 

51.  [22]  Describe  the  BDD  base  for  binary  addition  when  the  input  bits  are  numbered 
from  right  to  left,  namely  (/B+x/B/„_i  . . . /x)2  = (*2„_x  . . . x3Xl)2  + (x2n  . . . x4x2)2, 
instead  of  from  left  to  right  as  in  (35)  and  (36). 


52.  [20]  There’s  a sense  in  which  the  BDD  base  for  m functions  {/i,...,/m}  isn’t 
really  very  different  from  a BDD  with  just  one  root:  Consider  the  junction  function 
J(ui, . . . , un;  Vi, . . . , vn)  = ( u\ ? v±:  t/2?  v^\  • • • unl  vn : 0),  and  let 

/(fl,  • • • ,fm+l,XX,  . • ■ ,xn)  = J(tx, . . . ,tm+x;  fl(xi,  . ..  ,x„), ...  1), 

where  (fi, . . . , tm+x)  are  new  “dummy”  variables,  placed  ahead  of  (*x,  ...,*„)  in  the  or- 
dering. Show  that  B(f)  is  almost  the  same  as  the  size  of  the  BDD  base  for  {/x, . . . , /m}. 

► 53.  [23]  Play  through  Algorithm  R,  when  it  is  applied  to  the  binary  decision  diagram 
with  seven  branch  nodes  in  (2). 


54.  [17]  Construct  the  BDD  of  f(Xl,.  ,.,xn)  from  /’ s truth  table,  in  0(2")  steps. 

55.  [M30]  Explain  howto  construct  the  “connectedness  BDD”  of  a graph  (like  Fig.  22). 

56.  [20]  Modify  Algorithm  R so  that,  instead  of  pushing  any  unnecessary  nodes  onto 
an  AVAIL  stack,  it  creates  a brand  new  BDD,  consisting  of  consecutive  instructions 
Is-i,  ...,  h , 7o  that  have  the  compact  form  (va,  ? 1^ : hk ) assumed  in  Algorithms  B 
and  C.  (The  original  nodes  input  to  the  algorithm  can  then  all  be  recycled  en  masse.) 

57.  [25]  Specify  additional  actions  to  be  taken  between  steps  R1  and  R2  when  Algo- 
rithm R is  extended  to  compute  the  restriction  of  a function.  Assume  that  FIX  [e]  = 
t e {0, 1}  if  variable  v is  to  be  given  the  fixed  value  t;  otherwise  FIX[v]  < 0. 


58.  [20]  Prove  that  the  “melded”  diagram  defined  by  recursive  use  of  (37)  is  reduced. 

► 59.  [M28]  Let  h(Xl,  ...,*„)  be  a Boolean  function.  Describe  the  melded  BDD  fog  in 
terms  of  the  BDD  for  h,  when  (a)  f(xu. . . , x2n)  = h(xu. . . ,x„)  and  g(xu. . . , x2n)  = 
h(x„+1,...,x2n);  (b)  f(xi,x2, . . . , x2n)  = h(x1,x3,...,x2n-1)  and  g(xi,  x2, . . . , x2n)  = 
h{x2,x 4, . . . ,x2n).  [In  both  cases  we  obviously  have  B(f)  = B(g)  = B(h).] 

60.  [Jim]  Suppose  /(*x,.  ..,*„)  and  g(Xl, . ..,*,)  have  the  profiles  (60, . . . ,bn)  and 
(&o,...,6„),  respectively,  and  let  their  respective  quasi-profiles  be  ( q0,...,qn ) and 
(q'o,  • • . , q'n).  Show  that  their  meld  fog  has  B(fog)  < EJ=oMx  + b.q’  - b^b'f)  nodes. 

► 61.  [M2 7]  If  a and  B are  nodes  of  the  respective  BDDs  for  / and  g,  prove  that 


in-degree(a  o (3)  < in-degree(o)  • in-degree(/3) 
in  the  melded  BDD  fog.  (Imagine  that  the  root  of  a BDD  has  in-degree  1.) 
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► 62.  [M21]  If  f(x)  = V}=/12J(»2j-i  t\x2j)  and  g(x)  = (xi  Axn)  V V}^21  1 (x2j  f\x2j+i), 
what  are  the  asymptotic  values  of  B(f),  B(g),  B(f  o g),  and  B(f  V g)  as  n — >•  oo? 

63.  [M27]  Let  f(xi, ...  ,xn)  = Mm(xi  ® *^2  5 *^3  ® 3^4  ? • • • 5 ^2m-l  ® %2m  5 *E2m+l  5 • • • ? 3?n) 
and  g(xi, . . . ,xn)  = Mm(x 2 © X3, . . . , x2rn-2  © £2m-i,  X2m!  £2m+i, . . . , xn),  where  n = 
2m  + 2m.  What  are  B(/),  B(g),  and  B(f  A g)? 

64.  [M21  ] We  can  compute  the  median  (/1/2/3)  of  three  Boolean  functions  by  forming 

A = A v /2,  A = A a /2,  A = A a /4,  /7  = /5  v /6. 

Then  B(/4)  = 0(B(fx)B(fa)),  B(h)  = 0(B(/1)B(/2)),  S(/6)  - 0(B(f3)  B(f4))  = 
0(B(f1)B(f2)B(f3));  therefore  S(/7)  = 0(B(/5)B(/6))  = 0(B{fi)2 B(f2)2 B(f3)). 
Prove,  however,  that  S(/7)  is  actually  only  0(B(fi)B(f2)B(f3)),  and  the  running 
time  to  compute  it  from  /5  and  /6  is  also  0(B(fi) B(f2) B(f3)). 

► 65.  [ M25 } If  h(xi, . . . ,xn)  = f(xx, ...,  Xj-i,  g(xi> ...,  xn),  xj+i, ...,  xn),  prove  that 
B(h)  = 0(B(f)2  B(g)).  Can  this  upper  bound  be  improved  to  0(B(f)B(g))  in  general? 

66.  [20]  Complete  Algorithm  S by  explaining  what  to  do  in  step  SI  if  / o g turns  out 
to  be  trivially  constant. 

67.  [24]  Sketch  the  actions  of  Algorithm  S when  (41)  defines  / and  g,  and  op  = 1. 

68.  [20]  Speed  up  step  S10  by  streamlining  the  common  case  when  LEFT(t)  < 0. 

69.  [21  ] Algorithm  S ought  to  have  one  or  more  precautionary  instructions  such  as 
“if  NTOP  > TBOT,  terminate  the  algorithm  unsuccessfully,”  in  case  it  runs  out  of  room. 
Where  are  the  best  places  to  insert  them? 

70.  [21]  Discuss  setting  b to  [lg  LCOUNT  [Z]  J instead  of  fig  LCOUNT  [Z]  ] in  step  S4. 

71.  [20]  Discuss  how  to  extend  Algorithm  S to  ternary  operators. 

72.  [25]  Explain  how  to  eliminate  hashing  from  Algorithm  S. 

► 73.  [25]  Discuss  the  use  of  “virtual  addresses”  instead  of  actual  addresses  as  the  links 
of  a BDD:  Each  pointer  p has  the  form  7r(p)2e  + er(p),  where  7r(p)  = p » e is  p’s  “page” 
and  <r(p)  = p mod  2e  is  p’s  “slot”;  the  parameter  e can  be  chosen  for  convenience.  Show 
that,  with  this  approach,  only  two  fields  (L0,HI)  are  needed  in  BDD  nodes,  because 
the  variable  identifier  V (p)  can  be  deduced  from  the  virtual  address  p itself. 

► 74.  [M23]  Explain  how  to  count  the  number  of  self-dual  monotone  Boolean  functions 
of  n variables,  by  modifying  (49). 

75.  [M20]  Let  pn{x\, . . . ,x2n)  be  the  Boolean  function  that  is  true  if  and  only  if 
xi  . . ,x2n  is  the  truth  table  of  a regular  function  (see  exercise  7.1.1-110).  Show  that 
the  BDD  for  pn  can  be  computed  by  a procedure  similar  to  that  of  pn  in  (49). 

► 76.  [M22]  A “clutter”  is  a family  S of  mutually  incomparable  sets;  in  other  words, 
S 2 S'  whenever  S and  5'  are  distinct  members  of  S.  Every  set  S C {0, 1, . . . , n — 1} 
can  be  represented  as  an  n-bit  integer  s = X!{2e  I e £ S}\  so  every  family  of  such  sets 
corresponds  to  a binary  vector  x0xi  . . . x2"-i,  with  xs  = 1 if  and  only  if  s represents  a 
set  of  the  family. 

Show  that  the  BDD  for  the  function  ‘[xoxi  . . . X2n-i  corresponds  to  a clutter]’  has 
a simple  relation  to  the  BDD  for  the  monotone-function  function  pn(x\, . . . ,£2"). 

► 77.  [M30]  Show  that  there’s  an  infinite  sequence  (b0, 61,  b2, . . . ) = (1,  2, 3,  5, 6, . . . ) 
such  that  the  profile  of  the  BDD  for  p„  is  (Z>0,  61, . . . , b2n-i_1,  b2n-i  _x, . . . , bi,  b0, 2). 
(See  Fig.  25.)  How  many  branch  nodes  of  that  BDD  have  LO  = |~X~|  ? 
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► 78.  [25]  Use  BDDs  to  determine  the  number  of  graphs  on  12  labeled  vertices  for  which 
the  maximum  vertex  degree  is  d,  for  0 < d < 11. 

79.  [20]'  For  0 <d<  11,  compute  the  probability  that  a graph  on  vertices  {1, . . . , 12} 
has  maximum  degree  d,  if  each  edge  is  present  with  probability  1/3. 

80.  [23]  The  recursive  algorithm  (55)  computes  / A g in  a depth-first  manner,  while 
Algorithm  S does  its  computation  breadth-first.  Do  both  algorithms  encounter  the  same 
subproblems  /'  A g'  as  they  proceed  (but  in  a different  order),  or  does  one  algorithm 
consider  fewer  cases  than  the  other? 

► 81.  [20]  By  modifying  (55),  explain  how  to  compute  / © g in  a BDD  base. 

► 82.  [25]  When  the  nodes  of  a BDD  base  have  been  endowed  with  REF  fields,  explain 
how  those  fields  should  be  adjusted  within  (55)  and  within  Algorithm  U. 

83.  [ M20 ] Prove  that  if  / and  g both  have  reference  count  1,  we  needn’t  consult  the 
memo  cache  when  computing  AND (f,g)  by  (55). 

84.  [24]  Suggest  strategies  for  choosing  the  size  of  the  memo  cache  and  the  sizes  of 
the  unique  tables,  when  implementing  algorithms  for  BDD  bases.  What  is  a good  way 
to  schedule  periodic  garbage  collections? 

85.  [16]  Compare  the  size  of  a BDD  base  for  the  32  functions  of  16  x 16-bit  binary  mul- 
tiplication with  the  alternative  of  just  storing  a complete  table  of  all  possible  products. 

► 86.  [21]  The  routine  MUX  in  (62)  refers  to  “obvious”  values.  What  are  they? 

87.  [20]  If  the  median  operator  ( fgh ) is  implemented  with  a recursive  subroutine 
analogous  to  (62),  what  are  its  “obvious”  values? 

► 88.  [M25]  Find  functions  /,  g,  and  h for  which  the  recursive  ternary  computation  of 
f Ag  Ah  outperforms  any  of  the  binary  computations  (f  Ag)  Ah,  [g  Ah)  A/,  (hAf)Ag. 

89.  [15]  Are  the  following  quantified  formulas  true  or  false?  (a)  3x^X2  f = 3x23xif. 
(b)  VzrVxs/  = Vx2Vxi /.  (c)  VXl3x2f  < 3x2Vxi/.  (d)  \/Xl3x2f  > 3x2\/x1f. 

90.  [M20]  When  l = m = n = 3,  Eq.  (64)  corresponds  to  the  MOR  operation  of  MMIX. 
Is  there  an  analogous  formula  that  corresponds  to  MXOR  (matrix  multiplication  mod  2)? 

► 91.  [26]  In  practice  we  often  want  to  simplify  a Boolean  function  / with  respect  to  a 
“care  set”  g,  by  finding  a function  / with  small  B(f)  such  that 


f(x)Ag(x)  < f(x)  < f(x)Vg(x)  for  all  x. 

In  other  words,  f(x)  must  agree  with  f(x)  whenever  x satisfies  g(x)  = 1,  but  we 
don)t  care  what  value  f(x)  assumes  when  g(x)  = 0.  An  appealing  candidate  for  such 
an  / is  provided  by  the  function  fig,  “f  constrained  by  g”  defined  as  follows:  If  g(x)  is 
identically  0,  / i g = 0.  Otherwise  (f  i g)(x)  = f(y),  where  y is  the  first  element  of 

the  sequence  x,  x ® 1,  x © 2,  . . . , such  that  g(y)  = 1.  (Here  we  think  of  x and  y as 

n-bit  numbers  (Xl  . . . xn)2  and  (yi  . . . yn)2.  Thus  x®l=x®0...01=xi...  xn_,xn; 
x ® 2 = x ® 0 . . . 010  = x\  ...  Xn-2xn-ixn;  etc.) 

a)  What  are  / 4 1,  / 4.  xh  and  / i xj  ? 

b)  Prove  that  (/  A /')  4 9 = (/  i g)  A (/'  4 g). 

c)  True  or  false:  f ig  = f ig. 

d)  Simplify  the  formula  f(x  1, . . . , xn)  4 ( x2  Ax3Ax5A  x6 ). 

e)  Simplify  the  formula  f(x  1, . . . , xn)  4 (xx  ® x2  ® • • • ® xn). 

f)  Simplify  the  formula  /(an, . . . ,xn)  4 ((Xl  A • • • A xn)  V (x\  A • • • A xn)). 

g)  Simplify  the  formula  f(x1,...,xn)i(x1  A g(x2,  ...,*„)). 
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h)  Find  functions  f(x  1,2:2)  and  g( X\,X2)  such  that  B(f  l g)  > B(f). 

i)  Devise  a recursive  way  to  compute  fig,  analogous  to  (55). 

92.  [M2  7]  The  operation  fig  in  exercise  91  sometimes  depends  on  the  ordering  of  the 
variables.  Given  g = g(xi, . . . ,x„),  prove  that  (/lr  ig lr)  = [f  igY  for  all  permutations 
7r  of  {l,...,n}  and  for  all  functions  / = f(x i,...,xn)  if  and  only  if  g = 0 or  g is  a 
subcube  (a  conjunction  of  literals). 

93.  [36]  Given  a graph  G on  the  vertices  {l,...,n},  construct  Boolean  functions  / 
and  g with  the  property  that  an  approximating  function  / exists  as  in  exercise  91  with 
small  B(f)  if  and  only  if  G can  be  3-colored.  (Hence  the  task  of  minimizing  B(f)  is 
NP-complete.) 

94.  [21]  Explain  why  (65)  performs  existential  quantification  correctly. 

► 95.  [20]  Improve  on  (65)  by  testing  if  n = 1 before  computing  r/,. 

96.  [20]  Show  how  to  achieve  (a)  universal  quantification  / = / Ag,  and 

(b)  differential  quantification  Qxj1  . . . Qxjm  f = f D g,  by  modifying  (65). 

97.  [M20]  Prove  that  it’s  possible  to  compute  arbitrary  bottom-of-the-BDD  quantifi- 
cations such  as  3xn-YJ  xn-^Qxn-z3xn-2  ^xn-\i  xn  f (x\, . . . ,x„)  in  0(B(f))  steps. 

► 98.  [22]  In  addition  to  (70),  explain  how  to  define  the  vertices  ENDPT(x)  of  Q that 
have  degree  < 1.  Also  characterize  PAIR(x,  y),  the  components  of  size  2. 

99.  [20]  (R.  E.  Bryant,  1984.)  Every  4-coloring  of  the  US  map  considered  in  the  text 
corresponds  to  24  solutions  of  the  COLOR  function  (73),  under  permutation  of  colors. 
What’s  a good  way  to  remove  this  redundancy? 

► 100.  [24]  In  how  many  ways  is  it  possible  to  4-color  the  contiguous  USA  with  exactly 
12  states  of  each  color?  (Eliminate  DC  from  the  graph.) 

101.  [20]  Continuing  exercise  100,  with  colors  (1,2, 3, 4},  find  such  a coloring  that 
maximizes  (state  weight)  x (state  color),  where  states  are  weighted  as  in  (18). 

102.  [23]  Design  a method  to  cache  the  results  of  functional  composition  using  the  fol- 
lowing conventions:  The  system  maintains  at  all  times  an  array  of  functions  [gi, . . . ,(/„], 
one  for  each  variable  Xj.  Initially  g}  is  simply  the  projection  function  Xj,  for  1 < j < n. 
This  array  can  be  changed  only  by  the  subroutine  NEWG(/,g),  which  replaces  <7,  by  g. 
The  subroutine  COMPOSE (/)  always  performs  functional  composition  with  respect  to 
the  current  array  of  replacement  functions. 

► 103.  [20]  Mr.  B.  C.  Dull  wanted  to  evaluate  the  formula 

3yi  ■ • • 3ym{(yi  = fi{xu . . . ,xn))  A ■ • ■ A (ym  = fm(xi,  ■ ■ ■ ,%„))  A <7(2/1, . . -,ym)), 

for  certain  functions  /1,  . . . , fm,  and  g.  But  his  fellow  student,  J.  H.  Quick,  found  a 
much  simpler  formula  for  the  same  problem.  What  was  Quick’s  idea? 

► 104.  [21]  Devise  an  efficient  way  to  decide  whether  / < g or  / > g or  / j|  g,  where 
/ ||  g means  that  / and  g are  incomparable,  given  the  BDDs  for  / and  g. 

105.  [25]  A Boolean  function  f(x  1, . . . , xn)  is  called  unate  with  polarities  (yi, . . . , yn ) 
if  the  function  h(x  1, . . . ,xn)  = f(x  1 ® 3/1,  ...,*„  ® yn ) is  monotone. 

a)  Show  that  / can  be  tested  for  unateness  by  using  the  A and  N quantifiers. 

b)  Design  a recursive  algorithm  to  test  unateness  in  at  most  0(B(f)2)  steps,  given 
the  BDD  for/.  If  / is  unate,  your  algorithm  should  also  find  appropriate  polarities. 
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106.  [25]  Let  f$g$h  denote  the  relation  “/(*)  = g(y)  = 1 implies  h(x  Ay)  = 1,  for 
all  x and  j/.”  Show  that  this  relation  can  be  evaluated  in  at  most  0(B(f)B(g)B(h)) 
steps.  [ Motivation : Theorem  7.1.1H  states  that  / is  a Horn  function  if  and  only  if 
/$/$/;  thus  we  can  test  Horn-ness  in  0(B(f)3)  steps.] 

107.  [26]  Continuing  exercise  106,  show  that  it’s  possible  to  determine  whether  or  not 
/ is  a Krom  function  in  0(B(/)4)  steps.  [Hint:  See  Theorem  7.1. IS.] 

108.  [HM24]  Let  b(n,  s ) be  the  number  of  n-variable  Boolean  functions  with  B(f)  < s. 
Prove  that  (a  - 3)\b(n,s)  < ( n(a  - I)2)*-2  when  a > 3,  and  explore  the  ramifications 
of  this  inequality  when  a = |2n/(n  + 1/ln  2)J.  Hint:  See  the  proof  of  Theorem  7.1.2S. 

► 109.  [HM17]  Continuing  exercise  108,  show  that  almost  all  Boolean  functions  of  n var- 
iables have  B(D  > 2 n/(n  + 1/ln 2),  for  all  permutations  jr  of  {1, . . . ,n},  as  n ->  oo. 

110.  [25]  Construct  explicit  worst-case  functions  /„  with  B(f„)  = Un  in  Theorem  U. 

111.  [M21]  Verify  the  summation  formula  (79)  in  Theorem  U. 

112.  [HM23]  Prove  that  min(2*,22"“  - 22"'*-1)  - bk  is  very  small,  where  bk  is  the 
number  defined  in  (80),  except  when  n — lgn  — 1 < fc  < n — lgn+1. 


113.  [20 ] Instead  of  having  two  sink  nodes,  one  for  each  Boolean  constant,  we  could 
have  2 sinks,  one  for  each  Boolean  function  of  four  variables.  Then  a BDD  could  stop 
four  levels  earlier,  after  branching  on  xn-4 . Would  this  be  a good  idea? 


114.  [20]  Is  there  a function  with  profile  (1, 1, 1, 1, 1, 2)  and  quasi-profile  (1, 2, 3, 4, 3, 2)? 

► 115.  [M22]  Prove  the  quasi-profile  inequalities  (84)  and  (124). 

116.  [M21]  What  is  the  (a)  worst  case  (b)  average  case  of  a random  quasi-profile? 

117.  [M20]  Compare  Q(f)  to  B(f)  when  / = Mm(xu  . . . , xm;  xm+1, . . . , xm+2m). 

118.  [M23]  Show  that,  from  the  perspective  of  Section  7.1.2,  the  hidden  weighted  bit 
function  has  cost  C(hn ) = 0(n).  What  is  the  exact  value  of  C(/i4)? 

119.  [20]  True  or  false:  Every  symmetric  Boolean  function  of  n variables  is  a special 
case  of  h2„+i.  (For  example,  xx  ® x2  = h5(0, 1, 0, aq, x2).) 

120.  [15]  Explain  the  hidden-permuted-weighted-bit  formula  (94). 

► 121.  [M2 2]  li  f(xi, ...  ,xn)  is  any  Boolean  function,  its  dual  fD  is  f(xi, . . . , xn),  and 
its  reflection  fR  is  f(xn  ...,Xl).  Notice  that  fDD  = fRR  = f and  fDR  = fRD. 

a)  Show  that  h%R(  xu...,xn)  = hn(x2, . . . ,xn,Xl). 

b)  Furthermore,  the  hidden  weighted  bit  function  satisfies  the  recurrence 


hi(xi)  xu  hn+ i(xi, . . . , xn+i)  — (xn+1?  hn(x  2, ...  ,xn,xi):  hn(x  1, . . . ,xn)). 

c)  Define  xip,  a permutation  on  the  set  of  all  binary  strings  x,  by  the  recursive  rules 

eip  = e,  (Xl  . . .xnQ)ip  = (xi  . . .xn%p)Q,  (Xl  . . . xnl)ip  = (x2  . . . xnxi)il;l. 

For  example,  1101^  = (101^)1  = (01^)11  = (0^)111  = (t/>)0111  = 0111;  and  we 
also  have  0111)/  = 1101.  Is  ip  an  involution? 

d)  Show  that  h„{ x)  = hn(xip),  where  the  function  hn  has  a very  small  BDD. 

122.  [27]  Construct  an  FBDD  for  hn  that  has  fewer  than  n2  nodes,  when  n > 1. 

123.  [M20]  Prove  formula  (97),  which  enumerates  all  slates  of  offset  a. 

► 124.  [27]  Design  an  efficient  algorithm  to  compute  the  profile  and  quasi-profile  of  h„, 
given  a permutation  tt.  Hint:  When  does  the  slate  [r0, . . . , r„._fc]  correspond  to  a bead? 
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► 125.  [HMSJf } Prove  that  B(hn)  can  be  expressed  exactly  in  terms  of  the  sequences 


126.  [HM42]  Analyze  B(h for  the  organ-pipe  permutation  n — (2,4, . . . , n, . . . , 3, 1). 

127.  [46]  Find  a permutation  n that  minimizes  B(hf 00). 

► 128.  [25]  Given  a permutation  7r  of  {l,...,m  + 2m},  explain  how  to  compute  the 
profile  and  quasi-profile  of  the  permuted  2m-way  multiplexer 

(xi, . . . 

1 'Em  j •Em-^l  5 • • • > %m-\-2Tn  ) — -^^m(^l7r?  • • • 5 xmn]  %(m+l)ir  y • • • ? *E(rn+2m)7r  ) • 

129.  [M25]  Define  Qm(x  1, . . . ,xm2)  to  be  1 if  and  only  if  the  0-1  matrix  (x(i_i )m+j) 
has  no  all-zero  row  and  no  all-zero  column.  Prove  that  B(Qfn)  = 0(2 m/m2)  for  all  n. 

130.  [HM31  ] The  adjacency  matrix  of  an  undirected  graph  G on  vertices  {1, . . . , m} 
consists  of  (™)  variable  entries  xuv  = [it  — v in  G],  for  1 < u < v < m.  Let  Cm,k 
be  the  Boolean  function  [G  has  a fc-clique],  for  some  ordering  of  those  ("']  variables. 

a)  If  1 < k < s/m,  prove  that  B(Cmtk)  > where  s = (2)  — 1 and  t — m + 2 — k2 . 

b)  Consequently  B(Cmjm/2 1)  = 0(2 m^3/y/m),  regardless  of  the  variable  ordering. 

131.  [M28]  (The  covering  function.)  The  Boolean  function 


G(xi , X2 , • • • , Xp,  y\  1,2/12,  ■ ■ ■ , yiq , 1/21 , ■ * * , I/2q,  - * * , yPl  , Vp2 , • * * , Vpq  ) 

= ((*iAj/n)V(x2Ay2i)V  • • • V (xpAj/pi))  A • • • A ((xiAyi9)V(x2A2/2,)V  • • • V(xpA ypq)) 


is  true  if  and  only  if  all  columns  of  the  matrix  product 


X Y = (X\X2  . . . xp 


/ y n 

i/12  • 

■■  yiq\ 

2/21 

J/22  • 

••  2/2, 

. 

V Vp  1 

yP2  ■ 

ypq  ) 

are  positive,  i.e.,  when  the  rows  of  Y selected  by  x “cover”  every  column  of  that  matrix. 
The  reliability  polynomial  of  C is  important  in  the  analysis  of  fault-tolerant  systems, 
a)  When  a BDD  for  C tests  the  variables  in  the  order 


®l}2/ll?t/l2>  • • • 5 Vlq  > *^2  1 2/21 5 2/22  5 • • • ? Vlq  ? • • • 5 -Epi  2/pl  1 ?/p2  5 • • • 5 2/pg  5 

show  that  the  number  of  nodes  is  asymptotically  pq2q~l  for  fixed  q as  p — ► oo. 

b)  Find  an  ordering  for  which  the  size  is  asymptotically  pq2p~1  for  fixed  p as  q — > oo. 

c)  Prove,  however,  that  Bm-m(C)  = n(2mm(p’,3)/2)  in  general. 

132.  [32]  What  Boolean  functions  f(x i, x2, X3, x4, xs)  have  the  largest  Bmin(/)? 

133.  [20]  Explain  how  to  compute  Bmln(f)  and  Bmax(/)  from  /’ s master  profile  chart. 

134.  [24]  Construct  the  master  profile  chart,  analogous  to  (102),  for  the  Boolean 
function  xi  © ((x2  © (xi  V (x2  A X3)))  A (X3  © x4)).  What  are  Bmin(/)  and  Bmax(/)? 
Hint:  The  identity  /(xi,  x2,  X3,  x4)  = /(xi,  x2,  x4,  X3)  saves  about  half  the  work. 

135.  [M2 7]  For  all  n > 4,  find  a Boolean  function  6n(x  1, . . . , xn)  that  is  uniquely  thin, 
in  the  sense  that  B(O^)  = n + 2 for  exactly  one  permutation  7r.  (See  (93)  and  (102).) 
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► 136.  [M3 4]  What  is  the  master  profile  chart  of  the  median-of-medians  function 


((*11*12  ■ • • *ln)  (*21*22  • ■ • *2 n)  • . . (*ml*m2  ■ • • *mn)), 


when  m and  n are  odd  integers?  What  is  the  best  ordering?  (There  are  mn  variables.) 

137.  [M38]  Given  a graph,  the  optimum  linear  arrangement  problem  asks  for  a permu- 
tation t r of  the  vertices  that  minimizes  \utv-vit\.  Construct  a Boolean  function  / 

for  which  this  minimum  value  is  characterized  by  the  optimum  BDD  size  Bmin(f). 

► 138.  [M36]  The  purpose  of  this  exercise  is  to  develop  an  attractive  algorithm  that 
computes  the  master  profile  chart  for  a function  /,  given  f's  QDD  (not  its  BDD). 

a)  Explain  how  to  find  ("+1)  weights  of  the  master  profile  chart  from  a single  QDD. 

b)  Show  that  the  jump-up  operation  can  be  performed  easily  in  a QDD,  without 
garbage  collection  or  hashing.  Hint:  See  the  "bucket  sort”  in  Algorithm  R. 

c)  Consider  the  2n~1  orderings  of  variables  in  which  the  ( i + l)st  is  obtained  from 
the  ith  by  a jump- up  from  depth  pi  -\-  ui  to  depth  ui  — 1.  For  example,  we  get 

12345  21345  32145  31245  43125  41325  42135  42315  54231  52431  53241  53421  51342  51432  51243  51234 

when  n = 5.  Show  that  every  fc-element  subset  of  {1, . . . , n}  occurs  at  the  top  k 
levels  of  one  of  these  orderings. 

d)  Combine  these  ideas  to  design  the  desired  chart-construction  algorithm. 

e)  Analyze  the  space  and  time  requirements  of  your  algorithm. 

139.  [22]  Generalize  the  algorithm  of  exercise  138  so  that  (i)  it  computes  a common 
profile  chart  for  all  functions  of  a BDD  base,  instead  of  a single  function:  and  (ii)  it 
restricts  the  chart  to  variables  {*a,*a+i,  • -.,*&},  preserving  {jei,  . . . , *a_i}  at  the  top 
and  {*6+i at  the  bottom. 


140.  [27]  Explain  how  to  find  Bmin(f)  without  knowing  all  of  f’s  master  profile  chart. 

141.  [30]  True  or  false:  If  Xi,  X2,  . . . , Xm  are  disjoint  sets  of  variables,  then  an  opti- 
mum BDD  ordering  for  the  variables  of  g(hi(X1),  h2(X2),  ■ . . ,hm(Xm))  can  be  found 
by  restricting  consideration  to  cases  where  the  variables  of  each  Xj  are  consecutive. 

^ ^ 42.  [HM32]  The  representation  of  threshold  functions  by  BDDs  is  surprisingly  myste- 
rious. Consider  the  self-dual  function  f(x)  = (x™1  ...*“"),  where  each  Wj  is  a positive 

integer  and  wi-\ \-wn  is  odd.  We  observed  in  (28)  that  B(f)  = 0(wi~\ \-wn)2;  and 

B(f)  is  often  O(n)  even  when  the  weights  grow  exponentially,  as  in  (29)  or  exercise  41. 

a)  Prove  that  when  = 1,  wk  = 2fc~2  for  1 < k < m,  and  wk  = 2m  - 2n~k  for 
m < k < 2m  = n,  B(f)  grows  exponentially  as  n ->  00,  but  Bmin(f)  = 0(n2). 

b)  Find  weights  {wi,. . ,,wn}  for  which  Bmin(f)  = fI(2v/"/2). 


143.  [24]  Continuing  exercise  142(a),  find  an  optimum  ordering  of  variables  for  the 
function  (*i*2*i*l*l*|6*?2*i4*|28*?S6*n2*Ii8*?l6*?f*fI2*{g082:}?16a:^20a:11^22a;i°23). 


144.  [16]  What  is  the  quasi-profile  of  the  addition  functions  {fi,f2,  /3,  /4,  /5}  in  (36)? 

145.  [24]  Find  -Bmin(/i,  /2,  /3,  /4,  /s)  and  -Bmax(/i,  /2,  /3,  A,  /s)  of  those  functions. 

► 146.  [M22]  Let  (bo, . . . , bn)  and  ( go , • ■ - , gn)  be  a BDD  base  profile  and  quasi-profile. 

a)  Prove  that  60  < min(<jo,  (bi  + q2)(bi  + q2  — 1)),  bi  < min(b0  + <?o,  92 (<?2  — 1)),  and 
bo  + bi  > qo  — q2- 

b)  Conversely,  if  60,  &i,  q0,  and  q2  are  nonnegative  integers  that  satisfy  those  in- 
equalities, there  is  a BDD  base  with  such  a profile  and  quasi-profile. 


► 147.  [27]  Flesh  out  the  details  of  Rudell’s  swap-in-place  algorithm,  using  the  conven- 
tions of  Algorithm  U and  the  reference  counters  of  exercise  82. 
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148.  [M21]  True  or  false:  B(ff, ...,  /£)  < 2B(fi, ...,  fm),  after  swapping  (T)  @ . 

149.  [M20]  (Bollig,  Lobbing,  and  Wegener.)  Show  that,  in  addition  to  Theorem  J~, 
we  also  have  B(ff  , . . . , f £,)  < ( 2k  — 2)bo  + B(f\, . . . , /m)  after  a jump-down  operation 
of  k — 1 levels,  when  (bo, . . . , b„)  is  the  profile  of  {/i, . . . , fm}- 

150.  [ 30]  When  repeated  swaps  are  used  to  implement  jump-up  or  jump-down,  the 
intermediate  results  might  be  much  larger  than  the  initial  or  final  BDD.  Show  that 
variable  jumps  can  actually  be  done  more  directly,  with  a method  whose  worst-case 
running  time  is  0(B(fx,. . . , fm)  + B(ff , ...,  /£)). 

151.  [20]  Suggest  a way  to  invoke  Algorithm  J so  that  each  variable  is  sifted  just  once. 

152.  [25]  The  hidden  weighted  bit  function  h\o o has  more  than  17.5  trillion  nodes 
in  its  BDD.  By  how  much  does  sifting  reduce  this  number?  Hint:  Use  exercise  124, 
instead  of  actually  constructing  the  diagrams. 

153.  [30]  Put  the  tic-tac-toe  functions  {3/1,..., 3/9}  of  exercise  7.1.2-65  into  a BDD 
base.  How  many  nodes  are  present  when  variables  are  tested  in  the  order  x\,  X2,  . . . , xg, 
01,  o2,  . . . , o 9,  from  top  to  bottom?  What  is  Bmin(yi, ...  ,3/9)? 

154.  [20]  By  comparing  (104)  to  (106),  can  you  tell  how  far  each  state  was  moved 
when  it  was  sifted? 

► 155.  [25]  Let  /1  be  the  independent-set  function  (105)  of  the  contiguous  USA,  and 
let  be  the  corresponding  kernel  function  (see  (68)).  Find  orderings  7r  of  the  states 
so  that  (a)  B(/J)  and  (b)  B(ff , f%)  are  as  small  as  you  can  make  them.  (Note  that 
the  ordering  (110)  gives  B(ff)  = 339,  -B(/2  ) = 795,  and  B(ff  , /J")  = 1129.) 

156.  [30]  Theorems  J+  and  J“  suggest  that  we  could  save  reordering  time  by  only 
jumping  up  when  sifting,  not  bothering  to  jump  down.  Then  we  could  eliminate  steps 
J3,  J5,  J6,  and  J7  of  Algorithm  J.  Would  that  be  wise? 

157.  [M24]  Show  that  if  the  m + 2m  variables  of  the  2m-way  multiplexer  Mm  are  ar- 
ranged in  any  order  such  that  B(Mfn)  > 2m+1  + l,  then  sifting  will  reduce  the  BDD  size. 

158.  [M24]  When  a Boolean  function  f(x\, . . . , xn)  is  symmetrical  in  the  variables 
{xi, . . . ,xp},  it’s  natural  to  expect  that  those  variables  will  appear  consecutively  in  at 
least  one  of  the  reorderings  /"(x  1, . . . ,xn)  that  minimize  B(fn).  Show,  however,  that  if 

f(xi,  ...,Xn)  [Xi  -f-  * * * T Xp  — LP/3J]  + [xi  + • • • + Xp  — ( 2p/3]  ] g(Xp+ 1 , , Xp+rn] , 

where  p = n — m and  9(3/1, . . . ,3/m)  is  any  nonconstant  Boolean  function,  then  B(flr)  = 
\n2  +0(n)  as  n — > 00  when  (xi, . . . , xp}  are  consecutive  in  n,  but  B(fn)  = |n2  + 0(n) 
when  7r  places  about  half  of  those  variables  at  the  beginning  and  half  at  the  end. 

159.  [20]  John  Conway’s  basic  rule  for  Life,  exercise  7.1.3-167,  is  a Boolean  function 
L(x nw,xn,xne,Xw,x,xe,Xsw,Xs,Xse).  What  ordering  of  those  nine  variables  will 
make  the  BDD  as  small  as  possible? 

► 160.  [24]  ( Chess  Life.)  Consider  an  8 x 8 matrix  X = ( xtj ) of  Os  and  Is,  bordered  by 
infinitely  many  Os  on  all  sides.  Let  Lij(X)  = L(x(i_1)(J_1), . . . ,Xij, . . . , X(i+i)(j+i))  be 
Conway’s  basic  rule  at  position  (i,j).  Call  X “tame”  if  Lij(X)  = 0 whenever  i £ [1 . . 8] 
or  j ^ [1 . . 8];  otherwise  X is  “wild,”  because  it  activates  cells  outside  the  matrix. 

a)  How  many  tame  configurations  X vanish  in  one  Life  step,  making  all  Lij(X)  = 0? 

b)  What  is  the  maximum  weight  xSJ  among  all  such  solutions? 

c)  How  many  wild  configurations  vanish  within  the  matrix  after  one  Life  step? 

d)  What  are  the  minimum  and  maximum  weight,  among  all  such  solutions? 

e)  How  many  configurations  X make  Lij(X)  = 1 for  I < i.j  < 8? 
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f)  Investigate  the  tame  8x8  predecessors  of  the  following  patterns: 


(1) 


(Here,  as  in  Section  7.1.3,  black  cells  denote  Is  in  the  matrix.) 

161.  [28]  Continuing  exercise  160,  write  L(X)  = Y — (yt])  if  X is  a tame  matrix  such 
that  Lij(X)  = yij  for  1 < i,j  < 8. 

a)  How  many  X’s  satisfy  L(X)  = X (“still  Life”)? 

b)  Find  an  8 x 8 still  Life  with  weight  35. 

c)  A “flip-flop”  is  a pair  of  distinct  matrices  with  L(X)=Y,  L(Y)=X.  Count  them. 

d)  Find  a flip-flop  for  which  X and  Y both  have  weight  28. 

► 162.  [30]  ( Caged  Life.)  If  X and  L(X)  are  tame  but  L(L(X))  is  wild,  we  say  that  X 
escapes  its  cage  after  three  steps.  How  many  6x6  matrices  escape  their  6x6  cage 
after  exactly  k steps,  for  k = 1,  2,  . . . ? 


163.  [23]  Prove  formulas  (112)  and  (113)  for  the  BDD  sizes  of  read-once  functions. 

► 164.  [M27]  What  is  the  maximum  of  B(f),  over  all  read-once  functions  f(x  1 , . . . , xn)? 

165.  [ M21 ] Verify  the  Fibonacci-based  formulas  (115)  for  B(um)  and  B(vm). 

166.  [M29]  Complete  the  proof  of  Theorem  W. 

167 ‘w  [21]  Design  an  efficient  algorithm  that  computes  a permutation  n for  which  both 
B(fw)  and  B(f7' , fn)  are  minimized,  given  any  read-once  function  f(x  1, . . . , xn). 

► 168.  [HM^O]  Consider  the  following  binary  operations  on  ordered  pairs  z = [x . y ) : 


zo  z — (x,  y)  o (*',  y ) = (x  + x , min(x  + y , x + y)); 
z • z = (x,  y)  • (x  , y ) = (x  + x + min(y,  y'),  ma x(y,  y )). 


(These  operations  are  associative  and  commutative.)  Let  Si  = {(1,0)},  and 


► 


S„ 


\J{zoz' 


n—  1 

2 € Sk,  z £ Sn_fc}  U \^J  {z  • z1  \ z £ Sk,  z £ Sn-fc}  for  n > 1. 


fc=1  k=i 

Thus  S2  = {(2, 0),  (2, 1)};  S3  = {(3,  0),  (3, 1),  (3,  2)};  S4  = {(4, 0), . . . , (4, 3),  (5, 1)};  etc. 

a)  Prove  that  there  exists  a read-once  function  f(xu...,xn)  for  which  we  have 
min,r  B(fn)  = c and  min^  B(f7r , fn)  = c'  if  and  only  if  (Ic  — 1,  c—  |c'  — 1)  £ Sn. 

b)  True  or  false:  0 < y < x for  all  (2:,  y)  £ Sn. 

c)  If  zT  = (x  + y,x  - y)/V2,  show  that  zT  a z'T  = ( z»z')T  and  zT  • z'T  = (zoz')T. 

d)  Prove  that  x2  + y2  < n2p  for  all  (*,  y)  £ Sn,  if  /3  is  the  constant  in  (116).  Hints: 
Let  \z\2  = x2+y2-  it  suffices  to  prove  that  \z%z'[  < 2&  = s/2<f  whenever  0 < y < x, 
0<y'  < x',  \z\  =r  = (1— <5)^,  \z'\  = r’  = (1+J)*,  and  0 < 5 < 1.  If  also  y = y'\ 
z»z'  lies  inside  the  ellipse  (a  cos  9 + b sin  0,  b sin  0),  where  a = r + r'  and  b = Vrr7. 

169.  [M46]  lsmm7rB(f7r)<B(v2m+i)  for  every  read-once  function/of  22m+1  variables? 

170.  [M25]  Let’s  say  that  a Boolean  function  is  “skinny”  if  its  BDD  involves  all  the 
variables  in  the  simplest  possible  way:  A skinny  BDD  has  exactly  one  branch  node  (j) 
for  each  variable  Xj . and  either  LO  or  HI  is  a sink  node  at  every  branch. 

a)  How  many  Boolean  functions  f(xlt . . . ,xn)  are  skinny  in  this  sense? 

b)  How  many  of  them  are  monotone? 

c)  Show  that  ft(x  1, . . . , xn)  = [(xi  . . . xn)2  > t]  is  skinny  when  0 < t < 2n  and  t is  odd. 
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d)  What  is  the  dual  of  the  function  ft  in  part  (c)? 

e)  Explain  how  to  find  the  shortest  CNF  and  DNF  formulas  for  ft,  given  t. 

171.  [M26]  Continuing  exercise  170,  show  that  a function  is  read-once  and  regular  if 
and  only  if  it  is  skinny  and  monotone. 

172.  [ M27 ] How  many  skinny  functions  f(x i, . . . ,xn ) are  also  Horn  functions?  How 
many  of  them  have  the  property  that  / and  / both  satisfy  Horn’s  condition? 

► 173.  [ HM28 } Exactly  how  many  Boolean  functions  f(x\, . . . , xn)  are  skinny  after  some 
reordering  of  the  variables,  f(x\-K , . . . , xn7r)? 

► 174.  [M39]  Let  Sn  be  the  number  of  Boolean  functions  f{xi, . . . ,xn)  whose  BDD  is 
“thin”  in  the  sense  that  it  has  exactly  one  node  labeled  (J)  for  1 < j < n.  Show 
that  Sn  is  also  the  number  of  combinatorial  objects  of  the  following  types: 

a)  Dellac  permutations  of  order  2 n (namely,  permutations  pip2---P2n  such  that 
\k/2]  < pk  < n + \k/ 2]  for  1 < k < 2 n). 

b)  Genocchi  derangements  of  order  2n  + 2 (namely,  permutations  q\q2  ■ ■ ■ q2n+2  such 
that  qu  > k if  and  only  if  k is  odd,  for  1 < k < 2n  + 2;  also  qk  / k in  a 
derangement). 

c)  Irreducible  Dumont  pistols  of  order  2n  + 2 (namely,  sequences  rir2  • • • r2n+2  such 
that  k < rk  < 2n  + 2 for  1 < k < 2n+2  and  {ri,r2, . . . ,r2n+ 2}  = (2,4,6, ... , 
2n,2n  + 2},  with  the  special  property  that  2k  6 (ri, . . . ,r2fc-i)  for  1 < k < n). 

d)  Paths  from  (1,0)  to  (2 n + 2,0)  in  the  directed  graph 

(7,3)  * (8,3)  * 

t 4 

(5,2) — > (6,2) — > (7,2) — > (8,2) — > 

(4(4 

(3,1) — ► (4,1) — ► (5,1) — ► (6,1) — ► (7,1) — ► (8,1) — ► 

t 4 t 4 t 4 

(1,0)  ► (2,0) — *■  (3,0) — > (4,0) — * (5,0) — * (6,0) — * (7,0) — * (8,0) — *■ 

(Notice  that  objects  of  type  (d)  are  very  easy  to  count.) 

175.  [M30]  Continuing  exercise  174,  find  a way  to  enumerate  the  Boolean  functions 
whose  BDD  contains  exactly  bj-  1 nodes  labeled  (7),  given  a profile  (bo, . . . ,bn-i,bn). 

176.  [M35]  To  complete  the  proof  of  Theorem  X,  we  will  use  exercise  6.4-78,  which 
states  that  {ha,b  \ a £ A and  b £ B}  is  a universal  family  of  hash  functions  from  n bits 
to  l bits,  when  ha,b(x)  = ((ax  + b)  i£>  (n  — l ))  mod  2l , A = {a  | 0 < a < 2n,  a odd},  B = 
(6  | 0 < b < 2n~1},  and  0 < l < n.  Let  / = (ha,6(p)  | p € P}  and  J = {ha,b(q)  \ q £ Q}- 

a)  Show  that  if  2l  — 1 < 2t-1e/(l  — e),  there  are  constants  a € A and  b £ B for 
which  |/|  > (1  — e)2*  and  |J|  > (1  — e)2(. 

b)  Given  such  an  a,  let  J = {ji,  ■ ■ ■ , j\j\}  where  0 = j\  < ■ ■ ■ < jj/|,  and  choose 
Q'  = {qt, . . . , q|j| } C Q so  that  hUtb(qk ) = jk  for  1 < k < \ J\.  Let  g(q)  denote  the 
middle  l — 1 bits  of  aq,  namely  (aq^>(n  — l + l))  mod  2,_1.  Prove  that  g(q)  / g(q') 
whenever  q and  q are  distinct  elements  of  the  set  Q"  = {qi,qz,  ■ ■ ■ , n /2"|  — 1 } • 

c)  Prove  that  the  following  set  Q*  satisfies  condition  (120),  when  l > 3 and  y = a: 

Q * = {q  | q £ Q" , g(q ) is  even,  and  g(p ) + g(q ) = 2(_1  for  some  p £ P}. 

d)  Finally,  show  that  |Q*|  is  large  enough  to  prove  Theorem  X. 

177.  [ M22 ] Complete  the  proof  of  Theorem  A by  bounding  the  entire  quasi-profile. 

178.  [M24]  (Amano  and  Maruoka.)  Improve  the  constant  in  (121)  by  using  a better 
variable  ordering:  Zn(x2n-i,  aq,  x3, . . . , x2n-3;  x2n,  x2,  x4,  ■ ■ ■ ,x2n-2)- 


179.  [M4 7]  Does  the  middle  bit  of  multiplication  satisfy  Bmm(Zn)  = 0(26n,/5)? 

180.  [M2 7]  Prove  Theorem  Y,  using  the  hint  given  in  the  text. 

181.  [M21]  Let  Lm,n  be  the  leading  bit  function  Z^£n\xi, ...  ,*m;  r/i, . . ,,y„).  Prove 
that  Bmin (Bm,rP)  -■  0(2  n]  when  m ^ n. 

182.  [M38]  (I.  Wegener.)  Does  Bmin(L„tn)  grow  exponentially  as  n — > oo? 

► 183.  [M25]  Draw  the  first  few  levels  of  the  BDD  for  the  “limiting  leading  bit  function” 

[(.*1133:5  • • . )2  • (.*2*4*6  • • . )2  > 5], 

which  has  inflnitely  many  Boolean  variables.  How  many  nodes  bk  are  there  on  level  fc? 
(We  don’t  allow  (.*1*3*5  . . . )2  or  (.*2*4*6  . . . )2  to  end  with  infinitely  many  Is.) 

184.  [M23]  What  are  the  BDD  and  ZDD  profiles  of  the  permutation  function  Pm  ? 

185.  [M25]  How  large  can  Z(f)  be,  when  / is  a symmetric  Boolean  function  of 
n variables?  (See  exercise  44.) 

186.  [10]  What  Boolean  function  of  {*1, *2, *3, *4, *5, *e}  has  the  ZDD  ‘J”L’? 

► 187.  [20]  Draw  the  ZDDs  for  all  16  Boolean  functions  /(* i,*2)  of  two  variables. 

188.  [16]  Express  the  16  Boolean  functions  /(* i,*2)  as  families  of  subsets  of  {1,2}. 

189.  [18]  What  functions  /(*  1, . . . , *n)  have  a ZDD  equal  to  their  BDD? 

190.  [20]  Describe  all  functions  / for  which  (a)  Q(f)  = B(f);  (b)  Q(f ) = Z(f). 

► 191.  [HM25]  How  many  functions  /(*i, . . . , *„)  have  no  □ in  their  ZDD? 

192.  [M20]  Define  the  ^-transform  of  binary  strings  as  follows:  ez  = e,  0Z  = 0, 
lz  = 1,  and 

{azaz , if  |a|  = n and  (3  = 0n; 

az0n,  if  |q|  = n and  P — a; 

azpz,  if  jaj  = \p\  - 1,  or  if  |a|  = \p\  = n and  a / /3  / 0n. 

a)  What  is  110010010000111 1 lz? 

b)  True  or  false:  ( tz)z  = r for  all  binary  strings  t. 

c)  If  f(x  1, . . • , *n)  is  a Boolean  function  with  truth  table  r,  let  fz(x  1, . . . ,*„)  be 
the  Boolean  function  whose  truth  table  is  rz . Show  that  the  profile  of  / is  almost 

identical  to  the  z-profile  of  fz,  and  vice  versa.  (Therefore  Theorem  U holds  for 

ZDDs  as  well  as  for  BDDs,  and  statistics  such  as  (80)  are  valid  also  for  z-profiles.) 

193.  [M21]  Continuing  exercise  192,  what  is  Sz  (*1, . . . ,*n)  when  0 < k < n? 

194.  [M25]  How  many /(*!,...,*„)  have  the  z-profile  (1,. ..,  1)?  (See  exercise  174.) 

195.  [24]  Find  Z(A42),  ^min(A72),  and  ^max(iH2),  where  M2  is  the  4-way  multiplexer. 

196.  [M21]  Find  a function  /(*  1,. . . ,*„)  for  which  Z(f ) = O(n)  and  Z(f)  = 0(n2). 

197.  [25]  Modify  the  algorithm  of  exercise  138  so  that  it  computes  the  “master  z- 
profile  chart”  of  /.  (Then  Zmin(f)  and  ZmSiX(f)  can  be  found  as  in  exercise  133.) 

► 198.  [23]  Explain  how  to  compute  AND(/, g)  with  ZDDs  instead  of  BDDs  (see  (55)). 

199.  [21]  Similarly,  implement  (a)  OR(/,j),  (b)  XOR (f,g),  (c)  BUTNOT(/, g). 

200.  [21]  And  similarly,  implement  MUX(/,  g,  h)  for  ZDDs  (see  (62)). 

201.  [22]  The  projection  functions  x3  each  have  a simple  3-node  BDD,  but  their  ZDD 
representations  are  more  complicated.  What’s  a good  way  to  implement  these  functions 
in  a general-purpose  ZDD  toolkit? 
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202.  [24]  What  changes  are  needed  to  the  swap-in-place  algorithm  of  exercise  147, 
when  levels  (u)  -<->  (X)  are  being  interchanged  in  a ZDD  base  instead  of  a BDD  base? 

► 203.  [M24]  ( Family  algebra.)  The  following  algebraic  conventions  are  useful  for  deal- 
ing with  finite  families  of  finite  subsets  of  positive  integers,  and  with  their  representation 
as  ZDDs.  The  simplest  such  families  are  the  empty  family , denoted  by  0 and  represented 
by  [Xj;  the  unit  family  {0},  denoted  by  e and  represented  by  [T];  and  the  elementary 
families  {{/}}  for  j > 1,  denoted  by  ej  and  represented  by  a branch  node  (j)  with 
LO  = [X]  and  HI  = [T].  (Exercise  186  illustrates  the  ZDD  for  e3.) 

Two  families  / and  g can  be  combined  with  the  usual  set  operations: 

• The  union  f U g = {a  | a £ / or  a G g}  is  implemented  by  OR (/,  <7); 

• The  intersection  f fl  g = (a  j a £ / and  a € g}  is  implemented  by  AND  (/,  <7); 

• The  difference  f \ g = {a  |a€  / and  a ^ g}  is  implemented  by  BUTNOT(/,  <7); 

• The  symmetric  difference  f ® g = (/  \ g)  U (g  \ f)  is  implemented  by  XOR(/,  g). 
And  we  also  define  three  new  ways  to  construct  families  of  subsets: 

• The  join  f\Jg  = {a  Uf3\a€f  and  6 <?},  sometimes  written  just  fg\ 

• The  meet  fng  = {aC\^\aEf  and  /3  € g}; 

• The  delta  /EB<7  = {a©/3|aG/  and  /3  G g}. 

All  three  are  commutative  and  associative:  / U g = <7  U /,  f U (g  L)  h)  = (f  U g)  U h,  etc. 

a)  Suppose  / = {0,  {1, 2},  {1, 3}}  = e U (ei  U (e2  U e3))  and  g = {{1,  2},  {3}}  = 

(ei  U e2)  U e3.  What  are  / U g and  (/  n g)  \ (/  EE  ei)? 

b)  Any  family  / can  also  be  regarded  as  a Boolean  function  f(x  1,2:2,...),  where 
a € / <=>■  /([l  G a],  [2  G a], . . . ) = 1.  Describe  the  operations  U,  n,  and  EB  in 
terms  of  Boolean  logical  formulas. 

c)  Which  of  the  following  formulas  hold  for  all  families  /,  <7,  and  h ? (i)  fU(gUh)  = 
(fUg)U(fUh)-,  (ii)  fn(gUh)  = (fng)D(fnh)-,  (iii)  fU(gnh)  = ( fUg)H(fUh ); 

(iv)  / U (g  U h)  = (/  U g)  U (/  U h);  (v)  / ffl  0 = 0 n g - h U 0;  (vi)  / n e = e. 

d)  We  say  that  / and  g are  orthogonal,  written  / i j,  ifafl/3  = 0 for  all  a G / 

and  all  /3  G g.  Which  of  the  following  statements  is  true  for  all  families  / and  <7? 
(i)  / -L  9 <==>  / n g = e;  (ii)  f ±g  =>  \f  U g\  = |/|  |ff|;  (iii)  \f  U g\  = \ f\  \g\  ==> 
f -Lg;  (iv)  fig  fUg  = fmg. 

e)  Describe  all  families  / for  which  the  following  statements  hold:  (i)  / U g = g for 
all  g-,  (ii)  / U g - g for  all  g-,  (iii)  / n g = g for  all  g\  (iv)  / U (ei  U e2)  = /; 

(v)  /U(eiUe2)  = /;  (vi)  / ffl  ((ei  U e2)  U e3)  = /;  (vii)  / BB  / = e;  (viii)  fnf  = f. 

► 204.  [M25]  Continuing  exercise  203,  two  further  operations  are  also  important: 

• the  quotient  f / g = {a  \ a U ft  G / and  a fl  /3  = 0,  for  all  /3  G g}. 

• the  remainder  f mod  g = f \ (g  U (/ /<?))• 

The  quotient  is  sometimes  also  called  the  “cofactor”  of  / with  respect  to  g. 

a)  Prove  that  // [g  U h)  = (/ / g)  C (f/h). 

b)  Suppose  / = {{1,2},  {1,3},  {2},  {3},  {4}}.  What  are  f/e2  and  //(//e2)? 

c)  Simplify  the  expressions  //0,  f/e,  f/f,  and  (/  mod  g)/g,  for  arbitrary  / and  g. 

d)  Show  that  f/g  = f / (/ / (/ / g)).  Hint:  Start  with  the  relation  g C f/(f/g). 

e)  Prove  that  f/g  can  also  be  defined  as  |J  {h  \ g U h C / and  g 1 h}. 

f)  Given  / and  j,  show  that  / has  a unique  representation  (ejUg)Uh  with  ej l(gLSh). 

g)  True  or  false:  (fUg)  mod  ej  = (/  mod  ej)  U (g  mod  ej);  (fng)/ej  = (f/ej)n(g/ej). 

205.  [M25]  Implement  the  five  basic  operations  of  family  algebra,  namely  (a)  / U g, 
(b)  /llg,  (c)  / EE  g,  (d)  f/g,  and  (e)  / mod  g,  using  the  conventions  of  exercise  198. 
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206.  [M46]  What  are  the  worst-case  running  times  of  the  algorithms  in  exercise  205? 

► 207.  [M25]  When  one  or  more  projection  functions  Xj  are  needed  in  applications,  as 
m exercise  201,  the  following  “symmetrizing”  operation  turns  out  to  be  very  handy: 

(e,j  U a2  U ■ ■ • U e,( ) § k = S/c(xi1,Xi2, . . . ,Xit),  integer  k > 0. 

For  example,  ej  § 1 =xf,  ej  § 0 = Xj;  (e,  U ej)  § 1 = x*  © Xji  (e2  U e3  U e5)  § 2 = 
(x2  A x3  A x5)  V (x2  A x3  A x5)  V (x2  A x3  A x5).  Show  that  it’s  easy  to  implement  this 
operation.  (Notice  that  etl  U • • • U e,,  has  a very  simple  ZDD  of  size  l + 2,  when  l > 0.) 

► 208.  [16]  By  modifying  Algorithm  C,  show  that  all  solutions  of  a Boolean  function 
can  readily  be  counted  when  its  ZDD  is  given  instead  of  its  BDD. 

209.  [M21]  Explain  how  to  compute  the  fully  elaborated  truth  table  of  a Boolean 
function  from  its  ZDD  representation.  (See  exercise  31.) 

► 210.  [23]  Given  the  ZDD  for  /,  show  how  to  construct  the  ZDD  for  the  function 

dix)  = [f{x)  = 1 and  ux  = ma x{vy  \ f(y)  = 1}]. 

211.  [M20]  When  / describes  the  solutions  to  an  exact  cover  problem,  is  Z(f)  < B(f)7 

► 212.  [25]  What’s  a good  way  to  compute  the  ZDD  for  an  exact  cover  problem? 

213.  [16]  Why  can’t  the  mutilated  chessboard  be  perfectly  covered  with  dominoes? 

► 214.  [21]  When  some  shape  is  covered  by  dominoes,  we  say  that  the  covering  is 
faultfree  if  every  straight  line  that  passes  through  the  interior  of  the  shape  also  passes 
through  the  interior  of  some  domino.  For  example,  the  right-hand  covering  in  (127) 
is  faultfree,  but  the  middle  one  isn’t;  and  the  left-hand  one  has  faults  galore. 

How  many  domino  coverings  of  a chessboard  are  faultfree? 

215.  [21  ] Japanese  tatami  mats  are  1 x 2 rectangles  that  are  traditionally  used  to  cover 
rectangular  floors  in  such  a way  that  no  four  mats  meet  at  any  corner.  For  example, 
Fig.  29(a)  shows  a 6 x 5 pattern  from  the  1641  edition  of  Mitsuyoshi  Yoshida’s  Jinkoki. 
a book  first  published  in  1627. 

Find  all  domino  coverings  of  a chessboard  that  are  also  tatami  tilings. 


Fig.  29.  Two  nice  examples: 

(a)  A 17th-century  tatami  tiling;  (a) 

(b)  a tricolored  domino  covering. 


► 216.  [30]  Figure  29(b)  shows  a chessboard  covered  with  red,  white,  and  blue  domi- 
noes, in  such  a way  that  no  two  dominoes  of  the  same  color  are  next  to  each  other. 

a)  In  how  many  ways  can  this  be  done? 

b)  How  many  of  the  12,988,816  domino  coverings  are  3-colorable? 

217.  [29]  The  monomino/domino/tromino  covering  illustrated  in  (130)  happens  to 
satisfy  an  additional  constraint:  No  two  congruent  pieces  are  adjacent.  How  many  of 
the  92  sextillion  coverings  mentioned  in  the  text  are  “separated,”  in  this  sense? 

► 218.  [24]  Apply  BDD  and  ZDD  techniques  to  the  problem  of  Langford  pairs,  discussed 
at  the  beginning  of  this  chapter. 
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219.  [20]  What  is  Z(F)  when  F is  the  family  (a)  WORDS(IOOO);  . . . ; (e)  WORDS(5000)? 

► 220.  [21  ] The  2-profile  of  the  5757  SGB  words,  represented  with  130  variables  a i . . 25 
as  discussed  in  (131),  is  (1,  1,  1,  . . . , 1,  1,  1,  23,  3,  . . . , 6,  2,  0,  3,  2,  1,  1,  2). 

a)  Explain  the  entries  23  and  3,  which  correspond  to  the  variables  a 2 and  62 . 

b)  Explain  the  final  entries  0,  3,  2,  1,  1,  2,  which  correspond  to  v$,  w&,  £5,  etc. 

► 221.  [MS 7]  Only  5020  nodes  are  needed  to  represent  the  5757  most  common  five-letter 
words  of  English,  using  the  130-variable  representation,  because  of  special  linguistic 
properties.  But  there  are  265  = 11,881,376  possible  five-letter  words.  Suppose  we 
choose  5757  of  them  at  random;  how  big  will  the  ZDD  be  then,  on  average? 

► 222.  [27]  When  family  algebra  is  applied  to  five-letter  words  as  in  (131),  the  130 
variables  are  called  01,  61,  . . . , 25  instead  of  xi,  £2,  • • • , £130 ; and  the  corresponding 
elementary  families  are  denoted  by  the  symbols  a1;  bi,  . . . , Z5  instead  of  e\,  e2,  . . . , ei3o- 
Thus  the  family  F = WORDS  (5757)  can  be  constructed  by  synthesizing  the  formula 

F = (wi  U h2  LI  i3  U C4  U h5)  U • • • U (f  i U u2  Un3  U n4  U y5)  U • • • U (pj  U U2  U p3  U a4  U 15). 

a)  Let  p denote  the  universal  family  of  all  subsets  of  {ai, . . . ,25},  also  called  the 
“power  set.”  What  does  the  formula  Fnp  signify? 

b)  Let  X = Xi  U U Xs,  where  Xj  = {aj,bj, . . . ,Zj}.  Interpret  the  formula  FnX. 

c)  Find  a simple  formula  for  all  words  of  F that  match  the  pattern  t*u*h. 

d)  Find  a formula  for  all  SGB  words  that  contain  exactly  k vowels,  for  0 < k < 5 
(considering  only  a,  e,  i,  o,  and  u to  be  vowels).  Let  V}  = aj  U e3  U ij  U o j Uuj. 

e)  How  many  patterns  in  which  exactly  three  letters  are  specified  are  matched  by 
at  least  one  SGB  word?  (For  example,  m*tc*  is  such  a pattern.)  Give  a formula. 

f)  How  many  of  those  patterns  are  matched  at  least  twice  (e.g.,  *atc*)? 

g)  Express  all  words  that  remain  words  when  a ‘b’  is  changed  to  ‘o’. 

h)  What’s  the  significance  of  the  formula  F/V2? 

i)  Contrast  (Xi  U V2  LI  V3  U V4  U X5)  n F with  (Xi  U X5)  \ ((p\F)/(V2  U V3  LI  14)). 

223.  [28 j A “median  word”  is  a five-letter  word  p = p 4 . . ,p5  that  can  be  obtained 
from  three  words  a = a\  ...  as,  /3  = /34  . . . /35,  7 = 71  ...  75  by  the  rule  [a*  = p, ] + 
[j8i  = Pi]  + [7;  = pi]  = 2 for  1 < i < 5.  For  example,  mixed  is  a median  of  the  words 
{fixed, mixer, mound},  and  also  of  {mated, mixup, nixed}.  But  noted  is  not  a median 
of  {notes,  voted, naked},  because  each  of  those  words  has  e in  position  4. 

a)  Show  that  {d(a,  p),  d(f3,  p),  d(7,  p)}  is  either  {1,1,3}  or  {1,2,2}  whenever  p is  a 
median  of  {a,  /3, 7 } . (Here  d denotes  Hamming  distance.) 

b)  How  many  medians  can  be  obtained  from  WORDS  (n) , when  n = 100?  1000?  5757? 

c)  How  many  of  those  medians  belong  to  WORDS  (m),  when  m = 100?  1000?  5757? 

► 224.  [20]  Suppose  we  form  the  ZDD  for  all  source-to-sink  paths  in  a dag,  as  in  Fig.  28, 
when  the  dag  happens  to  be  a forest;  that  is,  assume  that  every  non-source  vertex  of 
the  dag  has  in-degree  I . Show  that  the  corresponding  ZDD  is  essentially  the  same  as 
the  binary  tree  that  represents  the  forest  under  the  “natural  correspondence  between 
forests  and  binary  trees,”  Eqs.  2.3.2-(i)  through  2.3.2-(3). 

► 225.  [30]  Design  an  algorithm  that  will  produce  a ZDD  for  all  sets  of  edges  that  form 
a simple  path  from  s to  t,  given  a graph  and  two  distinct  vertices  {s,£}  of  the  graph. 

► 226.  [20]  Modify  the  algorithm  of  exercise  225  so  that  it  yields  a ZDD  for  all  of  the 
simple  cycles  in  a given  graph. 

227.  [20]  Similarly,  modify  it  so  that  it  considers  only  Hamiltonian  paths  from  s to  t. 
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228.  [21]  And  mutate  it  once  more,  for  Hamiltonian  paths  from  s to  any  other  vertex. 

229.  [15]  There  are  587,218,421,488  paths  from  CA  to  ME  in  the  graphs  (18),  but  only 
437,525,772,584  such  paths  in  (133)-  Explain  the  discrepancy. 


230.  [25]  Find  the  Hamiltonian  paths  of  (133)  that  have  minimum  and  maximum 
total  length.  What  is  the  average  length,  if  all  Hamiltonian  paths  are  equally  likely? 

231.  [23]  In  how  many  ways  can  a king  travel  from  one  corner  of  a chessboard  to 
the  opposite  corner,  never  occupying  the  same  cell  twice?  (These  are  the  simple  paths 
from  corner  to  corner  of  the  graph  Ps  Kl  Ps.) 


► 232.  [23]  Continuing  exercise  231,  a king’s  tour  of  the  chessboard  is  an  oriented 
Hamiltonian  cycle  of  PgKlPg.  Determine  the  exact  number  of  king’s  tours.  What  is  the 
longest  possible  king’s  tour,  in  terms  of  Euclidean  distance  traveled? 


► 233.  [25]  Design  an  algorithm  that  builds  a ZDD  for  the  family  of  all  oriented  cycles 
of  a given  digraph.  (See  exercise  226.) 


234.  [22]  Apply  the  algorithm  of  exercise  233  to  the  directed  graph  on  the  49  postal 
codes  AL,  AR,  . . . , WY  of  (18),  with  XY  — ► YZ  as  in  exercise  7 54(b).  For  example,  one 
such  oriented  cycle  is  NC  — ► CT  — ► TN  — > NC.  How  many  oriented  cycles  are  possible? 
What  are  the  minimum  and  maximum  cycle  lengths? 

235.  [22]  Form  a digraph  on  the  five-letter  words  of  English  by  saying  that  x — > y 
when  the  last  three  letters  of  x match  the  first  three  letters  of  y (e.g.,  crown— 4 owner). 
How  many  oriented  cycles  does  this  digraph  have?  What  are  the  longest  and  shortest? 

► 236.  [M25]  Many  extensions  to  the  family  algebra  of  exercise  203  suggest  themselves 
when  ZDDs  are  applied  to  combinatorial  problems,  including  the  following  five  opera- 
tions on  families  of  sets: 


• The  maximal  elements  f = {a  <=  / | /3  € / and  a C /3  implies  a = f3}; 

• The  minimal  elements  /J'  = {a€/|/36/andaD/3  implies  a = /3}; 

• The  nonsubsets  fSg  = {a  <E  f \ /3  E g implies  a g /?}; 

• The  nonsupersets  f'\g  = {aef\/3eg  implies  a g /3); 

• The  minimal  hitting  sets  f>  = {a\/3ef  implies  aO^  0}1. 

For  example,  when  / and  g are  the  families  of  exercise  203(a)  we  have  fr  = ei  U(e2Ue3), 
f = G /#  = 0,  = g,  g‘  = (eiUe2)Ue3,  fPg  = eiUe3,  f\g  = e,  gsf  = g\f  = 0. 

a)  Prove  that  / T g = f \ (/  n g),  and  give  a similar  formula  for  f \ g. 

b)  Let  fc  = (q  I a € /}  = / ffl  U,  where  U = e 1 U e2  U ■ ■ • is  the  “universal  set.” 
Clearly  fcc  = /,  ( fUgf  = fcUgc , ( fHgf  = fcngc,  (. f\gf  = fC\gc.  Show 
that  we  also  have  the  duality  laws  ftc  = fci  f^c  — fc t.  (f  M „\C  _ fc  n c 

(fngf=fcugc;  {f,j = A i,  A = 7=  <; ' ,4. ' 

c)  True  or  false?  (i)  xf  = ei;  (ii)  x\  = eU  (hi)  x»  = Cl;  (iv)  (i.Vx^  = e,Ue2; 
(v)  (xi  A x2)^  = ei  Ue2. 

d)  Which  of  the  following  formulas  hold  for  all  families  /,  g,  and  h?  (i)  fn  = /t- 
(ii)  fn  = fl;  (hi)  fn  = /T;  (iv)  = A (v)  fii  = ft.  (vi)  fn  = ft 
(Vh)  fU  = /#;  (vih)  /t«  = /«;  (ix)  /«»  =/»;(x)//(5Ul.)  = (//j)n(//ft); 
(xi)  f\(g\Jh)  = (f\g)n(f \h);  (xii)  f\(g(Jh)  = (/ \g)\h;  (xiii)  f Pg^  = f Sg\ 
(xiv)  f\gt  = f\g]  (xv)  (/  U p)#  = (/«  U <?»)L  (xvi)  (/  U g )«  = (/«  U . 

e)  Suppose  g = \Ju_v(eu  U ev)  is  the  family  of  all  edges  in  a graph,  and  let  / be 
the  family  of  all  the  independent  sets.  Using  the  operations  of  extended  family 
algebra,  find  simple  formulas  that  express  (i)  / in  terms  of  g\  (ii)  g in  terms  of  /. 
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237.  [25]  Implement  the  five  operations  of  exercise  236,  in  the  style  of  exercise  205. 

► 238.  [22]  Use  ZDDs  to  compute  the  maximal  induced  bipartite  subgraphs  of  the  con- 
tiguous-USA  graph  G in  (18),  namely  the  maximal  subsets  U such  that  G \ U has  no 
cycles  of  odd  length.  How  many  such  sets  U exist?  Give  examples  of  the  smallest  and 
largest.  Consider  also  the  maximal  induced  tripartite  (3-colorable)  subgraphs. 

► 239.  [21]  Explain  how  to  compute  the  maximal  cliques  of  a graph  G using  family 
algebra,  when  G is  specified  by  its  edges  g as  in  exercise  236(e).  Find  the  maximal  sets 
of  vertices  that  can  be  covered  by  k cliques,  for  k = 1,  2,  ... , when  G is  the  graph  (18). 


► 240.  [22]  A set  of  vertices  U is  called  a dominating  set  of  a graph  if  every  vertex  is 
at  most  one  step  away  from  U. 

a)  Prove  that  every  kernel  of  a graph  is  a minimal  dominating  set. 

b)  How  many  minimal  dominating  sets  does  the  USA  graph  (18)  have? 

c)  Find  seven  vertices  of  (18)  that  dominate  36  of  the  others. 

► 241.  [28]  The  queen  graph  Q g consists  of  the  64  squares  of  a chessboard,  with  u — v 
when  squares  u and  v lie  in  the  same  row,  column,  or  diagonal.  How  large  are  the  ZDDs 
for  its  (a)  kernels?  (b)  maximal  cliques?  (c)  minimal  dominating  sets?  (d)  minimal 
dominating  sets  that  are  also  cliques?  (e)  maximal  induced  bipartite  subgraphs? 

Illustrate  each  of  these  five  categories  by  exhibiting  smallest  and  largest  examples. 


242.  [24]  Find  all  of  the  maximal  ways  to  choose  points  on  an  8 x 8 grid  so  that  no 
three  points  lie  on  a straight  line  of  any  slope. 

243.  [M23]  The  closure  fr  of  a family  / of  sets  is  the  family  of  all  sets  that  can  be 
obtained  by  intersecting  one  or  more  members  of  /. 

a)  Prove  that  /n  = {a  | a=  H{/3  | P G / and  fl  D a}}. 

b)  What’s  a good  way  to  compute  the  ZDD  for  fn,  given  the  ZDD  for  /? 

c)  Find  the  generating  function  for  Fc  when  F = WORDS  (5757)  as  in  exercise  222. 


244.  [25]  What  is  the  ZDD  for  the  connectedness  function  of  P3C1P3  (Fig.  22)?  What 
is  the  BDD  for  the  spanning  tree  function  of  the  same  graph?  (See  Corollary  S.) 

► 245.  [M22]  Show  that  the  prime  clauses  of  a monotone  function  / are  PI(/)®. 

246.  [M21]  Prove  Theorem  S,  assuming  that  (137)  is  true. 

► 247.  [M27]  Determine  the  number  of  sweet  Boolean  functions  of  n variables  for  n < 7. 

248.  [M22]  True  or  false:  If  / and  g are  sweet,  so  is  f(x  1, . . . ,x„)  A g(x  1, . . . ,xn). 

249.  [HM31]  The  connectedness  function  of  a graph  is  “ultrasweet,”  in  the  sense  that 
it  is  sweet  under  all  permutations  of  its  variables.  Is  there  a nice  way  to  characterize 
ultrasweet  Boolean  functions? 


250.  [28]  There  are  7581  monotone  Boolean  functions  /(xi,  x2,  X3,  X4,  £5).  What  are 
the  average  values  of  B(f)  and  Z(Pl(f))  when  one  of  them  is  chosen  at  random?  What 
is  the  probability  that  Z(Pl(f))  > B(f)7  What  is  the  maximum  of  Z(PI(/))/jB(/)? 

251.  [M46]  Is  Z(Pl(f))  = 0(B(f))  for  all  monotone  Boolean  functions  /? 

252.  [M30]  When  a Boolean  function  isn’t  monotone,  its  prime  implicants  involve 
negative  literals;  for  example,  the  prime  implicants  of  (xi?  X2 : X3 ) are  xiAx2,  X1AX3, 
and  x2Ax3.  In  such  cases  we  can  conveniently  represent  them  with  ZDDs  if  we  consider 
them  to  be  words  in  the  2n-letter  alphabet  {elt  e[, . . . , en,  e'n}.  A “subcube”  such  as 
01*0*  is  then  e\  U e2  U e'4  in  family  algebra  (see  7.1.1  (29));  and  PI(xi?  x2:  X3)  = 
(ex  U e2)  U (ei  U e3)  U (e2  U e3). 
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Exercise  7.1.1-116  shows  that  symmetric  functions  of  n variables  might  have 
n(3n/n)  prime  implicants.  How  large  can  Z(PI(/))  be  when  / is  symmetric? 

► 253.  [ M26 ] Continuing  exercise  252,  prove  that  if  / = (xiA/0)  V (*iA/i)  we  have 
PI(/)  = A U (eiU B)  U (eiUC),  where  A = PI(/0  A/i),  B = PI(/0)  \ A,  and  C = 
PI(/2)  \ A.  (Equation  (137)  is  the  special  case  when  / is  monotone.) 

► 254.  [M23]  Let  the  functions  / and  g of  (52)  be  monotone,  with  / C g.  Prove  that 

P%)\PI(/)  = (PI(9,)\PI(/i))u(PI(flh)\PI(/hUjW)). 

► 255.  [25]  A multifamily  of  sets,  in  which  members  of  / are  allowed  to  occur  more 
than  once,  can  be  represented  as  a sequence  of  ZDDs  (/o,  /1,  /2, . . . ) in  which  fk  is  the 
family  of  sets  that  occur  ( . . . <i2d\  0:0 ) 2 times  in  f where  ak  — 1.  For  example,  if  ct 
appears  exactly  9 = (1001)2  times  in  the  multifamily,  a would  be  in  /3  and  /0. 

a)  Explain  how  to  insert  and  delete  items  from  this  representation  of  a multifamily. 

b)  Implement  the  multiset  union  h = f l±!  g for  multifamilies. 

256.  [M32]  Any  nonnegative  integer  x can  be  represented  as  a family  of  subsets  of 
the  binary  powers  U = {22k  | k > 0}  = {21, 22 , 24, 2s, . . . },  in  the  following  way:  If 
x = - 1 + ' ' ' + 2e‘,  where  ei  > • • • > et  > 0 and  t > 0,  the  corresponding  family  has 
t sets  Ej  C U,  where  2e*  = }“[{«  | u 6 Ej}.  Conversely,  every  finite  family  of  finite 
subsets  of  U corresponds  in  this  way  to  a nonnegative  integer  x.  For  example,  the 
number  41  = 25  + 23  + 1 corresponds  to  the  family  { (21 , 24},  {21, 22},  0}. 

a)  Find  a simple  connection  between  the  binary  representation  of  x and  the  truth 
table  of  the  Boolean  function  that  corresponds  to  the  family  for  x. 

b)  Let  Z(x)  be  the  size  of  the  ZDD  for  the  family  that  represents  x,  when  the  ele- 
ments of  U are  tested  in  reverse  order  . . . , 24,  22,  21  (with  highest  exponents  near- 
est to  the  root);  for  example,  Z(41)  = 5.  Show  that  Z(x)  = 0(logx/logloga:). 

c)  The  integer  x is  called  “sparse”  if  Z(x)  is  substantially  smaller  than  the  upper 
bound  in  (b).  Prove  that  the  sum  of  sparse  integers  is  sparse,  in  the  sense  that 
Z(x  + y)  = 0(Z(x)Z(y)). 

d)  Is  the  saturating  difference  of  sparse  integers,  x — y,  always  sparse? 

e)  Is  the  product  of  sparse  integers  always  sparse? 

257.  [40]  (S.  Minato.)  Explore  the  use  of  ZDDs  to  represent  polynomials  with  nonneg- 
ative integer  coefficients.  Hint:  Any  such  polynomial  in  x,  y,  and  2 can  be  regarded  as 
a family  of  subsets  of  {2,  22, 24, . . . , x,  x2 , x4, . . . , y,  y2,  y4 , . . . , 2,  22, 24, . . . for  exam- 
ple, x3  + 3xy  + 2z  corresponds  naturally  to  the  family  {{ x,x 2),  {x,y},  {2 ,x,y},  {2 ,2}}. 

► 258.  [25]  Given  a positive  integer  n,  what  is  the  minimum  size  of  a BDD  that  has 
exactly  n solutions?  Answer  this  question  also  for  a ZDD  of  minimum  size. 

► 259.  [25]  A sequence  of  parentheses  can  be  encoded  as  a binary  string  by  letting  0 
represent  T and  1 represent  *)’.  For  example,  ())(()  is  encoded  as  011001. 

Every  forest  of  n nodes  corresponds  to  a sequence  of  2 n parentheses  that  are 
properly  nested,  in  the  sense  that  left  and  right  parentheses  match  in  the  normal  way. 
(See,  for  example,  2.3.3-(i)  or  7.2.1.6-(i).)  Let 

Nn(xi, x2  n)  = [xi . . . X2n  represents  properly  nested  parentheses]. 

For  example,  N3{0, 1, 1,0,0, 1)  = 0 and  7V3(0,0, 1,0, 1, 1)  = 1;  in  general,  Nn  has  Cn  « 
47(\ Arn3/2)  solutions,  where  Cn  is  a Catalan  number.  What  are  B(N„)  and  Z(Nn)? 
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► 260.  [M27]  We  will  see  in  Section  7. 2. 1.5  that  every  partition  of  {1, . . . , n}  into  disjoint 
subsets  corresponds  to  a “restricted  growth  string”  a\...an,  which  is  a sequence  of 
nonnegative  integers  with 

ai  = 0 and  fflj+i  < 1 + max(ai, . . . ,aj)  for  1 < j < n. 

Elements  j and  k belong  to  the  same  subset  of  the  partition  if  and  only  if  aj  = a*, . 

a)  Let  Xjtk  = [dj  = k]  for  0 < k < j < n,  and  let  Rn  be  the  function  of  these  ("j1) 
variables  that  is  true  if  and  only  if  ai  . . . a„  is  a restricted  growth  string.  (By 
studying  this  Boolean  function  we  can  study  the  family  of  all  set  partitions,  and 
by  placing  further  restrictions  on  Rn  we  can  study  set  partitions  with  special 
properties.  There  are  ruioo  ~ 5 X 10115  set  partitions  when  n = 100.)  Calculate 
B(R\oo)  and  Z(Rioo).  Approximately  how  large  are  B(Rn)  and  Z(Rn)  as  n -) 
oo? 

b)  Show  that,  with  a proper  ordering  of  the  variables  Xj tk,  the  BDD  base  for 
{ Ri , . . . , Rn  } has  the  same  number  of  nodes  as  the  BDD  for  Rn  alone. 

c)  We  can  also  use  fewer  variables,  approximately  nlgn  instead  of  ("J1),  if  we 
represent  each  a*,  as  a binary  integer  with  [ Ig  k } bits.  How  large  are  the  BDD 
and  ZDD  bases  in  this  representation  of  set  partitions? 

261.  [ HM21 ] “The  deterministic  finite-state  automaton  with  fewest  states  that  ac- 
cepts any  given  regular  language  is  unique.”  What  is  the  connection  between  this 
famous  theorem  of  automata  theory  and  the  theory  of  binary  decision  diagrams? 

262.  [ M26 ] The  determination  of  optimum  Boolean  chains  in  Section  7.1.2  was  greatly 
accelerated  by  restricting  consideration  to  Boolean  functions  that  are  normal,  in  the 
sense  that  /( 0, . . . , 0)  = 0.  (See  Eq.  7.1.2-(io).)  Similarly,  we  could  restrict  BDDs  so 
that  each  of  their  nodes  denotes  a normal  function. 

a)  Explain  how  to  do  this  by  introducing  “complement  links,”  which  point  to  the 
complement  of  a subfunction  instead  of  to  the  subfunction  itself. 

b)  Show  that  every  Boolean  function  has  a unique  normalized  BDD. 

c)  Draw  the  normalized  BDDs  for  the  16  functions  in  exercise  1. 

d)  Let  B°(f)  be  the  size  of  the  normalized  BDD  for  /.  Find  the  average  and  worst 
case  of  B°(f),  and  compare  B°(f)  to  B(f).  (See  (8o)  and  Theorem  U.) 

e)  The  BDD  base  for  3x3  multiplication  in  (58)  has  B(F\, . . . , Fe)  = 52  nodes. 
What  is  B°(Fi, . . . , F6)? 

f)  How  do  (54)  and  (55)  change,  when  AND  is  implemented  with  complement  links? 

263.  [HM25]  A linear  block  code  is  the  set  of  binary  column  vectors  x = (aq, . . . ,xn)T 
such  that  Fix  = 0,  where  H is  a given  m x n “parity  check  matrix.” 

a)  The  linear  block  code  with  n = 2m  — 1,  whose  columns  are  the  nonzero  binary 
m-tuples  from  (0, . . . , 0, 1)T  to  (1, . . . , 1, 1)T,  is  called  the  Hamming  code.  Prove 
that  the  Hamming  code  is  1-error  correcting  in  the  sense  of  exercise  7-23. 

b)  Let  /( x)  = [Hx  = 0],  where  H is  an  m x n matrix  with  no  all-zero  columns.  Show 
that  the  BDD  profile  of  / has  a simple  relation  to  the  ranks  of  submatrices  of  H 
mod  2,  and  compute  B(f)  for  the  Hamming  code. 

c)  In  general  we  can  let  f(x)  = [x  is  a codeword]  define  any  block  code.  Suppose 
some  codeword  x = x\  . . ,xn  has  been  transmitted  through  a possibly  noisy 
channel,  and  that  we’ve  received  the  bits  y = yi  ■ ■ . yn,  where  the  channel  delivers 
yk  = Xk  with  probability  pk  for  each  k independently.  Explain  how  to  determine 
the  most  likely  codeword  x,  given  y,  pi,  . . . , pn,  and  the  BDD  for  /. 
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264.  [M46]  The  text’s  “sweeping  generalization”  of  Algorithms  B and  C,  based  on  (22) 
embraces  many  important  applications;  but  it  does  not  appear  to  include  quantities 
such  as 


max 

/(*)= 1 


n—  1 

Wkxk  + ^2  w'kXkXk+l 
k= 1 


) 


or 


max 
/(*)  = ! 


n — 1 n — j 

£ 

3= 0 k= 1 


which  also  can  be  computed  efficiently  from  the  BDD  or  ZDD  for  /. 

Develop  a generalization  that  is  even  more  sweeping. 

► 265.  [21]  Devise  an  algorithm  that  finds  the  mth  smallest  solution  to  f(x)  = 1 in 
lexicographic  order  of  Xl  . . .xn,  given  m and  the  BDD  for  a Boolean  function  f of  n 
variables.  Your  algorithm  should  take  0(nB(f)  + n 2)  steps. 

► 266.  [20]  Every  forest  F whose  nodes  are  numbered  {l,...,n}  in  preorder  defines 
two  families  of  sets 


o(F)  _ {anc(l),...,anc(n)}  and  d(F)  = {dec(l), . . . , dec(n)}, 

where  anc(fc)  and  dec(k)  are  the  inclusive  ancestors  and  descendants  of  node  k.  For 
example,  if  F is 


then  a(F)  = {{!},  {1,2},  {3},  {3,4},  {3,5}}  and  d(F)  = {{1,2},  {2},  {3,4,5},  {4},  {5}}. 
Conversely,  F can  be  reconstructed  from  either  a(F)  or  d(F). 

Prove  that  the  ZDD  for  the  family  a(F)  has  exactly  n + 2 nodes. 

267.  [HM32]  Continuing  exercise  266,  find  the  minimum,  maximum,  and  average  size 
of  the  ZDD  for  the  family  d(F),  as  F ranges  over  all  forests  on  n nodes. 


We  dare  not  lengthen  this  book  much  more, 
lest  it  be  out  of  due  proportion, 
and  repel  men  by  its  size. 
— AELFRIC,  Catholic  Homilies  II  (c.  1000) 

There  are  a thousand  hacking  at  the  branches  of  evil 
to  one  who  is  striking  at  the  root. 
HENRY  D.  THOREAU,  Walden;  or,  Life  in  the  Woods  (1854) 


7.2. 1.1 


GENERATING  ALL  n-TUPLES  281 


7.2.  GENERATING  ALL  POSSIBILITIES 

All  present  or  accounted  for,  sir. 

— Traditional  American  military  saying 

All  present  and  correct,  sir. 

— Traditional  British  military  saying 

7.2.1.  Generating  Basic  Combinatorial  Patterns 

Our  GOAL  in  this  section  is  to  study  methods  for  running  through  all  of  the 
possibilities  in  some  combinatorial  universe,  because  we  often  face  problems 
in  which  an  exhaustive  examination  of  all  cases  is  necessary  or  desirable.  For 
example,  we  might  want  to  look  at  all  permutations  of  a given  set. 

Some  authors  call  this  the  task  of  enumerating  all  of  the  possibilities;  but 
that’s  not  quite  the  right  word,  because  “enumeration”  most  often  means  that 
we  merely  want  to  count  the  total  number  of  cases,  not  that  we  actually  want 
to  look  at  them  all.  If  somebody  asks  you  to  enumerate  the  permutations  of 
{1,  2,  3},  you  are  quite  justified  in  replying  that  the  answer  is  3!  = 6;  you  needn’t 
give  the  more  complete  answer  {123,132,213,231,312,321}. 

Other  authors  speak  of  listing  all  the  possibilities;  but  that’s  not  such  a great 
word  either.  No  sensible  person  would  want  to  make  a list  of  the  10!  = 3,628,800 
permutations  of  {0, 1, 2,  3,  4,  5, 6,  7,  8, 9}  by  printing  them  out  on  thousands  of 
sheets  of  paper,  nor  even  by  writing  them  all  in  a computer  file.  All  we  really 
want  is  to  have  them  present  momentarily  in  some  data  structure,  so  that  a 
program  can  examine  each  permutation  one  at  a time. 

So  we  will  speak  of  generating  all  of  the  combinatorial  objects  that  we  need, 
and  visiting  each  object  in  turn.  Just  as  we  studied  algorithms  for  tree  traversal 
in  Section  2.3.1,  where  the  goal  was  to  visit  every  node  of  a tree,  we  turn  now 
to  algorithms  that  systematically  traverse  a combinatorial  space  of  possibilities. 


He's  got  'em  on  the  list  — 
he's  got  'em  on  the  list; 

And  they’ll  none  of  ’em  be  missed  — 
they’ll  none  of  'em  be  missed. 

— WILLIAM  S.  GILBERT,  The  Mikado  (1885) 

7.2. 1.1.  Generating  all  n-tuples.  Let’s  start  small,  by  considering  how  to 
run  through  all  2"  strings  that  consist  of  n binary  digits.  Equivalently,  we  want 
to  visit  all  n-tuples  (aj,...,an)  where  each  aj  is  either  0 or  1.  This  task  is 
also,  in  essence,  equivalent  to  examining  all  subsets  of  a given  set  {aq, . . . , xn}, 
because  we  can  say  that  Xj  is  in  the  subset  if  and  only  if  aj  = 1. 

Of  course  such  a problem  has  an  absurdly  simple  solution.  All  we  need  to 
do  is  start  with  the  binary  number  (0 . . . 00)2  = 0 and  repeatedly  add  1 until 
we  reach  (1 . . . 11)2  = 2n—  1.  We  will  see,  however,  that  even  this  utterly  trivial 
problem  has  astonishing  points  of  interest  when  we  look  into  it  more  deeply.  And 
our  study  of  n-tuples  will  pay  off  later  when  we  turn  to  the  generation  of  more 
difficult  kinds  of  patterns. 
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In  the  first  place,  we  can  see  that  the  binary-notation  trick  extends  to  other 
kinds  of  n-tuples.  If  we  want,  for  example,  to  generate  all  («!,...,  an)  in  which 
each  a,j  is  one  of  the  decimal  digits  {0, 1, 2, 3, 4,  5, 6,  7,  8,  9},  we  can  simply  count 
from  (0...00)io  = 0 to  (9...99)io  — 10"  — 1 in  the  decimal  number  system. 
And  if  we  want  more  generally  to  run  through  all  cases  in  which 

0 < cij  < rrij  for  1 < j < n,  M 

where  the  upper  limits  nij  might  be  different  in  different  components  of  the 
vector  (cq, . . . , an),  the  task  is  essentially  the  same  as  repeatedly  adding  unity 
to  the  number 

.mi,  m2,  . . . , mn\  ^ 

in  a mixed-radix  number  system;  see  Eq.  4.1 -(9)  and  exercise  4.3. 1-9. 

We  might  as  well  pause  to  describe  the  process  more  formally: 

Algorithm  M ( Mixed-radix  generation).  This  algorithm  visits  all  n-tuples 
that  satisfy  (1),  by  repeatedly  adding  1 to  the  mixed-radix  number  in  (2)  until 
overflow  occurs.  Auxiliary  variables  a0  and  m0  are  introduced  for  convenience. 

Ml.  [Initialize.]  Set  a3  <-  0 for  0 < j < n,  and  set  m0  2. 

M2.  [Visit.]  Visit  the  n-tuple  (ax, ... , a„).  (The  program  that  wants  to  examine 
all  n-tuples  now  does  its  thing.) 

M3.  [Prepare  to  add  one.]  Set  j <—  n. 

M4.  [Carry  if  necessary.]  If  aj  = mj  — 1,  set  aj  <—  0,  j y-  j — 1,  and  repeat  this 
step. 

M5.  [Increase,  unless  done.]  If  j = 0,  terminate  the  algorithm.  Otherwise  set 
aj  «-  aj  + 1 and  go  back  to  step  M2.  | 

Algorithm  M is  simple  and  straightforward,  but  we  shouldn’t  forget  that 
nested  loops  are  even  simpler,  when  n is  a fairly  small  constant.  When  n = 4, 
we  could  for  example  write  out  the  following  instructions: 

For  ax  = 0,  1,  . . . , mi  — 1 (in  this  order)  do  the  following: 

For  a2  — 0,  1,  . . . , m2  — 1 (in  this  order)  do  the  following: 

For  a3  — 0,  1,  . . . , m3  — 1 (in  this  order)  do  the  following:  (3) 

For  a4  = 0,  1,  . . . , m4  - 1 (in  this  order)  do  the  following: 

Visit  (ai,  a2, 03,  a4). 

These  instructions  are  equivalent  to  Algorithm  M,  and  they  are  easily  expressed 
in  any  programming  language. 

Gray  binary  code.  Algorithm  M runs  through  all  (a4, . . . , an)  in  lexicographic 
order,  as  in  a dictionary.  But  there  are  many  situations  in  which  we  prefer  to  visit 
those  n-tuples  in  some  other  order.  The  most  famous  alternative  arrangement  is 
the  so-called  Gray  binary  code,  which  lists  all  2"  strings  of  n bits  in  such  a way 
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that  only  one  bit  changes  each  time,  in  a simple  and  regular  way.  For  example, 
the  Gray  binary  code  for  n = 4 is 


0000, 0001, 0011, 0010, 0110,  011  1, 0101, 0100, 

1100, 1101, 1111, 1110, 1010, 1011, 1001, 1000.  (4) 

Such  codes  are  especially  important  in  applications  where  analog  information 
is  being  converted  to  digital  or  vice  versa.  For  example,  suppose  we  want  to 
identify  our  current  position  on  a rotating  disk  that  has  been  divided  into  16 
sectors,  using  four  sensors  that  each  distinguish  black  from  white.  If  we  use 
lexicographic  order  to  mark  the  tracks  from  0000  to  1111,  as  in  Fig.  30(a),  wildly 
inaccurate  measurements  can  occur  at  the  boundaries  between  sectors;  but  the 
code  in  Fig.  30(b)  never  gives  a bad  reading. 

Gray  binary  code  can  be  defined  in  many  equivalent  ways.  For  example, 
if  r„  stands  for  the  Gray  binary  sequence  of  n-bit  strings,  we  can  define  Tn 
recursively  by  the  two  rules 


To  — e; 

r„+i  = or„,  ir£. 


(5) 


Here  e denotes  the  empty  string,  0Fn  denotes  the  sequence  Tn  with  0 prefixed  to 
each  string,  and  1F((  denotes  the  sequence  Tn  in  reverse  order  with  1 prefixed 
to  each  string.  Since  the  last  string  of  Tn  equals  the  first  string  of  F^,  it  is  clear 
from  (5)  that  exactly  one  bit  changes  in  every  step  of  Fn+1  if  Fn  enjoys  the  same 
property. 

Another  way  to  define  the  sequence  Tn  = g(0),  g(  1),  . . . , g( 2n  — 1)  is  to  give 
an  explicit  formula  for  its  individual  elements  g{k).  Indeed,  since  Tn+i  begins 
with  0rn,  the  infinite  sequence 

Too  =0(0),  5(1),  s(2),  3(3),  s(4),... 

= (0)2,  (1)2,(11)2,(10)2,(110)2,... 

is  a permutation  of  all  the  nonnegative  integers,  if  we  regard  each  string  of  0s 
and  Is  as  a binary  integer  with  optional  leading  0s.  Then  Tn  consists  of  the  first 
2"  elements  of  (6),  converted  to  n-bit  strings  by  inserting  0s  at  the  left  if  needed. 
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When  k = 2n  + r,  where  0 < r < 2n,  relation  (5)  tells  us  that  g{k)  is  equal 
to  2”  + g(2n - 1 - r).  Therefore  we  can  prove  by  induction  on  n that  the  integer 
k whose  binary  representation  is  (. . . 626160)2  has  a Gray  binary  equivalent  g(k) 
with  the  representation  (. . . a2aia0)2,  where 

aj  = bj  © bj+ 1,  for  j > 0.  (7) 

(See  exercise  6.)  For  example,  3((111001000011)2)  = (100101100010)2.  Con- 
versely, if  g(k)  = (. . . a2aia0)2  is  given,  we  can  find  k = (. . . b2b1b0)2  by  inverting 
the  system  of  equations  (7),  obtaining 

bj  = aj  ® aj+ 1 ® aj+2  ® • ■ ■ , for  j > 0;  (8) 

this  infinite  sum  is  really  finite  because  aj+t  = 0 for  all  large  t. 

One  of  the  many  pleasant  consequences  of  Eq.  (7)  is  that  g(k)  can  be  com- 
puted very  easily  with  bitwise  arithmetic: 

g(k)  = k®[k/ 2J.  (9) 

Similarly,  the  inverse  function  in  (8)  satisfies 

g[~1]{l)  = l®\l/2\  ® L//4J  (10) 

this  function,  however,  requires  more  computation  (see  exercise  7.1.3-117).  We 
can  also  deduce  from  (7)  that,  if  k and  k'  are  any  nonnegative  integers, 

g(k®  k')  = g(k)eg(k').  (n) 

Yet  another  consequence  is  that  the  (n  + l)-bit  Gray  binary  code  can  be  written 

r„+i  = orn,  (orn)®iio...o; 

this  pattern  is  evident,  for  example,  in  (4).  Comparing  with  (5),  we  see  that 
reversing  the  order  of  Gray  binary  code  is  equivalent  to  complementing  the  first, 

bit:  n-i 

= r„®  10...0,  also  written  Tn  ® lO”-1.  (12) 

Tf16  exercises  below  show  that  the  function  g(k)  defined  in  (7),  and  its  inverse 
g defined  in  (8),  have  many  further  properties  and  applications  of  interest. 
Sometimes  we  think  of  these  as  functions  taking  binary  strings  to  binary  strings; 
at  other  times  we  regard  them  as  functions  from  integers  to  integers,  via  binary 
notation,  with  leading  zeros  irrelevant. 

Gray  binary  code  is  named  after  Frank  Gray,  a physicist  who  became  fa- 
mous for  helping  to  devise  the  method  long  used  for  compatible  color  television 
broadcasting  [Bell  System  Tech.  J.  13  (1934),  464-515],  He  invented  Tn  for 
applications  to  pulse  code  modulation,  a method  for  analog  transmission  of  dig- 
ital signals  [see  Bell  System  Tech.  J.  30  (1951),  38-40;  U.S.  Patent  2632058  (17 
March  1953);  W.  R.  Bennett,  Introduction  to  Signal  Transmission  (1971),  238- 
240],  But  the  idea  of  “Gray  binary  code”  was  known  long  before  he  worked  on  if 
for  example,  it  appeared  in  U.S.  Patent  2307868  by  George  Stibitz  (12  January 
1943).  More  significantly,  T5  was  used  in  a telegraph  machine  demonstrated 
m 1878  by  Emile  Baudot,  after  whom  the  term  “baud”  was  later  named.  At 
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about  the  same  time,  a similar  but  less  systematic  code  for  telegraphy  was 
independently  devised  by  Otto  Schaffler  [see  Journal  Telegraphique  4 (1878), 
252-253;  Annales  Telegraphiques  6 (1879),  361,  382-383].* 

In  fact,  Gray  binary  code  is  implicitly  present  in  a classic  toy  that  has 
fascinated  people  for  centuries,  now  generally  known  as  the  “Chinese  ring  puzzle” 
in  English,  although  Englishmen  used  to  call  it  the  “tiring  irons.”  Figure  31 
shows  a seven-ring  example.  The  challenge  is  to  remove  the  rings  from  the  bar, 
and  the  rings  are  interlocked  in  such  a way  that  only  two  basic  types  of  move  are 
possible  (although  this  may  not  be  immediately  apparent  from  the  illustration): 

a)  The  rightmost  ring  can  be  removed  or  replaced  at  any  time; 

b)  Any  other  ring  can  be  removed  or  replaced  if  and  only  if  the  ring  to  its  right 
is  on  the  bar  and  all  rings  to  the  right  of  that  one  are  off. 

We  can  represent  the  current  state  of  the  puzzle  in  binary  notation,  writing  1 
if  a ring  is  on  the  bar  and  0 if  it  is  off;  thus  Fig.  31  shows  the  rings  in  state 
1011000.  (The  second  ring  from  the  left  is  encoded  as  0,  because  it  lies  entirely 
above  the  bar.) 


Fig.  31. 

The  Chinese  ring  puzzle. 


A French  magistrate  named  Louis  Gros  demonstrated  an  explicit  connection 
between  Chinese  rings  and  binary  numbers,  in  a booklet  called  Theorie  du  Bague- 
nodier  [sic]  (Lyon:  Aime  Vingtrinier,  1872)  that  was  published  anonymously. 
If  the  rings  are  in  state  a„_i  . . .ao,  and  if  we  define  the  binary  number  k — 
(&„_ i . . . 60)2  by  Eq.  (8),  he  showed  that  exactly  k more  steps  are  necessary  and 
sufficient  to  solve  the  puzzle.  Thus  Gros  is  the  true  inventor  of  Gray  binary  code. 

Certainly  no  home  should  be  without 
this  fascinating,  historic,  and  instructive  puzzle. 

— HENRY  E.  DUDENEY  (1901) 

When  the  rings  are  in  any  state  other  than  00 ...  0 or  10 ...  0,  exactly  two 
moves  are  possible,  one  of  type  (a)  and  one  of  type  (b).  Only  one  of  these  moves 
advances  toward  the  desired  goal;  the  other  is  a step  backward  that  will  need  to 
be  undone.  A type  (a)  move  changes  k to  k 0 1;  thus  we  want  to  do  it  when  k 
is  odd,  since  this  will  decrease  k.  A type  (b)  move  from  a position  that  ends  in 
(10J-1)2  for  1 < j < n changes  k to  fc©(P+1)2  = fc©( 2J+1  — 1).  [In  this  formula 
‘P+1’  stands  for  j + 1 repetitions  of  T’,  but  "2-)+1  ’ denotes  a power  of  2.]  When 

* Some  authors  have  asserted  that  Gray  code  was  invented  by  Elisha  Gray,  who  developed  a 
printing  telegraph  machine  at  the  same  time  as  Baudot  and  Schaffler.  Such  claims  are  untrue, 
although  Elisha  did  get  a raw  deal  with  respect  to  priority  for  inventing  the  telephone  [see 
L.  W.  Taylor,  Amer.  Physics  Teacher  5 (1937),  243-251]. 
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k is  even,  we  want  k ® (2*+l  - 1)  to  equal  k - 1,  which  means  that  k must  be  a 
multiple  of  2'1  but  not  a multiple  of  2-?+1;  in  other  words, 

j P(^)»  (13) 

where  p is  the  “ruler  function”  of  Eq.  7.1.3-(44).  Therefore  the  rings  follow  a nice 
pattern  when  the  puzzle  is  solved  properly:  If  we  number  them  0,  1,  — 1 

(starting  at  the  free  end),  the  sequence  of  ring  moves  on  or  off  the  bar  is  the 
sequence  of  numbers  that  ends  with  . . . , p( 4),  p(3),  p(2),  p(i). 

Going  backwards,  successively  putting  rings  on  or  off  until  we  reach  the 
u timate  state  10  ...  0 (which,  as  John  Wallis  observed  in  1693,  is  more  difficult  to 
reach  than  the  supposedly  harder  state  11 ...  1),  yields  an  algorithm  for  counting 
m Gray  binary  code: 

Algorithm  G (Gray  binary  generation).  This  algorithm  visits  all  binary  n- 
tuples  («„_!,  • • • , ar,  u0)  by  starting  with  (0, . . . , 0, 0)  and  changing  only  one  bit 
at  a time,  also  maintaining  a parity  bit  such  that 

aoo  an  — l ® • * • ® CLi  0 (Iq.  (14) 

It  successively  complements  bits  p(  1),  p( 2),  p( 3),  . . . , p(2«-  1)  and  then  stops. 
Gl.  [Initialize.]  Set  aj  t—  0 for  0 < j < n;  also  set  a t—  0. 

G2.  [Visit.]  Visit  the  n-tuple  (an_j, . . . ,ai,a0). 

G3.  [Change  parity.]  Set  aoo  4-  1 - a<x. 

G4.  [Choose  j.]  If  = 1,  set  j f-  0.  Otherwise  let  j > 1 be  minimum  such 
that  dj_i  = 1.  (After  the  fcth  time  we  have  performed  this  step,  j = p(k).) 

G5.  [Complement  coordinate  j]  Terminate  if  j = n;  otherwise  set  a,  <-  1 - a ■ 
and  return  to  G2.  | j j 

The  parity  bit  comes  in  handy  if  we  are  computing  a sum  like 

Aooo  - A001  - Amo  + Xon  - A100  + X101  + Ai10  - Xin 
or 

Xf,  - xa  - Xb  + Xab  - Xc  + Xac  + Xbc  - Xabc , 

where  the  sign  depends  on  the  parity  of  a binary  string  or  the  number  of  elements 
in  a subset.  Such  sums  arise  frequently  in  “inclusion-exclusion”  formulas  such 
as  Eq.  1.3.3-(29).  The  parity  bit  is  also  necessary,  for  efficiency:  Without  it  we 
could  not  easily  choose  between  the  two  ways  of  determining  j , which  correspond 
to  performing  a type  (a)  or  type  (b)  move  in  the  Chinese  ring  puzzle.  But  the 
most  important  feature  of  Algorithm  G is  that  step  G5  makes  only  a single 
coordinate  change.  Therefore  only  a simple  change  is  usually  needed  to  the 
terms  X that  we  are  summing,  or  to  whatever  other  structures  we  are  concerned 
with  as  we  visit  each  n-tuple. 

It  is  impossible,  of  course,  to  remove  all  ambiguity  in  the  lowest-order  digit 
except  by  a scheme  like  one  the  Irish  railways  are  said  to  have  used 

of  removing  the  last  car  of  every  train 
because  it  is  too  susceptible  to  collision  damage. 

G.  R.  STIBITZ  and  J.  A.  LARRIVEE,  Mathematics  and  Computers  (1957) 
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Fig.  32.  Walsh  functions  wk(x)  for 
0 < k < 8,  with  the  analogous  trigo- 
nometric functions  y/2  cos  knx  shown 
in  gray  for  comparison. 
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Another  key  property  of  Gray  binary  code  was  discovered  by  J.  L.  Walsh 
in  connection  with  an  important  sequence  of  functions  now  known  as  Walsh 
functions  [see  Amer.  J.  Math.  45  (1923),  5-24].  Let  w0(x)  — 1 for  all  real 
numbers  x,  and 

wk(x)  = (- l)L2a:-lrfc/2’lu;Lfc/2j(2a:),  for  k > 0.  (15) 

For  example,  w\{x)  = ( — l)L2xJ  changes  sign  whenever  x is  an  integer  or  an 
integer  plus  It  follows  that  wk(x)  = wk(x  + 1)  for  all  k,  and  that  wk{x)  = ±1 
for  all  x.  More  significantly,  wk{ 0)  = 1 and  wk(x)  has  exactly  k sign  changes  in 
the  interval  (0 . . 1),  so  that  it  approaches  (— l)fc  as  x approaches  1 from  the  left. 
Therefore  wk(x)  behaves  rather  like  a trigonometric  function  cos  kxx  or  sin  knx, 
and  we  can  represent  other  functions  as  a linear  combination  of  Walsh  functions 
in  much  the  same  way  as  they  are  traditionally  represented  as  Fourier  series.  This 
fact,  together  with  the  simple  discrete  nature  of  wk(x),  makes  Walsh  functions 
extremely  useful  in  computer  calculations  related  to  information  transmission, 
image  processing,  and  many  other  applications. 

Figure  32  shows  the  first  eight  Walsh  functions  together  with  their  trigono- 
metric cousins.  Engineers  commonly  call  wk(x)  the  Walsh  function  of  sequency 
fc,  by  analogy  with  the  fact  that  cos  knx  and  sin/c7rx  have  frequency  k/2.  [See, 
for  example,  the  book  Sequency  Theory:  Foundations  and  Applications  (New 
York:  Academic  Press,  1977),  by  H.  F.  Harmuth.] 
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Although  Eq.  (15)  may  look  formidable  at  first  glance,  it  actually  provides  an 
easy  way  to  see  by  induction  why  wk(x)  has  exactly  k sign  changes  as  claimed.  If 
k is  even,  say  k = 21,  we  have  w2i(x)  = w{( 2x)  for  0 < x < §;  the  effect  is  simply 
to  compress  the  function  wi(x ) into  half  the  space,  so  w2 /(x)  has  accumulated 
l sign  changes  so  far.  Then  w2;(x)  = (-l)'w,(2x)  = {~l)lWl(2x  - 1)  in  the 
range  ^ ^ x < 1;  this  concatenates  another  copy  of  w;(2x),  flipping  the  sign  if 
necessary  to  avoid  a sign  change  at  x = §.  The  function  w2i+i(x)  is  similar,  but 
it  forces  a sign  change  when  x = I . 

What  does  this  have  to  do  with  Gray  binary  code?  Walsh  discovered  that 
his  functions  could  all  be  expressed  neatly  in  terms  of  simpler  functions  called 
Rademacher  functions  [Hans  Rademacher,  Math.  Annalen  87  (1922),  112-138], 


rk{x)  = (-1)^*1,  (l6) 

which  take  the  value  (— l)c~lc  when  (. . . c2C1C0.c_jc._2  . . . )2  is  the  binary  represen- 
tation of  x.  Indeed,  we  have  wk{x)  = rr(x),  w2(x)  = rr(x)r2(x),  w3(x)  = r2(x), 
and  in  general 

wk{x)  = J]  rj+i(x)bj®bj+1  when  k = (. . . 62Mo)2.  (17) 

J>0 

(See  exercise  33.)  Thus  the  exponent  of  rj+1(x)  in  wk(x)  is  the  jth  bit  of  the 
Gray  binary  number  g(k),  according  to  (7),  and  we  have 

wk(x)  - rp{k)+1( x)wfc_1(x),  for  k > 0.  (18) 

Equation  (17)  implies  the  handy  formula 

wk(x)wk,(x)  = wk(Bk,{x),  (i9) 


which  is  much  simpler  than  the  corresponding  product  formulas  for  sines  and 
cosines.  This  identity  follows  easily  because  r^x)2  = 1 for  all  j and  x,  hence 
rj(x)  ® = rj{x)a+b.  It  implies  in  particular  that  wk(x ) is  orthogonal  to  wk/(x) 
when  k T k . in  the  sense  that  the  average  value  of  wk  (x)  v)ki  (x)  is  zero.  We  also 
can  use  (17)  to  define  wk(x)  for  fractional  values  of  k like  1/2  or  13/8. 

The  Walsh  transform  of  2"  numbers  {X0, . . . , X2n_1)  is  the  vector  defined  by 
the  equation  (x<j, . . . , x2»_i)t  = Wn(X  0, , X2n_1)T , where  Wn  is  the  2n  x 2” 
matrix  having  Wj(k/ 2n)  in  row  j and  column  k,  for  0 < j,  k < 2n.  For  example, 
Fig.  32  tells  us  that  the  Walsh  transform  when  n — 3 is 
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(Here  1 stands  for  — 1,  and  the  subscripts  are  conveniently  regarded  as  binary 
strings  000-111  instead  of  as  the  integers  0-7.)  The  Hadamard  transform  is 
defined  similarly,  but  with  the  matrix  Hn  in  place  of  Wn,  where  Hn  has  (— l)J'fc 

in  row  j and  column  fc;  here  ‘j  ■ k'  denotes  the  dot  product  an_jfon_i  H \-a0bo 

of  the  binary  representations  j = (an_  i-..ao)2  and  k = (6n_i . . . 60)2-  For 
example,  the  Hadamard  transform  for  n = 3 is 

1 1 1 1 1 1 1 1\  /X0oo\ 

11111111  X001 

lllTllTT  X010 

1 1 1 1 1 1 1 1 Xou 

11111111  Xioo  ' (21> 

1 I 1 I 1 1 T 1 Xioi 

lllTTTll  X110 

1 1 1 1 I 1 1 1/  \Xm/ 

This  is  the  same  as  the  discrete  Fourier  transform  on  an  n-dimensional  cube, 
Eq.  4.6.4-(38),  and  we  can  evaluate  it  quickly  “in  place”  by  adapting  the  method 
of  Yates  discussed  in  Section  4.6.4: 

Given  First  step  Second  step  Third  step 

*000  *000+*001  *ooo+*ooi+*oio+*on  *ooo+*ooi4-*oio+*oii+*ioo+*ioi+*iio+*m 

*001  *000— *001  *000  — *001+*010— *011  *000  — *001+*010  — *011-F*100—  *10lT*110  — *111 
*010  *010+*011  *000+*001  — *010  — *011  *000  + *001—  *010—  *011 +*100+*101  — *110  — *111 
*011  *010  — *011  *000  — *001  — *010  + *011  *000  — *001  — *010+*011+*100  — *101  — *110+*111 
*100  *100+*101  *100+*101+*110+*111  *000+*001+*010  + *011  — *100— *101  — *110  — *111 
*101  *100  — *101  *100— *101+*110— *111  *000— *001-|-*010— *011  — *100+*101  — *110+*111 
*110  *110  + *111  *100  + *101  — *110  — *111  *000+*001—  *010  — *011—  *100  — *101+*110  + *111 
*111  *110  — *111  *100  — *101  — *110  + *111  *000  — *001  — *010  + *011  — *ioo4-*ioi+*no  — *111 

Notice  that  the  rows  of  H3  are  a permutation  of  the  rows  of  W3.  This  is  true  in 
general,  so  we  can  obtain  the  Walsh  transform  by  permuting  the  elements  of  the 
Hadamard  transform.  Exercise  36  discusses  the  details. 

Going  faster.  When  we’re  running  through  2"  possibilities,  we  usually  want 
to  reduce  the  computation  time  as  much  as  possible.  Algorithm  G needs  to 
complement  only  one  bit  aj  per  visit  to  (a„_  1, . . . ,a 0),  but  it  loops  in  step  G4 
while  choosing  an  appropriate  value  of  j.  Another  approach  has  been  suggested 
by  Gideon  Ehrlich  [JACM  20  (1973),  500-513],  who  introduced  the  notion  of 
loopless  combinatorial  generation:  With  a loopless  algorithm,  the  number  of 
operations  performed  between  successive  visits  is  required  to  be  bounded  in 
advance,  so  there  never  is  a long  wait  before  a new  pattern  has  been  generated. 

We  learned  some  tricks  in  Section  7.1.3  about  quick  ways  to  determine  the 
number  of  leading  or  trailing  Os  in  a binary  number.  Those  methods  could  be 
used  in  step  G4  to  make  Algorithm  G loopless,  assuming  that  n isn’t  unreason- 
ably large.  But  Ehrlich’s  method  is  quite  different,  and  much  more  versatile, 
so  it  provides  us  with  a new  weapon  in  our  arsenal  of  techniques  for  efficient 
computation.  Here  is  how  his  approach  can  be  used  to  generate  binary  n-tuples 
[see  Bitner,  Ehrlich,  and  Reingold,  CACM  19  (1976),  517-521]: 


Algorithm  L ( Loopless  Gray  binary  generation).  This  algorithm,  like  Algo- 
rithm G,  visits  all  binary  n-tuples  (on_lv  . . ,a0)  in  the  order  of  the  Gray  binary 
code.  But  instead  of  maintaining  a parity  bit,  it  uses  an  array  of  “focus  pointers” 
( fn , ■ ■ ■ , /o),  whose  significance  is  discussed  below. 

LI.  [Initialize.]  Set  aj  <—  0 and  fj  j for  0 < j < n;  also  set  /„  n.  (A 
loopless  algorithm  is  allowed  to  have  loops  in  its  initialization  step,  as  long 
as  the  initial  setup  is  reasonably  efficient;  after  all,  every  program  needs  to 
be  loaded  and  launched.) 

L2.  [Visit.]  Visit  the  n-tuple  (a„_i, . . . , ax,  a0). 

L3.  [Choose  j.\  Set  j <-  /0,  /0  •<—  0.  (If  this  is  the  ktb.  time  we  are  performing 
the  present  step,  j is  now  equal  to  p(k).)  Terminate  if  j = n;  otherwise  set 
fj  fj+i  and  fj+i  <—  j + I- 

L4.  [Complement  coordinate  j.]  Set  a,  <-  1 - a}  and  return  to  L2.  | 

For  example,  the  computation  proceeds  as  follows  when  n — 4.  Elements  aj  have 
been  underlined  in  this  table  if  the  corresponding  bit  bj  is  1 in  the  binary  string 
b3b2bib0  such  that  a3a2aia0  = g{b3b2bib0): 

a3  0000000011111111 
a2  000011111  1 1 10000 
°i  o 0 1 1 1 1000  0 111  100 

«o  011001100I100I10 
/a  3333333344443333 
h 2222332222224422 
A 1121113111211141 
/o  01020103010201  0 4 

Although  the  binary  number  k = (frn_i  . . . 60)2  never  appears  explicitly  in  Al- 
gorithm L,  the  focus  pointers  fj  represent  it  implicitly  in  a clever  way,  so  that 
we  can  repeatedly  form  g{k ) = (an_i . . . ao)2  by  complementing  bit  ap^  as  we 
should.  Let’s  say  that  aj  is  passive  when  it  is  underlined,  active  otherwise.  Then 
the  focus  pointers  satisfy  the  following  invariant  relations: 

1)  If  aj  is  passive  and  is  active,  then  fj  is  the  smallest  index  j'  > j such 
that  aj / is  active.  (Bits  an  and  a_j  are  considered  to  be  active  for  purposes 
of  this  rule,  although  they  aren’t  really  present  in  the  algorithm.) 

2)  Otherwise  fj  = j. 

Thus,  the  rightmost  element  aj  of  a block  of  passive  elements  Oj_!  . . . aJ+1a.) , 
with  decreasing  subscripts,  has  a focus  fj  that  points  to  the  element  a,  just  to 
the  left  of  that  block.  All  other  elements  aj  have  fj  pointing  to  themselves. 

In  these  terms,  the  first  two  operations  ‘j  /o,  fo  <-  0’  in  step  L3  are 
equivalent  to  saying,  “Set  j to  the  index  of  the  rightmost  active  element,  and 
activate  all  elements  to  the  right  of  aj”  Notice  that  if  f0  = 0,  the  operation 
/o  0 is  redundant;  but  it  doesn’t  do  any  harm.  The  other  two  operations  of  L3, 
‘/f  ^ fj+ 1>  fj+i  j + T,  are  equivalent  to  saying,  “Make  aj  passive,”  because 
we  know  that  aj  and  aj_x  are  both  active  at  this  point  in  the  computation. 
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(Again  the  operation  fJ+i  j + 1 might  be  harmlessly  redundant.)  The  net 
effect  of  activation  and  passivation  is  therefore  equivalent  to  counting  in  binary 
notation,  as  in  Algorithm  M,  with  1-bits  passive  and  0-bits  active. 

Algorithm  L is  almost  blindingly  fast,  because  it  does  only  five  assignment 
operations  and  one  test  for  termination  between  each  visit  to  a generated  n-tuple. 
But  we  can  do  even  better.  In  order  to  see  how,  let’s  consider  an  application 
to  recreational  linguistics:  Rudolph  Castown,  in  Word  Ways  1 (1968),  165- 
169,  noted  that  all  16  of  the  ways  to  intermix  the  letters  of  sins  with  the 
corresponding  letters  of  fate  produce  words  that  are  found  in  a sufficiently  large 
dictionary  of  English:  sine,  sits,  site,  etc.;  and  all  but  three  of  those  words 
(namely  fane,  f ite,  and  sats)  are  sufficiently  common  as  to  be  unquestionably 
part  of  standard  English.  Therefore  it  is  natural  to  ask  the  analogous  question 
for  five-letter  words:  What  two  strings  of  five  letters  will  produce  the  maximum 
number  of  words  in  the  Stanford  GraphBase,  when  letters  in  corresponding 
positions  are  swapped  in  all  32  possible  ways? 

To  answer  this  question,  we  need  not  examine  all  (2„6)5  = 3,625,908,203,125 
essentially  different  pairs  of  strings;  it  suffices  to  look  at  all  ( 2 ) — 16,568,646 
pairs  of  words  in  the  GraphBase,  provided  that  at  least  one  of  those  pairs 
produces  at  least  17  words,  because  every  set  of  17  or  more  five-letter  words 
obtainable  from  two  five-letter  strings  must  contain  two  that  are  “antipodal” 
(with  no  corresponding  letters  in  common).  For  every  antipodal  pair,  we  want 
to  determine  as  rapidly  as  possible  whether  the  32  possible  subset-swaps  produce 
a significant  number  of  English  words. 

Every  5-letter  word  can  be  represented  as  a 25-bit  number  using  5 bits  per 
letter,  from  "a"  = 00000  to  "z"  = 11001.  A table  of  225  bits  or  bytes  will  then 
determine  quickly  whether  a given  five-letter  string  is  a word.  So  the  problem 
is  reduced  to  generating  the  bit  patterns  of  the  32  potential  words  obtainable 
by  mixing  the  letters  of  two  given  words,  and  looking  those  patterns  up  in  the 
table.  We  can  proceed  as  follows,  for  each  pair  of  25-bit  words  w and  w': 

Wl.  [Check  the  difference.]  Set  z «—  w ® w' . Reject  the  word  pair  ( w,w ')  if 
m'  & (z  — m)  & fh  / 0,  where  m — 220  + 215  + 210  + 25  + 1 and  m'  = 2 5m; 
this  test  eliminates  cases  where  w and  w'  have  a common  letter  in  some 
position.  (See  7.1.3-(go).  It  turns  out  that  10,614,085  of  the  16,568,646 
word  pairs  have  no  such  common  letters.) 

W2.  [Form  individual  masks.]  Set  mo  2 & (25  — 1),  m\  4—  z & (210  — 2s), 
to2  4-  2 & (215  - 210),  m3^~  zk  (220  - 215),  and  m4  4-  z k (225  - 220),  in 
preparation  for  the  next  step. 

W3.  [Count  words.]  Set  l 4—  1 and  Aq  4—  w:  the  variable  l will  count  how  many 
words  starting  with  w we  have  found  so  far.  Then  perform  the  operations 
swap  (A)  defined  below. 

W4.  [Print  a record-setting  solution.]  If  l exceeds  or  equals  the  current  maxi- 
mum, print  Aj  for  0 <j<l-  I 

The  heart  of  this  high-speed  method  is  the  sequence  of  operations  swap( 4),  which 
should  be  expanded  inline  (for  example  with  a macro-processor)  to  eliminate  all 
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unnecessary  overhead.  It  is  defined  in  terms  of  the  basic  operation 

SWU ):  Set  w <—  w © rrij.  Then  if  w is  a word,  set  A;  4-  w and  l <—  l + 1. 
Given  sw(j),  which  flips  the  letters  in  position  j,  we  define 
swap{  0)  = sw(0); 

swap(l)  = swap(0),  sw(l),  swap{0)-, 

swap (2)  = swap(  1),  sw( 2),  swap(  1);  (22) 

swap{ 3)  = swap(2 ),  su>(3),  swap(2); 
swap (4)  = swap(3),  sw(4),  swap(3). 

Thus  swap(4)  expands  into  a sequence  of  31  steps  sw(0),  sw(l),  sw(0),  sw(2), 
. sw(0)  = sw(p(  1)),  sw(p(2)),  ...,  sw(p(31));  these  steps  will  be  used  10 
million  times.  We  clearly  gain  speed  by  embedding  the  ruler  function  values 
p(k)  directly  into  our  program,  instead  of  recomputing  them  repeatedly  for  each 
word  pair  via  Algorithm  M,  G,  or  L. 

The  winning  pair  of  words  generates  a set  of  21,  namely 

ducks,  ducky,  duces,  dunes,  dunks,  dinks,  dinky, 

dines,  dices,  dicey,  dicky,  dicks,  picks,  picky,  (23) 

pines,  piney , pinky,  pinks,  punks,  punky,  pucks. 

If,  for  example,  w = ducks  and  w'  = piney,  then  m0  = s 0 y,  so  the  first 
operation  sw(0)  changes  ducks  to  ducky,  which  is  seen  to  be  a word.  The  next 
operation  sw(l)  applies  m1;  which  is  k © e in  the  next-to-last  letter  position,  so 
it  produces  the  nonword  ducey.  Another  application  of  sw(0)  changes  ducey  to 
duces  (a  legal  term  generally  followed  by  the  word  tecum).  And  so  on.  All  word 
pairs  can  be  processed  by  this  method  in  at  most  a few  seconds. 

Further  streamlining  is  also  possible.  For  example,  once  we  have  found 
a pair  that  yields  k words,  we  can  reject  later  pairs  as  soon  as  they  generate 
33  - k nonwords.  But  the  method  we’ve  discussed  is  already  quite  fast,  and  it 
demonstrates  the  fact  that  even  the  loopless  Algorithm  L can  be  beaten. 

Fans  of  Algorithm  L may,  of  course,  complain  that  we  have  speeded  up 
the  process  only  in  the  small  special  case  n = 5,  while  Algorithm  L solves  the 
generation  problem  for  n in  general.  A similar  idea  does,  however,  work  also 
for  general  values  of  n > 5:  We  can  expand  out  a program  so  that  it  rapidly 
generates  all  32  settings  of  the  rightmost  bits  a4a3a2aia0,  as  above;  then  we  can 
apply  Algorithm  L after  every  32  steps,  using  it  to  generate  successive  changes 
to  the  other  bits  an_i  . . . a5.  This  approach  reduces  the  amount  of  unnecessary 
work  done  by  Algorithm  L by  nearly  a factor  of  32. 

Other  binary  Gray  codes.  The  Gray  binary  code  5(0),  g(  1),  . . . , g(2n—  1)  is 
only  one  of  many  ways  to  traverse  all  possible  n-bit  strings  while  changing  only 
a single  bit  at  each  step.  Let  us  say  that,  in  general,  a “Gray  cycle”  on  binary 
n-tuples  is  any  sequence  («0,  «lt . . . , v2n_1)  that  includes  every  n-tuple  and  has 
the  property  that  vk  differs  from  U(fc+1)mod2„  in  just  one  bit  position.  Thus, 
111  the  terminology  of  graph  theory,  a Gray  cycle  is  an  oriented  Hamiltonian 
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Fig.  33.  (a)  Complementary  Gray  code. 


cycle  on  the  n-cube.  We  can  assume  that  subscripts  have  been  chosen  so  that 
vo  = 0 ...  0. 

If  we  think  of  the  u’s  as  binary  numbers,  there  are  integers  So  . . . 82^-1  such 

that 

^(fc+i)  mod  2n  = ® 2 fc , for  0 < k < 2n;  (24) 

this  so-called  “delta  sequence”  is  another  way  to  describe  a Gray  cycle.  For 
example,  the  delta  sequence  for  standard  Gray  binary  when  n = 3 is  01020102; 
it  is  essentially  the  ruler  function  8k  = p(k  + 1)  of  (13),  but  the  final  value  <S2«-i 
is  n — 1 instead  of  n,  so  that  the  cycle  closes.  The  individual  elements  8k  always 
lie  in  the  range  0 < 8k  < n , and  they  are  called  “coordinates.” 

Let  d(n)  be  the  number  of  different  delta  sequences  that  define  an  n-bit 
Gray  cycle,  and  let  c(n)  be  the  number  of  “canonical”  delta  sequences  in  which 
each  coordinate  k appears  before  the  first  appearance  of  k + 1.  Then  d(n)  = 
n\  c(n),  because  every  permutation  of  the  coordinate  numbers  in  a delta  sequence 
obviously  produces  another  delta  sequence.  The  only  possible  canonical  delta 
sequences  for  n < 3 are  easily  seen  to  be 

00;  0101;  01020102  and  01210121.  (25) 

Therefore  c(l)  = c(2)  = 1,  c(3)  = 2;  d(l)  = 1,  d( 2)  = 2,  and  d( 3)  - 12.  A 
straightforward  computer  calculation,  using  techniques  for  the  enumeration  of 
Hamiltonian  cycles  that  we  will  study  later,  establishes  the  next  values, 

c(4)  = 112;  d( 4)  = 2688; 

c(5)  = 15,109,096;  d( 5)  = 1,813,091,520. 

No  simple  pattern  is  evident,  and  the  numbers  grow  quite  rapidly  (see  exer- 
cise 47);  therefore  it’s  a fairly  safe  bet  that  nobody  will  ever  know  the  exact 
values  of  c(8)  and  d( 8). 

Since  the  number  of  possibilities  is  so  huge,  people  have  been  encouraged 
to  look  for  Gray  cycles  that  have  additional  useful  properties.  For  example, 
Fig.  33(a)  shows  a 4-bit.  Gray  cycle  in  which  every  string  aza^a^ao  is  diametrically 
opposite  to  its  complement  dz^a \do.  Such  coding  schemes  are  possible  whenever 
the  number  of  bits  is  even  (see  exercise  49) . 
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An  even  more  interesting  Gray  cycle,  found  by  G.  C.  Tootill  [Proc.  IEE  103 
Part  B Supplement  (1956),  435],  is  shown  in  Fig.  33(b).  This  one  has  the  same 
number  of  changes  in  each  of  the  four  coordinate  tracks,  hence  all  coordinates 
share  equally  in  the  activities.  Gray  cycles  that  are  balanced  in  a similar  way  can 
in  fact  be  constructed  for  all  larger  values  of  n,  by  using  the  following  versatile 
method  to  extend  a cycle  from  n bits  to  n + 2 bits: 


Theorem  D.  Let  aij1a2j2  ■ ■ ■ oqji  be  a delta  sequence  for  an  n-bit  Gray  cycle , 
where  each  jk  is  a single  coordinate,  each  ak  is  a possibly  empty  sequence  of 
coordinates,  and  l is  odd.  Then 

aj(n+l)af  na1 

j1a2na2  (n+l)a2  j2a3{n+l)a3  na3  . . . ji-iat(n+l)a^ nat  (27) 

(n+l)afj, .jajLj . • -afijaf  n 
is  the  delta  sequence  of  an  ( n + 2)-bit  Gray  cycle. 

For  example,  if  we  start  with  the  sequence  0 1020 102  for  n = 3 and  let  the  three 
underlined  elements  be  ji,  j2,  j3,  the  new  sequence  (27)  for  a 5-bit  cycle  is 

01410301020131024201043401020103.  (28) 

Proof.  Let  ak  have  length  mk  and  let  vkt  be  the  vertex  reached  if  we  start  at 
0 ...  0 and  apply  the  coordinate  changes  ak  jx . . . ak^1jk_1  and  the  first  t of  ak. 
We  need  to  prove  that  all  vertices  00ufct,  Qlvkt,  I0vku  and  Uvkt  occur  when  (27) 
is  used,  for  1 < k < l and  0 < t < rnk . (The  leftmost  coordinate  is  n+1.) 
Starting  with  000  . . . 0 = 00ulo,  we  proceed  to  obtain  the  vertices 


00tq!,  . . . , 00vimi , 10uimi , . . . , lOiqo,  Huio,  . . . , llulmi ; 
then  ji  yields  llr>20,  which  is  followed  by 


11^21,  . . . , 1 lU2m2 , 10u2m2,  . . . , 10^20, 00^20,  . . . , 00u2m2 ; 

then  comes  00 u30,  etc.,  and  we  eventually  reach  llvlmi.  The  glorious  finale  then 
uses  the  third  line  of  (27)  to  generate  all  the  missing  vertices  01  virn  ....  01u10 
and  take  us  back  to  000  ...  0.  | 


The  transition  counts  (c0, . . . , cn_j)  of  a delta  sequence  are  defined  by  letting 
cj  be  the  number  of  times  Sk  = j.  For  example,  (28)  has  transition  counts 
(12, 8, 4, 4, 4),  and  it  arose  from  a sequence  with  transition  counts  (4,  2, 2).  If  we 
choose  the  original  delta  sequence  carefully  and  underline  appropriate  elements 
jk,  we  can  obtain  transition  counts  that  are  as  equal  as  possible: 

Corollary  B.  For  all  n > 1 , there  is  an  n-bit  Gray  cycle  with  transition  counts 
(c0,  ci, ... , Cji—i'j  that  satisfy  the  condition 


I cj  ~ ck  | < 2 for  0 < j < k < n.  (29) 

(This  is  the  best  possible  balance  condition,  because  each  Cj  must  be  an  even 
number,  and  we  must  have  c0  + Cl  + • • • + cn_x  = 2".  Indeed,  condition  (29) 
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holds  if  and  only  if  n — r of  the  counts  are  equal  to  2 q and  r are  equal  to  2 q -f  2, 
where  q = [2n_1/nJ  and  r = 2"-1  mod  n .) 

Proof.  Given  a delta  sequence  for  an  n-bit  Gray  cycle  with  transition  counts 
(co, . . . , the  counts  for  cycle  (27)  are  obtained  by  starting  with  the  values 

{c'o,  ■ ■ ■ ,c'n-\ic'mc'n+i)  = (4c0,  ...,4cn_1,/+l,/+l),  then  subtracting  2 from  c'jk 
for  1 < k < l and  subtracting  4 from  c'  ; . For  example,  when  n = 3 we  can  obtain 
a balanced  5-bit  Gray  cycle  having  transition  counts  (8  — 2, 16  — 10,8,6,6)  = 
(6, 6, 8,  6, 6)  if  we  apply  Theorem  D to  the  delta  sequence  01210121.  Exercise  51 
works  out  the  details  for  other  values  of  n.  | 

Another  important  class  of  n-bit  Gray  cycles  in  which  each  of  the  coordinate 
tracks  has  equal  responsibility  arises  when  we  consider  run  lengths,  namely  the 
distances  between  consecutive  appearances  of  the  same  5 value.  Standard  Gray 
binary  code  has  run  length  2 in  the  least  significant  position,  and  this  can  lead  to 
a loss  of  accuracy  when  precise  measurements  need  to  be  made  [see,  for  example, 
the  discussion  by  G.  M.  Lawrence  and  W.  E.  McClintock,  Proc.  SPIE  2831 
(1996),  104-111],  But  all  runs  have  length  4 or  more  in  the  remarkable  5-bit 
Gray  cycle  whose  delta  sequence  is 

(0 1230421 032 10423)2.  (30) 

Let  r(n)  be  the  maximum  value  r such  that  an  n-bit  Gray  cycle  can  be 
found  in  which  all  runs  have  length  > r.  Clearly  r(l)  = 1,  and  r(2)  = r(3)  = 
r(4)  = 2;  and  it  is  easy  to  see  that  r(n)  must  be  less  than  n when  n > 2,  hence 
(30)  proves  that  r(5)  = 4.  Exhaustive  computer  searches  establish  the  values 
r(6)  = 4 and  r(7)  = 5.  Indeed,  a fairly  straightforward  backtrack  calculation 
for  the  case  n — 7 needs  a tree  of  only  about  60  million  nodes  to  determine 
that  r( 7)  < 6,  and  exercise  61(a)  constructs  a 7-bit  cycle  with  no  run  shorter 
than  5.  The  exact  values  of  r(n)  are  unknown  for  n > 8;  but  r(10)  is  almost 
certainly  8,  and  interesting  constructions  are  known  by  which  we  can  prove  that 
r(n)  = n — O(logn)  as  n -»  00.  (See  exercises  60-64.) 

^Binary  Gray  paths.  We  have  defined  an  n-bit  Gray  cycle  as  a way  to  arrange 
all  binary  n-tuples  into  a sequence  (vo,  iq, . . . , i^-i)  with  the  property  that 
vk  is  adjacent  to  vk+i  in  the  n-cube  for  0 < k < 2”  — 1,  and  such  that  u2"-i 
is  also  adjacent  to  u0 . The  cyclic  property  is  nice,  but  not  always  essential; 
sometimes  we  can  do  better  without  it.  Therefore  we  say  that  an  n-bit  Gray  path, 
also  commonly  called  a Gray  code,  is  any  sequence  that  satisfies  the  conditions 
of  a Gray  cycle  except  that  the  last  element  need  not  be  adjacent  to  the  first.  In 
other  words,  a Gray  cycle  is  a Hamiltonian  cycle  on  the  vertices  of  the  n-cube, 
but  a Gray  code  is  simply  a Hamiltonian  path  on  that  graph. 

The  most  important  binary  Gray  paths  that  are  not  also  Gray  cycles  are 
n-bit  sequences  (vq,vi,  . . . that  are  monotonic,  in  the  sense  that 

v{vk)  < i/(ufc+2)  for  0 < fc  < 2"  — 2.  (31) 

(Here,  as  elsewhere,  we  use  v to  denote  the  “weight”  or  the  “sideways  sum”  of  a 
binary  string,  namely  the  number  of  Is  that  it  has.)  Trial  and  error  shows  that 
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there  are  essentially  only  two  monotonic  n-bit  Gray  codes  for  each  n < 4,  one 
starting  with  0n  and  the  other  starting  with  0n-1l.  The  two  for  n = 3 are 

000,  001,  Oil,  010,  110,  100,  101,  111;  (32) 

001,  000,  010,  110,  100,  101,  111,  Oil.  (33) 

The  two  for  n = 4 are  slightly  less  obvious,  but  not  really  difficult  to  discover. 

Since  v{yk+\)  = i'(i’fc)  ± 1 whenever  v is  adjacent  to  Vk+i,  we  obviously 
can’t  strengthen  (31)  to  the  requirement  that  all  n-tuples  be  strictly  sorted  by 
weight.  But  relation  (31)  is  strong  enough  to  determine  the  weight  of  each  Vk , 
given  k and  the  weight  of  Vq,  because  we  know  that  exactly  (")  of  the  n-tuples 
have  weight  j. 

Figure  34  summarizes  our  discussions  so  far,  by  illustrating  seven  of  the 
zillions  of  Gray  codes  that  make  a grand  tour  through  all  256  of  the  possible 
8-bit  bytes.  Black  squares  represent  ones  and  white  squares  represent  zeros. 
Figure  34(a)  is  the  standard  Gray  binary  code,  while  Fig.  34(b)  is  balanced  with 
exactly  256/8  = 32  transitions  in  each  coordinate  position.  Figure  34(c)  is  a Gray 
code  analogous  to  Fig.  33(a),  in  which  the  bottom  128  codes  are  complements 
of  the  top  128.  In  Fig.  34(d),  the  transitions  in  each  coordinate  position  never 
occur  closer  than  five  steps  apart;  in  other  words,  all  run  lengths  are  at  least  5. 
The  cycle  in  Fig.  34(e)  is  nonlocal  in  the  sense  of  exercise  59.  A rnonotonic  path 
for  n = 8 appears  in  Fig.  34(f);  notice  how  black  it  gets  near  the  bottom.  Finally, 
Fig.  34(g)  illustrates  a Gray  code  that  is  totally  nonmonotonic,  in  the  sense  that 
the  center  of  gravity  of  the  black  squares  lies  exactly  at  the  halfway  point  in  each 
column.  Standard  Gray  binary  code  has  this  property  in  seven  of  the  coordinate 
positions,  but  Fig.  34(g)  achieves  perfect  black-white  weight  balance  in  all  eight. 
Such  codes  are  called  trend-free;  they  are  important  in  the  design  of  agricultural 
and  other  experiments  (see  exercises  75  and  76). 

Carla  Savage  and  Peter  Winkler  [J.  Combinatorial  Theory  A70  (1995),  230- 
248]  found  an  elegant  way  to  construct  monotonic  binary  Gray  codes  for  all  n > 0. 
Such  paths  are  necessarily  built  from  subpaths  Pnj  in  which  all  transitions  are 
between  n-tuples  of  weights  j and  j + 1.  Savage  and  Winkler  defined  suitable 
subpaths  recursively  by  letting  P10  = 0, 1 and,  for  all  n > 0, 

P(n+i)j  — 0Pn j',  (34) 

Pnj  = 0 if  j < 0 or  j > n.  (35) 

Here  7rn  is  a permutation  of  the  coordinates  that  we  will  specify  later,  and  the 
notation  P*  means  that  every  element  an_  1 . . . ajao  of  the  sequence  P is  replaced 
by  6n_i . . . bi&o,  where  bjn  = aj.  (We  don’t  define  Pn  by  letting  bj  = ajn, 
because  we  want  (2J')7r  to  be  2J7r.)  It  follows,  for  example,  that 

P20  = OP10  = 00,  01  (36) 

because  Pi(-i)  is  vacuous;  also 


P21  = lift  =10,  11 


(37) 
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because  P\\  is  vacuous  and  7Ti  must  be  the  identity  permutation.  In  general, 
Pnj  is  a sequence  of  n-bit  strings  containing  exactly  (n_1)  strings  of  weight  j 
interleaved  with  ("J1)  strings  of  weight  j + 1. 

Let  anj  and  u„j  be  the  first  and  last  elements  of  Pnj.  Then  we  easily  find 


wnj=0n-i~1V+1, 

for  0 < j < n; 

(38) 

an o = 0", 

for  n > 0; 

(39) 

anj  = 

for  1 < j < n. 

(40) 

In  particular,  anj  always  has  weight  j , and  tonj  always  has  weight  j + 1.  We  will 
define  permutations  nn  of  {0, 1, . . . , n - 1}  so  that  both  of  the  sequences 


p r>R  jd  tdR 

rn0'  rn\i  ■fn2’  n3’  • • ■ 

and  Pnfl0,  Pnl,  P‘<2,  Pn3,  ... 


(41) 

(42) 


are  monotonic  binary  Gray  paths  for  n = 1,  2,  3, In  fact,  the  monotonicity 

is  clear,  so  only  the  Grayness  is  in  doubt;  and  the  sequences  (41),  (42)  link  up 
nicely  because  the  adjacencies 


C*nO  Oin  1 •••  Otn(n- 1)>  Wno Wnl ^n(n-l)  (43) 

follow  immediately  from  (34),  regardless  of  the  permutations  7rn.  Thus  the 
crucial  point  is  the  transition  at  the  comma  in  formula  (34),  which  makes  P(n+ i)j 
a Gray  subpath  if  and  only  if 

un(j- 1)  = anj  for  0 <j<n.  (44) 

For  example,  when  n — 2 and  j = 1 we  need  (01 )*2  = a2 1 = 10,  by  (38)- 
(40);  hence  7r2  must  transpose  coordinates  0 and  1.  The  general  formula  (see 
exercise  71)  turns  out  to  be 

^n^n— 1)  (45) 

where  an  is  the  n-cycle  (n— 1 ...  10).  The  first  few  cases  are  therefore 

*Tl  = (0),  7T4  = (03), 

3t2  = (0 1),  7r5  = (04321), 

7T3  = (0  21),  ttq  — (0  5 2 41  3); 

no  simple  “closed  form”  for  the  magic  permutations  7 rn  is  apparent.  Exercise  73 
shows  that  the  Savage-Winkler  codes  can  be  generated  efficiently. 

Nonbinary  Gray  codes.  We  have  studied  the  case  of  binary  n-tuples  in 
great  detail,  because  it  is  the  simplest,  most  classical,  most  applicable,  and 
most  thoroughly  explored  part  of  the  subject.  But  of  course  there  are  numerous 
applications  in  which  we  want  to  generate  (aj, . . . , an)  with  integer  components 
in  the  more  general  ranges  0 < aj  < m:n  as  in  Algorithm  M.  Gray  codes  apply 
nicely  to  this  case  as  well. 

Consider,  for  example,  decimal  digits,  where  we  want  0 < aj  < 10  for 
each  j.  Is  there  a decimal  way  to  count  that  is  analogous  to  the  Gray  binary 
code,  changing  only  one  digit  at  a time?  Yes;  in  fact,  two  natural  schemes  are 
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available.  In  the  first,  called  reflected  Gray  decimal , the  sequence  for  counting 
up  to  a thousand  with  3-digit  strings  has  the  form 

000, 001, . . . , 009, 019, 018, . . . , Oil,  010, 020, 021, . . . , 091, 090, 190, 191, . . . , 900, 

with  each  component  moving  alternately  from  0 up  to  9 and  then  back  down  from 
9 to  0.  In  the  second,  called  modular  Gray  decimal , the  digits  always  increase 
by  1 mod  10,  therefore  they  “wrap  around”  from  9 to  0: 


000, 001, . . . , 009, 019, 010, . . . , 017, 018, 028, 029, . . . , 099, 090, 190, 191, ... , 900. 


In  both  cases  the  digit  that  changes  on  step  k is  determined  by  the  radix-ten 
ruler  function  pio(k),  the  largest  power  of  10  that  divides  k.  Therefore  each 
n-tuple  of  digits  occurs  exactly  once:  We  generate  10-7  different  settings  of  the 
rightmost  j digits  before  changing  any  of  the  others,  for  1 < j < n. 

In  general,  the  reflected  Gray  code  in  any  mixed-radix  system  can  be  re- 
garded as  a permutation  of  the  nonnegative  integers,  a function  that  maps  an 
ordinary  mixed-radix  number 


k = 


bn  — 1 1 
- ^-n  — 1 5 


• , bi,  bo 
. , mi,  to0. 


bn-\mn-2  ■ ■ ■ mimo  + • • • + fqrao  + bo 


(46) 


into  its  reflected-Gray  equivalent 

— 1 5 • • • 5 ■> 

Lran_i,  . . . , mi,  m0J 
just  as  (7)  does  this  in  the  special  case  of  binary  numbers.  Let 


<j(k)  = 


= a„_imn_2  . . . mimo -I h dim0  + a0,  (47) 


. . . , U,j 


^n  — 1 ? 


Lmn_i,  . . . , rrij 


Bj  = 


r • • • , bj 
mn- 1, . . . , nij  J 


with  An  = Bn  = 0,  so  that  when  0 < j < n we  have 


(48) 


Aj  — mjAj+x  + aj  and  Bj  — rrijBj+i  + bj.  (49) 

The  rule  connecting  the  a’s  and  b's  is  not  difficult  to  derive  by  induction  on  n — j: 

( bj,  if  Bj+i  is  even; 

a,-  = < (e;o) 

\ TOj  — 1 — bj,  if  Bj+ 1 is  odd. 

(Here  we  are  numbering  the  coordinates  of  the  n-tuples  (o„_i, . . . , ai,ao)  and 
(6„_i, . . . , 61,  &o)  from  right  to  left,  for  consistency  with  (7)  and  the  conven- 
tions of  mixed-radix  notation  in  Eq.  4.1-(g).  Readers  who  prefer  notations  like 
(di, . . . , an ) can  change  j to  n — j in  all  the  formulas  if  they  wish.)  Going  the 
other  way,  we  have 


bj  = 


f ah 

\irij 


if  dj-i-i  + aj+ 2 + • • • is  even; 
— 1 — aj,  if  aj+ 1 + dj+2  + • • • is  odd. 


(51) 


Curiously,  rule  (50)  and  its  inverse  in  (51)  are  exactly  the  same  when  all  of  the 
radices  rrij  are  odd.  I11  Gray  ternary  code,  for  example,  when  mo  = mi  = ■ ■ ■ = 3, 
we  have  </((10010211012)3)  = (12210211010)3  and  also  g((12210211010)3)  = 


300  COMBINATORIAL  SEARCHING' 


7.2. 1.1 


(10010211012)3.  Exercise  78  proves  (50)  and  (51),  and  discusses  similar  formulas 
that  hold  in  the  modular  case. 

We  can  in  fact  generate  such  Gray  sequences  looplessly,  generalizing  Algo- 
rithms M and  L: 

Algorithm  H ( Loopless  reflected  mixed-radix  Gray  generation).  This  algorithm 
visits  all  n-tuples  (a„_i,  ...,a0)  such  that  0 < a,  < m3  for  0 < j < n,  changing 
only  one  component  by  ±1  at  each  step.  It  maintains  an  array  of  focus  pointers 
[fn,  ■ ■ ■ , /o)  to  control  the  actions  as  in  Algorithm  L,  together  with  an  array  of 
directions  (on_!, . . . , o0).  We  assume  that  each  radix  m3  is  > 2. 

Hi.  [Initialize.]  Set  a3  0,  fj  «—  j,  and  oj  <—  1,  for  0 < j < n;  also  set  fn  n. 
H2.  [Visit.]  Visit  the  n-tuple  (a„_i, . . . alt  a0). 

H3.  [Choose  j.]  Set  j t—  f0  and  f0  <—  0.  (As  in  Algorithm  L,  j was  the  rightmost 
active  coordinate;  all  elements  to  its  right  have  now  been  reactivated.) 

H4.  [Change  coordinate  j.)  Terminate  if  j - n;  otherwise  set  a3  <-  aj  + oy 

H5.  [Reflect?]  If  a3  = 0 or  a,  = m3  - 1,  set  Qj  < 0j,  <-  fj+1,  and 

fj+ 1 J + 1-  (Coordinate  j has  thus  become  passive.)  Return  to  H2.  | 

A similar  algorithm  generates  the  modular  variation  (see  exercise  77). 

*Subforests.  An  interesting  and  instructive  generalization  of  Algorithm  H, 
discovered  by  Y.  Koda  and  F.  Ruskey  [J.  Algorithms  15  (1993),  324-340],  sheds 
fuither  light  on  the  subject  of  Gray  codes  and  loopless  generation.  Suppose  we 
have  a forest  of  n nodes,  and  we  want  to  visit  all  of  its  “principal  subforests,” 
namely  all  subsets  of  nodes  S such  that  if  x is  in  5 and  x is  not  a root,  the 
parent  of  x is  also  in  S.  For  example,  the  7-node  forest  has  33  such  subsets, 
corresponding  to  the  black  nodes  in  the  following  33  diagrams: 

tJk  SJk  !-X  iJk  tJk  ’-X  iJk 

tJk  I-Ov  I-X  !•>.  (52) 

tJk  tJk  '•X  '-X  tJk  :”X  :-X  =-X  '-X  '•X 

Notice  that  if  we  read  the  top  row  from  left  to  right,  the  middle  row  from  right 
to  left,  and  the  bottom  row  from  left  to  right,  the  status  of  exactly  one  node 
changes  at  each  step. 

If  the  given  forest  consists  of  degenerate  nonbranching  trees,  the  principal 
subforests  are  equivalent  to  mixed-radix  numbers.  For  example,  a forest  like 


has  3 x 2 x 4 x 2 principal  subforests,  corresponding  to  4-tuples  (x1,x2,x3,x4) 
such  that  0 < xY  < 3,  0 < a?2  < 2,  0 < a:3  < 4,  and  0 < x4  < 2;  the  value  of  Xj  is 
the  number  of  nodes  selected  in  the  jth  tree.  When  the  algorithm  of  Koda  and 
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Ruskey  is  applied  to  such  a forest,  it  will  visit  the  subforests  in  the  same  order 
as  the  reflected  Gray  code  on  radices  (3, 2, 4,  2). 

Algorithm  K ( Loopless  reflected  subforest  generation).  Given  a forest  whose 
nodes  are  (1, . . . , n)  when  arranged  in  postorder,  this  algorithm  visits  all  binary 
n-tuples  (oi,...,an)  such  that  ap  > aq  whenever  p is  a parent  of  q.  (Thus, 
ap  — 1 means  that  p is  a node  in  the  current  subforest.)  Exactly  one  bit  aj 
changes  between  one  visit  and  the  next.  Focus  pointers  (/o,  fi,  ■ ■ ■ , fn)  analogous 
to  those  of  Algorithm  L are  used  together  with  additional  arrays  of  pointers 
(lo,  h, . ■ ■ , ln)  and  (r0,  r i, . . . , r„),  which  represent  a doubly  linked  list  called  the 
“current  fringe.”  The  current  fringe  contains  all  nodes  of  the  current  subforest 
and  their  children;  r0  points  to  its  leftmost  node  and  l0  to  its  rightmost. 

An  auxiliary  array  (co,ci, . . . , cn ) defines  the  forest  as  follows:  If  p has  no 
children,  cp  = 0;  otherwise  cp  is  the  leftmost  (smallest)  child  of  p.  Also  c0  is  the 
leftmost  root  of  the  forest  itself.  When  the  algorithm  begins,  we  assume  that 
rp  = q and  lq  = p whenever  p and  q are  consecutive  children  of  the  same  family. 
Thus,  for  example,  the  forest  in  (52)  has  the  postorder  numbering 


therefore  we  should  have  (c0,...,C7)  = (2, 0, 1, 0, 0, 0, 4, 3)  and  r2  = 7,  l7  = 2, 

^3  = 6,  Iq  = 3,  r 4 = 5,  and  I5  — 4 at  the  beginning  of  step  K1  in  this  case. 

Kl.  [Initialize.]  Set  aj  t—  0 and  fj  <—  j for  1 < j < n,  thereby  making  the  initial 
subforest  empty  and  all  nodes  active.  Set  f0  t—  0,  Zo  <—  n,  rn  <—  0,  r0  «—  Co, 
and  lCo  t—  0,  thereby  putting  all  roots  into  the  current  fringe. 

K2.  [Visit.]  Visit  the  subforest  defined  by  (ax, . . . , a„). 

K3.  [Choose  p.)  Set  q t—  lo,  p t—  fq.  (Now  p is  the  rightmost  active  node  of  the 
fringe.)  Also  set  /,<—<?  (thereby  activating  all  nodes  to  p’s  right). 

K4.  [Check  ap .]  Terminate  the  algorithm  if  p = 0.  Otherwise  go  to  K6  if  ap  — 1. 

K5.  [Insert  p's  children.]  Set  ap  «—  1.  Then,  if  cp  / 0,  set  q <—  rp,  lq  «-  p - 1, 
rp-\  «—  q,  rp  t—  cp,  lCp  t—  p (thereby  putting  p's  children  to  the  right  of  p 
in  the  fringe).  Go  to  K7. 

K6.  [Delete  p's  children.]  Set  ap  0.  Then,  if  cp  / 0,  set  q rp_x,  rp  t—  q, 
lq  4—  p (thereby  removing  p's  children  from  the  fringe). 

K7.  [Make  p passive.]  (At  this  point  we  know  that  p is  active.)  Set  fp  t—  /j 
and  fip  t—  lp.  Return  to  K2.  | 

The  reader  is  encouraged  to  play  through  this  algorithm  on  examples  like  (52), 

in  order  to  understand  the  beautiful  mechanism  by  which  the  fringe  grows  and 

shrinks  at  just  the  right  times. 
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*Shift  register  sequences.  A completely  different  way  to  generate  all  n-tuples  of 
m-ary  digits  is  also  possible:  We  can  generate  one  digit  at  a time,  and  repeatedly 
work  with  the  n most  recently  generated  digits,  thus  passing  from  one  n-tuple 
(xo,Xi, . . . ,xn_i)  to  another  one  (aq, . . . , x„_i, xn)  by  shifting  an  appropriate 
new  digit  in  at  the  right.  For  example,  Fig.  35  shows  how  all  5-bit,  numbers  can 
be  obtained  as  blocks  of  5 consecutive  bits  in  a certain  cyclic  pattern  of  length  32. 
This  general  idea  has  already  been  discussed  in  some  of  the  exercises  of  Sections 
2. 3.4. 2 and  3.2.2,  and  we  now  are  ready  to  explore  it  further. 


Fig.  35. 

A de  Bruijn  cycle 
for  5-bit  numbers. 


Algorithm  S ( Generic  shift  register  generation).  This  algorithm  visits  all  n- 
tuples  (ai, . . . , an)  such  that  0 < aj  < m for  1 < j < n,  provided  that  a suitable 
function  / is  used  in  step  S3. 

51.  [Initialize.]  Set  a,  0 for  -n  < j < 0 and  k «-  1. 

52.  [Visit.]  Visit  the  n-tuple  (afc_n, . . . Terminate  if  k = mn . 

53.  [Advance.]  Set  a*,  «—  /(afc_n, . . . ,ak~i),  k «—  k + 1,  and  return  to  S2.  | 

Every  function  / that  makes  Algorithm  S valid  corresponds  to  a cycle  of 
mn  radix-m  digits  such  that  every  combination  of  n digits  occurs  consecutively 
in  the  cycle.  For  example,  the  case  m = 2 and  n = 5 illustrated  in  Fig.  35 
corresponds  to  the  binary  cycle 

00000100011001010011101011011111;  (53) 

and  the  first  to2  digits  of  the  infinite  sequence 

0011021220313233041424344 . . . (54) 

yield  an  appropriate  cycle  for  n = 2 and  arbitrary  m.  Such  cycles  are  commonly 
called  m-ary  de  Bruijn  cycles , because  N.  G.  de  Bruijn  treated  the  binary  case 
for  arbitrary  n in  Indagationes  Mathematical  8 (1946),  461-467. 

Exercise  2.3.4.2-23  proves  that  exactly  m!m"_1/mn  functions  / have  the 
required  properties.  That’s  a huge  number,  but  only  a few  of  those  functions  are 
known  to  be  efficiently  computable.  We  will  discuss  three  kinds  of  / that  appear 
to  be  the  most  useful. 
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Table  1 

PARAMETERS  FOR  ALGORITHM  A 


3 

1 

8 : 1,5 

13 

: 1,3 

18 

7 

23 

: 5 

28 

3 

4 

1 

9 : 4 

14 

: 1,11 

19 

1,5 

24 

: 1, 

3 

29 

2 

5 

2 

10  : 3 

15 

: 1 

20 

3 

25 

: 3 

30 

1,15 

6 

1 

11  : 2 

16 

:2,3 

21 

2 

26 

: 1, 

7 

31 

3 

7 

1 

12  : 3,4 

17 

: 3 

22 

1 

27 

: 1. 

7 

32 

1,27 

The  entries  ‘n  : 

: s’  or  ‘n  : s,  V 

mean 

that  the  polynomials  xn 

+ xs  + l 

or  xn 

+ (xs 

+ l)(x*  + 1) 

are  primitive  modulo  2.  Additional  values  up  to  n = 168  have  been  tabulated  by  W.  Stahnke, 
Math.  Comp.  27  (1973),  977-980. 


The  first  important  case  occurs  when  m is  a prime  number,  and  / is  the 
almost  linear  recurrence 

' ci,  if  (xi,x2, . . . ,xn)  = (0,0, . . . ,0); 

f(xi, . . . , xn)  = < 0,  if  (xi,x2,...,xn)  = (1,0,  ...,0);  (55) 

_ (ciXi  + c2x2  + b cnxn)  mod  m,  otherwise. 

Here  the  coefficients  (ci, . . . , c„)  must  be  such  that 

x11  - cnxn_1  - • ■ • - c2x  - ci  (56) 

is  a primitive  polynomial  modulo  m,  in  the  sense  discussed  following  Eq.  3.2.2- 
(9).  The  number  of  such  polynomials  is  ip(mn  — 1 )/n,  large  enough  to  allow  us 
to  find  one  in  which  only  a few  of  the  c’s  are  nonzero.  [This  construction  goes 
back  to  a pioneering  paper  of  Willem  Mantel,  Nieuw  Archief  voor  Wiskunde  (2) 
1 (1897),  172-184.] 

For  example,  suppose  m = 2.  We  can  generate  binary  n-tuples  with  a very 
simple  loopless  procedure: 

Algorithm  A ( Almost  linear  bit-shift,  generation ).  This  algorithm  visits  all 
n-bit  vectors,  by  using  either  a special  offset  s [Case  1]  or  two  special  offsets  s 
and  t [Case  2],  as  found  in  Table  1. 

Al.  [Initialize.]  Set  (x0,  xi, . . . , xn_i)  <—  (1,0,  ...,0)  and  k 4-  0,  j t—  s.  In 
Case  2,  also  set  i «—  t and  h «—  s + t. 

A2.  [Visit.]  Visit  the  n-tuple  (xfc_i, . . . ,Xo,x„_i, . . . ,Xfc+i,Xfc). 

A3.  [Test  for  end.]  If  x*,  / 0,  set  r <—  0;  otherwise  set  r <—  r + 1,  and  go  to  A6 
if  r — n — 1.  (We  have  just  seen  r consecutive  zeros.) 

A4.  [Shift.]  Set  k ( k — 1)  mod  n and  j (j  — 1)  mod  n.  In  Case  2 also  set 
*«—(*  — 1)  mod  n and  h <—  (h  — 1)  mod  n. 

A5.  [Compute  a new  bit.]  Set  Xfc  <—  Xk  © Xj  [Case  1]  or  x^  <—  Xk  © Xj  © x;  © Xh 
[Case  2].  Return  to  A2. 

A6.  [Finish.]  Visit  (0, ...  ,0)  and  terminate.  | 

Appropriate  offset  parameters  s and  possibly  t almost  certainly  exist  for  all  n, 
because  primitive  polynomials  are  so  abundant;  for  example,  eight  different 
choices  of  ( s,t ) would  work  when  n = 32,  and  Table  1 merely  lists  the  smallest. 
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However,  a rigorous  proof  of  existence  in  all  cases  lies  well  beyond  the  present 
state  of  mathematical  knowledge. 

Our  first  construction  of  de  Bruijn  cycles,  in  (55),  was  algebraic,  relying  for 
its  validity  on  the  theory  of  finite  fields.  A similar  method  that  works  when  m 
is  not  a prime  number  appears  in  exercise  3.2.2-21.  Our  next  construction,  by 
contrast,  will  be  purely  combinatorial.  In  fact,  it  is  strongly  related  to  the  idea 
of  modular  Gray  m- ary  codes. 

Algorithm  R ( Recursive  de  Bruijn  cycle  generation) . Suppose  /()  is  a coroutine 
that  will  output  the  successive  digits  of  an  m-ary  de  Bruijn  cycle  of  length  mn, 
beginning  with  n zeros,  when  it  is  invoked  repeatedly.  This  algorithm  is  a similar 
coroutine  that  outputs  a cycle  of  length  mn+1,  provided  that  n > 2.  It  maintains 
three  private  variables  x,  y,  and  t;  variable  x should  initially  be  zero. 

Rl.  [Output.]  Output  x.  Go  to  R3  if  x ± 0 and  t > n. 

R2.  [Invoke  /.]  Set  y <—  /(). 

R3.  [Count  ones.]  If  y ~ 1,  set  t 4—  t + 1;  otherwise  set  1 4—  0. 

R4.  [Skip  one?]  If  t = n and  x ^ 0,  go  back  to  R2. 

R5.  [Adjust  x.]  Set  x 4-  (x  + y)  mod  to  and  return  to  Rl.  | 

For  example,  let  m — 3 and  n = 2.  If  /()  produces  the  infinite  9-cycle 

001102122  001102122  0...,  (57) 

then  Algorithm  R will  produce  the  following  infinite  27-cycle  at  step  Rl: 

V = 001021220011110212200102122  001 .. . 
t = 001001000012340010000100100  001 .. . 

x = 000110102220120020211122121  0001 

The  proof  that  Algorithm  R works  correctly  is  interesting  and  instructive  (see 
exercise  93).  And  the  proof  of  the  next  algorithm,  which  doubles  the  window 
size  n,  is  even  more  so  (see  exercise  95). 

Algorithm  D ( Doubly  recursive  de  Bruijn  cycle  generation).  Suppose  /() 
and  / ()  are  coroutines  that  each  will  output  the  successive  digits  of  an  m-ary 
de  Bruijn  cycle  of  length  mn  when  invoked  repeatedly,  beginning  with  n zeros. 
(The  two  cycles  may  be  identical,  but  they  must  be  generated  by  independent 
coroutines,  because  we  will  consume  their  values  at  different  rates.)  This  algo- 
rithm is  a similar  coroutine  that  outputs  a cycle  of  length  m2n.  It  maintains  six 
private  variables  x,  y,  t,  x , y , and  t'\  variables  x and  s'  should  initially  be  to. 
The  special  parameter  r must  be  set  to  a constant  value  such  that 

0 <r  <m  and  gcd(m"  - r,  to"  + r)  = 2.  (58) 

The  best  choice  is  usually  r = 1 when  to  is  odd  and  r = 2 when  to  is  even. 

Dl.  [Possibly  invoke  /.]  If  t / n or  x > r,  set  y 4-  /(). 

D2.  [Count  repeats.]  If  x ± y,  set  x 4-  y and  t 4-  1.  Otherwise  set  t 4-  t + 1. 
D3.  [Output  from  /.]  Output  the  current  value  of  x. 
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D4.  [Invoke  /'.]  Set  y'  <-/'(). 

D5.  [Count  repeats.]  If  x'  ^ y' , set  x'  <—  y'  and  t' 4—  1.  Otherwise  set  t'  4-  t'  + l. 

D6.  [Possibly  reject  /'.]  If  t'  = n and  x'  < r and  either  t < n or  x'  < x,  go  to 
D4.  If  t'  = n and  x'  < r and  x'  = x,  go  to  D3. 

D7.  [Output  from  /'.]  Output  the  current  value  of  x' . Return  to  D3  if  t!  = n 
and  x'  < r;  otherwise  return  to  Dl.  | 

The  basic  idea  of  Algorithm  D is  to  output  from  /()  and  /'()  alternately,  making 
special  adjustments  when  either  sequence  generates  n consecutive  x’s  for  x < r. 
For  example,  when  /()  and  /'()  produce  the  9-cycle  (57),  we  take  r — 1 and  get 

t in  step  D2:  12  31211112  12312111  12123121  11121231  21111212  ... 
x in  step  D3:  00001102122  00011021  22000110  21220001  102122000... 

t'  in  step  D5:  121211112121211112121211112121211112121211112121  . . . 
x'  in  step  D7:  0 11021220  11021220  11021220  11021220  11021220  1 ...; 

so  the  81-cycle  produced  in  steps  D3  and  D7  is  00001011012  . . . 2222  00001 .... 

The  case  m — 2 of  Algorithm  R was  discovered  by  Abraham  Lempel  [IEEE 
Trans.  C-19  (1970),  1204-1209];  Algorithm  D was  not  discovered  until  more  than 
25  years  later  [C.  J.  Mitchell,  T.  Etzion,  and  K.  G.  Paterson,  IEEE  Trans.  IT- 
42  (1996),  1472-1478].  By  using  them  together,  starting  with  simple  coroutines 
for  n = 2 based  on  (54),  we  can  build  up  an  interesting  family  of  cooperating 
coroutines  that  will  generate  a de  Bruijn  cycle  of  length  m”  for  any  desired  m > 2 
and  n > 2,  using  only  O(logn)  simple  computations  for  each  digit  of  output. 
(See  exercise  96.)  Furthermore,  in  the  simplest  case  m = 2,  this  combination 
“R&D  method”  has  the  property  that  its  fcth  output  can  be  computed  directly, 
as  a function  of  k,  by  doing  0(n  log  n)  simple  operations  on  n-bit  numbers. 
Conversely,  given  any  n-bit  pattern  (3,  the  position  of  (3  in  the  cycle  can  also  be 
computed  in  O(nlogn)  steps.  (See  exercises  97-99.)  No  other  family  of  binary 
de  Bruijn  cycles  is  presently  known  to  have  the  latter  property. 

Our  third  construction  of  de  Bruijn  cycles  is  based  on  the  theory  of  prime 
strings,  which  will  be  of  great  importance  to  us  when  we  study  pattern  matching 
in  Chapter  9.  Suppose  7 = af3  is  the  concatenation  of  two  strings;  we  say  that 
a is  a prefix  of  7 and  (3  is  a suffix.  A prefix  or  suffix  of  7 is  called  proper  if  its 
length  is  positive  but  less  than  the  length  of  7.  Thus  (3  is  a proper  suffix  of  a(3 
if  and  only  if  a/e  and  (3  ^ e. 

Definition  P.  A string  is  prime  if  it  is  nonempty  and  (lexicographically)  less 
than  all  of  its  proper  suffixes.  | 

For  example,  01101  is  not  prime,  because  it  is  greater  than  01;  but  01102  is 
prime,  because  it  is  less  than  1102,  102,  02,  and  2.  (We  assume  that  strings  are 
composed  of  letters,  digits,  or  other  symbols  from  a linearly  ordered  alphabet. 
Lexicographic  or  dictionary  order  is  the  normal  way  to  compare  strings,  so  we 
write  a < (3  and  say  that  a is  less  than  (3  when  a is  lexicographically  less  than  (3. 
In  particular,  we  always  have  a < a(3 , and  a < a/3  if  and  only  if  (3  e.) 
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Prime  strings  have  often  been  called  Lyndon  words,  because  they  were 
introduced  by  R.  C.  Lyndon  [Trans.  Amer.  Math.  Soc.  77  (1954),  202-215]; 
Lyndon  called  them  “standard  sequences.”  The  simpler  term  “prime”  is  justified 
because  of  the  fundamental  factorization  theorem  in  exercise  101.  We  will, 
however,  continue  to  pay  respect  to  Lyndon  implicitly  by  often  using  the  letter  A 
to  denote  strings  that  are  prime. 

Several  of  the  most  important  properties  of  prime  strings  were  derived  by 
Chen,  Fox,  and  Lyndon  in  an  important  paper  on  group  theory  [Annals  of  Math. 
(2)  68  (1958),  81-95],  including  the  following  easy  but  basic  result: 

Theorem  P.  A nonempty  string  that  is  less  than  all  its  cyclic  shifts  is  prime. 

(The  cyclic  shifts  of  ai ...  an  are  a2  . . . ana1;  a3  . . . anaia2,  ■ • • , ano-i  • • ■ an~i-) 

Proof.  Suppose  y = a/3  is  not  prime,  because  a / e and  7 > /?  / e;  but  suppose 
7 is  also  less  than  its  cyclic  shift  /3a.  Then  the  conditions  /3  < 7 < /3a  imply 
that  7 = PO  for  some  string  8 < a.  Therefore,  if  7 is  also  less  than  its  cyclic 
shift  8/3,  we  have  8 < a < a(3  < 8(3.  But  that  is  impossible,  because  a and  8 
have  the  same  length.  | 

Let  Lm(n)  be  the  number  of  m- ary  primes  of  length  n.  Every  string  m . . . an, 
together  with  its  cyclic  shifts,  yields  d distinct  strings  for  some  divisor  d of  n, 
corresponding  to  exactly  one  prime  of  length  d.  For  example,  from  010010  we 
get  also  100100  and  001001  by  cyclic  shifting,  and  the  smallest  of  the  periodic 
parts  {010, 100,001}  is  the  prime  001.  Therefore  we  must  have 

^2,dLm{d)  = mn,  for  all  m, n > 1.  (59) 

d\n 

This  family  of  equations  can  be  solved  for  Lm(n)  using  exercise  4.5.3-28(a),  and 
we  obtain 

Lm(n)  = i ^2p(d)mn/d.  (60) 

d\n 

During  the  1970s,  Harold  Fredricksen  and  James  Maiorana  discovered  a 
beautifully  simple  way  to  generate  all  of  the  to- ary  primes  of  length  n or  less, 
in  increasing  order  [Discrete  Math.  23  (1978),  207-210],  Before  we  are  ready  to 
understand  their  algorithm,  we  need  to  consider  the  n-extension  of  a nonempty 
string  A,  namely  the  first  n characters  of  the  infinite  string  AAA For  example. 

tl|n/l?TtenSi°n  °f  123  iS  123123123L  In  general  if  |A|  = k,  its  n-extension  is 
ALn/  A',  where  A'  is  the  prefix  of  A whose  length  is  n mod  k. 

Definition  Q.  A string  is  preprime  if  it  is  a nonempty  prefix  of  a prime,  on 
some  alphabet.  | 

Theorem  Q.  A string  of  length  n > 0 is  preprime  if  and  only  if  it  is  the  n- 
extension  of  a prime  string  A of  length  k < n.  This  prime  string  is  uniquely 
determined. 


Proof.  See  exercise  105.  | 
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Theorem  Q states,  in  essence,  that  there  is  a one-to-one  correspondence  between 
primes  of  length  < n and  preprimes  of  length  n.  The  following  algorithm 
generates  all  of  the  m- ary  instances,  in  increasing  order. 

Algorithm  F (Prime  and  preprime  string  generation).  This  algorithm  visits 
all  m- ary  n-tuples  (ai, . . . , an)  such  that  the  string  ai . . . an  is  preprime.  It  also 
identifies  the  index  j such  that  ai  ...  an  is  the  n-extension  of  the  prime  a± . . ,aj. 

FI.  [Initialize.]  Set  ai  4—  • ■ • 4—  an  4—  0 and  j 4—  1;  also  set  a o 4 1. 

F2.  [Visit.]  Visit  (ai, . . . , an ) with  index  j. 

F3.  [Prepare  to  increase.]  Set  j 4—  n.  Then  if  aj  = m — 1,  decrease  j until 
finding  aj  < m — 1 . 

F4.  [Add  one.]  Terminate  if  j = 0.  Otherwise  set  a3  4—  aj  + 1.  (Now  ai . . . aj  is 
prime,  by  exercise  105(a).) 

F5.  [Make  n-extension.]  For  k 4—  j + 1,  . . . , n (in  this  order)  set  a*,  4—  a^-j. 
Return  to  F2.  | 

For  example,  Algorithm  F visits  32  ternary  preprimes  when  m = 3 and  n — 4: 


0000 

0011 

A 

0022 

A 

oni 

0122 

A 

0212 

A 

mi 

1212 

A 

i-H  < 

O 

O 

O 

0012a 

0101 

A 

0112 

A 

0202 

A 

0220 

A 

1112 

A 

1221 

A 

0002 

A 

0020A 

A 

0102 

A 

0120 

A 

0210 

A 

0221 

A 

1121 

A 

1222 

A 

0010 

0021 

0110 

0121 

0211 

0222 

1122 

2222 

A AA  A A A AA 


(The  digits  preceding  ‘a’  are  the  prime  strings  0,  0001,  0002,  001,  0011,  . . . , 2.) 

Theorem  Q explains  why  this  algorithm  is  correct,  because  steps  F3  and  F4 
obviously  find  the  smallest  m- ary  prime  of  length  < n that  exceeds  the  previous 
preprime  a\  ...an.  Notice  that  after  ai  increases  from  0 to  1,  the  algorithm 
proceeds  to  visit  all  the  (m  — l)-ary  primes  and  preprimes,  increased  by  1 ...  1. 

Algorithm  F is  quite  beautiful,  but  what  does  it  have  to  do  with  de  Bruijn 
cycles?  Here  now  comes  the  punch  line:  If  we  output  the  digits  ai,  . . . , aj  in 
step  F2  whenever  j is  a divisor  of  n,  the  sequence  of  all  such  digits  forms  a 
de  Bruijn  cycle!  For  example,  in  the  case  m — 3 and  n = 4,  the  following  81 
digits  are  output: 

0 0001  0002  0011 0012  0021  0022  01  0102  0111  0112 

01210122  02  02110212  022102221  11121122121222  2.  (62) 

(We  omit  the  primes  001,  002,  Oil,  . . . , 122  of  (61)  because  their  length  does 
not  divide  4.)  The  reasons  underlying  this  almost  magical  property  are  explored 
in  exercise  108.  Notice  that  the  cycle  has  the  correct  length,  by  (59). 

There  is  a sense  in  which  the  outputs  of  this  procedure  are  actually  equiva- 
lent to  the  “granddaddy”  of  all  de  Bruijn  cycle  constructions  that  work  for  all  m 
and  n,  namely  the  construction  first  published  by  M.  H.  Martin  in  Bull.  Amer. 
Math.  Soc.  40  (1934),  859-864:  Martin’s  original  cycle  for  m = 3 and  n = 4 
was  2222122202211 . . . 10000,  the  twos’  complement  of  (62).  In  fact,  Fredricksen 
and  Maiorana  discovered  Algorithm  F almost  by  accident  while  looking  for  a 
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simple  way  to  generate  Martin’s  sequence.  The  explicit  connection  between 
their  algorithm  and  preprime  strings  was  not  noticed  until  many  years  later, 
when  Ruskey,  Savage,  and  Wang  carried  out  a careful  analysis  of  the  running 
time  [J. Algorithms  13  (1992),  414-430].  The  principal  results  of  that  analysis 
appear  in  exercise  107,  namely 

i)  The  average  value  of  n — j in  steps  F3  and  F5  is  approximately  1/(to  — 1). 

ii)  The  total  running  time  to  produce  a de  Bruijn  cycle  like  (62)  is  0(mn). 


► 

► 


► 


EXERCISES 

1.  [10]  Explain  how  to  generate  all  n-tuples  (01, . . . , an)  in  which  < a,  < Uj 
lower  bounds  lj  and  upper  bounds  Uj  for  each  component.  (Assume  that  lj  <Uj.) 

2.  [15]  What  is  the  1000000th  n-tuple  visited  by  Algorithm  M if  n = 10  and  m = j 
for  1 < j < n?  Hint:  [°;  °;  J;  J;  J;  J;  °]  = 1000000. 


3.  [M20]  How  many  times  does  Algorithm  M perform  step  M4? 

4.  [1#]  On  most  computers  it  is  faster  to  count  down  to  0 rather  than  up  to  m. 
Revise  Algorithm  M so  that  it  visits  all  n-tuples  in  the  opposite  order,  starting  with 
(mi  — 1, ... , mn  — 1)  and  finishing  with  (0, . . . , 0). 


5.  [20]  Algorithms  such  as  the  “fast  Fourier  transform”  (exercise  4.6.4-  14)  often 
end  with  an  array  of  answers  in  bit-reflected  order,  having  A[(60  . . . ),1  in  the  nlace 

where  A[(bn-i  . . . 60)2]  is  desired.  What  is  a good  way  to  rearrange  the  answers  into 
proper  order?  [Hint:  Reflect  Algorithm  M.] 


6.  [Ml 7]  Prove  (7),  the  basic  formula  for  Gray  binary  code. 

7.  [20]  Figure  30(b)  shows  the  Gray  binary  code  for  a disk  that  is  divided  into  16 
sectors.  What  would  be  a good  Gray-like  code  to  use  if  the  number  of  sectors  were  12 
or  60  (for  hours  or  minutes  on  a clock),  or  360  (for  degrees  in  a circle)? 

8.  [15]  What’s  an  easy  way  to  run  through  all  n-bit  strings  of  even  parity,  changing 
only  two  bits  at  each  step? 


9.  [16]  What  move  should  follow  Fig.  31,  when  solving  the  Chinese  ring  puzzle? 

► 10.  [M21]  Find  a simple  formula  for  the  total  number  of  steps  An  or  Bn  in  which  a 
ring  is  (a)  removed  or  (b)  replaced,  in  the  shortest  procedure  for  removing  n Chinese 
rings.  For  example,  A3  = 4 and  B3  = 1. 


11.  [M22]  (H.  J.  Purkiss,  1865.)  The  two  smallest  rings  of  the  Chinese  ring  puzzle 
can  actually  be  taken  on  or  off  the  bar  simultaneously.  How  many  steps  does  the  puzzle 
require  when  such  accelerated  moves  are  permitted? 

► 12.  [25]  The  compositions  of  n are  the  sequences  of  positive  integers  that  sum  to  n 
For  example,  the  compositions  of  4 are  1111,  112,  121,  13,  211,  22,  31,  and  4.  An  integer 
n has  exactly  2"  compositions,  corresponding  to  all  subsets  of  the  points  {1, . . . , n-1} 
that  might  be  used  to  break  the  interval  (0 . . n)  into  integer-sized  subintervals.  ’ 

a)  Design  a loopless  algorithm  to  generate  all  compositions  of  n,  representing  each 
composition  as  a sequential  array  of  integers  81S2  ...  Sj. 

b)  Similarly,  design  a loopless  algorithm  that  represents  the  compositions  implicitly 
m an  array  of  pointers  q0qi...qt,  where  the  elements  of  the  composition  are 
(90  - qi)(qi  - <72)  • . . (qt-i  - qt)  and  we  have  q0  = n,  qt  = 0.  For  example,  the 
composition  211  would  be  represented  under  this  scheme  by  the  pointers  q0  = 4. 
9i  = 2,  <72  = 1,  (73  = 0,  and  with  t = 3. 
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13.  [21]  Continuing  the  previous  exercise,  compute  also  the  multinomial  coefficient 
C = (sli.n.,Sj)  for  use  as  the  composition  si  . . . Sj  is  being  visited. 

14.  [20]  Design  an  algorithm  to  generate  all  strings  ai  . . . aj  such  that  0 < j < n and 
0 < a,  < nii  for  1 < i < j,  in  lexicographic  order.  For  example,  if  m i = m2  = n = 2, 
your  algorithm  should  successively  visit  e,  0,  00,  01,  1,  10,  11. 

► 15.  [25]  Design  a loopless  algorithm  to  generate  the  strings  of  the  previous  exercise. 
All  strings  of  the  same  length  should  be  visited  in  lexicographic  order  as  before,  but 
strings  of  different  lengths  can  be  intermixed  in  any  convenient  way.  For  example, 
0,  00,  01,  e,  10,  11,  1 is  an  acceptable  order  when  m\  = m2  = n = 2. 

16.  [23]  A loopless  algorithm  obviously  cannot  generate  all  binary  vectors  (ai , . . . , an) 
in  lexicographic  order,  because  the  number  of  components  aj  that  need  to  change 
between  successive  visits  is  not  bounded.  Show,  however,  that  loopless  lexicographic 
generation  does  become  possible  if  a linked  representation  is  used  instead  of  a sequential 
one:  Suppose  there  are  2n  + 1 nodes  {0, 1, . . . , 2n},  each  containing  a LINK  field.  The 
binary  n-tuple  (ai, . . . , an)  is  represented  by  letting 

LINK(0)  = 1 + noi; 

LINK(j  — 1 + noj-i)  — j + naj,  for  1 < j < n; 

LINK(n  + na„)  = 0; 

the  other  n LINK  fields  can  have  any  convenient  values. 

17.  [20]  A well-known  construction  called  the  Karnaugh  map  [M.  Karnaugh,  Amer. 
Inst.  Elect.  Eng.  Trans.  72,  part  I (1953),  593-599]  uses  Gray  binary  code  in  two 
dimensions  to  display  all  4-bit  numbers  in  a 4 x 4 torus: 

000(1  0001  0011  0010 
0100  0101  0111  0110 
1100  1101  1111 1110 
1000  1001  1011  1010 


(The  entries  of  a torus  “wrap  around”  at  the  left  and  right  and  also  at  the  top  and 
bottom  — just  as  if  they  were  tiles,  replicated  infinitely  often  in  a plane.)  Show  that, 
similarly,  all  6-bit  numbers  can  be  arranged  in  an  8 x 8 torus  so  that  only  one  coordinate 
position  changes  when  we  move  north,  south,  east,  or  west  from  any  point. 

► 18.  [20]  The  Lee  weight  of  a vector  u = (iti, . . . , un),  where  each  component  satisfies 
0 < uj  < rrij , is  defined  to  be 

n 

vl(u)  = ^ mm[uj, mj  — Uj ); 

3= 1 

and  the  Lee  distance  between  two  such  vectors  u and  v is 

dn[u,  v ) = ul(u  — v ),  where  u — v = ((uj  — ui)  mod  mi, ... , ( u„  — vn)  mod  mn). 

(This  is  the  minimum  number  of  steps  needed  to  change  u to  v if  we  adjust  some 
component  Uj  by  ±1  (modulo  mj)  in  each  step.) 

A quaternary  vector  has  mj  = 4 for  1 < j < n,  and  a binary  vector  has  all  mj  = 2. 
Find  a simple  one-to-one  correspondence  between  quaternary  vectors  u = (uj, . . . ,un) 
and  binary  vectors  v!  — (y!x, . . . , u'2n),  with  the  property  that  vl(u)  = t'(u')  and 
dn(u,  v)  = u(u'  ® v'). 
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19. 


[23]  ( The  octacode.)  Let  g(x)  = x3  + 2x2  + x - 1. 

Use  one  of  the  algorithms  in  this  section  to  evaluate  £ zUo  zui  zU2  zU3  zU4  zU5  zU6  zUoo , 
a polynomial  in  the  variables  zo , zi,  Z2,  and  Z3 , summed  over  all  256  polynomials 

(no  + vix  + v2x2  + v3x3)g(x)  mod  4 = u0+uix  + u2x2  + u3x3  + u4x4  + u5x 5 + u6x6 


for  0 < v0,v!,v2,v3  < 4,  where  Uoo  is  chosen  so  that  0 < Uoo  < 4 and  (u0  +«i  + 
u2  + u3  + u4  + u3  + u6  + Uoo)  mod  4 = 0. 

b)  Construct  a set  of  256  16-bit  numbers  that  differ  from  each  other  in  at  least  six 
different  bit  positions.  (Such  a set,  first  discovered  by  Nordstrom  and  Robinson 
[Information  and  Control  11  (1967),  613-616],  is  essentially  unique.) 

20.  [M36]  The  16-bit  codewords  in  the  previous  exercise  can  be  used  to  transmit  8 
bits  of  information,  allowing  transmission  errors  to  be  corrected  if  any  one  or  two  bits 
are  corrupted;  furthermore,  mistakes  will  be  detected  (but  not  necessarily  correctable) 
if  any  three  bits  are  received  incorrectly.  Devise  an  algorithm  that  either  finds  the 
nearest  codeword  to  a given  16-bit  number  u ' or  determines  that  at  least  three  bits  of 
u are  erroneous.  How  does  your  algorithm  decode  the  number  (1100100100001111)2? 
[Hmi:  Use  the  facts  that  aR  = 1 (modulo  g(x)  and  4),  and  that  every  quaternary 
polynomial  of  degree  < 3 is  congruent  to  xi  + 2xk  (modulo  g(x)  and  4)  for  some 
j,  k£  {0, 1,  2, 3, 4, 5, 6,  oo},  where  x°°  = 0.] 

21.  [M30]  A t-subcube  of  an  n-cube  can  be  represented  by  a string  like  **10**0* 
containing  t asterisks  and  n - t specified  bits.  If  all  2n  binary  n-tuples  are  written  in 
lexicographic  order,  the  elements  belonging  to  such  a subcube  appear  in  2''  clusters 
of  consecutive  entries,  where  t'  is  the  number  of  asterisks  that  lie  to  the  left  of  the 
rightmost  specified  bit.  (In  the  example  given,  n = 8,  t = 5,  and  t'  = 4.)  But  if  the 
n-tuples  are  written  in  Gray  binary  order,  the  number  of  clusters  might  be  reduced. 
For  example,  the  (n  - l)-subcubes  * ...  *0  and  * ...  *1  occur  in  only  2n~2  + 1 and  2n~2 
clusters,  respectively,  when  Gray  binary  order  is  used,  not  in  2n~1  of  them. 

a)  Explain  how  to  compute  C(a),  the  number  of  Gray  binary  clusters  of  the  subcube 
defined  by  a given  string  a of  asterisks,  0s,  and  Is.  What  is  C(**10**0*)? 

b)  Prove  that  C(a)  always  lies  between  2t'~1  and  2*'.  inclusive. 

c)  What  is  the  average  value  of  C(a),  over  all  2"-‘(")  possible  t-subcubes? 

► 22.  [22]  A “right  subcube”  is  a subcube  such  as  0110**  in  which  all  the  asterisks 
appear  after  all  the  specified  digits.  Any  binary  trie  (Section  6.3)  can  be  regarded  as  a 
way  to  partition  a cube  into  disjoint  right  subcubes,  as  in  Fig.  36(a).  If  we  interchange 
the  left  and  right  subtries  of  every  right  subtrie,  proceeding  downward  from  the  root, 
we  obtain  a Gray  binary  trie,  as  in  Fig.  36(b). 

Prove  that  if  the  “lieves”  of  a Gray  binary  trie  are  traversed  in  order,  from  left  to 
right,  consecutive  lieves  correspond  to  adjacent  subcubes.  (Subcubes  are  adjacent  if 
they  contain  adjacent  vertices.  For  example,  00**  is  adjacent  to  011*  because  the  first 
contains  0010  and  the  second  contains  0110;  but  Oil*  is  not  adjacent  to  10**.) 


Fig.  36. 


1010  1011 


(a)  Normal  binary  trie. 


(b)  Gray  binary  trie. 
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23.  [20]  Suppose  g(k)  ® 2J  = g(l).  What  is  a simple  way  to  find  l,  given  j and  fc? 

24.  [M21]  Consider  extending  the  Gray  binary  function  g to  all  2-adic  integers  (see 
Section  7.1.3).  What  is  the  corresponding  inverse  function  (/ ' ~ 1 ' ? 

► 25.  [ M25 ] Prove  that  if  g(k)  and  g(l)  differ  in  t > 0 bits,  and  if  0 < k,l  < 2n,  then 
[2731  < \k-l\  <2n  - [273]. 

26.  [25]  (Frank  Ruskey.)  For  which  integers  N is  it  possible  to  generate  all  of  the 
nonnegative  integers  less  than  N in  such  a way  that  only  one  bit  of  the  binary  repre- 
sentation changes  at  each  step? 

► 27.  [20]  Let  So  = {1}  and  S„+i  = 1/(2  + Sn)  U 1/(2  — S„);  thus,  for  example, 


S2 


2 + 1 


2-1 


2+1 


3 13. 
7’  3’ 5 ’ 


and  Sn  has  2n  elements  that  lie  between  | and  1.  Compute  the  10loth  smallest  element 

of  S loo- 


28.  [M2 7]  A median  of  n-bit  strings  (on, . . . ,c*t},  where  a*,  has  the  binary  represen- 
tation a/c  = ak(n- 1)  ■ • • afco,  is  a string  a = an-\  . . .a 0 whose  bits  aj  for  0 < j < n 
agree  with  the  majority  of  the  bits  akj  for  1 < k < t.  (If  t is  even  and  the  bits 
akj  are  half  0 and  half  1,  the  median  bit  aj  can  be  either  0 or  1.)  For  example,  the 
strings  {0010,0100,0101,  1110}  have  two  medians,  0100  and  0110,  which  we  can  denote 
by  01*0. 

a)  Find  a simple  way  to  describe  the  medians  of  Gt  = {<7(0), . . . ,g(t  — 1)},  the  first  t 
Gray  binary  strings,  when  0 < t < 2n . 

b)  Prove  that  if  a = an-i  ■ ■ - a0  is  such  a median,  and  if  2n~1  < t < 2n,  then  the 
string  /3  obtained  from  a by  complementing  any  bit  a3  is  also  an  element  of  Gt. 


29.  [M24]  If  integer  values  k are  transmitted  as  n-bit  Gray  binary  codes  g(k)  and 
received  with  errors  described  by  a bit  pattern  p = (pn- 1 • • -Po)2,  the  average  numerical 
error  is 


1 

2n 


^2  9[  11  ® p)  — k 


assuming  that  all  values  of  k are  equally  likely.  Show  that  this  sum  is  equal  to 
El=o\(k  © p)  ~ k\/2n,  just  as  if  Gray  binary  code  were  not  used,  and  evaluate  it 
explicitly. 

► 30.  [M2 7]  ( Gray  permutation.)  Design  a one-pass  algorithm  to  replace  the  array 
elements  (X0,  Xi,  X2,  ■ ■ ■ , X2"-i)  by  (X9(0),  A9(1),  X9(2), . . . , A9(2n_1)),  using  only  a 
constant  amount  of  auxiliary  storage.  Hint:  Considering  the  function  g(n ) as  a per- 
mutation of  all  nonnegative  integers,  show  that  the  set 


L = {0, 1,  (10)2,  ( 100) 2 , (100*)2,  ( 100*0)2 , (100*0*)2, . . . } 


is  the  set  of  cycle  leaders  (the  smallest  elements  of  the  cycles). 

31.  [HM35]  (Gray  fields.)  Let  /„( x)  = g(rn(x))  denote  the  operation  of  reflecting 
the  bits  of  an  n-bit  binary  string  as  in  exercise  5 and  then  converting  to  Gray  binary 
code.  For  example,  the  operation  fs(x)  takes  (001)2  1+  (110)2  i-t  (010)2  <-+  (011)2  <-* 
(101)2  h->  ( 1 1 1 )2  i->  ( 100)2  <->  (001)2,  hence  all  of  the  nonzero  possibilities  appear  in 
a single  cycle.  Therefore  we  can  use  to  define  a field  of  8 elements,  with  © as  the 
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addition  operator  and  with  multiplication  defined  by  the  rule 

fs]W  X jf(l)  = f[3j+k](  1)  = fl3j](f[3k](  1)). 

The  functions  /2,  /5,  and  /6  have  the  same  nice  property.  But  /4  does  not,  because 
/4((1011)2j  = (1011)2. 

Find  all  n < 100  for  which  /„  defines  a field  of  2"  elements. 

32.  [M20]  True  or  false:  Walsh  functions  satisfy  wk(—x)  — ( — l)kwk(x). 

► 33.  [ M20 ] Prove  the  Rademacher-to- Walsh  law  (17). 

34.  [M21]  The  Paley  functions  pk(x ) are  defined  by 


po(x)  = 1 and  pk(x)  = (-l)[2x]kpLk/2J(2x). 

Show  that  pk(x)  has  a simple  expression  in  terms  of  Rademacher  functions,  analogous 
to  (17)?  and  relate  Paley  functions  to  Walsh  functions. 

35.  [HM23]  The  2n  x 2n  Paley  matrix  Pn  is  obtained  from  Paley  functions  just  as 
the  Walsh  matrix  Wn  is  obtained  from  Walsh  functions.  (See  (20).)  Find  interesting 
relations  between  Pn,  Wn,  and  the  Hadamard  matrix  Hn.  Prove  that  all  three  matrices 
are  symmetric. 


36.  [21  ] Spell  out  the  details  of  an  efficient  algorithm  to  compute  the  Walsh  transform 
(*o, . . . ,*2"-i)  of  a given  vector  (X0, . . . ,X2«_i). 

37.  [HM23]  Let  zki  be  the  location  of  the  Zth  sign  change  in  wk(x),  for  1 < l < k and 
0 < zki  < 1.  Prove  that  |zfci  - l/(k+  1)|  = 0((log k)/k). 

► 38.  [M25]  Devise  a ternary  generalization  of  Walsh  functions. 

► 39.  [HM30]  (J.  J.  Sylvester.)  The  rows  of  (g  _6a)  are  orthogonal  to  each  other  and 
have  the  same  magnitude;  therefore  the  matrix  identity 


(AB) 


_ ( Aa  + Bb  Ab  — Ba  ) / a A + bB  \ 
V bA  - aB  ) 


implies  the  sum-of-two-squares  identity  (a2  + b2)(A 2 + B2)  = (aA  + bB )2  + (bA  - aB)2 
Similarly,  the  matrix 

( a b c d \ 
b —a  d — c 
d c —b  —a 
\ c — d —a  b ) 

leads  to  the  sum-of-four-squares  identity 


(a  +b  +c  +d2)(A2  + B2  +C2  + D2)  = (aA  + bB+cC  + dD)2  + (bA-aB+dC-cD)2 

+ ( dA  + cB  -bC  - aD )2  + ( cA  - dB  - aC  + bD)2 . 

a)  Attach  the  signs  of  the  matrix  H3  in  (21)  to  the  symbols  {a,b,c,d,e,  f,g,h}, 
obtaining  a matrix  with  orthogonal  rows  and  a sum-of-eight-squares  identity. 

b)  Generalize  to  H4  and  higher-order  matrices. 

► 40.  [21]  Would  the  text’s  five-letter  word  computation  scheme  produce  correct  an- 
swers also  if  the  masks  in  step  W2  were  computed  as  nij  = z & (25-J  — 1)  for  0 < j < 5? 

41.  [25]  If  we  use  only  the  3000  most  common  five-letter  words  — thereby  omitting 
ducky,  duces,  dunks,  dinks,  dinky,  dices,  dicey,  dicky,  dicks,  picky,  pinky,  punky, 
and  pucks  from  (23)  — how  many  valid  words  can  still  be  generated  from  a single  pair? 
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42.  [35]  (M.  L.  Fredman.)  Algorithm  L uses  0(nlogn)  bits  of  auxiliary  memory 
for  focus  pointers  as  it  chooses  the  Gray  binary  bit  a,j  to  complement  next.  Step  L3 
examines  ©(logn)  of  the  auxiliary  bits,  and  it  occasionally  changes  fl(logn)  of  them. 

Show  that,  from  a theoretical  standpoint,  we  can  do  better:  The  n-bit  Gray  binary 
code  can  be  generated  by  changing  at  most  2 auxiliary  bits  between  visits.  (We  still 
allow  ourselves  to  examine  O(logn)  of  the  auxiliary  bits  on  each  step,  so  that  we  know 
which  of  them  should  be  changed.) 

43.  [47]  Determine  d( 6),  the  number  of  6-bit  Gray  cycles.  (See  (26).) 

44.  [M20]  Show  that  d(n)  < if  the  n-cube  has  M(n)  perfect  matchings. 

45.  [M4O]  (T.  Feder  and  C.  Subi,  2009.)  This  exercise  constructs  a large  number  of 
Gray  cycles  in  the  (4r  + 2)-cube  G = G-jaGad^oGiDGodG-i,  where  Gi  is  an  r-cube 
for  i > 0 and  Go  = G_  1 = P2.  The  vertices  v are  (4r  + 2)-bit  strings  V4  . . . vov-i,  where 
Vi  has  r bits  for  i > 0 and  1 bit  for  i < 0.  The  “signature”  of  v is  the  4-bit  string  <r{v)  = 
S4S3S2(si®vo),  where  Si  is  the  parity  of  u,.  We  treat  bit  strings  as  binary  numbers. 

For  1 < l < 4,  let  Mi(v)  be  a perfect  matching  in  G with  v — v'  = v\  . . . ngu'_i 
and  v[  = vt  for  i / l.  (Note  that  Mi(v')  = v.)  Also  define  Mo(v)  = v 0 2.  Consider 
the  cycles  formed  by  the  edges  v — Mi(v)(v),  where  l(v)  depends  on  u’s  signature: 

a(v)  = 0000  0001  0011  0010  0110  0111  0101  0100  1100  1101  1111  1110  1010  1011  1001  1000 
l{v)  =0203120412131204 

a)  Suppose  r = 2 and  A 4i(v)  = v 0 22!+S1-1  for  l > 1 and  M i(v)  = 11® 

What  cycle  contains  vertex  0 ...  0 in  this  case? 

b)  A vertex  whose  signature  is  a power  of  2 is  called  a “ground  vertex.”  Four  vertices 
with  the  same  114  .. . V\  are  called  “siblings.”  Define  u = v if  n and  v are  in  the  same 
cycle,  or  if  u and  v are  sibling  ground  vertices,  or  if  a chain  of  such  equivalences 
leads  from  u to  v.  Explain  how  to  construct  cycles  in  G for  each  equivalence  class. 

c)  Furthermore,  if  u and  v are  sibling  ground  vertices,  there  is  such  a cycle  that 
retains  the  edges  {u©2  — u,  u©2  — v } of  the  original  cycles. 

d)  Finally,  show  how  to  convert  the  cycles  of  (b)  and  (c)  into  a single  cycle. 

e)  When  Mi,  ■ ■ . , M 4 vary,  how  many  different  Hamiltonian  cycles  do  we  get? 

46.  [M23]  Extend  exercise  45  to  the  ( kr  + 2)-cube,  for  k even. 

47.  [ HM24 ] What  asymptotic  estimates  do  exercises  44  and  46  give  for  d(n)1^2  ? 

48.  [HM48]  Determine  the  asymptotic  behavior  of  <f(n)1//2  as  n A 00. 

49.  [20]  Prove  that  for  all  n > 1 there  is  a 2n-bit  Gray  cycle  in  which  vk+22n-i  is  the 
complement  of  vk,  for  all  k > 0. 

► 50.  [21]  Find  a construction  like  that  of  Theorem  D but  with  l even. 

51.  [M24]  ( Balanced  Gray  cycles.)  Complete  the  proof  of  Corollary  B to  Theorem  D. 

52.  [M20]  Prove  that  if  the  transition  counts  of  an  n-bit  Gray  cycle  satisfy  Co  < Ci  < 
• • • < c„-i,  we  must  have  Co  + • • • + Cj- 1 > 2\  with  equality  when  j = n. 

53.  [M46]  If  the  numbers  (co, . . . , cn_i)  are  even  and  satisfy  the  condition  of  the 
previous  exercise,  is  there  always  an  n-bit  Gray  cycle  with  these  transition  counts? 

54.  [M20]  (H.  S.  Shapiro,  1953.)  Show  that  if  a sequence  of  integers  (a  1,. . . , 02")  con- 
tains only  n distinct  values,  then  there  is  a subsequence  whose  product  (ik+i  a.k+2  ■ ■ - at 
is  a perfect  square,  for  some  0 < k < l < 2n . However,  this  conclusion  might  not  be 
true  if  we  disallow  the  case  l = 2n . 

► 55.  [35]  (F.  Ruskey  and  C.  Savage,  1993.)  If  (r>o, . . . , U2«-i)  is  an  n-bit  Gray  cycle, 
the  pairs  { {v2k,  V2k+\}  \ 0 < k < 2n_1  } form  a perfect  matching  between  the  vertices 
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of  even  and  odd  parity  in  the  n-cube.  Conversely,  does  every  such  perfect  matching 
arise  as  half  of  some  72-bit  Gray  cycle? 

56.  [M30]  (E.  N.  Gilbert,  1958.)  Say  that  two  Gray  cycles  are  equivalent  if  their  delta 
sequences  can  be  made  equal  by  permuting  the  coordinate  names,  or  by  reversing  the 
cycle  and/or  starting  the  cycle  at  a different  place.  Show  that  the  2688  different  4-bit 
Gray  cycles  fall  into  just  9 equivalence  classes. 

57.  [32]  Consider  a graph  whose  vertices  are  the  2688  possible  4-bit  Gray  cycles, 
where  two  such  cycles  are  adjacent  if  they  are  related  by  one  of  the  following  simple 
transformations: 


After  Type  1 After  Type  2 After  Type  3 After  Type  4 


(Type  1 changes  arise  when  the  cycle  can  be  broken  into  two  parts  and  reassembled 
with  one  part  reversed.  Types  2,  3,  and  4 arise  when  the  cycle  can  be  broken  into  three 
parts  and  reassembled  after  reversing  0,  1,  or  2 of  the  parts.  The  parts  need  not  have 
equal  size.  Such  transformations  of  Hamiltonian  cycles  are  often  possible.) 

Write  a program  to  discover  which  4-bit  Gray  cycles  are  transformable  into  each 
other  by  finding  the  connected  components  of  the  graph;  restrict  consideration  to  only 
one  of  the  four  types  at  a time. 


► 58.  [21]  Let  a be  the  delta  sequence  of  an  n-bit  Gray  cycle,  and  obtain  0 from  q by 
changing  q occurrences  of  0 to  n,  where  q is  odd.  Prove  that  6(3  is  the  delta  sequence 
of  an  (n  -f-  l)-bit  Gray  cycle. 

59.  [22]  The  5-bit  Gray  cycle  of  (30)  is  nonlocal  in  the  sense  that  no  2f  consecutive 
elements  belong  to  a single  f-subcube,  for  1 < t < n.  Prove  that  nonlocal  n-bit  Gray 
cycles  exist  for  all  n > 5.  [Hint:  See  the  previous  exercise.] 

60.  [20]  Show  that  the  run-length-bound  function  satisfies  r(n  + 1)  > r(n). 

61  [M30]  Show  that  r(m  + n)  > r(m)  + r(n)  - 1 if  (a)  m = 2 and  2 < r(n)  < 8-  or 

(b)  m < n and  r(n)  < 2m_3.  ’ 

62.  [46]  Does  r(8)  = 6? 

63.  [30]  (Luis  Goddyn.)  Prove  that  r(10)  > 8. 


► 64.  [HM35]  (L.  Goddyn  and  P.  Gvozdjak.)  An  n-bit  Gray  stream  is  a sequence  of 
permutations  (a0,au. . . , cr,_x)  where  each  ak  is  a permutation  of  the  vertices  of  the 
n-cube,  taking  every  vertex  to  one  of  its  neighbors. 

a)  Suppose  (u0, . . . , «2™_1)  is  an  m-bit  Gray  cycle  and  (a0,  au  . . . , a,™^)  is  an  n-bit 
ray  stream.  Let  v0  - 0 ...  0 and  nfc+1  = vkak,  where  ak  = ak  mod  2m  if  k > 2m. 
Under  what  conditions  is  the  sequence 


W — (uoVo,  UoVi,  UiVi,  U1V2, 


• 9 ^2m  + Tl—  1 — 1 r>2m  + n — 1 _ 1 5 


'U'2rn+n~  1 — 1^2m  + n_  1 ) 


an  (ra  + 72)-bit  Gray  cycle? 

b)  Show  that  if  m is  sufficiently  large,  there  is  an  n-bit  Gray  stream  satisfying  the 
conditions  of  (a)  for  which  all  run  lengths  of  the  sequence  (n0,  vi, . . . ) are  > n - 2 

c)  Apply  these  results  to  prove  that  r(n)  > n - O(logn). 

65.  [30]  (Brett  Stevens.)  In  Samuel  Beckett’s  play  Quad,  the  stage  begins  and  ends 
empty;  n actors  enter  and  exit  one  at  a time,  running  through  all  2n  possible  subsets, 
and  the  actor  who  leaves  is  always  the  one  whose  previous  entrance  was  earliest.  When 
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n = 4,  as  in  the  actual  play,  some  subsets  are  necessarily  repeated.  Show,  however, 
that  there  is  a perfect  pattern  with  exactly  2n  entrances  and  exits  when  n = 5. 

66.  [40]  Is  there  a perfect  Beckett-Gray  pattern  for  8 actors? 

67.  [20]  Sometimes  it  is  desirable  to  run  through  all  n-bit  binary  strings  by  changing 
as  many  bits  as  possible  from  one  step  to  the  next,  for  example  when  testing  a physical 
circuit  for  reliable  behavior  in  worst-case  conditions.  Explain  how  to  traverse  all  binary 
n-tuples  in  such  a way  that  each  step  changes  n or  n — 1 bits,  alternately. 

68.  [21]  Rufus  Q.  Perverse  decided  to  construct  an  anti-Gray  ternary  code,  in  which 
each  n-trit  number  differs  from  its  neighbors  in  every  digit  position.  Is  such  a code 
possible  for  all  n? 

► 69.  [M25]  Modify  the  definition  of  Gray  binary  code  (7)  by  letting 

h(k)  — (. . . (be  © be)  (be  © 64 ) (64  © 63  © 62  © bo)  (63  © bo)  (62  ffi  61  © bo)  bi)2, 
when  k = (. . . 656463626160)2- 

a)  Show  that  the  sequence  6(0),  6(1),  . . . , 6(2n—  1)  runs  through  all  n-bit  numbers 
in  such  a way  that  exactly  3 bits  change  each  time,  when  n > 3. 

b)  Generalize  this  rule  to  obtain  sequences  in  which  exactly  t bits  change  at  each 
step,  when  t is  odd  and  n > t. 

70.  [21]  How  many  monotonic  n-bit  Gray  codes  exist  for  n = 5 and  n = 6? 

71.  [M22]  Derive  (45),  the  recurrence  that  defines  the  Savage-Winkler  permutations. 

72.  [20]  What  is  the  Savage-Winkler  code  from  00000  to  11111? 

► 73.  [32]  Design  an  efficient  algorithm  to  construct  the  delta  sequence  of  an  n-bit 
monotonic  Gray  code. 

74.  [M25]  (Savage  and  Winkler.)  How  far  apart  can  adjacent  vertices  of  the  n-cube 
be,  in  a monotonic  Gray  code? 

75.  [32]  Find  all  5-bit  Gray  paths  no,  . . . , W31  that  are  trend-free,  in  the  sense  that 
£3*Lo  k(  — l)Vkj  = 0 in  each  coordinate  position  j. 

76.  [M25]  Prove  that  trend-free  n-bit  Gray  codes  exist  for  all  n > 5. 

77.  [21]  Modify  Algorithm  H in  order  to  visit  mixed-radix  n-tuples  in  modular  Gray 
order. 

78.  [M26]  Prove  the  conversion  formulas  (50)  and  (51)  for  reflected  mixed-radix  Gray 
codes,  and  derive  analogous  formulas  for  the  modular  case. 

► 79.  [M22]  When  is  the  last  n-tuple  of  the  (a)  reflected  (b)  modular  mixed-radix  Gray 
code  adjacent  to  the  first? 

80.  [M20]  Explain  how  to  run  through  all  divisors  of  a number,  given  its  prime  fac- 
torization p\l  . . . p[‘ , repeatedly  multiplying  or  dividing  by  a single  prime  at  each  step. 

81.  [M21]  Let  (ao,60),  (ai,6j),  ...,  (am2_1,bm2_1)  be  the  2-digit  m- ary  modular 
Gray  code.  Show  that,  if  m > 2,  every  edge  (x,  y)  — (x,  (y  + 1)  mod  m)  and  (x,  y ) — 
((x  + 1)  mod  m,y)  occurs  in  one  of  the  two  cycles 

(ao,  b0)  — (ai,  61) (avn2_1,bm2_1)  — (ao,  bo), 

(60,  ao)  — (61,  di) (bm2_1,  am2_!)  — (bo,  ao). 

► 82.  [M25]  (G.  Ringel,  1956.)  Use  the  previous  exercise  to  deduce  that  there  exist  four 
8-bit  Gray  cycles  that,  together,  cover  all  edges  of  the  8-cube. 

83.  [41]  Can  four  balanced  8-bit  Gray  cycles  cover  all  edges  of  the  8-cube? 


316  COMBINATORIAL  SEARCHING 


7.2.1. 1 


► 84.  [25]  (Howard  L.  Dyckman.)  Figure  37  shows  a fascinating  puzzle  called  Loony 
Loop  or  the  Gordian  Knot,  in  which  the  object  is  to  remove  a flexible  cord  from  the 
rigid  loops  that  surround  it.  Show  that  the  solution  to  this  puzzle  is  inherently  related 
to  the  reflected  Gray  ternary  code. 


► 85.  [M25]  (Dana  Richards.)  If  T — (a0, . . . , a(_j)  is  any  sequence  of  t strings  and 
r = (a'0, , . . , av_i)  is  any  sequence  of  t1  strings,  the  boustrophedon  product  FiT'  is  the 
sequence  of  tt'  strings  that  begins 

• • • > aO®t'-l>  al a*'-l>  • • ■ , Ofitto,  Q2Q0,  ■ • • , c*2ai'-l,  a3Qt' ) 

and  ends  with  at_ ia'0  if  t is  even,  at_1a't'_1  if  t is  odd.  For  example,  the  basic  definition 
of  Gray  binary  code  in  (5)  can  be  expressed  in  this  notation  as  F„  = (0, 1)  l rn_x  when 
n > 0.  Prove  that  the  operation  l is  associative,  hence  Tm+n  = Tm  ) rn. 

► 86.  [26]  Define  an  infinite  Gray  code  that  runs  through  all  possible  nonnegative 
integer  n-tuples  (ax,...,a„)  in  such  a way  that  max(a1, . . . ,on)  < max(ai, . . . , a'n) 
when  (01,. . . ,a„)  is  followed  by  (a'x, . . . ,a'n). 

87.  [27]  Continuing  the  previous  exercise,  define  an  infinite  Gray  code  that  runs 
through  all  integer  n-tuples  (oi,...,a„),  in  such  a way  that  max(|ax|, . . . , |an|)  < 
max(|ai  I, . . . , \a'n\)  when  (ax, . . . ,an)  is  followed  by  (a'x, . . . ,a'n). 

► 88.  [25]  After  Algorithm  K has  terminated  in  step  K4,  what  would  happen  if  we 
immediately  restarted  it  in  step  K2? 

► 89.  [25]  [Gray  code  for  Morse  code.)  The  Morse  code  words  of  length  n (exercise 
4.5.3-32)  are  strings  of  dots  and  dashes,  where  n is  the  number  of  dots  plus  twice  the 
number  of  dashes. 

a)  Show  that  it  is  possible  to  generate  all  Morse  code  words  of  length  n by  successively 
changing  a dash  to  two  dots  or  vice  versa.  For  example,  the  path  for  n = 3 must 
be  • — , • • • , — . or  its  reverse. 

b)  What  string  follows  . in  your  sequence  for  n = 15? 

90.  [26]  For  what  values  of  n can  the  Morse  code  words  be  arranged  in  a cycle,  under 
the  ground  rules  of  exercise  89?  [Hint:  The  number  of  code  words  is  Fn+1.] 

► 91.  [34]  Design  a loopless  algorithm  to  visit  all  binary  n-tuples  (a1,...,a„)  such  that 
“i  < a2  > a3  < a4  > ■ ■ ■ . [The  number  of  such  n-tuples  is  Fn+ 2.] 

92.  [M30]  Is  there  an  infinite  sequence  whose  first  mn  elements  form  an  m-ary 
de  Bruijn  cycle,  for  all  m?  [The  case  n = 2 is  solved  in  (54).] 

► 93.  [M28]  Prove  that  Algorithm  R outputs  a de  Bruijn  cycle  as  advertised. 

94.  [22]  What  is  the  output  of  Algorithm  D when  m = 5,  n = 1,  and  r = 3,  if  the 
coroutines  /()  and  /'()  generate  the  trivial  cycles  01234  01234  01 . . .? 
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► 95.  [M23]  Suppose  an  infinite  sequence  aoaia.2  ...  of  period  p is  interleaved  with  an 
infinite  sequence  b0b\b2  . . . of  period  q to  form  the  infinite  cyclic  sequence 

C0C1C2C3C4C5  ...  = ao6o®ifci“2&2  • 

a)  Under  what  circumstances  does  C0C1C2  . . . have  period  pql  (The  “period”  of  a 
sequence  0,00.10,2  . . . , for  the  purposes  of  this  exercise,  is  the  smallest  integer  p > 0 
such  that  a,k  = a k+p  for  all  fc  > 0.) 

b)  Which  2n-tuples  would  occur  as  consecutive  outputs  of  Algorithm  D if  step  D6 
were  changed  to  say  simply  “If  t'  — n and  x'  < r,  go  to  D4”? 

c)  Prove  that  Algorithm  D outputs  a de  Bruijn  cycle  as  advertised. 

► 96.  [ M28 ] Suppose  a family  of  coroutines  has  been  set  up  to  generate  a de  Bruijn 
cycle  of  length  mn  using  Algorithms  R and  D,  based  recursively  on  simple  coroutines 
like  Algorithm  S for  the  base  case  n = 2,  and  using  Algorithm  D when  n > 2 is  even. 

a)  How  many  coroutines  (Rn,  Dn,  Sn)  of  each  type  will  there  be? 

b)  What  is  the  maximum  number  of  coroutine  activations  needed  to  get  one  top-level 
digit  of  output? 

97.  [M29]  The  purpose  of  this  exercise  is  to  analyze  the  de  Bruijn  cycles  constructed 
by  Algorithms  R and  D in  the  important  special  case  m = 2.  Let  }n[k)  be  the  (fc  + l)st 
bit  of  the  2"-cycle,  so  that  fn[k)  = 0 for  0 < k < n.  Also  let  jn  be  the  index  such  that 
0 < jn  < 2n  and  f„[k)  — 1 for  jn  < k < j„  + n. 

a)  Write  out  the  cycles  (/n(0) . . . /„( 2n  — 1))  for  n = 2,  3,  4,  and  5. 

b)  Prove  that,  for  all  even  values  of  n,  there  is  a number  5n  = ±1  such  that  we  have 

, ,.vfE  fn(k),  if  0 < fc  < j„  or  2n  + jn  < k < 2n+1, 

JU+  ' ’ ~ ll  + £/„(fc  + <5n),  if  jn  < k < 2”+  jn, 

where  the  congruence  is  modulo  2.  (In  this  formula  £/  stands  for  the  summation 
function  £/(fc)  = Yl'jZo  /0)-)  Hence  j„+i  = 2n-  Sn  when  n is  even. 

c)  Let  (c„(0)c„(l) . . .cn(22"—  5))  be  the  cycle  produced  when  the  simplified  version 
of  Algorithm  D in  exercise  95(b)  is  applied  to  fn{)-  Where  do  the  (2 n — l)-tuples 
l2n_1  and  (01)Il_10  occur  in  this  cycle? 

d)  Use  the  results  of  (c)  to  express  f2n{k)  in  terms  of  fn()- 

e)  Find  a (somewhat)  simple  formula  for  jn  as  a function  of  n. 

98.  [M34]  Continuing  the  previous  exercise,  design  an  efficient  algorithm  to  compute 
fn{k),  given  n > 2 and  k > 0. 

► 99.  [ M23 ] Exploit  the  technology  of  the  previous  exercises  to  design  an  efficient 
algorithm  that  locates  any  given  n-bit  string  in  the  cycle  (/n(0)/n(l) . . . /„( 2n—  1)). 
100.  [40]  Do  the  de  Bruijn  cycles  of  exercise  97  provide  a useful  source  of  pseudo- 
random bits  when  n is  large? 

► 101.  [M30]  [Unique  factorization  of  strings  into  nonincreasing  primes.) 

a)  Prove  that  if  A and  A'  are  prime,  then  A A'  is  prime  if  A < A'. 

b)  Consequently  every  string  a can  be  written  in  the  form 

a = AiA2  ...  A*,  Ai  > A2  > • • • > A(,  where  each  A j is  prime. 

c)  In  fact,  only  one  such  factorization  is  possible.  Hint:  Show  that  At  must  be  the 
lexicographically  smallest  nonempty  suffix  of  a. 

d)  True  or  false:  Ai  is  the  longest  prime  prefix  of  a. 

e)  What  are  the  prime  factors  of  3141592653589793238462643383279502884197? 
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102.  [HM28]  Deduce  the  number  of  m-ary  primes  of  length  n from  the  unique  factor- 
ization theorem  in  the  previous  exercise. 

103.  [M20]  Use  Eq.  (59)  to  prove  Fermat’s  theorem  that  mv  = m (modulo  p). 

104.  [17]  According  to  formula  (60),  about  1/n  of  all  n-letter  words  are  prime.  How 
many  of  the  5757  five-letter  GraphBase  words  are  prime?  Which  of  them  is  the  smallest 
nonprime?  The  largest  prime? 

105.  [ M31 } Let  a be  a preprime  string  of  length  n on  an  infinite  alphabet. 

a)  Show  that  if  the  final  letter  of  a is  increased,  the  resulting  string  is  prime. 

b)  If  a has  been  factored  as  in  exercise  101,  show  that  it  is  the  n-extension  of  Ai. 

c)  Furthermore  a cannot  be  the  n-ex tension  of  two  different  primes. 

► 106.  [M30]  By  reverse-engineering  Algorithm  F,  design  an  algorithm  that  visits  all 
m-ary  primes  and  preprimes  in  decreasing  order. 

107.  [HM30]  Analyze  the  running  time  of  Algorithm  F,  for  fixed  m as  n ->  00. 

108.  [M35]  Let  Ai  < • • • < A4  be  the  m-ary  prime  strings  whose  lengths  divide  n,  and 
let  ai  . . . a„  be  any  m-ary  string.  The  object  of  this  exercise  is  to  prove  that  01  . . . an 
appears  in  Ax  . . . AtAiA2;  hence  Ai  . . . At  is  a de  Bruijn  cycle  (since  it  has  length  mn ) 
For  convenience  we  may  assume  that  m = 10  and  that  strings  correspond  to  decimal 
numbers;  the  same  arguments  will  apply  for  arbitrary  m > 2. 

a)  Show  that  if  m ...an=a/3  is  distinct  from  all  its  cyclic  shifts,  and  if  /3a  = \k  is 
prime,  then  a/3  is  a substring  of  \k\k+i  ■ unless  a = IF  for  some  j > 1. 

b)  Where  does  a/3  appear  in  Ai  . . . \t  if  /3a  is  prime  and  a consists  of  all  9s?  Hint: 
Show  that  if  an+1_;  . . . an  = 9l  in  step  F2  for  some  l > 0,  and  if  j is  not  a divisor 
of  n,  the  previous  step  F2  had  an_;  ...an  = 9,+1. 

c)  Now  consider  n-tuples  of  the  form  ( a/3)d , where  d > 1 is  a divisor  of  n and 
/3a  = A*,  is  prime. 

d)  Where  do  899135,  997879,  913131,  090909,  909090,  and  911911  occur  when  n = 6? 

e)  Is  Ai  . . . A<  the  lexicographically  least  m-ary  de  Bruijn  cycle  of  length  mn? 

109.  [M22]  An  m-ary  de  Bruijn  torus  of  size  m2  x m2  for  2 x 2 windows  is  a matrix 
of  m-ary  digits  dij  such  that  each  of  the  m4  submatrices 


( dij  di(j+i)  \ 
\d(i+l)j  d(t+l)(j+l)  / 


0 < i,j  < m2 


is  different,  where  subscripts  wrap  around  modulo  m2.  Thus  every  possible  m-ary  2x2 
submatrix  occurs  exactly  once;  Ian  Stewart  [Game,  Set,  and  Math  (Oxford:  Blackwell. 
1989),  Chapter  4]  has  therefore  called  it  an  m-ary  ourotorus.  For  example, 

/0  0 1 0\ 

0 0 0 1 
0 111 
\1  0 1 l) 

is  a binary  ourotorus;  indeed,  it  is  essentially  the  only  such  matrix  when  m = 2,  except 
for  shifting  and/or  transposition. 

Consider  the  infinite  matrix  D whose  entry  in  row  i = (.  ..a2aia 0)2  and  column 
j — (. . . &2&1 60)2  is  dij  = (. . . C2CiCo)2,  where 

Co  = (a 0 © bo)(ai  © 61)  © 61; 

Ck  = (a,2ka0  © b2k)bo  © (a2k+iao  © &2fc+i)(ho  © 1),  for  k > 0. 

Show  that  the  upper  left  22n  x 22n  submatrix  of  I)  is  a 2"-ary  ourotorus  for  all  n > 0. 
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110.  [ M25 ] Continuing  the  previous  exercise,  construct  m- ary  ourotoruses  for  all  m. 

111.  [20]  We  can  obtain  the  number  100  in  twelve  ways  by  inserting  + and  — signs 
into  the  sequence  123456789;  for  example,  100  = 1 + 23  — 4 + 5 + 6 + 78  — 9 = 
123  - 45  - 67  + 89  = -1  + 2 - 3 + 4 + 5 + 6 + 78  + 9. 

a)  What  is  the  smallest  positive  integer  that  cannot  be  represented  in  such  a way? 

b)  Consider  also  inserting  signs  into  the  10-digit  sequence  9876543210. 

► 112.  [25]  Continuing  the  previous  exercise,  how  far  can  we  go  by  inserting  signs  into 
12345678987654321?  For  example,  100  = -1234  - 5 - 6 + 7898  - 7 - 6543  - 2 - 1. 

7. 2. 1.2.  Generating  all  permutations.  After  n-tuples,  the  next  most  im- 
portant item  on  nearly  everybody’s  wish  list  for  combinatorial  generation  is  the 
task  of  visiting  all  permutations  of  some  given  set  or  multiset.  Many  different 
ways  have  been  devised  to  solve  this  problem.  In  fact,  almost  as  many  different 
algorithms  have  been  published  for  unsorting  as  for  sorting!  We  will  study  the 
most  important  permutation  generators  in  this  section,  beginning  with  a classical 
method  that  is  both  simple  and  flexible: 

Algorithm  L ( Lexicographic  permutation  generation) . Given  a sequence  of  n 
elements  a\a2  ■ ■ ■ a„,  initially  sorted  so  that 

ai  < a2  < • • • < a„,  (i) 

this  algorithm  generates  all  permutations  of  {ai,  <12, . . . , an},  visiting  them  in 
lexicographic  order.  (For  example,  the  permutations  of  {1,2,2,  3}  are 

1223,  1232,  1322,  2123,  2132,  2213,  2231,  2312,  2321,  3122,  3212,  3221, 

ordered  lexicographically.)  An  auxiliary  element  is  assumed  to  be  present  for 
convenience;  ao  must  be  strictly  less  than  the  largest  element  an. 

LI.  [Visit.]  Visit  the  permutation  0102  . . . a„. 

L2.  [Find  j]  Set  j «—  n — 1.  If  aj  > aJ+i,  decrease  j by  1 repeatedly  until 
aj  < a:j  + 1 . Terminate  the  algorithm  if  j = 0.  (At  this  point  j is  the  smallest 
subscript  such  that  we’ve  already  visited  all  permutations  beginning  with 
a 1 . . . aj.  So  the  lexicographically  next  permutation  will  make  aj  larger.) 
L3.  [Increase  aj.]  Set  l +-  n.  If  a3  > a;,  decrease  l by  1 repeatedly  until  <+  < a;. 
Then  interchange  aj  «+  a;.  (Since  aJ+i  > •••  > an,  element  a;  is  the 
smallest  element  greater  than  aj  that  can  legitimately  follow  ai . . . aj_i  in  a 
permutation.  Before  the  interchange  we  had  aj+i  > ■ ■ ■ > a;_i  > a;  > aj  > 

ai+i  > ■ • • > an ; after  the  interchange,  we  have  aj+ 1 > • ■ • > a;_  1 > aj  > 

a;  > a;+i  > • ■ • > an.) 

L4.  [Reverse  aJ+l  . . . an.\  Set  k <—  j + 1 and  l +-  n.  Then,  while  k < l,  inter- 
change af.  <+  ai  and  set  k <—  k + 1,  Z +-  l — 1.  Return  to  LI.  | 

This  algorithm  goes  back  to  Narayana  Pandita  in  14th-century  India  (see  Section 
7. 2. 1.7);  it  also  appeared  in  C.  F.  Hindenburg’s  preface  to  Specimen  Analyticum 
de  Lineis  Curvis  Secundi  Ordinis  by  C.  F.  Riidiger  (Leipzig:  1784),  xlvi-xlvii, 
and  it  has  been  frequently  rediscovered  ever  since.  The  parenthetical  remarks  in 
steps  L2  and  L3  explain  why  it  works. 
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Tin  tan  din  dan  bim  bam  bom  bo  — 
tan  tin  din  dan  bam  bim  bo  bom  — 
tin  tan  dan  din  bim  bam  bom  bo  — 
tan  tin  dan  din  bam  bim  bo  bom  — 
tan  dan  tin  bam  din  bo  bim  bom  — 
....  Tin  tan  din  dan  bim  bam  bom  bo. 

DOROTHY  L.  SAYERS,  The  Nine  Tailors  (1934) 

A permutation  on  the  ten  decimal  digits  is  simply  a 10  digit  decimal  number 
m which  all  digits  are  distinct.  Hence  all  we  need  to  do  is  to  produce 
all  10  digit  numbers  and  select  only  those  whose  digits  are  distinct. 

Isn  t it  wonderful  how  high  speed  computing  saves  us  from 
the  drudgery  of  thinking!  We  simply  program  k + 1 ->  k 
and  examine  the  digits  of  k for  undesirable  equalities. 

This  gives  us  the  permutations  in  dictionary  order  too! 
On  second  sober  thought  ...  we  do  need  to  think  of  something  else. 

— D.  H.  LEHMER  (1957) 


In  general,  the  lexicographic  successor  of  any  combinatorial  pattern  ax  . . ,an 
is  obtainable  by  a three-step  procedure: 

1)  Find  the  largest  j such  that  a3  can  be  increased. 

2)  Increase  a3  by  the  smallest  feasible  amount. 

3)  Find  the  lexicographically  least  way  to  extend  the  new  ax  . . . a3  to  a complete 
pattern. 

Algorithm  L follows  this  general  procedure  in  the  case  of  permutation  generation, 
just  as  Algorithm  7.2.1.1M  followed  it  in  the  case  of  n-tuple  generation;  we  will 
see  numerous  further  instances  later,  as  we  consider  other  kinds  of  combinatorial 
patterns.  Notice  that  we  have  aj+1  > • • • > an  at  the  beginning  of  step  L4. 
Therefore  the  first  permutation  beginning  with  the  current  prefix  ax  . . . a3  is 
ai . . . a3an  . . . aj+1,  and  step  L4  produces  it  by  doing  [(n  - j)/2j  interchanges. 

In  practice,  step  L2  finds  j — n - 1 half  of  the  time  when  the  elements  are 
distinct,  because  exactly  n!/2  of  the  n!  permutations  have  an_x  < an.  Therefore 
Algorithm  L can  be  speeded  up  by  recognizing  this  special  case,  without  making 
it  significantly  more  complicated.  (See  exercise  1.)  Similarly,  the  probability 
that  j < n - 1 is  only  1/t!  when  the  a’s  are  distinct;  hence  the  loops  in  steps  L2 
L4  usually  go  very  fast.  Exercise  6 analyzes  the  running  time  in  general,  showing 
that  Algorithm  L is  reasonably  efficient  even  when  equal  elements  are  present, 
unless  some  values  appear  much  more  often  than  others  do  in  the  multiset 
{oi,a2, . 

Adjacent  interchanges.  We  saw  in  Section  7.2.1. 1 that  Gray  codes  are  ad- 
vantageous for  generating  n-tuples,  and  similar  considerations  apply  when  we 
want  to  generate  permutations.  The  simplest  possible  change  to  a permutation 
is  to  interchange  adjacent  elements,  and  we  know  from  Chapter  5 that  any 
permutation  can  be  sorted  into  order  if  we  make  a suitable  sequence  of  such 
interchanges.  (For  example,  Algorithm  5.2.2B  works  in  this  way.)  Hence  we  can 
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go  backward  and  obtain  any  desired  permutation,  by  starting  with  all  elements 
in  order  and  then  exchanging  appropriate  pairs  of  adjacent  elements. 

A natural  question  now  arises:  Is  it  possible  to  run  through  all  permutations 
of  a given  multiset  in  such  a way  that  only  two  adjacent  elements  change  places 
at  every  step?  If  so,  the  overall  program  that  is  examining  all  permutations  will 
often  be  simpler  and  faster,  because  it  will  only  need  to  calculate  the  effect  of 
an  exchange  instead  of  to  reprocess  an  entirely  new  array  a\  . . ,an  each  time. 

Alas,  when  the  multiset  has  repeated  elements,  we  can’t  always  find  such 
a Gray-like  sequence.  For  example,  the  six  permutations  of  {1, 1,2,2}  are  con- 
nected to  each  other  in  the  following  way  by  adjacent  interchanges: 

2112  

1 122  1212  ___  _ 2121  2211;  (2) 

1221  ' w 


this  graph  has  no  Hamiltonian  path. 

But  most  applications  deal  with  permutations  of  distinct  elements,  and  for 
this  case  there  is  good  news:  A simple  algorithm  makes  it  possible  to  generate 
all  n!  permutations  by  making  just  n!  — 1 adjacent  interchanges.  Furthermore, 
another  such  interchange  returns  to  the  starting  point,  so  we  have  a Hamiltonian 
cycle  analogous  to  Gray  binary  code. 

The  idea  is  to  take  such  a sequence  for  {l,...,n  — 1}  and  to  insert  the 
number  n into  each  permutation  in  all  ways.  For  example,  if  n = 4 the  sequence 
(123, 132,312,321,231,213)  leads  to  the  columns  of  the  array 


1234 

1324 

3124 

3214 

2314 

2134 

1243 

1342 

3142 

3241 

2341 

2143 

1423 

1432 

3412 

3421 

2431 

2413 

4123 

4132 

4312 

4321 

4231 

4213 

when  4 is  inserted  in  all  four  possible  positions.  Now  we  obtain  the  desired 
sequence  by  reading  downwards  in  the  first  column,  upwards  in  the  second,  down- 
wards in  the  third,  . . . , upwards  in  the  last:  (1234, 1243, 1423,4123,4132, 1432, 
1342, 1324,  3124, 3142, . . . , 2143,  2134). 

In  Section  5.1.1  we  studied  the  inversions  of  a permutation,  namely  the  pairs 
of  elements  (not  necessarily  adjacent)  that  are  out  of  order.  Every  interchange 
of  adjacent  elements  changes  the  total  number  of  inversions  by  ±1.  In  fact,  when 
we  consider  the  so-called  inversion  table  Ci  . . .cn  of  exercise  5. 1.1-7,  where  Cj  is 
the  number  of  elements  lying  to  the  right  of  j that  are  less  than  j,  we  find  that 
the  permutations  in  (3)  have  the  following  inversion  tables: 


0000 

0010 

0020 

0120 

0110 

0100 

0001 

0011 

0021 

0121 

0111 

0101 

0002 

0012 

0022 

0122 

0112 

0102 

0003 

0013 

0023 

0123 

0113 

0103 

And  if  we  read  these  columns  alternately  down  and  up  as  before,  we  obtain 
precisely  the  reflected  Gray  code  for  mixed  radices  (1,  2,  3, 4),  as  in  Eqs.  (46)  (51) 
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of  Section  7.2.1. 1.  The  same  property  holds  for  all  n,  as  noticed  by  E.  W Dijkstra 
[Acta  Information  6 (1976),  357-359],  and  it  leads  us  to  the  following  formulation: 

Algorithm  P ( Plain  changes).  Given  a sequence  a1a2...a„  of  n distinct 
elements,  this  algorithm  generates  all  of  their  permutations  by  repeatedly  inter- 
changing adjacent  pairs.  It  uses  an  auxiliary  array  cic2  . ..c„,  which  represents 
inversions  as  described  above,  running  through  all  sequences  of  integers  such  that 

0 <cj  < j for  1 < j < n.  (5) 

Another  array  0lo2  ...on  governs  the  directions  by  which  the  entries  Cj  change. 
PI.  [Initialize.]  Set  cj  T-  0 and  oj  «-  1 for  1 < j < n. 

P2.  [Visit.]  Visit  the  permutation  Oia2  . . . an. 

P3.  [Prepare  for  change.]  Set  j <-  n and  s <-  0.  (The  following  steps  determine 
the  coordinate  j for  which  cj  is  about  to  change,  preserving  (5);  variable  s 
is  the  number  of  indices  k > j such  that  ck  = k - 1.) 

P4.  [Ready  to  change?]  Set  q «-  Cj  + 0j.  If  q < 0,  go  to  P7;  if  q = j , go  to  P6. 
P5.  [Change.]  Interchange  aJ_Cj+s  -h-  Oj_9+s.  Then  set  cj  <-  q and  return  to  P2. 
P6.  [Increase  s .]  Terminate  if  j = 1;  otherwise  set  s <-  s + 1. 

P7.  [Switch  direction.]  Set  Oj  < Oj,  j <—  j — 1,  and  go  back  to  P4.  | 

This  procedure,  which  clearly  works  for  all  n > 1,  originated  in  17th-century 
England,  when  bell  ringers  began  the  delightful  custom  of  ringing  a set  of  bells 
in  all  possible  permutations.  They  called  Algorithm  P the  method  of  plain 
changes.  Figure  38(a)  illustrates  the  “Cambridge  Forty-Eight,”  an  irregular 
and  ad  hoc  sequence  of  48  permutations  on  5 bells  that  had  been  used  in 
the  early  1600s,  before  the  plain-change  principle  revealed  how  to  achieve  all 
5!  = 120  possibilities.  The  venerable  history  of  Algorithm  P has  been  traced  to 
a manuscript  by  Peter  Mundy  now  in  the  Bodleian  Library,  written  about  1653 
and  transcribed  by  Ernest  Morris  in  The  History  and  Art  of  Change  Ringing 
(1931),  29-30.  Shortly  afterwards,  a famous  book  called  Tintinnalogia , published 
anonymously  in  1668  but  now  known  to  have  been  written  by  Richard  Duckworth 
and  Fabian  Stedman,  devoted  its  first  60  pages  to  a detailed  description  of  plain 
changes,  working  up  from  n — 3 to  the  case  of  arbitrarily  large  n. 

Cambridge  Forty-eight,  for  many  years, 
was  the  greatest  Peal  that  was  Rang  or  invented;  but  now, 
neither  Forty-eight,  nor  a Hundred,  nor  Seven-hundred  and  twenty, 
nor  any  Number  can  confine  us;  for  we  can  Ring  Changes,  Ad  infinitum! 

. . . On  four  Bells,  there  are  Twenty  four  several  Changes, 
in  Ringing  of  which,  there  is  one  Bell  called  the  Hunt, 
and  the  other  three  are  Extream  Bells; 
the  Hunt  moves,  and  hunts  up  and  down  continually  . . . ; 

two  of  the  Extream  Bells  makes  a Change 
every  time  the  Hunt  comes  before  or  behind  them. 

R.  DUCKWORTH  and  F.  STEDMAN,  Tintinnalogia  (1668) 
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(incomplete) 

(a)  The  Cambridge  Forty-Eight. 


(b)  Plain  Changes. 


' \JOOOB 

C/\J  NjooooLA.  \-> 

^ANaneaL/  s\ Y", 

(c)  Grandsire  Doubles. 


(d)  Stedman  Doubles. 


Fig.  38.  Four  patterns  that  were  used  in  17th-century 
England  to  ring  permutations  of  five  different  church- 
bells.  Pattern  (b)  corresponds  to  Algorithm  P. 


British  bellringing  enthusiasts  soon  went  on  to  develop  more  complicated 
schemes  in  which  two  or  more  pairs  of  bells  change  places  simultaneously.  For 
example,  they  devised  the  pattern  in  Fig.  38(c)  known  as  Grandsire  Doubles, 
“the  best  and  most  ingenious  Peal  that  ever  was  composed,  to  be  rang  on  five 
bells”  [Tintinnalogia,  page  95],  Such  fancier  methods  are  more  interesting  than 
Algorithm  P from  a musical  standpoint,  but  they  are  less  useful  in  computer 
applications,  so  we  shall  not  dwell  on  them  here.  Interested  readers  can  learn 
more  by  reading  W.  G.  Wilson’s  book,  Change  Ringing  (1965);  see  also  A.  T. 
White,  AMM  103  (1996),  771-778. 

H.  F.  Trotter  published  the  first  computer  implementation  of  plain  changes 
in  CACM  5 (1962),  434-435.  The  algorithm  is  quite  efficient,  especially  when  it 
is  streamlined  as  in  exercise  16,  because  n — 1 out  of  every  n permutations  are 
generated  without  using  steps  P6  and  P7.  By  contrast,  Algorithm  L enjoys  its 
best  case  only  about  half  of  the  time. 

The  fact  that  Algorithm  P does  exactly  one  interchange  per  visit  means  that 
the  permutations  it  generates  are  alternately  even  and  odd  (see  exercise  5.1.1- 
13).  Therefore  we  can  generate  all  the  even  permutations  by  simply  bypassing 
the  odd  ones.  In  fact,  the  c and  o tables  make  it  easy  to  keep  track  of  the  current 
total  number  of  inversions,  cj  + • • • + c„ , as  we  go. 

Many  programs  need  to  generate  the  same  permutations  repeatedly,  and  in 
such  cases  we  needn’t  run  through  the  steps  of  Algorithm  P each  time.  We  can 
simply  prepare  a list  of  suitable  transitions,  using  the  following  method: 

Algorithm  T ( Plain  change  transitions).  This  algorithm  computes  a table  t[l], 
t [2] , ...,  t[n\  — 1]  such  that  the  actions  of  Algorithm  P are  equivalent  to  the 
successive  interchanges  at[k]  at[fe]+1  for  1 < k < n\.  We  assume  that  n > 2. 
Tl.  [Initialize.]  Set  N n!,  d N/ 2,  t[d\  1,  and  m <—  2. 


k 
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T2.  [Loop  on  m.]  Terminate  if  m = n.  Otherwise  set  m <—  m + 1,  d *r-  d/m , 
and  k «—  0.  (We  maintain  the  condition  d = n\/m\.) 

T3.  [Hunt  down.]  Set  k <-  k + d and  j <-  m — 1.  Then  while  j > 0,  set  t\k]  <-  j 
k <—  k + d,  and  j <—  j — 1. 

T4.  [Offset.]  Set  t[k]  <—  t[k\  + 1. 

T5.  [Hunt  up.]  Set  k <—  k + d,  j «—  1.  While  j < m,  set  t[k]  <—  j , k k + d , 
3 7 + 1-  Then  return  to  T3  if  A;  < AT,  otherwise  return  to  T2.  | 

For  example,  if  n = 4 we  get  the  table  (t[l],  t[2], . . . , t\  23])  = (3,  2, 1 3 1 2 3 1 
3, 2, 1,3, 1,2,  3, 1,3, 2, 1,3, 1,2, 3). 

Alphametics.  Now  let’s  consider  a simple  kind  of  puzzle  in  which  permutations 
are  useful:  How  can  the  pattern 

SEND 

+ MORE  (6) 

MONEY 

represent  a correct  sum,  if  every  letter  stands  for  a different  decimal  digit? 
[H.  E.  Dudeney,  Strand  68  (1924),  97,  214.]  Such  puzzles  are  often  called 
“alphametics,”  a word  coined  by  J.  A.  H.  Hunter  [Globe  and  Mail  (Toronto: 
27  October  1955),  27];  another  term,  “cryptarithm,”  has  also  been  suggested  by 
S.  Vatriquant  [Sphinx  1 (May  1931),  50], 

The  classic  alphametic  (6)  can  easily  be  solved  by  hand  (see  exercise  21).  But 
let’s  suppose  we  want  to  deal  with  a large  set  of  complicated  alphametics,  some 
of  which  may  be  unsolvable  while  others  may  have  dozens  of  solutions.  Then  we 
can  save  time  by  programming  a computer  to  try  out  all  permutations  of  digits 
that  match  a given  pattern,  seeing  which  permutations  yield  a correct  sum. 
[A  computer  program  for  solving  alphametics  was  published  by  John  Beidler  in 
Creative  Computing  4,6  (November-December  1978),  110-113.] 

We  might  as  well  raise  our  sights  slightly  and  consider  additive  alphametics 
m general,  dealing  not  only  with  simple  sums  like  (6)  but  also  with  examples  like 

VIOLIN + VIOLIN + VIOLA  = TRIO  + SONATA. 

Equivalently,  we  want  to  solve  puzzles  such  as 

2(VI0LIN)  + VIOLA  - TRIO  - SONATA  = 0,  (7) 

where  a sum  of  terms  with  integer  coefficients  is  given  and  the  goal  is  to  obtain 
zero  by  substituting  distinct  decimal  digits  for  the  different  letters.  Each  letter 
m such  a problem  has  a “signature”  obtained  by  substituting  1 for  that  letter 
and  0 for  the  others;  for  example,  the  signature  for  I in  (7)  is 

2(010010)  + 01000  - 0010  - 000000, 

namely  21010.  If  we  arbitrarily  assign  the  codes  (1,2,...,  10)  to  the  letters 
(V,  I,  0,  L,  N,  A,  T,  R,  S,  X),  the  respective  signatures  corresponding  to  (7)  are 

si  = 210000,  s2  = 21010,  s3  = -7901,  s4  = 210,  s5  = -998, 
s6  = -100,  s7  = -1010,  s8  = -100,  s9  = -100000,  s10  = 0.’  ^ 
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(An  additional  letter,  X,  has  been  added  because  we  need  ten  of  them.)  The 
problem  now  is  to  find  all  permutations  ai . . . am  of  {0, 1, . . . , 9}  such  that 

10 

a-s  = ^ ajSj  = 0.  (g) 

1=1 

There  also  is  a side  condition,  because  the  numbers  in  alphametics  should  not 
have  zero  as  a leading  digit.  For  example,  the  sums 

7316  5731  6524  2817 

+ 0823  and  + 0647  and  + 0735  and  + 0368 
08139  06378  07259  03185 

and  numerous  others  are  not  considered  to  be  valid  solutions  of  (6).  In  general 
there  is  a set  F of  first  letters  such  that  we  must  have 

a,j  ^ 0 for  all  j G F;  (10) 

the  set  F corresponding  to  (7)  and  (8)  is  {1,7,9}. 

One  way  to  tackle  a family  of  additive  alphametics  is  to  start  by  using 
Algorithm  T to  prepare  a table  of  10!  — 1 transitions  t[k\.  Then,  for  each  problem 
defined  by  a signature  sequence  (si, . ■ . , sio)  and  a first-letter  set  F,  we  can 
exhaustively  look  for  solutions  as  follows: 

Al.  [Initialize.]  Set  a1a2...o10  4-  01... 9,  v 4-  Yl)=iU  ~ 1 )sj,  k 4-  1,  and 
6j  4—  Sj+i  — Sj  for  1 < j < 10. 

A2.  [Test.]  If  v — 0 and  if  (10)  holds,  output  the  solution  ax  . . . ax 0. 

A3.  [Swap.]  Stop  if  k = 10!.  Otherwise  set  j 4—  t[k ],  v 4—  v — (aj+ x — aj)6j , 
a,j+i  <->■  dj , k 4—  k + 1,  and  return  to  A2.  | 

Step  A3  is  justified  by  the  fact  that  swapping  dj  with  dj+x  simply  decreases  a ■ s 
by  (aJ+i  — aj)(sj+i  — Sj).  Even  though  10!  is  3,628,800,  a fairly  large  number, 
the  operations  in  step  A3  are  so  simple  that  the  whole  job  takes  only  a fraction 
of  a second  on  a modern  computer. 

An  alphametic  is  said  to  be  pure  if  it  has  a unique  solution.  Unfortunately 
(7)  is  not  pure;  the  permutations  1764802539  and  3546281970  both  solve  (9)  and 
(10),  hence  we  have  both 

176478  + 176478  + 17640  = 2576  + 368020 

and 

354652  + 354652  + 35468  = 1954  + 742818. 

Furthermore  «6  = Ss  in  (8),  so  we  can  obtain  two  more  solutions  by  interchanging 
the  digits  assigned  to  A and  R. 

On  the  other  hand  (6)  is  pure,  yet  the  method  we  have  described  will  find 
two  different  permutations  that  solve  it.  The  reason  is  that  (6)  involves  only 
eight  distinct  letters,  hence  we  will  set  it  up  for  solution  by  using  two  dummy 
signatures  sg  = sjo  = 0.  In  general,  an  alphametic  with  to  distinct  letters  will 
have  10  — to.  dummy  signatures  sm+i  = ■ • • = s10  = 0,  and  each  of  its  solutions 
will  be  found  (10  — to)!  times  unless  we  insist  that,  say,  < • ■ • < aig. 
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A general  framework.  A great  many  algorithms  have  been  proposed  for 
generating  permutations  of  distinct  objects,  and  the  best  way  to  understand 
them  is  to  apply  the  multiplicative  properties  of  permutations  that  we  studied 
m Section  1.3.3.  For  this  purpose  we  will  change  our  notation  slightly,  by  using 
0-origin  indexing  and  writing  a^a-i . . . ara_j  for  permutations  of  {0, 1, . . . , n — 1} 
instead  of  writing  a1a2  ...an  for  permutations  of  {1,  2, . . . , n}.  More  importantly, 
we  will  consider  schemes  for  generating  permutations  in  which  most  of  the  action 
takes  place  at  the  left,  so  that  all  permutations  of  {0,1,...,  k - 1}  will  be 
generated  during  the  first  fc!  steps,  for  1 < k < n.  For  example,  one  such 
scheme  for  n = 4 is 

0123, 1023,  0213,  2013,  1203,  2103,  0132,  1032,  0312,  3012,  1302,  3102 

0231,  2031,  0321,  3021,  2301,  3201,  1230,  2130,  1320,  3120,  2310,  321o’;  (ll) 


this  is  called  “reverse  colex  order,”  because  if  we  reflect  the  strings  from  right 
to  left  we  get  3210,  3201,  3120,  . . . , 0123,  the  reverse  of  lexicographic  order. 
Another  way  to  think  of  (n)  is  to  view  the  entries  as  (n— an) . . . (n— a2)(n— ai), 
where  aia2  ...an  runs  lexicographically  through  the  permutations  of  {1, 2, 

Let’s  recall  from  Section  1.3.3  that  a permutation  like  a = 250143  can  be 
written  either  in  the  two-line  form 


/012345\ 
V 250143  ) 


or  in  the  more  compact  cycle  form 


a = (0  2)(1  5 3), 

with  the  meaning  that  a takes  0 2,  1 4 5,  2 K 0,  3 H 1,  4 i->  4,  and 
5 ^ 3;  a 1-cycle  like  ‘(4)’  need  not  be  indicated.  Since  4 is  a fixed  point  of  this 
permutation  we  say  that  “a  fixes  4.”  We  also  write  0 a = 2,  la  = 5,  and  so  on. 
saying  that  ja  is  “the  image  of  j under  a.”  Multiplication  of  permutations,  like 
a times  /3  where  /3  = 543210,  is  readily  carried  out  either  in  the  two-line  form 


/012345X 

/012345N 

_ /012345\ 

/250143\ 

_ /012345N 

V250143/ 

V543210/ 

~ V250143/ 

V305412J 

“ V305412/ 

or  in  the  cycle  form 


a/3  = (0  2) (1  5 3)  ■ (0  5)(1  4)(2  3)  = (0  3 4 1)(2  5). 

Notice  that  the  image  of  1 under  a(3  is  1 (a/3)  = (1  a)(3  -5/3  = 0,  etc.  Warning: 
About  half  of  all  books  that  deal  with  permutations  multiply  them  the  other  way 
(from  right  to  left),  imagining  that  a/3  means  that  /?  should  be  applied  before  a. 
The  reason  is  that  traditional  functional  notation,  in  which  one  writes  a(l)  = 5, 
makes  it  natural  to  think  that  a/3(l)  should  mean  a(/3(l))  = a(4)  = 4.  However, 
the  present  book  subscribes  to  the  other  philosophy,  and  we  shall  always  multiply 
permutations  from  left  to  right. 

The  order  of  multiplication  needs  to  be  understood  carefully  when  permu- 
tations are  represented  by  arrays  of  numbers.  For  example,  if  we  “apply”  the 
reflection  /3  = 543210  to  the  permutation  a = 250143,  the  result  341052  is  not  a/3 
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but  /3a.  In  general,  the  operation  of  replacing  a permutation  a = aoai . . . on_ i 
by  some  rearrangement  ao/Jfli/?  • • • a(n-i)/3  takes  k afc/3  = kfia.  Permuting 
the  positions  by  /3  corresponds  to  premultiplication  by  /3,  changing  a to  /3a; 
permuting  the  values  by  /3  corresponds  to  postmultiplication  by  /3,  changing  a 
to  a/3.  Thus,  for  example,  a permutation  generator  that  interchanges  ai  a?  is 
premultiplying  the  current  permutation  by  (1  2),  postmultiplying  it  by  (ai  a2). 

Following  a proposal  made  by  Evariste  Galois  in  1830,  a nonempty  set  G 
of  permutations  is  said  to  form  a group  if  it  is  closed  under  multiplication,  that 
is,  if  the  product  a/3  is  in  G whenever  a and  /3  are  elements  of  G [see  Ecrits 
et  Memoires  Mathematiques  d’Evariste  Galois  (Paris:  1962),  47],  Consider,  for 
example,  the  4-cube  represented  as  a 4 x 4 torus 


0 13  2 
4 5 7 6 
c d f e 
8 9 b a 


(12) 


as  in  exercise  7.2.1.1-17,  and  let  G be  the  set  of  all  permutations  of  the  vertices 
{0,  l,...,f}  that  preserve  adjacency:  A permutation  a is  in  G if  and  only  if 
u — v implies  ua  — va  in  the  4-cube.  (Here  we  are  using  hexadecimal 
digits  (0,1,...,  f)  to  stand  for  the  integers  (0, 1, . . . , 15).  The  labels  in  (12) 
are  chosen  so  that  u — v if  and  only  if  u and  v differ  in  only  one  bit  position.) 
This  set  G is  obviously  a group,  and  its  elements  are  called  the  symmetries  or 
“automorphisms”  of  the  4-cube. 

Groups  of  permutations  G are  conveniently  represented  inside  a computer  by 
means  of  a Sims  table,  introduced  by  Charles  C.  Sims  [Computational  Methods 
in  Abstract  Algebra  (Oxford:  Pergamon,  1970),  169-183],  which  is  a family  of 
subsets  Si,  S2,  ...  of  G having  the  following  property:  S*,  contains  exactly  one 
permutation  that  takes  k j and  fixes  the  values  of  all  elements  greater 
than  k,  whenever  G contains  such  a permutation.  We  let  er^/t  be  the  identity 
permutation,  which  is  always  present  in  G;  but  when  0 < j < k,  any  suitable 
permutation  can  be  selected  to  play  the  role  of  o^j.  The  main  advantage  of  a 
Sims  table  is  that  it  provides  a convenient  representation  of  the  entire  group: 


Lemma  S.  Let  S 1,  S2,  . . . , Sn_i  be  a Sims  table  for  a group  G of  permutations 
on  {0, 1, . . . , n — 1}.  Then  every  element  a of  G has  a unique  representation 

a = <ti£72  • • • <r„— 1,  where  er*,  G Sfc  for  1 < k < n.  (13) 


Proof.  If  a has  such  a representation  and  if  <rn_i  is  the  permutation  \ )j  € 
S„_  1,  then  a takes  n — 1 1-4  j,  because  all  elements  of  Si  U • • • U S„_2  fix  the 
value  of  n — 1.  Conversely,  if  a takes  n — 1 j we  have  a = a'<7(n_i)j,  where 

a'  = a <j7  , , . 

is  a permutation  of  G that  fixes  n — 1.  (As  in  Section  1.3.3,  a~  denotes  the 
inverse  of  a.)  The  set  G'  of  all  such  permutations  is  a group,  and  Si,  . . . , S„_2 
is  a Sims  table  for  G'\  therefore  the  result  follows  by  induction  on  n.  | 
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For  example,  a bit  of  calculation  shows  that  one  possible  Sims  table  for  the 
automorphism  group  of  the  4-cube  is 


S*  = {()»  (01)(23)(45)(67)(89)(ab)(cd)(ef ),  . . . , 

(Of  )(le)  (2d)  (3c)  (4b)  (5a)  (69)  (78)}; 
Se  = {(),  (12)(56)(9a)(de),  (l4)(36)(9c)(be),  (l8)(3a)(5c)(7e)}; 

= {(),  (24)(35)(ac)(bd),  (28)(39)(6c)(7d)}; 

•5c  = {()}; 

•5b  = {(),  (48) (59) (6a) (7b)}; 

Sa  = S9  = ...  = Sl  = {()}; 


here  5,  contains  16  permutations  <rtj  for  0 < j < 15,  which  respectively  take 
i •— >■  i © (15  - j)  for  0 < * < 15.  The  set  Se  contains  only  four  permutations, 
because  an  automorphism  that  fixes  f must  take  e into  a neighbor  of  f ; thus  the 
image  of  e must  be  either  e or  d or  b or  7.  The  set  Sc  contains  only  the  identity 
permutation,  because  an  automorphism  that  fixes  f,  e,  and  d must  also  fix  c. 
Most  groups  have  Sk  = {()}  for  all  small  values  of  k,  as  in  this  example;  hence  a 
Sims  table  usually  needs  to  contain  only  a fairly  small  number  of  permutations 
although  the  group  itself  might  be  quite  large. 

The  Sims  representation  (13)  makes  it  easy  to  test  if  a given  permutation  a 
lies  in  G:  First  we  determine  an_j  = <r(„_lb,  where  a takes  n-l^j,  and  we 
let  a'  = acrn_i;  then  we  determine  o-„_2  = a(n_2)j,,  where  a'  takes  n-  2n  j' , 
and  we  let  a"  = afo“_2;  and  so  on.  If  at  any  stage  the  required  ak;l  does  not 
exist  in  Sk,  the  original  permutation  a does  not  belong  to  G.  In  the  case  of  (14) 
this  process  must  reduce  a to  the  identity  after  finding  ae,  ad,  ac,  and  <rb. 

For  example,  let  a be  the  permutation  (14)(28)(3c)(69)(7d)(be),  which  cor- 
responds to  transposing  (12)  about  its  main  diagonal  {0,  5,  f , a}.  Since  a fixes  f 
<rf  will  be  the  identity  permutation  (),  and  a'  = a.  Then  ue  is  the  member  of  Se 
that  takes  e b,  namely  (l4)(36)(9c)(be),  and  we  find  a"  = (28)(39)(6c)(7d). 
This  permutation  belongs  to  Sd,  so  a is  indeed  an  automorphism  of  the  4-cube. 

Conversely,  (13)  also  makes  it  easy  to  generate  all  elements  of  the  corre- 
sponding group.  We  simply  run  through  all  permutations  of  the  form 


(t(1,c1)o-(2,c2)  ...a (n-  l,cn_i), 

where  <r(k,ck)  is  the  (ck  + l)st  element  of  Sk  for  0 < ck  < sk  = |S*|  and 
1 <k<n,  using  any  algorithm  of  Section  7.2. 1.1  that  runs  through  all  (n  - 1)- 
tuples  (ci, . . . , cn_j)  for  the  respective  radices  (slt . . . , s„_x). 

Using  the  general  framework.  Our  chief  concern  is  the  group  of  all  permuta- 
tions on  {0, 1, . . . , n- 1},  and  in  this  case  every  set  Sk  of  a Sims  table  will  contain 
A;  + l elements  {a(k,0),a(k,  1),. . ,,ar(k,k)},  where  <r(k,  0)  is  the  identity  and  the 
others  take  k to  the  values  {0, . . . , k- 1}  in  some  order.  (The  permutation  cr (k,  j) 
need  not  be  the  same  as  crkj,  and  it  usually  is  different.)  Every  such  Sims  table 
leads  to  a permutation  generator,  according  to  the  following  outline: 
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Algorithm  G ( General  permutation  generator).  Given  a Sims  table  (Si,S2, 
...,Sn-i)  where  each  S*.  has  k + 1 elements  <r(k,j ) as  just  described,  this 
algorithm  generates  all  permutations  a0oi...a„_i  of  {0, 1,  ...,n—  1},  using 
an  auxiliary  control  table  c.n  ...  C2  c1 . 

Gl.  [Initialize.]  Set  aj  e-  j and  Cj+i  «—  0 for  0 < j < n. 

G2.  [Visit.]  (At  this  point  the  mixed-radix  number  [ Cn~  1 ’ 32’  Cl  ] is  the  number 

of  permutations  visited  so  far.)  Visit  the  permutation  a0ai . . . a„_i. 

G3.  [Add  1 to  c„  . . . c2cj.]  Set  k <—  1.  While  Cfc  = k , set  c*,  <—  0 and  k k + 1. 
Terminate  the  algorithm  if  k — n;  otherwise  set  c*,  <—  + 1. 

G4.  [Permute.]  Apply  the  permutation  r(fc,Cfc)w(/c  - l)-  to  a0ai . . . a„_i,  as 
explained  below,  and  return  to  G2.  | 

Applying  a permutation  tf  to  aoai . . . «„_i  means  replacing  aj  by  aJ7T  for 
0 < j < n;  this  corresponds  to  premultiplication  by  if  as  explained  earlier.  Let 
us  define 


r(k,j)  = a(k,j)cr(k,j  - 1)  , for  1 < j < k;  (15) 

u(k)  = cr(l,  1) . . .cr(k,k).  (16) 

Then  steps  G3  and  G4  maintain  the  property  that 

a0ai . . . an_!  is  the  permutation  er(l,  ci)a(2,  c2) . . . a (n  - 1,  cn_i),  (17) 
and  Lemma  S proves  that  every  permutation  is  visited  exactly  once. 


The  tree  in  Fig.  39  illustrates  Algorithm  G in  the  case  n — 4.  According 
to  (17),  every  permutation  a0OiO2a3  of  {0, 1,2,3}  corresponds  to  a three-digit 
control  string  c3c2ci,  with  0 < c3  < 3,  0 < c2  < 2,  and  0 < c\  < 1.  Some  nodes 
of  the  tree  are  labeled  by  a single  digit  c3;  these  correspond  to  the  permutations 
cr(3,c3)  of  the  Sims  table  being  used.  Other  nodes,  labeled  with  two  digits  c3c2, 
correspond  to  the  permutations  <r(2,  c2)er(3,  c3).  A heavy  line  connects  node  c3 
to  node  c30  and  node  c3c2  to  node  c3c20,  because  <r(2,0)  and  <r(l,0)  are  the 
identity  permutation  and  these  nodes  are  essentially  equivalent.  Adding  1 to  the 
mixed-radix  number  c3c2ci  in  step  G3  corresponds  to  moving  from  one  node  of 
Fig.  39  to  its  successor  in  preorder,  and  the  transformation  in  step  G4  changes 
the  permutations  accordingly.  For  example,  when  c3c2ci  changes  from  121  to 
200,  step  G4  premultiplies  the  current  permutation  by 


r(3,2M2)~  =t(3,2)<t(2,2)-<7(1,1)-; 
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premultiplying  by  cr(l,  1)  takes  us  from  node  121  to  node  12,  premultiplying 
by  2,2)  takes  us  from  node  12  to  node  1,  and  premultiplying  by  t(3.2)  = 
er(3,2)er(3, 1)“  takes  us  from  node  1 to  node  2 = 200,  which  is  the  preorder  suc- 
cessor of  node  121.  Stating  this  another  way,  premultiplication  by  t(3,2)w(2)~ 
is  exactly  what  is  needed  to  change  cr(l,  l)cr(2,  2)<j(3, 1)  to  er(l,  ojcr(2, 0)er(3, 2), 
preserving  (17). 

Algorithm  G defines  a huge  number  of  permutation  generators  (see  exer- 
cise 37),  so  it  is  no  wonder  that  many  of  its  special  cases  have  appeared  in  the 
literature.  Of  course  some  of  its  variants  are  much  more  efficient  than  others, 
and  we  want  to  find  examples  where  the  operations  are  particularly  well  suited 
to  the  computer  we  are  using. 

We  can,  for  instance,  obtain  permutations  in  reverse  colex  order  as  a special 
case  of  Algorithm  G (see  (11)),  by  letting  a(k,j)  be  the  (j  + l)-cycle 

a(kJ)  = (k-j  k-j+1  ...  k).  (1.8) 

The  reason  is  that  cr(k,  j)  should  be  the  permutation  that  corresponds  to  cn  . . . c\ 
in  reverse  colex  order  when  cy.  = j and  Cj  = 0 for  i 7^  k , and  this  permutation 
doai  ■ is  01 . . . (k-j -1) (k-j+1) . . . (k)(k-j)(k+l) . . . (n— 1).  For  exam- 

ple, when  n = 8 and  cn  . . ,c\  = 00030000  the  corresponding  reverse  colex 
permutation  is  01345267,  which  is  (2  3 4 5)  in  cycle  form.  When  a(k,j)  is  given 
by  (18),  Eqs.  (15)  and  (16)  lead  to  the  formulas 

r{k,j)  = (k-j  k);  (lg) 

w(*)  = (01)(012)...(01  ...  A;)  = (0  fc)  (1  *— 1)  (2  fc— 2) . . . = 4>{k)]  (20) 

here  4>(k)  is  the  (A 4-  l)-flip”  that  changes  a0  . . . a*,  to  a*, . . . a0.  In  this  case  w(fc) 
turns  out  to  be  the  same  as  ui(k)~ , because  4>(k)2  = (). 

Equations  (19)  and  (20)  are  implicitly  present  behind  the  scenes  in  Algo- 
rithm L and  in  its  reverse  colex  equivalent  (exercise  2),  where  step  L3  essentially 
applies  a transposition  and  step  L4  does  a flip.  Step  G4  actually  does  the  flip 
first;  but  the  identity 

(k-j  k)(/>(k  - 1)  = (f>(k  - 1)0-1  k ) (21) 

shows  that  a flip  followed  by  a transposition  is  the  same  as  a (different)  trans- 
position followed  by  the  flip. 

In  fact,  equation  (21)  is  a special  case  of  the  important  identity 

n~  til  h ■■■  jt)  7T  = 0i7t  j2n  ...  jt  tt),  (22) 

which  is  valid  for  any  permutation  n and  any  t-cycle  (jx  j2  ...  jt).  On  the 
left  of  (22)  we  have,  for  example,  jin  ^ jx  j2  >->■  j2n,  in  agreement  with 
the  cycle  on  the  right.  Therefore  if  a and  7r  are  any  permutations  whatsoever, 
the  permutation  n an  (called  the  conjugate  of  a by  n)  has  exactly  the  same 
cycle  structure  as  a;  we  simply  replace  each  element  j in  each  cycle  by  jn. 

Another  significant  special  case  of  Algorithm  G was  introduced  by  R.  J. 
Ord-Smith  [CACM  10  (1967),  452;  12  (1969),  638;  see  also  Comp.  J.  14  (1971), 
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136-139],  whose  algorithm  is  obtained  by  setting 

cr{k,j)  = (A  ...  10)h  (23) 

Now  it  is  clear  from  (15)  that 

r(k,j)  = (A  ...  1 0);  (24) 

and  once  again  we  have 

w(A)  = (0  A)(  1 k- 1)(2  k—2) . . . = <f>(k),  (25) 

because  a(k,  k)  = (0  1 ...  A;)  is  the  same  as  before.  The  nice  thing  about  this 
method  is  that  the  permutation  needed  in  step  G4,  namely  r(A,  Cfc)w(A  — 1)_, 
does  not  depend  on  c*,: 


r(k,j)uj(k-  1)  = (k  ...  10)<^>(A  — 1)  — <j)(k).  (26) 


Thus,  Ord-Smith’s  algorithm  is  the  special  case  of  Algorithm  G in  which  step  G4 
simply  interchanges  a 0 «-»  a*,,  ai  a*,_  1,  ...;  this  operation  is  usually  quick, 
because  k is  small,  and  it  saves  some  of  the  work  of  Algorithm  L.  (See  exercise  38 
and  the  reference  to  G.  S.  Kliigel  in  Section  7. 2. 1.7.) 

We  can  do  even  better  by  rigging  things  so  that  step  G4  needs  to  do  only  a 
single  transposition  each  time,  somewhat  as  in  Algorithm  P but  not  necessarily 
on  adjacent  elements.  Many  such  schemes  are  possible.  The  best  is  probably 
to  let 


~(k,j)w(k  - 1)  = 


= { 


(AO),  if  A is  even, 
(A  j—  1),  if  A is  odd, 


(27) 


as  suggested  by  B.  R.  Heap  [Comp.  J.  6 (1963),  293-294],  Notice  that  Heap’s 
method  always  transposes  a ^ a0  except  when  A = 3,  5,  ... ; and  the  value  of  A, 
in  5 of  every  6 steps,  is  either  1 or  2.  Exercise  40  proves  that  Heap’s  method 
does  indeed  generate  all  permutations. 


Bypassing  unwanted  blocks.  One  noteworthy  advantage  of  Algorithm  G is 
that  it  runs  through  all  permutations  of  ao  . . .afc_i  before  touching  a*,;  then  it 
performs  another  A!  cycles  before  changing  a*,  again,  and  so  on.  Therefore  if  at 
any  time  we  reach  a setting  of  the  final  elements  . . . a„_  1 that  is  unimportant 
to  the  problem  we’re  working  on,  we  can  skip  quickly  over  all  permutations  that 
end  with  the  undesirable  suffix.  More  precisely,  we  could  replace  step  G2  by  the 
following  substeps: 

G2.0.  [Acceptable?]  If  afc  . . . an_  1 is  not  an  acceptable  suffix,  go  to  G2.1.  Oth- 
erwise set  A t—  A — 1.  Then  if  A > 0,  repeat  this  step;  if  A = 0,  proceed  to 
step  G2.2. 

G2.1.  [Skip  this  suffix.]  While  Cf.  — A,  apply  er(A,  A)-  to  ao...a„-i  and  set 
Ck  i — 0,  A 4 — A T 1.  Terminate  if  A = n;  otherwise  set  <—  c*,  + 1,  apply 
r(A,  Cfc)  to  ao  . . . a„_i,  and  return  to  G2.0. 

G2.2.  [Visit.]  Visit  the  permutation  ao  . . . an_i.  | 

Step  G1  should  also  set  A 4—  n — 1.  Notice  that  the  new  steps  are  careful  to 
preserve  condition  (17).  The  algorithm  has  become  more  complicated,  because 
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we  need  to  know  the  permutations  r(k,j)  and  a(k,k)  in  addition  to  the  permu- 
tations T(k,j)u(k  - 1)  that  appear  in  G4.  But  the  additional  complications 
are  often  worth  the  effort,  because  the  resulting  program  might  run  significantly 

IdblGl. 


tree  of  Fig.  39,  if  Algorithm  G is  suitably  extended. 

For  example,  Fig.  40  shows  what  happens  to  the  tree  of  Fig.  39  when 
the  suffixes  of  a0a1a2a3  that  correspond  to  nodes  00,  11,  121,  and  2 are  not 
acceptable.  (Each  suffix  ak  . . . an_i  of  the  permutation  a0  . . . on_!  corresponds 
to  a prefix  cn...ck  of  the  control  string  c„...c x,  because  the  permutations 
vy  >ci)  ••  ■ ~ l>cfc-i)  do  not  affect  ak  . . ,an_x.)  Step  G2.1  premultiplies  by 

T(k,j)  to  move  from  node  cn_1  . . . ck+1(j- 1)  to  its  right  sibling  cn_i . . . ck+1j, 
and  it  premultiphes  by  a(k,  k)  to  move  up  from  node  cn_1...Cfc+ifc  to  its 
parent  cn~1  ...ck+1.  Thus,  to  get  from  the  rejected  prefix  121  to  its  preorder 
successor,  the  algorithm  premultiplies  by  a(  1, 1)-,  a(2,  2)“,  and  r( 3,  2),  thereby 
moving  from  node  121  to  12  to  1 to  2.  (This  is  a somewhat  exceptional  case, 
because  a prefix  with  k = 1 is  rejected  only  if  we  don’t  want  to  visit  the  unique 
permutation  a0ai  . . . an _x  that  has  suffix  ax  . . . an_1.)  After  node  2 is  rejected. 
t( 3,3)  takes  us  to  node  3,  etc. 

Notice,  incidentally,  that  bypassing  a suffix  ak...an^  in  this  extension 
of  Algorithm  G is  essentially  the  same  as  bypassing  a prefix  ax . . . a,  in  our 
original  notation,  if  we  go  back  to  the  idea  of  generating  permutations  ax...an 
of  {1, ...  ,n}  and  doing  most  of  the  work  at  the  right-hand  end.  Our  original 
notation  corresponds  to  choosing  ax  first,  then  a2,  ...,  then  an;  the  notation 
m Algorithm  G essentially  chooses  an_x  first,  then  an_2,  ...,  then  a0.  Algo- 
rithm G’s  conventions  may  seem  backward,  but  they  make  the  formulas  for  Sims 
table  manipulation  a lot  simpler.  A good  programmer  soon  learns  to  switch 
without  difficulty  from  one  viewpoint  to  another. 

We  can  apply  these  ideas  to  alphametics,  because  it  is  clear  for  example  that 
most  choices  of  the  values  for  the  letters  D,  E,  and  Y will  make  it  impossible  for 
SEND  plus  MORE  to  equal  MONEY:  We  need  to  have  (D  -f-  E - Y)  mod  10  = 0 in  that 
problem.  Therefore  many  permutations  can  be  eliminated  from  consideration. 

In  general,  if  rk  is  the  maximum  power  of  10  that  divides  the  signature 
value  sfc,  we  can  sort  the  letters  and  assign  codes  {0,1,..., 9}  so  that  r0  > 
G > • • • > r9.  For  example,  to  solve  the  trio  sonata  problem  (7),  we  could  us^ 
(0, 1, . . . , 9)  respectively  for  (X,  S,  V,  A,  R,  I,  L,  T,  0,  N),  obtaining  the  signatures 

s0  - 0,  Si  = -100000,  s2  = 210000,  s3  = -100,  s4  = -100, 
s5  = 21010,  s6  = 210,  s7  = -1010,  s8  = -7901,  Sg  = -998; 
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hence  (ro, . . . , rg)  = (oo,  5, 4, 2, 2, 1, 1, 1, 0, 0).  Now  if  we  get  to  step  G2.0  for  a 
value  of  k with  rk-\  ^ r k,  we  can  say  that  the  suffix  ak  . . . ag  is  unacceptable 

unless  OfcSfc  H + a9s9  is  a multiple  of  10rfc-1.  Also,  (10)  tells  us  that  ak  . . . ag 

is  unacceptable  if  ak  = 0 and  k € F;  the  first-letter  set  F is  now  {1, 2,  7}. 

Our  previous  approach  to  alphametics  with  steps  A1A  3 above  used  brute 
force  to  run  through  10!  possibilities.  It  operated  rather  fast  under  the  circum- 
stances, since  the  adjacent-transposition  method  allowed  it  to  get  by  with  only 
6 memory  references  per  permutation;  but  still,  10!  is  3,628,800,  so  the  entire 
process  cost  almost  22  megamems,  regardless  of  the  alphametic  being  solved. 
By  contrast,  the  extended  Algorithm  G with  Heap’s  method  and  the  cutoffs  just 
described  will  find  all  four  solutions  to  (7)  with  fewer  than  128  Aikmiems!  Thus 
the  suffix-skipping  technique  runs  more  than  170  times  faster  than  the  previous 
method,  which  simply  blasted  away  blindly. 

Most  of  the  128  kilomems  in  the  new  approach  are  spent  applying  T(k,ck) 
in  step  G2.1.  The  other  memory  references  come  primarily  from  applications  of 
a(k,k)~  in  that  step,  but  r is  needed  7812  times  while  a~  is  needed  only  2162 
times.  The  reason  is  easy  to  understand  from  Fig.  40,  because  the  “shortcut 
move”  r(fc,  Cfc)  w(A’  — 1)“  in  step  G4  hardly  ever  applies;  in  this  case  it  is  used 
only  four  times,  once  for  each  solution.  Thus,  preorder  traversal  of  the  tree  is 
accomplished  almost  entirely  by  r steps  that  move  to  the  right  and  o~  steps 
that  move  upward.  The  r steps  dominate  in  a problem  like  this,  where  very 
few  complete  permutations  are  actually  visited,  because  each  step  a(k,k)~  is 
preceded  by  k steps  r(fc,  1),  r(k,  2),  ...,  r(k,k). 

This  analysis  reveals  that  Heap’s  method  — which  goes  to  great  lengths  to 
optimize  the  permutations  r(k,j)uj(k  — 1)~  so  that  each  transition  in  step  G4 
is  a simple  transposition  — is  not  especially  good  for  the  extended  Algorithm  G 
unless  comparatively  few  suffixes  are  rejected  in  step  G2.0.  The  simpler  reverse 
colex  order,  for  which  r(k,j)  itself  is  always  a simple  transposition,  is  now  much 
more  attractive  (see  (19)).  Indeed,  Algorithm  G with  reverse  colex  order  solves 
the  alphametic  (7)  with  only  97  kilomems. 

Similar  results  occur  with  respect  to  other  alphametic  problems.  For  ex- 
ample, if  we  apply  the  extended  Algorithm  G to  the  alphametics  in  exercise  24, 
parts  (a)  through  (h),  the  computations  involve  respectively 

(551,  110,  14,  8,  350,  84,  153,  1598)  kilomems  with  Heap’s  method; 

(429,  84,  10,  5,  256,  63,  117,  1189)  kilomems  with  reverse  colex. 

The  speedup  factor  for  reverse  colex  in  these  examples,  compared  to  brute  force 
with  Algorithm  T,  ranges  from  18  in  case  (h)  to  4200  in  case  (d),  and  it  is  about 
80  on  the  average;  Heap’s  method  gives  an  average  speedup  of  about  60. 

We  know  from  Algorithm  L,  however,  that  lexicographic  order  is  easily  han- 
dled without  the  complication  of  the  control  table  cn. . ,c\  used  by  Algorithm  G. 
And  a closer  look  at  Algorithm  L shows  that  we  can  improve  its  behavior  when 
permutations  are  frequently  being  skipped,  by  using  a linked  list  instead  of  a 
sequential  array.  The  improved  algorithm  is  well-suited  to  a wide  variety  of 
algorithms  that  wish  to  generate  restricted  classes  of  permutations: 
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Algorithm  X ( Lexicographic  permutations  with  restricted  prefixes).  This  al- 
gorithm generates  all  permutations  aia2  . . . an  of  {1, 2, . . . , n}  that  pass  a given 
sequence  of  tests 

M«i),  t2(ai,  a2),  ...,  tn(a1,a2, . . . , an), 

visiting  them  in  lexicographic  order.  It  uses  an  auxiliary  table  of  links  l0, 

■ ■ ■ , ln  to  maintain  a cyclic  list  of  unused  elements,  so  that  if  the  currently 
available  elements  are 

{!)•■•  ,n}  \ {ai, ...  ,ak}  = {6i, ...,  6n_fc},  where  b1  <•••<  6n_fc,  (29) 
then  we  have 

lo  = h,  lbj=bJ+1  for  1 < j < n — k,  and  lbn_k  = 0.  (30) 

It  also  uses  an  auxiliary  table  ui  . . ,un  to  undo  operations  that  have  been 
performed  on  the  l array. 

XI.  [Initialize.]  Set  lk  «-  A:  + 1 for  0 < k < n,  and  ln  <—  0.  Then  set  k t-  1. 

X2.  [Enter  level  k.]  Set  p «-  0,  q <-  l0. 

X3.  [Test  ax . . . ak.]  Set  ak  i-  q.  lftk(ai,...,  ak)  is  false,  go  to  X5.  Otherwise, 
if  k — n,  visit  aj  . . . an  and  go  to  X6. 

X4.  [Increase  A:.]  Set  uk  <—  p,  lp  lq,  k <r-  k + 1,  and  return  to  X2. 

X5.  [Increase  ak.]  Set  p q,  q lp.  If  q ^ 0 return  to  X3. 

X6.  [Decrease  k.]  Set  k <—  k — 1,  and  terminate  if  k = 0.  Otherwise  set  p <- uk, 
q ak,  lp  <r-  q,  and  go  to  X5.  | 

The  basic  idea  of  this  elegant  algorithm  is  due  to  M.  C.  Er  [Comp.  J.  30  (1987). 
282],  We  can  apply  it  to  alphametics  by  changing  notation  slightly,  obtaining 
permutations  o0  . . . a9  of  {0, . . . , 9}  and  letting  l10  play  the  former  role  of  l0.  The 
resulting  algorithm  needs  only  49  kilomems  to  solve  the  trio-sonata  problem  (7), 
and  it  solves  the  alphametics  of  exercise  24(a)-(h)  in 

(248,  38,  4,  3,  122,  30,  55,  553)  kilomems,  (31) 

respectively.  Thus  it  runs  about  165  times  faster  than  the  brute-force  approach. 

Another  way  to  apply  Algorithm  X to  alphametics  is  often  faster  yet  (see 
exercise  49). 


Fig.  41.  The  tree  implicitly  traversed  by  Algorithm  X when  n = 4,  if  all  permu- 
tations are  visited  except  those  beginning  with  132,  14,  2,  314,  or  4312. 
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*Dual  methods.  If  Si,  . . . , Sn_i  is  a Sims  table  for  a permutation  group  G, 
we  learned  in  Lemma  S that  every  element  of  G can  be  expressed  uniquely  as 
a product  <7T  ...  crn_i,  where  ak  € Sk;  see  (13).  Exercise  50  shows  that  every 
element  a can  also  be  expressed  uniquely  in  the  dual  form 

a = <V_  1 . . . ujerf , where  ak  G Sk  for  1 < k < n,  (32) 

and  this  fact  leads  to  another  large  family  of  permutation  generators.  In  par- 
ticular, when  G is  the  group  of  all  n!  permutations,  every  permutation  can  be 
written 

cr(n  - 1 , Cn _ ! ) — . . .cr(2,c2)“cr(l,ci)“,  (33) 

where  0 < ck  < k for  1 < k < n and  the  permutations  cr(k,j)  are  the  same  as 
in  Algorithm  G.  Now,  however,  we  want  to  vary  c„_i  most  rapidly  and  cx  least 
rapidly,  so  we  arrive  at  an  algorithm  of  a different  kind: 

Algorithm  H ( Dual  permutation  generator).  Given  a Sims  table  as  in  Algo- 
rithm G,  this  algorithm  generates  all  permutations  a0  . . . a„_  1 of  {0, . . . , n - 1}, 
using  an  auxiliary  table  Co  . . . cra_i. 

HI.  [Initialize.]  Set  aj  4—  j and  Cj  4-  0 for  0 < j < n. 

H2.  [Visit.]  (At  this  point  the  mixed-radix  number  32’  c”n'1]  is  the  number 

of  permutations  visited  so  far.)  Visit  the  permutation  a0ai . . . a„_i. 

H3.  [Add  1 to  c0 c\ . . .c„_x.]  Set  k 4-  n-  1.  If  ck  = k,  set  ck  4-  0,  k 4-  k-  1,  and 

repeat  until  k = 0 or  ck  < k.  Terminate  the  algorithm  if  k — 0;  otherwise 

set  cfc  4-  cfc  + 1. 

H4.  [Permute.]  Apply  the  permutation  r(k,  ck)uj(k  + 1)~  to  a0ai . . .an~i,  as 
explained  below,  and  return  to  H2.  | 

Although  this  algorithm  looks  almost  identical  to  Algorithm  G,  the  permutations 
t and  ui  that  it  needs  in  step  H4  are  quite  different  from  those  needed  in  step  G4. 
The  new  rules,  which  replace  (15)  and  (16),  are 

T(k,j)  = <r(k,j)-<r(k,j  - 1),  for  1 < j < k;  (34) 

u(k)  = er(n  - 1,  n — l)~cr(n  - 2,  n - 2)~  . . . a{k , k)~ . (35) 

The  number  of  possibilities  is  just  as  vast  as  it  was  for  Algorithm  G,  so  we 
will  confine  our  attention  to  a few  cases  that  have  special  merit.  One  natural 
case  to  try  is,  of  course,  the  Sims  table  that  makes  Algorithm  G produce  reverse 
colex  order,  namely 

a(k,j)  = ( k-j  k—j+1  ...  k)  (36) 

as  in  (18).  The  resulting  permutation  generator  turns  out  to  be  very  nearly  the 
same  as  the  method  of  plain  changes;  so  we  can  say  that  Algorithms  L and  P 
are  essentially  dual  to  each  other.  (See  exercise  52.) 

Another  natural  idea  is  to  construct  a Sims  table  for  which  step  H4  always 
makes  a single  transposition  of  two  elements,  by  analogy  with  the  construction 
of  (27)  that  achieves  maximum  efficiency  in  step  G4.  But  such  a mission  now 
turns  out  to  be  impossible:  We  cannot  achieve  it  even  when  n = 4.  For  if 
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we  start  with  the  identity  permutation  00010203  = 0123,  the  transitions  that 
take  us  from  control  table  c0cic2c3  = 0000  to  0001  to  0002  to  0003  must  move 
the  3;  so,  if  they  are  transpositions,  they  must  be  (3a),  (a b),  and  (be)  for  some 
permutation  abc  of  {0, 1, 2}.  The  permutation  corresponding  to  c0cic2c3  = 0003 
is  now  cr(3, 3)  = (be)  (a  b) (3  a)  = (3  a be);  and  the  next  permutation,  which 

corresponds  to  coCiC2c3  = 0010,  will  be  <r(2, 1)~,  which  must  fix  the  element  3. 
The  only  suitable  transposition  is  (3  c),  hence  a(2, 1)~  must  be  (3  c)  (3  a be)  = 
(abc).  Similarly  we  find  that  ct(2,2)~  must  be  (acb),  and  the  permutation 
corresponding  to  c0cic2c3  = 0023  will  be  (3 abc)(acb)  = (3c).  Step  H4  is  now 
supposed  to  convert  this  to  the  permutation  cr(l,  1)_,  which  corresponds  to  the 
control  table  0100  that  follows  0023.  But  the  only  transposition  that  will  convert 
(3  c)  into  a permutation  that  fixes  2 and  3 is  (3  c);  and  the  resulting  permutation 
also  fixes  1,  so  it  cannot  be  er(l,  1)~. 

The  proof  in  the  preceding  paragraph  shows  that  we  cannot  use  Algorithm  H 
to  generate  all  permutations  with  the  minimum  number  of  transpositions.  But  it 
also  suggests  a simple  generation  scheme  that  comes  very  close  to  the  minimum, 
and  the  resulting  algorithm  is  quite  attractive  because  it  needs  to  do  extra  work 
only  once  per  n(n  - 1)  steps.  (See  exercise  53.) 

Finally,  let’s  consider  the  dual  of  Ord-Smith’s  method,  when 

a(k,j)  = (k  ...  1 0)J  (37) 

as  in  (23).  Once  again  the  value  of  r(k,j)  is  independent  of  j, 

T(k,j)  = (0  1 ...  k),  (38) 

and  this  fact  is  particularly  advantageous  in  Algorithm  H because  it  allows  us 
to  dispense  with  the  control  table  coCj  . . .cn-1.  The  reason  is  that  cn_  1 = 0 in 
step  H3  if  and  only  if  a„_j  = n - 1,  because  of  (32);  and  indeed,  when  c}  = 0 
for  k < j < n in  step  H3  we  have  c^  = 0 if  and  only  if  a^  — k.  Therefore  we  can 
reformulate  this  variant  of  Algorithm  H as  follows. 

Algorithm  C (Permutation  generation  by  cyclic  shifts).  This  algorithm  visits 
all  permutations  a\  . . .an  of  the  distinct  elements  {aq, . . . ,xn}. 

Cl.  [Initialize.]  Set  a3  xj  for  1 < j < n. 

C2.  [Visit.]  Visit  the  permutation  ai . . . an,  and  set  k <-  n. 

C3.  [Shift.]  Replace  aia2  . . . a^  by  the  cyclic  shift  a2  . . . a^ai,  and  return  to  C2 
if  -f1  %k  • 

C4.  [Decrease  k.]  Set  k <-  k - 1,  and  go  back  to  C3  if  k > 1.  | 

For  example,  the  successive  permutations  of  {1,  2,  3, 4}  generated  when  n = 4 are 

1234,  2341,  3412,  4123,  (1234), 

2314,  3142,  1423,  4231,  (2314), 

3124,  1243,  2431,  4312,  (3124),  (1234), 

2134,  1342,  3421,  4213,  (2134), 

1324,  3241,  2413,  4132,  (1324), 

3214,  2143,  1432,  4321,  (3214),  (2134),  (1234), 
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with  unvisited  intermediate  permutations  shown  in  parentheses.  This  algorithm 
may  well  be  the  simplest  permutation  generator  of  all,  in  terms  of  minimum 
program  length.  It  is  due  to  G.  G.  Langdon,  Jr.  [CACM  10  (1967),  298-299; 
11  (1968),  392];  similar  methods  had  been  published  previously  by  C.  Tompkins 
[Proc.  Symp.  Applied  Math.  6 (1956),  202-205]  and,  more  explicitly,  by  R.  Seitz 
[ Unternehmensforschung  6 (1962),  2-15],  The  procedure  is  particularly  well 
suited  to  applications  in  which  cyclic  shifting  is  efficient,  for  example  when  suc- 
cessive permutations  are  being  kept  in  a machine  register  instead  of  in  an  array. 

The  main  disadvantage  of  dual  methods  is  that  they  usually  do  not  adapt 
well  to  situations  where  large  blocks  of  permutations  need  to  be  skipped,  be- 
cause the  set  of  all  permutations  with  a given  value  of  the  first  control  entries 
coCi...Cfc_i  is  usually  not  of  importance.  The  special  case  (36)  is,  however, 
sometimes  an  exception,  because  the  n\/k\  permutations  with  cqct  . . . = 

00 . . . 0 in  that  case  are  precisely  those  aoai...a„_i  in  which  0 precedes  1, 
1 precedes  2,  . . . , and  k — 2 precedes  k — 1. 

*Ehrlich’s  swap  method.  Gideon  Ehrlich  has  discovered  a completely  different 
approach  to  permutation  generation,  based  on  yet  another  way  to  use  a control 
table  Ci  . . . cn_i.  His  method  obtains  each  permutation  from  its  predecessor  by 
interchanging  the  leftmost  element  with  another: 

Algorithm  E ( Ehrlich  swaps).  This  algorithm  generates  all  permutations  of  the 
distinct  elements  ao  . . . a„_  1 by  using  auxiliary  tables  bo  ■ ■ ■ &n-i  and  cx  . . . cn. 

El.  [Initialize.]  Set  bj  t—  j and  Cj+ x «—  0 for  0 < j < n. 

E2.  [Visit.]  Visit  the  permutation  ao  . . . an-i- 

E3.  [Find  k.]  Set  k 4—  1.  Then  while  c = k,  set  c^  0 and  k 4—  k + 1. 
Terminate  if  k — n,  otherwise  set  c*  «—  c*  + 1. 

E4.  [Swap.]  Interchange  ao  at,k  ■ 

E5.  [Flip.]  Set  j 4—  1,  k 4—  k — 1.  While  j < k,  interchange  bj  4->  and  set 
j 4—  j + 1,  k 4—  k — 1.  Return  to  E2.  | 

Notice  that  steps  E2  and  E3  are  identical  to  steps  G2  and  G3  of  Algorithm  G. 
The  most  amazing  thing  about  this  algorithm,  which  Ehrlich  communicated  to 
Martin  Gardner  in  1987,  is  that  it  works;  exercise  55  contains  a proof.  A similar 
method,  which  simplifies  the  operations  of  step  E5,  can  be  validated  in  the  same 
way  (see  exercise  56).  The  average  number  of  interchanges  performed  in  step  E5 
is  less  than  0.18  (see  exercise  57). 

As  it  stands,  Algorithm  E isn’t  faster  than  other  methods  we  have  seen.  But 
it  has  the  nice  property  that  it  changes  each  permutation  in  a minimal  way,  using 
only  n — 1 different  kinds  of  transpositions.  Whereas  Algorithm  P used  adjacent 
interchanges,  at- 1 4-»  at,  Algorithm  E uses  first-element  swaps,  ao  4->  at,  also 
called  star  transpositions,  for  some  well-chosen  sequence  of  indices  t[l],  f[2],  . . . , 
t[n\  — 1].  And  if  we  are  generating  permutations  repeatedly  for  the  same  fairly 
small  value  of  n,  we  can  precompute  this  sequence,  as  we  did  in  Algorithm  T 
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for  the  index  sequence  of  Algorithm  P.  Notice  that  star  transpositions  have  an 
advantage  over  adjacent  interchanges,  because  we  always  know  the  value  of  a0 
from  the  previous  swap;  we  need  not  read  it  from  memory. 

Let  En  be  the  sequence  of  n!  — 1 indices  t such  that  Algorithm  E swaps  o0 
with  at  in  step  E4.  Since  En+1  begins  with  En,  we  can  regard  En  as  the  first 
n\  — 1 elements  of  an  infinite  sequence 

Ex  = 121213212123121213212124313132131312  ...  . (39) 

For  example,  if  n = 4 and  do <2102(13  = 1234,  the  permutations  visited  by 
Algorithm  E are 

1234,  2134,  3124,  1324,  2314,  3214, 

4213,  1243,  2143,  4123,  1423,  2413, 

3412,  4312,  1342,  3142,  4132,  1432,  ^ 

2431,  3421,  4321,  2341,  3241,  4231. 

*Using  fewer  generators.  After  seeing  Algorithms  P and  E,  we  might  naturally 
ask  whether  all  permutations  can  be  obtained  by  using  just  two  basic  operations, 
instead  of  n - 1.  For  example,  Nijenhuis  and  Wilf  [ Combinatorial  Algorithms 
(1975),  Exercise  6]  noticed  that  all  permutations  can  be  generated  for  n = 4 
if  we  replace  axa2a3  . . . an  at  each  step  by  either  a2a3  . . . anai  or  a2aia3  . . . an, 
and  they  wondered  whether  such  a method  exists  for  all  n. 

In  general,  if  G is  any  group  of  permutations  and  if  ax,  ... , ak  are  ele- 
ments of  G,  the  Cayley  graph  for  G with  generators  (aq, . . . , ak)  is  the  directed 
graph  whose  vertices  are  the  permutations  tt  of  G and  whose  arcs  go  from  tt 
to  ai tt,  ...,  akTr.  [Arthur  Cayley,  American  J.  Math.  1 (1878),  174-176.]  The 
question  of  Nijenhuis  and  Wilf  is  equivalent  to  asking  whether  the  Cayley  graph 
for  all  permutations  of  {1, 2, . . . , n},  with  generators  a and  r where  a is  the  cyclic 
permutation  (12  ...  n)  and  r is  the  transposition  (1  2),  has  a Hamiltonian  path. 

A basic  theorem  due  to  R.  A.  Rankin  [Proc.  Cambridge  Philos.  Soc.  44 
(1948),  17-25]  allows  us  to  conclude  in  many  cases  that  Cayley  graphs  with  two 
generators  do  not  have  a Hamiltonian  cycle: 

Theorem  R.  Let  G be  a group  consisting  of  g permutations.  If  the  Cayley  graph 
for  G with  generators  (a,/?)  has  a Hamiltonian  cycle,  and  if  the  permutations 
(a,  /3,  a/3~)  are  respectively  of  order  (a,  b,  c),  then  either  c is  even  or  g/a  and  g/b 
are  odd. 

(The  order  of  a permutation  a is  the  least  positive  integer  a such  that  aa  is  the 
identity.) 

Proof.  See  exercise  73.  | 

In  particular,  when  a = a and  f3  = r as  above,  we  have  g — n!,  a — n,  b = 2,  and 
c — n-l,  because  ot~  = (2  ...  n ).  Therefore  we  conclude  that  no  Hamiltonian 
cycle  is  possible  when  n > 4 is  even.  However,  a Hamiltonian  path  is  easy  to 
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construct  when  n — 4,  because  we  can  join  up  the  12-cycles 

1234  ->  2341  -4  3412  -4  4312  -4  3124  -4  1243  -4  2431 

-4  4231  -4  2314  -4  3142  -4  1423  -4  4123  -4  1234, 

2134  -4  1342  -4  3421  -4  4321  -4  3214  -4  2143  -4  1432  ^41' 

-4  4132  -4  1324  -4  3241  -4  2413  -4-  4213  -4  2134, 
by  starting  at  2341  and  jumping  from  1234  to  2134,  ending  at  4213. 

Ruskey,  Jiang,  and  Weston  [Discrete  Applied  Math.  57  (1995),  75-83]  un- 
dertook an  exhaustive  search  in  the  cr-r  graph  for  n = 5 and  discovered  that 
it  has  five  essentially  distinct  Hamiltonian  cycles,  one  of  which  (the  “most 
beautiful”)  is  illustrated  in  Fig.  42(a).  They  also  found  a Hamiltonian  path 
for  n = 6;  this  was  a difficult  feat,  because  it  is  the  outcome  of  a 720-stage 
binary  decision  tree.  Unfortunately  the  solution  they  discovered  has  no  apparent 
logical  structure.  A somewhat  less  complex  path  is  described  in  exercise  70,  but 
even  that  path  cannot  be  called  simple.  Therefore  a cr-r  approach  will  probably 
not  be  of  practical  interest  for  larger  values  of  n unless  a new  construction 
is  discovered.  R.  C.  Compton  and  S.  G.  Williamson  [Linear  and  Multilinear 
Algebra  35  (1993),  237-293]  have  proved  that  Hamiltonian  cycles  exist  for  all  n 
if  the  three  generators  cr,  o~ , and  r are  allowed  instead  of  just  a and  r;  their 
cycles  have  the  interesting  property  that  every  nth  transformation  is  r,  and  the 
intervening  n - 1 transformations  are  either  all  cr  or  all  cr-.  But  their  method  is 
too  complicated  to  explain  in  a short  space. 

Exercise  69  describes  a general  permutation  algorithm  that  is  reasonably 
simple  and  needs  only  three  generators,  each  of  order  2.  Figure  42(b)  illustrates 
the  case  n = 5 of  this  method,  which  was  motivated  by  examples  of  bell-ringing. 


(a)  Using  only  transitions  (12  34  5)  and  (12). 


(b)  Using  only  transitions  (12)  (3  4),  (2  3)  (4  5),  and  (3  4). 


Fig.  42.  Hamiltonian  cycles  for  5!  permutations. 

Faster,  faster.  What  is  the  fastest  way  to  generate  permutations?  This  question 
has  often  been  raised  in  computer  publications,  because  people  who  examine  n! 
possibilities  want  to  keep  the  running  time  as  small  as  possible.  But  the  answers 
have  generally  been  contradictory,  because  there  are  many  different  ways  to 
formulate  the  question.  Let’s  try  to  understand  the  related  issues  by  studying 
how  permutations  might  be  generated  most  rapidly  on  the  MMIX  computer. 

Suppose  first  that  our  goal  is  to  produce  permutations  in  an  array  of  n 
consecutive  memory  words  (octabytes).  The  fastest  way  to  do  this,  of  all  those 
we’ve  seen  in  this  section,  is  to  streamline  Heap’s  method  (27),  as  suggested  by 
R.  Sedgewick  [Computing  Surveys  9 (1977),  157-160]. 
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The  key  idea  is  to  optimize  the  code  for  the  most  common  cases  of  steps  G2 
and  G3,  namely  the  cases  in  which  all  activity  occurs  at  the  beginning  of  the 
array.  If  registers  u,  v,  and  w contain  the  contents  of  the  first  three  words,  and 
if  the  next  six  permutations  to  be  generated  involve  permuting  those  words  in 
all  six  possible  ways,  we  can  clearly  do  the  job  as  follows: 


PUSHJ  0, Visit 
STO  v,A0;  STO  u,Al 
STO  vi , AO;  STO  v,A2 
STO  u.AO;  STO  w,Al 
STO  v,A0;  STO  u,A2 
STO  w, AO;  STO  v,Al 


PUSHJ  0, Vi sit 
PUSHJ  0, Visit 
PUSHJ  0, Visit 
PUSHJ  0, Visit 
PUSHJ  0, Visit 


(42) 


(Here  A0  is  the  address  of  octabyte  a0,  etc.)  A complete  permutation  program 
which  takes  care  of  getting  the  right  things  into  u,  v.  and  w,  appears  in  exer- 
cise 77,  but  the  other  instructions  are  less  important  because  they  need  to  be 
performed  only  g of  the  time.  The  total  cost  per  permutation,  not  counting  the 
4u  needed  for  PUSHJ  and  POP  on  each  call  to  Visit,  comes  to  approximately 
2.77p  + 3.690  with  this  approach.  If  we  use  four  registers  u,  v,  w,  x , and  if 
we  expand  (42)  to  24  calls  on  Visit,  the  running  time  per  permutation  drops 
to  about  2.19/1  + 3.07m  And  with  r registers  and  r\  Visits,  exercise  78  shows 

that  the  cost  is  (2  + 0(l/r!))(/i  + v),  which  is  very  nearly  the  cost  of  two  STO 
instructions. 


The  latter  is,  of  course,  the  minimum  possible  time  for  any  method  that 
generates  all  permutations  in  a sequential  array.  ...  Or  is  it?  We  have  assumed 
that  the  visiting  routine  wants  to  see  permutations  in  consecutive  locations,  but 
perhaps  that  routine  is  able  to  read  the  permutations  from  different  starting 
points.  Then  we  can  arrange  to  keep  an_1  fixed  and  to  keep  two  copies  of  the 
other  elements  in  its  vicinity: 


a0ax  . . . an_2an_ia0ai 


(43) 


If  we  now  let  a0ax  . . . an_2  run  through  (n  - 1)!  permutations,  always  changing 
both  copies  simultaneously  by  doing  two  STO  commands  instead  of  one,  we  can 
let  every  call  to  Visit  look  at  the  n permutations 


ao°i'.-an-i,  ai ■ ■ ■ an_iao,  •■.,  an-iao  . 


• Q,n  — 2i 


(44) 


which  all  appear  consecutively.  The  cost  per  permutation  is  now  reduced  to  the 
cost  of  three  simple  instructions  like  ADD,  CMP,  PBNZ,  plus  0(l/n).  [See  Varol 
and  Rotem,  Comp.  J.  24  (1981),  173-176.] 

Furthermore,  we  might  not  want  to  waste  time  storing  permutations  into 
memory  at  all.  Suppose,  for  example,  that  our  goal  is  to  generate  all  permuta- 
tions of  {0, 1, . . . , n - 1}.  The  value  of  n will  probably  be  at  most  16,  because 
16.  - 20,922,789,888,000  and  17!  = 355,687,428,096,000.  Therefore  an  entire 
permutation  will  fit  in  the  16  nybbles  of  an  octabyte,  and  we  can  keep  it  in  a 
single  register.  This  will  be  advantageous  only  if  the  visiting  routine  doesn’t 
need  to  unpack  the  individual  nybbles;  but  let’s  suppose  that  it  doesn’t.  How 
fast  can  we  generate  permutations  in  the  nybbles  of  a 64-bit  register? 
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One  idea,  suggested  by  a technique  due  to  A.  J.  Goldstein  [U.  S.  Patent 
3383661  (14  May  1968)],  is  to  precompute  the  table  (t[l], . . . , t [5039] ) of  plain- 
change  transitions  for  seven  elements,  using  Algorithm  T.  These  numbers  t[k]  lie 
between  1 and  6,  so  we  can  pack  20  of  them  into  a 64-bit  word.  It  is  convenient 
to  put  the  number  = i 23fc_1t[20j  + k]  into  word  j of  an  auxiliary  table,  for 
0 < j < 252,  with  f[5040]  = 1;  for  example,  the  table  begins  with  the  codeword 

00001010011 100 101 110 100 110 101 100011010001 110001010011 100 10111000. 

The  following  program  reads  such  codes  efficiently: 


Perm 

( Set  register  a 

to  the  first  permutation) 

OH 

LDA  p,T 

JMP  3F 

p 4—  address  of  first  codeword. 

1H 

( Visit  the  permutation  in  register  a ) 

( Swap  the  nybbles  of  a that  lie  t bits  from  the  right ) 

SRU  c , c , 3 

c<-c>3. 

2H 

AND  t,c,#lc  t +-  c& (11100)2. 

PBNZ  t , IB 
ADD  p , p , 8 

Branch  if  t ^ 0. 

3H 

LD0  c,p,0 

c <—  next  codeword. 

PBNZ  c ,2B 

(The  final  codeword  is  followed  by  0. 

(45) 


( If  not  done,  advance  the  leading  n — 7 nybbles  and  return  to  OB ) 

Exercise  79  shows  how  to  ( Swap  the  nybbles  . . . ) with  seven  instructions,  using 
bit  manipulation  operations  that  are  found  on  most  computers.  Therefore  the 
cost  per  permutation  is  just  a bit  more  than  10u.  (The  instructions  that  fetch 
new  codewords  cost  only  (//  + 5v) /20:  and  the  instructions  that  advance  the 
leading  n — 7 nybbles  are  even  more  negligible  since  their  cost  is  divided  by  5040.) 
Notice  that  there  is  now  no  need  for  PUSHJ  and  POP  as  there  was  with  (42);  we 
ignored  those  instructions  before,  but  they  did  cost  Av. 

We  can,  however,  do  even  better  by  adapting  Langdon’s  cyclic-shift  method, 
Algorithm  C.  Suppose  we  start  with  the  lexicographically  largest  permutation 
and  operate  as  follows: 

GREG  <8 


(46) 


( Continue  with  Langdon’s  method ) 

The  running  time  per  permutation  is  now  only  5v  + 0(1/ n),  again  without  the 
need  for  PUSHJ  and  POP.  See  exercise  81  for  an  interesting  way  to  extend  (46)  to 
a complete  program,  obtaining  a remarkably  short  and  fast  routine. 


OH 

0CTA 

#f edcba9876543210& (1« (4*N) -1) 

Perm 

LD0U 

a,  0B 

Set  a «-  * . . .3210. 

JMP 

2F 

1H 

SRU 

a,a,4*(16-N) 

a 4 — |_a/1616_nJ. 

OR 

a,a,t 

a «—  a | t. 

2H 

(Visit  the  permutation  in  register  a) 

SRU 

t , a,4* (N-l) 

t <-  [a/16n_1J . 

SLU 

a,a,4*(17-N) 

a <—  1617-na  mod  16 

PBNZ 

t , IB 

To  IB  if  t ^ 0. 
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Fast  permutation  generators  are  amusing,  but  in  practice  we  can 
save  more  time  by  streamlining  the  visiting  routine  than  by  speeding 
generator. 


usually 
up  the 


Topological  sorting.  Instead  of  working  with  all  n!  permutations  of  {1, , n) 
we  often  want  to  look  only  at  permutations  that  obey  certain  restrictions.  For 
example,  we  might  be  interested  only  in  permutations  for  which  1 precedes  3. 

2 precedes  3,  and  2 precedes  4;  there  are  five  such  permutations  of  jl  2 3 4} 
namely  1 ’ ’ ’ 1 ’ 

1234,  1243,  2134,  2143,  2413.  (4r) 

The  problem  of  topological  sorting , which  we  studied  in  Section  2.2.3  as  a first 
example  of  nontrivial  data  structures,  is  the  general  problem  of  finding  a permu- 
tation that  satisfies  to  such  conditions  Xl  A yu  . . . , xm  -<  ym,  where  x<y  means 
that  x should  precede  y m the  permutation.  This  problem  arises  frequently  in 
practice,  so  it  has  several  different  names;  for  example,  it  is  often  called  the  linear 
embedding  problem,  because  we  want  to  arrange  objects  in  a line  while  preserving 
certain  order  relationships.  It  is  also  the  problem  of  extending  a partial  ordering 
to  a total  ordering  (see  exercise  2.2.3-14). 

Our  goal  in  Section  2.2.3  was  to  find  a single  permutation  that  satisfies 
all  the  relations.  But  now  we  want  rather  to  find  all  such  permutations,  all 
topological  sorts.  Indeed,  we  will  assume  in  the  present  section  that  the  elements 
* and  V on  which  the  relations  are  defined  are  integers  between  1 and  n and 
that  we  have  x < y whenever  x < y.  Consequently  the  permutation  12  „ 

will  always  be  topologically  correct.  (If  this  simplifying  assumption  is  not  met 

we  can  preprocess  the  data  by  using  Algorithm  2.2.3T  to  rename  the  objects 
appropriately.) 

Many  important  classes  of  permutations  are  special  cases  of  this  topological 
ordering  problem.  For  example,  the  permutations  of  {1, . . . , 8}  such  that 

1^2,  2^3,  3^4,  6^7,  7^8 

are  equivalent  to  permutations  of  the  multiset  {1,1,1,1,2,3,3,31,  because  we 
can  map  {1,2,  3,  4}  ^ 1,  5 ^ 2,  and  {6,7,8}  ^ 3.  We  know  how  to  generate 
permutations  of  a multiset  using  Algorithm  L,  but  now  we  will  learn  another  way. 

Notice  that  x precedes  y in  a permutation  a1...an  if  and  only  if  a'  < a'  in 
the  inverse  permutation  a{  . . . <.  Therefore  the  algorithm  we  are  aboutto  study 
will  also  find  all  permutations  a\  . . . a'n  such  that  a'  < a'k  whenever  j -x  k For 
example  we  learned  in  Section  5.1.4  that  a Young  tableau  is  an  arrangement  of 
{!’•••,  ni  m rows  and  columns  so  that  each  row  is  increasing  from  left  to  right 
and  each  column  is  increasing  from  top  to  bottom.  The  problem  of  generating  all 
3X3  Young  tableaux  is  therefore  equivalent  to  generating  all  a[  . . . a'g  such  that 

a}  < a'2  < a'3,  a'4  < a'5  < a'6, 
al  < a4  < a7>  a2  < a$  < a'8, 
and  this  is  a special  kind  of  topological  sorting. 


a7  < a8  < a9> 

a3  < a6  < a9i 


(48) 
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We  might  also  want  to  find  all  perfect  matchings  of  2 n elements,  namely  all 
ways  to  partition  {1, . . . , 2 n}  into  n pairs.  There  are  (2 n - 1)(2 n - 3) ...  (1)  = 
(2n)!/(2"n!)  ways  to  do  this,  and  they  correspond  to  permutations  that  satisfy 


a,  < a 


2> 


l2n-l 


< a 


2 n 5 


< 


< a. 


2n—  1 • 


(49) 


An  elegant  algorithm  for  exhaustive  topological  sorting  was  discovered  by 
Y.  L.  Varol  and  D.  Rotem  [Comp.  J.  24  (1981),  83-84],  who  realized  that  a 
method  analogous  to  plain  changes  (Algorithm  P)  can  be  used.  Suppose  we 
have  found  a way  to  arrange  {l,...,n  — 1}  topologically,  so  that  ai...on_i 
satisfies  all  the  conditions  that  do  not  involve  n.  Then  we  can  easily  write  down 
all  the  allowable  ways  to  insert  the  final  element  n without  changing  the  relative 
order  of  ai  . . . an_ j:  We  simply  start  with  a\  . . . an_in,  then  shift  n left  one  step 
at  a time,  until  it  cannot  move  further.  Apifiying  this  idea  recursively  yields  the 
following  straightforward  procedure. 


Algorithm  V (All  topological  sorts).  Given  a relation  -<  on  {1, . . . , n}  with  the 
property  that  x -<  y implies  x < y,  this  algorithm  generates  all  permutations 
al. . ,an  and  their  inverses  a[  . . . a'n  with  the  property  that  a'  < a'k  whenever 
j -<  k.  We  assume  for  convenience  that  a0  = a'0  — 0 and  that  0 -<  k for  1 < k < n. 
VI.  [Initialize.]  Set  a ■ «—  j and  a'-  «—  j for  0 < j <n. 

V2.  [Visit.]  Visit  the  permutation  ax  . . .an  and  its  inverse  a\  . . . a'n.  Then  set 
k 4—  n. 


V3.  [Can  k move  left?]  Set  j 4—  a'k  and  l 4—  aj_1.  If  l -<  k,  go  to  V5. 

V4.  [Yes,  move  it.]  Set  a^_1  4-  k,  a.j  4-  l,  a'k  4—  j - 1,  and  a\  4-  j.  Go  to  V2. 
V5.  [No,  put  k back.]  While  j < k,  set  l 4-  a -+1,  a-  4-  l,  a\  4-  j,  and  j 4-  j + 1. 

Then  set  ak  4—  a'k  4—  k.  Decrease  k by  1 and  return  to  V3  if  k > 0.  | 

For  example,  Theorem  5.1.4H  tells  us  that  there  are  exactly  42  Young  tableaux 
of  size  3x3.  If  we  apply  Algorithm  V to  the  relations  (48)  and  write  the  inverse 
permutation  in  array  form 


al  a2a3 
°4a5a6  ) 
fly  flg  flg 


(50) 


we  get  the  following  42  results: 


123 

123 

123 

123 

123 

124 

124 

124 

124 

124 

125 

125 

125 

125 

456 

457 

458 

467 

468 

356 

357 

358 

367 

368 

367 

368 

346 

347 

789 

689 

679 

589 

579 

789 

689 

679 

589 

579 

489 

479 

789 

689 

125 

348 

679 

126 

347 

589 

126 

348 

579 

127 

348 

569 

126 

357 

489 

126 

358 

479 

127 

358 

469 

134 

256 

789 

134 

257 

689 

134 

258 

679 

134 

267 

589 

134 

268 

579 

135 

267 

489 

135 

268 

479 

145 

145 

135 

135 

135 

136 

136 

137 

136 

136 

137 

146 

146 

147 

267 

268 

246 

247 

248 

247 

248 

248 

257 

258 

258 

257 

258 

258 

389 

379 

789 

689 

679 

589 

579 

569 

489 

479 

469 

389 

379 

369 
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Let  tT  be  the  number  of  topological  sorts  for  which  the  final  n-r  elements 
are  in  their  initial  position  a,  = j for  r < j < n.  Equivalently,  tr  is  the  number 
of  topological  sorts  ax . . . aT  of  {1, . . . , r},  when  we  ignore  the  relations  involving 
e ements  greater  than  r.  Then  the  recursive  mechanism  underlying  Algorithm  V 

shows  that  step  V2  is  performed  N times  and  step  V3  is  performed  M times 
where 

M = tn-\ f ti  and  N = tn.  (51) 

Also,  step  V4  and  the  loop  operations  of  V5  are  performed  N — 1 times-  the  rest 
of  step  V5  is  done  M-N+  1 times.  Therefore  the  total  running  time  of  the 
algorithm  is  a linear  combination  of  M,  N , and  n. 

If  the  element  labels  are  chosen  poorly,  M might  be  much  larger  than  N. 
For  example,  if  the  constraints  input  to  Algorithm  V are 

2 A 3,  3^4,  ...,  n-l^n,  (52) 

then  tj=  j for  1 < j < n and  we  have  M = |(n2  + n),  N — n.  But  those 
constraints  are  also  equivalent  to 

l^2,  2 -<  3,  n-2^n-l,  (53) 

under  renaming  of  the  elements;  then  M is  reduced  to  2n  - 1 = 2N  - 1. 

Exercise  89  shows  that  a simple  preprocessing  step  will  find  element  labels 
so  that  a slight  modification  of  Algorithm  V is  able  to  generate  all  topological 
sorts  in  0(N  + n)  steps.  Thus  topological  sorting  can  always  be  done  efficiently. 

Think  twice  before  you  permute.  We  have  seen  several  attractive  algorithms 
for  permutation  generation  in  this  section,  but  many  algorithms  are  known  by 
which  permutations  that  are  optimum  for  particular  purposes  can  be  found 
~ut  runninS  through  all  possibilities.  For  example,  Theorem  6.1S  showed 
that  we  can  find  the  best  way  to  arrange  records  on  a sequential  storage  simply 
by  sorting  them  with  respect  to  a certain  cost  criterion,  and  this  process  takes 
only  6>(n  log  n)  steps.  In  Section  7.5.2  we  will  study  the  assignment  problem, 
which  asks  how  to  permute  the  columns  of  a square  matrix  so  that  the  sum  of 
the  diagonal  elements  is  maximized.  That  problem  can  be  solved  in  at  most 
0(n  ) operations,  so  it  would  be  foolish  to  use  a method  of  order  n!  unless  n 
is  extremely  small.  Even  in  cases  like  the  traveling  salesrep  problem,  when  no 
efficient  algorithm  is  known,  we  can  usually  find  a much  better  approach  than 
to  examine  every  possible  solution.  Permutation  generation  is  best  used  when 
there  is  good  reason  to  look  at  each  permutation  individually. 

EXERCISES 

► 1.  [20]  Explain  how  to  make  Algorithm  L run  faster,  by  streamlining  its  operations 

when  the  value  of  j is  near  n. 

2.  [20]  Rewrite  Algorithm  L so  that  it  produces  all  permutations  of  a,  an  in 
reverse  colex  order.  (In  other  words,  the  values  of  the  reflections  an  ...a,  should  be 
lexicographically  decreasing,  as  in  (n).  This  form  of  the  algorithm  is  often  simpler 
and  taster  than  the  original,  because  fewer  calculations  depend  on  the  value  of  n.) 
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► 3.  [M21]  The  rank  of  a combinatorial  arrangement  X with  respect  to  a generation 
algorithm  is  the  number  of  other  arrangements  that  the  algorithm  visits  prior  to  X. 
Explain  how  to  compute  the  rank  of  a given  permutation  ai . . ,an  with  respect  to 
Algorithm  L,  if  {ai, . . . ,a„)  = {1, ...  ,n}.  What  is  the  rank  of  314592687? 

4.  [M23]  Generalizing  exercise  3,  explain  how  to  compute  the  rank  oi  a\  ...  an  with 
respect  to  Algorithm  L when  {ai,...,a„}  is  the  multiset  {n i • x-i,...,nt  ■ Xt}',  here 
ni  + • ■ ■ + rif  = n and  x\  < ■ ■ ■ < xt.  (The  total  number  of  permutations  is,  of  course, 
the  multinomial  coefficient 

( " ) = — 

see  Eq.  5.1.2-(3).)  What  is  the  rank  of  314159265? 

5.  [HM25]  Compute  the  mean  and  variance  of  the  number  of  comparisons  made  by 
Algorithm  L in  (a)  step  L2,  (b)  step  L3,  when  the  elements  {ai, . . . ,an}  are  distinct. 

6.  [HM34]  Derive  generating  functions  for  the  mean  number  of  comparisons  made 
by  Algorithm  L in  (a)  step  L2,  (b)  step  L3,  when  {ai, . . . ,a„}  is  a general  multiset 
as  in  exercise  4.  Also  give  the  results  in  closed  form  when  {ai, . . . ,an}  is  the  binary 
multiset  (s  • 0,  (n  — s)  ■ 1}. 

7.  [HM35]  What  is  the  limit  as  f A oo  of  the  average  number  of  comparisons 
made  per  permutation  in  step  L2  when  Algorithm  L is  being  applied  to  the  multiset 
(a)  (2  • 1,  2 • 2,  . . . , 2 ■ f}?  (b)  {1  • 1,  2 • 2,  . . . , t • t}?  (c)  {2  • 1,  4 • 2,  . . . , 2*  ■ t}? 

► 8.  [21]  The  variations  of  a multiset  are  the  permutations  of  all  its  submultisets.  For 
example,  the  variations  of  {1,2, 2, 3}  are 

e,  1,  12,  122,  1223,  123,  1232,  13,  132,  1322, 

2,  21,  212,  2123,  213,  2132,  22,  221,  2213,  223,  2231,  23,  231,  2312,  232,  2321, 

3,  31,  312,  3122,  32,  321,  3212,  322,  3221. 


Show  that  simple  changes  to  Algorithm  L will  generate  all  variations  of  a given  multiset 

{“1, 02,  • ■ ■ , on}. 

9.  [22]  Continuing  the  previous  exercise,  design  an  algorithm  to  generate  all  r- 
variations  of  a given  multiset  {ai,  02, . . . , an},  also  called  its  r-permutations,  namely  all 
permutations  of  its  r-element  submultisets.  (For  example,  the  solution  to  an  alphametic 
with  r distinct  letters  is  an  r-variation  of  {0, 1, . . . , 9}.) 

10.  [20]  What  are  the  values  of  a 102  . . . an,  cjc2  . . . cn,  and  01O2  . . . on  at  the  end  of 
Algorithm  P,  if  aia2  . . . an  = 12  . . . n at  the  beginning? 

11.  [M22]  How  many  times  is  each  step  of  Algorithm  P performed?  (Assume  that 

n > 2.) 


► 12.  [M23]  What  is  the  1000000th  permutation  visited  by  (a)  Algorithm  L,  (b)  Algo- 
rithm P,  (c)  Algorithm  C,  if  {ax, . . . , a„}  = {0, . . . , 9}?  Hint:  In  mixed-radix  notation 


we  have  1000000  = 


2,  6,  6,  2,  5,  1,  2,  2,  0,  01 
10,  9,  8,  7,  6,  5,  4,  3,  2,  1 J 


0,  0,  1,  2,  3,  0,  2,  7,  1,  01 

1,  2,  3,  4,  5,  6,  7,  8,  9,  10 1 ' 


13.  [M21]  (Martin  Gardner,  1974.)  True  or  false:  If  aia2...an  is  initially  12  ...n, 
Algorithm  P begins  by  visiting  all  n!/2  permutations  in  which  1 precedes  2;  then  the 
next  permutation  is  n ...  21. 


14.  [ M22 ] True  or  false:  If  aia.2  . . . an  is  initially  X\X2  ■ ■ ■ xn  in  Algorithm  P,  we  always 
have  aj-Cj+s  = Xj  at  the  beginning  of  step  P5. 
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15.  [M23]  (Selmer  Johnson,  1963.)  Show  that  the  offset  variable  s never  exceeds  2 in 
Algorithm  P. 

16.  [21]  Explain  how  to  make  Algorithm  P run  faster,  by  streamlining  its  operations 
when  the  value  of  j is  near  n.  (This  problem  is  analogous  to  exercise  1.) 

► 17.  [20]  Extend  Algorithm  P so  that  the  inverse  permutation  a\  . . . a'n  is  available  for 

processing  when  m . . . an  is  visited  in  step  P2.  (The  inverse  satishes  a'k  = j if  and  onlv 
if  (ij  = k.)  - 

18.  [21]  {Rosary  permutations.)  Devise  an  efficient  way  to  generate  (n- l)!/2  permu- 
tations that  represent  all  possible  undirected  cycles  on  the  vertices  (1, . . . , n};  that  is. 
no  cyclic  shift  of  cu  . . . an  or  «„  . . . ax  will  be  generated  if  oi  . . . o„  is  generated.  The 
permutations  (1234,  1324,  3124)  could,  for  example,  be  used  when  n = 4. 

19.  [25]  Construct  an  algorithm  that  generates  all  permutations  of  n distinct  elements 
looplessly  in  the  spirit  of  Algorithm  7.2.1.1L. 

► 20.  [20]  The  n-cube  has  2nn!  symmetries,  one  for  each  way  to  permute  and/or  com- 
plement the  coordinates.  Such  a symmetry  is  conveniently  represented  as  a signed 
permutation, ji amely  a permutation  with  optional  signs  attached  to  the  elements.  For 
example,  231  is  a signedjjermutation  that  transforms  the  vertices  of  the  3-cube  by 
changing  xlX2x3  to  x2x3xu  so  that  000  m-  001,  001  n-  Oil,  . . . , 111  ^ no.  Design 
a simple  algorithm  that  generates  all  signed  permutations  of  {1,2,...,  ri},  where  each 
step  either  interchanges  two  adjacent  elements  or  negates  the  first  element. 

21.  [M21]  (E.  P.  McCravy,  1971.)  How  many  solutions  does  the  alphametic  (6)  have 
in  radix  b? 


22.  [M15]  True  or  false:  If  an  alphametic  has  a solution  in  radix  b,  it  has  a solution 
in  radix  6+1. 

23.  [M20]  True  or  false:  A pure  alphametic  cannot  have  two  identical  signatures 
Sj  = sk  / 0 when  j ^ k. 

24.  [25]  Solve  the  following  alphametics  by  hand  or  by  computer: 

a)  SEND  + A + TAD  + MORE  = MONEY. 

b)  ZEROES  + ONES  = BINARY.  (Peter  MacDonald,  1977) 

c)  DCLIX  + DLXVI  = MCCXXV.  (Willy  Enggren,  1972) 


d) 

e) 

f) 

g) 

h) 


i) 

25. 


COUPLE  + COUPLE  = QUARTET.  (Michael  R.  W.  Buckley,'  1977) 

FISH  + N + CHIPS  = SUPPER.  (Bob  Vinnicombe,  1978) 

SATURN  + URANUS  + NEPTUNE  + PLUTO  = PLANETS.  (Willy  Enggren  19681 

EARTH  + AIR  + FIRE  + WATER  = NATURE.  (Herman  Nijon,’  1977) 

AN  + ACCELERATING  + INFERENTIAL  + ENGINEERING  + TALE  + ELITE  + GRANT  + FEE  + 
ET  + CETERA  = ARTIFICIAL  + INTELLIGENCE. 

HARDY  + NESTS  = NASTY  + HERDS. 

[M21]  Devise  a fast  way  to  compute  min(a  • s)  and  max(«  • s)  over  all  valid 
permutations  «i  . . . a10  of  (0, ... , 9),  given  the  signature  vector  s = (si, . . . , s10)  and 
the  first-letter  set  F of  an  alphametic  problem.  (Such  a procedure  makes  it  possible 
to  rule  out  many  cases  quickly  when  a large  family  of  alphametics  is  being  considered 
as  m several  of  the  exercises  that  follow,  because  a solution  can  exist  only  when 
min(a  • s)  < 0 < max(a  • s).) 

26.  [25]  What  is  the  unique  alphametic  solution  to 

NIIHAU  ± KAUAI  ± OAHU  ± MOLOKAI  ± LANAI  ± MAUI  ± HAWAII  = 0? 

27.  [30]  Construct  pure  additive  alphametics  in  which  all  words  have  five  letters. 
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28.  [M25]  A partition  of  the  integer  n is  an  expression  of  the  form  n = m + - ■ -+nt  with 

n\  > ■ ■ ■ >nt  > 0.  Such  a partition  is  called  doubly  true  if  a(n)  = a(ni)H \-a(nt)  is 

also  a pure  alphametic,  where  a(n)  is  the  “name”  of  n in  some  language.  Doubly  true 
partitions  were  introduced  by  Alan  Wayne  in  AMM  54  (1947),  38,  412-414,  where  he 
suggested  solving  TWENTY  = SEVEN  + SEVEN  + SIX  and  a few  others. 

a)  Find  all  partitions  that  are  doubly  true  in  English  when  1 < n < 20. 

b)  Wayne  also  gave  the  example  EIGHTY  = FIFTY  + TWENTY  + NINE  + ONE.  Find  all 
doubly  true  partitions  for  1 < n < 100  in  which  the  parts  are  distinct,  using  the 
names  ONE,  TWO,  . . . , NINETYNINE,  ONEHUNDRED. 

► 29.  [M25]  Continuing  the  previous  exercise,  find  all  equations  of  the  form  n1  4 h 

nt  — n'i  + • • ■ + n't'  that  are  both  mathematically  and  alphametically  true  in  English, 
when  {rii, . . . , nt,  n'x, . . . , n't,}  are  distinct  positive  integers  less  than  20.  For  example, 


TWELVE  + NINE  + TWO  = ELEVEN  + SEVEN  + FIVE ; 


the  alphametics  should  all  be  pure. 


30.  [25]  Solve  these  multiplicative  alphametics  by  hand  or  by  computer: 

a)  TWO  x TWO  = SQUARE.  (H.  E.  Dudeney,  1929) 

b)  HIP  x HIP  = HURRAY.  (Willy  Enggren,  1970) 

c)  PI  x R x R = AREA.  (Brian  Barwell,  1981) 

d)  NORTH/SOUTH  = EAST/WEST.  (Nob  Yoshigahara,  1995) 

e)  NAUGHT  x NAUGHT  = ZERO  x ZERO  x ZERO.  (Alan  Wayne,  2003) 

31.  [ M22 ] (Nob  Yoshigahara.)  (a)  What  is  the  unique  solution  to  A/BC+D/EF+G/HI  = 

1,  when  (A 1}  = {1, . . . ,9}?  (b)  Similarly,  make  AB  mod  2 = 0,  ABC  mod  3 = 0,  etc. 

32.  [M25]  (H.  E.  Dudeney,  1901.)  Find  all  ways  to  represent  100  by  inserting  a 
plus  sign  and  a slash  into  a permutation  of  the  digits  {1, . . . ,9}.  For  example,  100  = 
91  + 5742/638.  The  plus  sign  should  precede  the  slash. 

33.  [25]  Continuing  the  previous  exercise,  find  all  positive  integers  less  than  150  that 
(a)  cannot  be  represented  in  such  a fashion;  (b)  have  a unique  representation. 

34.  [M26]  Make  the  equation  EVEN  + ODD  + PRIME  = x doubly  true  when  (a)  a:  is  a 
perfect  5th  power;  (b)  a:  is  a perfect  7th  power. 

► 35.  [M20]  The  automorphisms  of  a 4-cube  have  many  different  Sims  tables,  only  one 
of  which  is  shown  in  (14).  How  many  different  Sims  tables  are  possible  for  that  group, 
when  the  vertices  are  numbered  as  in  (12)? 


36.  [ M23 ] 
board 


Find  a Sims  table  for  the  group  of  all  automorphisms  of  the  4x4  tic-tac-toe 


0 12  3 
4 5 6 7 
8 9 a b 
c d e f 


namely  the  permutations  that  take  lines  into  lines,  where  a “line”  is  a set  of  four 
elements  that  belong  to  a row,  column,  or  diagonal. 

► 37.  [HM22]  How  many  Sims  tables  can  be  used  with  Algorithms  G or  H?  Estimate 
the  logarithm  of  this  number  as  n — » 00. 

38.  [HM21]  Prove  that  the  average  number  of  transpositions  per  permutation  when 
using  Ord-Smith’s  algorithm  (26)  is  approximately  sinh  1 « 1.175. 
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39.  [16]  Write  down  the  24  permutations  generated  for  n = 4 by  (a)  Ord-Smith’s 

method  (26);  (b)  Heap’s  method  (27).  Y 1 ’ h S 

40.  [MSS]  Show  that  Heap’s  method  (27)  corresponds  to  a valid  Sims  table. 

► 41.  [MSS).  Design  an  algorithm  that  generates  all  r- variations  of  {0, 1 n - 1}  bv 

interchanging  just  two  elements  when  going  from  one  variation  to’  the  next.  (See 
exercise  9.)  Hint:  Generalize  Heap’s  method  (27),  obtaining  the  results  in  positions 
an_,  . an- 1 of  an  array  a0  . . . on_ 1.  For  example,  one  solution  when  n = 5 and  r = 2 
tWO  elements  of  the  respective  permutations  01234,  31204,  30214,  30124 

34012' 14032- 13042' IM02’  — • «“*•  o243>' 02341,' 

42.  [MW]  Construct  a Sims  table  for  all  permutations  in  which  every  a(k  j)  and 
every  r(k,j)  for  1 <j  < k is  a cycle  of  length  < 3. 

43.  [M24]  Construct  a Sims  table  for  all  permutations  in  which  every  a(k.k),  u(k) 
and  T(k,j)u>(k  - 1)  for  1 < j < k is  a cycle  of  length  < 3. 

44.  [20]  When  blocks  of  unwanted  permutations  are  being  skipped  by  the  extended 
Algorithm  G,  is  the  Sims  table  of  Ord-Smith’s  method  (23)  superior  to  the  Sims  table 
ol  the  reverse  colex  method  (18)? 

miSivWMwuT  the  indiCeS  Ul  ■ • • ' “9  When  A1g°rithm  X Visits  the  permutation 
Ji4t)92b87!'  (b)  What  permutation  is  visited  when  ux...u9  = 161800000? 

46  [20]  True  or  false:  When  Algorithm  X visits  a,  ...  a„,  we  have  uk  > uk+l  if  and 

only  it  ak  > ak+ 1,  for  1 < k < n. 

► 47  [M21]  Express  the  number  of  times  that  each  step  of  Algorithm  X is  performed 
in  terms  of  the  numbers  N0,  Nlt  . . . , Nn,  where  Nk  is  the  number  of  prefixes  ax...ak 
that  satisfy  t3  (ax , . . . , aj ) for  1 < j < k. 

* thei^fel  w rUnn^g  tim6S/°f  A1gorithm  X and  Algorithm  L,  in  the  case 

ess  1 (ui),  t2(ax,  02),  ...,  tn(ax,  a2, ...,  an)  always  are  true. 

► 49.  [28]  The  text’s  suggested  method  for  solving  additive  alphametics  with  Algo- 

rithm X essentially  chooses  digits  from  right  to  left;  in  other  words,  it  assigns  tentative 
values  to  the  least  significant  digits  before  considering  digits  that  correspond  to  higher 
powers  of  10.  6 

Explore  an  alternative  approach  that  chooses  digits  from  left  to  right.  For  example 
such  a method  will  deduce  immediately  that  M = 1 when  SEND  + MORE  = MONEY  Hint- 
bee  exercise  25. 

50.  [ M15 ] Explain  why  the  dual  formula  (32)  follows  from  (13). 

51.  [M16]  True  or  false:  If  the  sets  Sk  = {a(k,  0), . . . , a(k,  k)}  form  a Sims  table  for 
the  group  of  all  permutations,  so  also  do  the  sets  S~  = {a(k,  0)~,  a(k,  k)~}. 

* iM22}  What  permutations  r(k,j)  and  u ,(k)  arise  when  Algorithm  H ’is  used  with 
the  Sims  table  (36)?  Compare  the  resulting  generator  with  Algorithm  P. 

► 53.  [M26]  (F.  M.  Ives.)  Construct  a Sims  table  for  which  Algorithm  H will  generate 
all  permutations  by  making  only  n\  + 0((n  - 2)!)  transpositions. 

54.  [20]  Would  Algorithm  C work  properly  if  step  C3  did  a right-cyclic  shift,  setting 
“x  • ' • afc-iafc  “fc“i  • • • a*:-i,  instead  of  a left-cyclic  shift? 

55.  [M27]  Consider  the  factorial  ruler  function 

pi{m)  = max{fc  | m mod  k\  = 0}. 
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Let  a k and  r*  be  permutations  of  the  nonnegative  integers  such  that  ayrt,  = Tk  a, 
whenever  j < k.  Let  ao  and  /3o  be  the  identity  permutation,  and  for  m > 0 define 

— fivn—  lL>! ( m ) 0m  — \OLm  — 1 , fim  ~ & p,(m)  fim—1  • 

For  example,  if  erfe  is  the  flip  operation  (1  k— 1)(2  k— 2) ...  = (0  k)<j>{k ) and  if  Tk  = (0  k), 
and  if  Algorithm  E is  started  with  a,j  = j for  0 < j < n,  then  am  and  /3m  are  the 
contents  of  ao  • • ■ an-i  and  bo  ■■  ■ 6„_i  after  step  E5  has  been  performed  m times. 

a)  Prove  that /3(n+1),a{n+1),  = an+1a~  Tn+1r~ (/3n,an,)n+1 . 

b)  Use  the  result  of  (a)  to  establish  the  validity  of  Algorithm  E. 

56.  [ M22 ] Prove  that  Algorithm  E remains  valid  if  step  E5  is  replaced  by 

E5'.  [Transpose  pairs.)  If  k > 2,  interchange  b]  + 1 <->  bj  for  j = k — 2,  k — 4,  . . . , 
(2  or  1).  Return  to  E2.  | 

57.  [ HM22 ] What  is  the  average  number  of  interchanges  made  in  step  E5? 

58.  [M21  ] True  or  false:  If  Algorithm  E begins  with  aQ  . . . an-i  = x\  . . . xn  then  the 
final  permutation  visited  begins  with  ao  = xn. 

59.  [M20]  Some  authors  define  the  arcs  of  a Cayley  graph  as  running  from  n to  naj 
instead  of  from  n to  oy  7r . Are  the  two  definitions  essentially  different? 

► 60.  [21]  A Gray  cycle  for  permutations  is  a cycle  (7ro,7Ti, . . . ,7r„!_i)  that  includes 
every  permutation  of  {1, 2, . . . , n}  and  has  the  property  that  tt*,  differs  from  n^+i)  mod  «! 
by  an  adjacent  transposition.  It  can  also  be  described  as  a Hamiltonian  cycle  on  the 
Cayley  graph  for  the  group  of  all  permutations  on  {1,2,...,  n},  with  the  n-1  generators 
((1  2),  (2  3),...,  (n—  1 n)).  The  delta  sequence  of  such  a Gray  cycle  is  the  sequence  of 
integers  5o<5i  . . . <5„!-i  such  that 

^(Hl)  modn!  = ^fc“bl)7Tfc. 

(See  7. 2. 1.1  (24),  which  describes  the  analogous  situation  for  binary  n-tuples.)  For 
example,  Fig.  43  illustrates  the  Gray  cycle  defined  by  plain  changes  when  n = 4;  its 
delta  sequence  is  (32131231)3. 

a)  Find  all  Gray  cycles  for  permutations  of  {1,2, 3, 4}. 

b)  Two  Gray  cycles  are  considered  to  be  equivalent  if  their  delta  sequences  can  be 
obtained  from  each  other  by  cyclic  shifting  (Sk  ■ ■ ■ 4n!-i<5o  . . . Sk-i  ) and/or  reversal 
(<5„!_i  . . .5i<5o)  and/or  complementation  ((n— <50)(n— <Si) . . . (n— <5„!_i)).  Which  of 
the  Gray  cycles  in  (a)  are  equivalent? 

1234 


Fig.  43.  Algorithm  P traces  out 
this  Hamiltonian  cycle  on  the 
truncated  octahedron  of  Fig.  5-1. 

4321 
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61.  [21]  Continuing  the  previous  exercise,  a Gray  code  for  permutations  is  like  a Gray 
cycle  except  that  the  final  permutation  is  not  required  to  be  adjacent  to  the  initial 
permutation  n0.  Study  the  set  of  all  Gray  codes  for  n = 4 that  start  with  1234. 

► 62  [M23]  What  permutations  can  be  reached  as  the  final  element  of  a Gray  code 

that  starts  at  12  ...  n?  y 

63.  [M25]  Estimate  the  total  number  of  Gray  cycles  for  permutations  of  {1,  2, 3, 4,  5}. 

64.  [23]  A “doubly  Gray”  code  for  permutations  is  a Gray  cycle  with  the  additional 
property  that  <W  = 8k  ± 1 for  all  k.  Compton  and  Williamson  have  proved  that  such 
codes  exist  for  all  n > 3.  How  many  doubly  Gray  codes  exist  for  n = 5? 

65.  [M25]  For  which  integers  IV  is  there  a Gray  path  through  the  N lexicographically 

smallest  permutations  of  {1, ....  n}?  (Exercise  7.2.1.1-26  solves  the  analogous  problem 
tor  binary  n-tuples.)  ^ 

66'  i Pi!  ffhrlich’s  swaP  method  suggests  another  type  of  Gray  cycle  for  permutations, 
m which  the  n - 1 generators  are  the  star  transpositions  (1  2),  (1  3)  ...  (1  n)  For 

of  this  graph  44  Sh°WS  th<3  feleVant  gmph  When  71  = 4 Analyze  the  Hamiltonian  cycles 


Fig.  44.  The  Cayley  graph  for  permutations  of  {1,2  3 4} 
star  transpositions  (1  2),  (1  3),  and  (1  4),  drawn  as  a twisted 


generated  by  the 
torus. 


67_1  [26]  Continuing  the  previous  exercise,  find  a first-element-swap  Gray  cycle  for 
n - 5 m which  each  star  transposition  (1  j)  occurs  30  times,  for  2 < j < 5. 

68.  [M30]  (V.  L.  Kompel ’makher  and  V.  A.  Liskovets,  1975.)  Let  G be  the  Cayley 

fsTtra  With  generators  (#1 fli)  where  each  a3 

s transposi  ion  (ujvj);  also  let  A be  the  graph  with  vertices  {1  and  edges 

~ ] S Ve  that  ° haS  a Hamil,°nian  cycle  if  and  only  if  A is 

'he  ‘P“‘'  C“'  "h“  14  * * Pa‘hi  « is 


" 6r'rjf{  If  n ~ 4’  the  following  algorithm  generates  all  permutations  A j A 2 A:t  . . . An 
oi  (1,  2, 3, . . . , nj  using  only  three  transformations, 


P = (1  2)  (3 4)  (5  6) ... , a = (2  3)  (4  5)  (6  7) ... , r = (3  4)  (5  6)  (7  8) . . . , 

never  applying  p and  r next  to  each  other.  Explain  why  it  works. 

Zl.  [Initialize.]  Set  A,  <-  j for  1 < j < n.  Also  set  a3  e-  2 j for  1 < j < n/2  and 

an~\  *7  2j  +,1  f°r  1 - i < n/2-  Then  invoke  Algorithm  P,  but  with  parameter 
n 1 instead  of  n.  We  will  treat  that  algorithm  as  a coroutine,  which  should 
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return  control  to  us  whenever  it  “visits”  ai  . . .a„-i  in  step  P2.  We  will  also 
share  its  variables  (except  n). 

Z2.  [Set  x and  y. J Invoke  Algorithm  P again,  obtaining  a new  permutation 
ai  . . . a„_ i and  a new  value  of  j.  If  j = 2,  interchange  ai+„  44  02+s  (thereby 
undoing  the  effect  of  step  P5)  and  repeat  this  step;  in  such  a case  we  are  at  the 
halfway  point  of  Algorithm  P.  If  j = 1 (so  that  Algorithm  P has  terminated), 
set  x -4—  y 4—  0 and  go  to  Z3.  Otherwise  set 

X i Uj— cj+s+[o.j  = +1],  y 4—  Ojj— Cj+s—  [oj=—  1]  i 

these  are  the  two  elements  most  recently  interchanged  in  step  P5. 

Z3.  [Visit.]  Visit  the  permutation  Ai  . . . An.  Then  go  to  Z5  if  Ai  = x and  .4 2 = y. 

Z4.  [Apply  p,  then  <7.]  Interchange  A\  44  A 2,  A3  44  A4,  A5  4 A A6,  ....  Visit 

Ai ...  An-  Then  interchange  A2  44  A3,  A4  44  A5,  As  44  At,  ....  Terminate 

if  Ai  . . . An  = 1 . . . n,  otherwise  return  to  Z3. 

Z5.  [Apply  t,  then  a.]  Interchange  A3  44  A4,  A5  44  As,  A7  44  As,  ....  Visit 

Ai  . . . A„.  Then  interchange  A2  44  A3,  A4  44  A5,  As  44  A7,  . . . , and  return 

to  Z2.  | 

Hint:  Show  first  that  the  algorithm  works  if  modified  so  that  Aj  4—  n + 1 — j and 
a,j  4-  j in  step  Zl,  and  if  the  “flip”  permutations 

p = (1  n)(2  n—  1) . . . , a'  — (2  n)  (3  n— 1) . . . , t = (2  n— 1)(3  n— 2) . . . 

are  used  instead  of  p,  a,  r in  steps  Z4  and  Z5.  In  this  modification,  step  Z3  should  go 
to  Z5  if  Ai  = x and  An  = y;  step  Z4  should  terminate  when  Ai  . . . An  = n . . . 1. 

► 70.  [M33]  The  two  12-cycles  (41)  can  be  regarded  as  a -r  cycles  for  the  twelve  per- 
mutations of  {1, 1,3,4}: 

1134  — 4 1341  -4  3411  -4  4311  -4  3114  -4  1143  -A  1431 

-A  4131  —4  1314  -4  3141  -4  1413  -4  4113  -4  1134. 

Replacing  {1.1}  by  {1,2}  yields  disjoint  cycles,  and  we  obtained  a Hamiltonian  path  by 
jumping  from  one  to  the  other.  Can  a <r-r  path  for  all  permutations  of  6 elements  be 
formed  in  a similar  way,  based  on  a 360-cycle  for  the  permutations  of  {1, 1,3, 4, 5, 6}? 

71.  [48]  Does  the  Cayley  graph  with  generators  <7  = (1  2 ...  n)  and  r = (1  2)  have  a 
Hamiltonian  cycle  whenever  n > 3 is  odd? 

72.  [M21]  Given  a Cayley  graph  with  generators  (ai, . . . ,«*,),  assume  that  each  aj 
takes  x 1 — 4 y.  (For  example,  both  a and  r in  exercise  71  take  1 1—4  2.)  Prove  that  any 
Hamiltonian  path  starting  at  12  ...  n in  G must  end  at  a permutation  that  takes  y 1-4  x. 

► 73.  [ M30 ] Let  a,  0,  and  a be  permutations  of  a set  X , where  X = A U B.  Assume 
that  xa  = xa  when  x € A and  xu  = x/3  when  x £ B,  and  that  the  order  of  a0~  is  odd. 

a)  Prove  that  all  three  permutations  a,  0,  a have  the  same  sign;  that  is,  they  are  all 
even  or  all  odd.  Hint:  A permutation  has  odd  order  if  and  only  if  its  cycles  all 
have  odd  length. 

b)  Derive  Theorem  R from  part  (a). 

74.  [ M30 } (R.  A.  Rankin.)  Assuming  that  a0  = 0a  in  Theorem  R,  prove  that  a 
Hamiltonian  cycle  exists  in  the  Cayley  graph  for  G if  and  only  if  there  is  a number  k 
such  that  0 < k < g/c  and  f + Hc,  where  09'c  = 7*,  7 = a0~ . Hint:  Represent 
elements  of  the  group  in  the  form  09,yk . 
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75.  [MZ6]  The  directed  torus  Cm  xCn  has  mn  vertices  (x,  y)  for  0 < x < m,  0 < y < n, 
and  arcs  (x,y)  ->  (x  y)a  = ((*  + 1)  mod  m,  y),  (x,  y)  — ► (*,  y)/3  = (*,  (y  + l)  mod  n)! 
Prove  that,  if  m > 1 and  n > 1,  the  number  of  Hamiltonian  cycles  of  this  digraph  is 

(k)  [scd((^  — k)m,  kn)  = d\,  d = gcd(m,  n). 

k= 1 


76.  [M31]  The  cells  numbered  0,  1,  . . . , 63 
in  Fig.  45  illustrate  a northeasterly  knight’s 
tour  on  an  8 x 8 torus:  If  k appears  in  cell 
(xk,yk),  then  (xk+1  ,yk+1)  = (xk+2,yk  + l) 
or  (xk  + l,yk+2),  modulo  8,  and  ( x64,ye4 ) = 
(*0,1/0).  How  many  such  tours  are  possible 
on  an  m x n torus,  when  m,n  > 3? 


Fig.  45.  A northeasterly  knight’s  tour. 


► 77.  [22]  Complete  the  MMIX  program  whose  inner  loop  appears  in  (42),  using  Heap’s 

method  (27).  * 

78.  [M23]  Analyze  the  running  time  of  the  program  in  exercise  77,  generalizing  it  so 
that  the  inner  loop  does  r!  visits  (with  a0...ar-i  in  global  registers). 

79.  [20]  What  seven  MMIX  instructions  will  ( Swap  the  nybbles  . . . ) as  (45)  requires1? 
For  example,  if  register  t contains  the  value  4 and  register  a contains  the  nybbles 

12345678,  register  a should  change  to  # 12345687. 

80.  [21]  Solve  the  previous  exercise  with  only  five  MMIX  instructions.  Hint:  Use  MXOR. 

► 81.  [22]  Complete  the  MMIX  program  (46)  by  specifying  how  to  ( Continue  with  Lang- 

don’s  method).  & 

82.  [M21]  Analyze  the  running  time  of  the  program  in  exercise  81. 

83.  [22]  Use  the  <t-t  path  of  exercise  70  to  design  an  MMIX  routine  analogous  to  (42) 
that  generates  all  permutations  of  * 123456  in  register  a. 

84.  [20]  Suggest  a good  way  to  generate  all  n!  permutations  of  {1, . . . , n]  on  p pro- 
cessors that  are  running  in  parallel. 

► 85.  [25]  Assume  that  n is  small  enough  that  n!  fits  in  a computer  word  What’s  a 
good  way  to  convert  a given  permutation  a = a,  . . . an  of  {1, . . . , n}  into  an  integer 

' r(a)  in  the  range  0 < k < n!?  Both  functions  k = r(a)  and  a = rl~1](k)  should 
be  computable  in  only  0(n ) steps. 

86.  [20]  A partial  order  relation  is  supposed  to  be  transitive;  that  is,  x -<  y and  y -<  2 

s ould  imply  x ^ z.  But  Algorithm  V does  not  require  its  input  relation  to  satisfy  this 
condition.  J 

Show  that  if  x -<  y and  y -<  z,  Algorithm  V will  produce  identical  results  whether 
or  not  x -<  z. 

87‘  ,(F-  Ruskey-)  Consider  the  inversion  tables  d . . . c„  of  the  permutations  vis- 

ited by  Algorithm  V.  What  noteworthy  property  do  they  have?  (Compare  with  the 
inversion  tables  (4)  in  Algorithm  P.) 
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88.  [21]  Show  that  Algorithm  V can  be  used  to  generate  all  ways  to  partition  the 
digits  {0, 1, . . . , 9}  into  two  3-element  sets  and  two  2-element  sets. 

► 89.  [MSO]  Consider  the  numbers  to,  ti,  . . . , tn  defined  before  (51).  Clearly  to  = ti  = 1. 

a)  Say  that  index  j is  “trivial”  if  tj  = tj- 1.  For  example,  9 is  trivial  with  respect  to 
the  Young  tableau  relations  (48).  Explain  how  to  modify  Algorithm  V so  that  the 
variable  k takes  on  only  nontrivial  values. 

b)  Analyze  the  running  time  of  the  modified  algorithm.  What  formulas  replace  (51)? 

c)  Say  that  the  interval  [j  . . fc]  is  not  a chain  if  there  is  an  index  l such  that  j < l < k 
and  we  do  not  have  l -<  l + 1.  Prove  that  in  such  a case  tk  > 2tj-i. 

d)  Every  inverse  topological  sort  a\  . . . a'n  defines  a labeling  that  corresponds  to 
relations  a'n  -<  a'kl,  . . . , a'Jm  -<  a'km,  which  are  equivalent  to  the  original  relations 
ji  -<  ki,  . . . , jm  -<  km.  Explain  how  to  find  a labeling  such  that  [j  . . k]  is  not  a 
chain  when  j and  k are  consecutive  nontrivial  indices. 

e)  Prove  that  with  such  a labeling,  M < 4 N in  the  formulas  of  part  (b). 

90.  [M21]  Algorithm  V can  be  used  to  produce  all  permutations  that  are  /(-ordered 
for  all  ft  in  a given  set,  namely  all  a[  . . . a'n  such  that  a'j  < a'j +h  for  1 < j < n — h 
(see  Section  5.2.1).  Analyze  the  running  time  of  Algorithm  V when  it  generates  all 
permutations  that  are  both  2-ordered  and  3-ordered. 

91.  [ HM21 ] Analyze  the  running  time  of  Algorithm  V when  it  is  used  with  the 
relations  (49)  to  find  perfect  matchings. 

92.  [M18]  How  many  permutations  is  Algorithm  V likely  to  visit,  in  a “random” 
case?  Let  Pn  be  the  number  of  partial  orderings  on  {l,...,n},  namely  the  number 
of  relations  that  are  reflexive,  antisymmetric,  and  transitive.  Let  Qn  be  the  number 
of  such  relations  with  the  additional  property  that  j < k whenever  j -<  k.  Express 
the  expected  number  of  ways  to  sort  n elements  topologically,  averaged  over  all  partial 
orderings,  in  terms  of  Pn  and  Q „ . 

93.  [35]  Prove  that  all  topological  sorts  can  be  generated  in  such  a way  that  only 
one  or  two  adjacent  transpositions  are  made  at  each  step.  (The  example  1 -<  2,  3 A 4 
shows  that  a single  transposition  per  step  cannot  always  be  achieved,  even  if  we  allow 
nonadjacent  swaps,  because  only  two  of  the  six  relevant  permutations  are  odd.) 

► 94.  [25]  Show  that  in  the  case  of  perfect  matchings,  using  the  relations  in  (49),  all 
topological  sorts  can  be  generated  with  just  one  transposition  per  step. 

95.  [21  ] Discuss  how  to  generate  all  up-down  permutations  of  (1, . . . , n},  namely  those 
a-i  ...  an  such  that  ai  < 02  > <13  < 04  > • • •. 

96.  [21]  Discuss  how  to  generate  all  cyclic  permutations  of  {1, . . . ,n},  namely  those 
ai  . . . a„  whose  cycle  representation  consists  of  a single  n-cycle. 

97.  [21  ] Discuss  how  to  generate  all  derangements  of  (1, . . . , n},  namely  those  a\  . . .an 
such  that  ai  / 1,  a2  / 2,  03  / 3,  . . . . 

98.  [ HM23 ] Analyze  the  asymptotic  running  time  of  the  method  in  the  previous 
exercise. 

99.  [MSO]  Given  n > 3,  show  that  all  derangements  of  {1, . . . ,n}  can  be  generated 
by  making  at  most  two  transpositions  between  visits. 

100.  [21  ] Discuss  how  to  generate  all  of  the  indecomposable  permutations  of  {1, ... , n}, 
namely  those  a\  . . ,an  such  that  {01, . . . , Oj}  / {1, . . . for  1 < j < n. 

101.  [21]  Discuss  how  to  generate  all  involutions  of  (1, . . . ,n},  namely  those  permu- 
tations a\  . . ,a„  with  aai  . . . aan  = 1 . . . n. 
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102.  [M30]  Show  that  all  involutions  of  {l,...,n}  can  be  generated  by  making  at 
most  two  transpositions  between  visits. 

103.  [M32]  Show  that  all  even  permutations  of  (l,...,n)  can  be  generated  by  suc- 
cessive rotations  of  three  consecutive  elements. 

► 104.  [M22]  A permutation  <n  . . . an  of  {1, . . . , n}  is  well-balanced  if 

n n 

5 Zk°k  = ^(n  + 1 - k)ak. 

k = 1 k=  1 

For  example,  3142  is  well-balanced  when  n = 4. 

a)  Prove  that  no  permutation  is  well-balanced  when  n mod  4 = 2. 

b)  Prove  that  if  a\  . . . an  is  well-balanced,  so  are  its  reversal  an  . . . ai , its  complement 
(n+l~ a0  • • • (n+1— an),  and  its  inverse  a\  . . ,a'n. 

c)  Determine  the  number  of  well-balanced  permutations  for  small  values  of  n. 

► 105.  [26]  A weak  order  is  a relation  A that  is  transitive  (x  X y and  y X z implies 
x A z)  and  complete  (x  A y or  y X x always  holds).  We  can  write  x = y if  x X y and 

y < x\  x -<  y \i  x < y and  y y<  x.  There  are  thirteen  weak  orders  on  three  elements 
{1,  2, 3},  namely 

1 = 2 = 3’  1 = 2x3,  1 k 2 s 3,  1x2x3,  1 = 3x2,  H 3 ^ 2, 

2 ^ * = 3’  2x1x3,  2 = 3x1,  2 -<  3 -<  1,  3X1  = 2,  3x1x2,  3x2x1. 

a)  Explain  how  to  generate  all  weak  orders  of  {1, . . . , n}  systematically,  as  sequences 
of  digits  separated  by  the  symbols  s or  X. 

b)  A weak  order  can  also  be  represented  as  a sequence  a4  . . . a„  where  «,  = k if  j 
is  preceded  by  k X signs.  For  example,  the  thirteen  weak  orders  on  {1  2 3}  are 
respectively  000,  001,  Oil,  012,  010,  021,  101,  102,  100,  201,  110,  120,  210  in  this 
form.  Find  a simple  way  to  generate  all  such  sequences  of  length  n. 

106.  [M40]  Can  exercise  105(b)  be  solved  with  a Gray-like  code? 

► 107.  [30]  (John  H.  Conway,  1973.)  To  play  the  solitaire  game  of  “topswops,”  start  by 
shuffling  a pack  of  n cards  labeled  (1, . . . , n}  and  place  them  face  up  in  a pile.  Then  if 
the  top  card  is  fc  > 1,  deal  out  the  top  k cards  and  put  them  back  on  top  of  the  pile, 
thereby  changing  the  permutation  from  Oj  . . . an  to  ak  . . . aiafc+1  ...an.  Continue  until 
the  top  card  is  1.  For  example,  the  7-step  sequence 

31452  -X  41352  -x  53142  -x  24135  ->•  42135  ->  31245  ->•  21345  ->  12345 

might  occur  when  n = 5.  What  is  the  longest  sequence  possible  when  n = 13? 

108.  [M27]  If  the  longest  n-card  game  of  topswops  has  length  f(n),  prove  that  f(n ) < 

f’ji+i  — 1. 

109.  [Mf7]  Find  good  upper  and  lower  bounds  on  the  topswops  function  f(n). 

► 110.  [25]  Find  all  permutations  a0  . . . ag  of  {0, . . . , 9}  such  that 

{ao,a2,a3,a7}  = {2, 5, 7, 8},  {ai,a4,o5}  = (0,3,6), 

{ai,a3,a7,a8}  = (3, 4, 5, 7),  {o0,a3,a4}  = (0,7,8). 

Also  suggest  an  algorithm  for  solving  large  problems  of  this  type. 

►111.  [M25]  Several  permutation-oriented  analogs  of  de  Brui.jn  cycles  have  been  pro- 
posed. The  simplest  and  nicest  of  these  is  the  notion  of  a universal  cycle  of  permuta- 
tions, introduced  by  B.  W.  Jackson  in  Discrete  Math.  117  (1993),  141-150,  namely  a 
cycle  of  n!  digits  such  that  each  permutation  of  (1, ... , n)  occurs  exactly  once  as  a block 
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of  n — 1 consecutive  digits  (with  its  redundant  final  element  suppressed).  For  example, 
(121323)  is  a universal  cycle  of  permutations  for  n = 3,  and  it  is  essentially  unique. 

Prove  that  universal  cycles  of  permutations  exist  for  all  n > 2.  What  is  the 
lexicographically  smallest  one  when  n = 4? 

► 112.  [M30]  (A.  Williams,  2007.)  Continuing  exercise  111,  construct  explicit  cycles: 

a)  Show  that  a universal  cycle  of  permutations  is  equivalent  to  a Hamiltonian  cycle 
on  the  Cayley  graph  with  two  generators  p = (1  2 ...  n— 1)  and  a = (1  2 ...  n). 

b)  Prove  that  any  Hamiltonian  path  in  that  graph  is  actually  a Hamiltonian  cycle. 

c)  Find  such  a path  of  the  form  cr2  pn~3ax  . . . cr2pn_3a(„_1)i,  ctj  G {p,  cr} , for  n > 3. 
113.  [HM43]  Exactly  how  many  universal  cycles  exist,  for  permutations  of  < 9 objects? 

7. 2. 1.3.  Generating  all  combinations.  Combinatorial  mathematics  is  often 
described  as  “the  study  of  permutations,  combinations,  etc.,”  so  we  turn  our 
attention  now  to  combinations.  A combination  of  n things,  taken  t at  a time, 
often  called  simply  a t-combination  of  n things,  is  a way  to  select  a subset  of  size  t 
from  a given  set  of  size  n.  We  know  from  Eq.  1.2. 6 (2)  that  there  are  exactly  (") 
ways  to  do  this;  and  we  learned  in  Section  3.4.2  how  to  choose  (-combinations 
at  random. 

Selecting  t of  n objects  is  equivalent  to  choosing  the  n — t elements  not 
selected.  We  will  emphasize  this  symmetry  by  letting 

n — s + t (x) 

throughout  our  discussion,  and  we  will  often  refer  to  a f-combination  of  n things 
as  an  “(s,  f)-combination.”  Thus,  an  (s,  t)-combination  is  a way  to  subdivide 
s + t objects  into  two  collections  of  sizes  s and  t. 

If  I ask  how  many  combinations  of  21  can  be  taken  out  of  25, 
I do  in  effect  ask  how  many  combinations  of  4 may  be  taken. 
For  there  are  just  as  many  ways  of  taking  21  as  there  are  of  leaving  4. 
— AUGUSTUS  DE  MORGAN,  An  Essay  on  Probabilities  (1838) 

There  are  two  main  ways  to  represent  (s,  ^-combinations:  We  can  list  the 
elements  ct . . . C2C1  that  have  been  selected,  or  we  can  work  with  binary  strings 
an_x . . . aiao  for  which 

a„_i  + •••-)-  ax  + ao  = t.  (2) 

The  string  representation  has  s Os  and  t Is,  corresponding  to  elements  that  are 
unselected  or  selected.  The  list  representation  c( . . . C2C1  tends  to  work  out  best 
if  we  let  the  elements  be  members  of  the  set  {0, 1, . . . , n — 1}  and  if  we  list  them 
in  decreasing  order: 

n > ct  > ■ ■ ■ > c2  > ci  > 0.  (3) 

Binary  notation  connects  these  two  representations  nicely,  because  the  item  list 
ct . . . C2C1  corresponds  to  the  sum 

n— 1 

2C‘  + • ■ • + 2C ~ + 2Cl  = afc 2k  — (an_i . . . aiao)2-  (4) 

fc= 0 
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Of  course  we  could  also  list  the  positions  bs  . . . b2bx  of  the  Os  in  an_!  . . . aia0, 
where 

n > bs  > ■ ■ • > b2  > b±  > 0.  (5) 

Combinations  are  important  not  only  because  subsets  are  omnipresent  in 
mathematics  but  also  because  they  are  equivalent  to  many  other  configurations. 
For  example,  every  (s,  ^-combination  corresponds  to  a combination  of  s + 1 
things  taken  t at  a time  with  repetitions  permitted , also  called  a multicombination 
°f  s + 1 things,  namely  a sequence  of  integers  dt . . . d2di  with 

s > dt  > ■ ■ ■ > d2  > di  > 0.  (6) 

One  reason  is  that  dt . . . d2dx  solves  (6)  if  and  only  if  ct . . . c2c1  solves  (3),  where 

ct  = dt+t  — 1,  ...,  c2  = d2  + 1,  ci  = di  (7) 

(see  exercise  1.2.6-60).  And  there  is  another  useful  way  to  relate  combinations 
with  repetition  to  ordinary  combinations,  suggested  by  Solomon  Golomb  fAMM 
75  (1968),  530-531],  namely  to  define 


= f<*.  ^ ^ 

l eCj-s,  if  Cj 


< s\ 
> s. 


(8) 


In  this  form  the  numbers  et . . . ex  don’t  necessarily  appear  in  descending  or- 
der, but  the  multiset  {elte2, . . . ,et}  is  equal  to  {cu  c2, . . . , ct}  if  and  only  if 
\ei)  e2>  ■ ■ • , } is  a set.  (See  Table  1 and  exercise  1.) 

An  (s,  ^(-combination  is  also  equivalent  to  a composition  of  n + 1 into  t + 1 
parts,  namely  an  ordered  sum 


n + 1 — pH  l-pi+po,  where  pt, ...  ,pi,p0  > 1.  (9) 

The  connection  with  (3)  is  now 

Pt  — n~ct,  Pt-i  = ct  - Ct_j,  ...,  pi=c2-ci,  p0  = Ci  + 1.  (10) 

Equivalently,  if  q,  = Pj  - 1,  we  have 


s - qt  + ---  + qi+q0,  where  qt,...,qi,  q0  > 0,  (11) 

a composition  of  s into  t + 1 nonnegative  parts,  related  to  (6)  by  setting 

qt  = s-dt,  qt-i  = dt  - dt-X,  ...,  qi  = d2  - dx,  q0  = dx.  (12) 

Furthermore  it  is  easy  to  see  that  an  (s,  ^(-combination  is  equivalent  to  a 

path  of  length  s + t from  corner  to  corner  of  an  .s  x t grid,  because  such  a path 
contains  s vertical  steps  and  t horizontal  steps. 

Thus,  combinations  can  be  studied  in  at  least  eight  different  guises.  Table  1 
illustrates  all  (®)  = 20  possibilities  in  the  case  s = t = 3. 

These  cousins  of  combinations  might  seem  rather  bewildering  at  first  glance, 
but  most  of  them  can  be  understood  directly  from  the  binary  representation 
an - 1 • • • aia0.  Consider,  for  example,  the  “random”  bit  string 

a23  . . . aiao  = 011001001000011111101101,  (13) 
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Table  1 

THE  (3, 3)-COMBINATIONS  AND  THEIR  EQUIVALENTS 


05040302^1^0 

636261 

C3C2C1 

e3e2ei 

P3P2P1P0 

93929190 

path 

000111 

543 

210 

000 

210 

4111 

3000 

E 

001011 

542 

310 

100 

310 

3211 

2100 

o 

001101 

541 

320 

110 

320 

3121 

2010 

0 

001110 

540 

321 

111 

321 

3112 

2001 

0 

010011 

532 

410 

200 

010 

2311 

1200 

0 

010101 

531 

420 

210 

020 

2221 

1110 

0 

010110 

530 

421 

211 

121 

2212 

1101 

0 

011001 

521 

430 

220 

030 

2131 

1020 

0 

011010 

520 

431 

221 

131 

2122 

1011 

0 

011100 

510 

432 

222 

232 

2113 

1002 

0 

100011 

432 

510 

300 

110 

1411 

0300 

m 

100101 

431 

520 

310 

220 

1321 

0210 

0 

100110 

430 

521 

311 

221 

1312 

0201 

0 

101001 

421 

530 

320 

330 

1231 

0120 

0 

101010 

420 

531 

321 

331 

1222 

0111 

0 

101100 

410 

532 

322 

332 

1213 

0102 

0 

110001 

321 

540 

330 

000 

1141 

0030 

0 

110010 

320 

541 

331 

111 

1132 

0021 

0 

110100 

310 

542 

332 

222 

1123 

0012 

0 

111000 

210 

543 

333 

333 

1114 

0003 

0 

which  has  s = 

11  zeros 

and  t = 

13  ones, 

hence  n 

= 24.  The  dual  combination 

bs . . . bi  lists  the  positions  of  the  zeros,  namely 


23  20  19  17  16  14  13  12  11  4 1, 

because  the  leftmost  position  is  n — 1 and  the  rightmost  is  0.  The  primal 
combination  ct . . .c\  lists  the  positions  of  the  ones,  namely 

22  21  18  15  10  9 8 7 6 5 3 2 0. 

The  corresponding  multicombination  dt . . . d\  lists  the  number  of  Os  to  the  right 
of  each  1: 

10  10  8622222211  0. 

The  composition  pt  ■ ■ - Po  lists  the  distances  between  consecutive  Is,  if  we  imagine 
additional  Is  at  the  left  and  the  right: 

2133511111212  1. 

And  the  nonnegative  composition  qt  ■ ■ q o counts  how  many  0s  appear  between 
“fenceposts”  represented  by  Is: 

1022400000101  0; 

thus  we  have 

a„_1...aia0  = 0"*  lO9'"1 1 . . . 109l109°.  (14) 

The  paths  in  Table  1 also  have  a simple  interpretation  (see  exercise  2). 
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Lexicographic  generation.  Table  1 shows  combinations  a„_1...a1a0  and 
ct...ci  in  lexicographic  order,  which  is  also  the  lexicographic  order  of  dt...dr. 
Notice  that  the  dual  combinations  bs  . . . b\  and  the  corresponding  compositions 
Pt  ■ ■ - Poi  It  ■ ■ ■ qo  then  appear  in  reverse  lexicographic  order. 

Lexicographic  order  usually  suggests  the  most  convenient  way  to  generate 
combinatorial  configurations.  Indeed,  Algorithm  7.2.1.2L  already  solves  the 
problem  for  combinations  in  the  form  an_j...aiao,  since  (s, f)-combinations 
m bitstring  form  are  the  same  as  permutations  of  the  multiset  {s  ■ 0,  t ■ 1}.  That 
general-purpose  algorithm  can  be  streamlined  in  obvious  ways  when  it  is  applied 
to  this  special  case.  (See  also  exercise  7.1.3-20,  which  presents  a remarkable 
sequence  of  seven  bitwise  operations  that  will  convert  any  given  binary  number 
(an_i . . .aia0)2  to  the  lexicographically  next  f-combination,  assuming  that  n 
does  not  exceed  the  computer’s  word  length.) 

Let’s  focus,  however,  on  generating  combinations  in  the  other  principal  form 
ct . . . c2ci,  which  is  more  directly  relevant  to  the  ways  in  which  combinations  are 
often  needed,  and  which  is  more  compact  than  the  bit  strings  when  t is  small 
compared  to  n.  In  the  first  place  we  should  keep  in  mind  that  a simple  sequence 
of  nested  loops  will  do  the  job  nicely  when  t is  very  small.  For  example,  when 
t = 3 the  following  instructions  suffice: 

For  c3  = 2,  3,  . . . , n — 1 (in  this  order)  do  the  following: 

For  c2  = 1,  2,  . . . , c3  - 1 (in  this  order)  do  the  following: 

For  Ci  = 0,  1,  . . .,  c2  - 1 (in  this  order)  do  the  following: 

Visit  the  combination  c3c2ci. 

(See  the  analogous  situation  in  7.2.1.1— (3).) 

On  the  other  hand  when  t is  variable  or  not  so  small,  we  can  generate 
combinations  lexicographically  by  following  the  general  recipe  discussed  after 
Algorithm  7.2.1.2L,  namely  to  find  the  rightmost  element  cj  that  can  be  increased 
and  then  to  set  the  subsequent  elements  cj_1  . . . a to  their  smallest  possible 
values: 

Algorithm  L ( Lexicographic  combinations).  This  algorithm  generates  all  t- 
combinations  ct . . . c2cj  of  the  n numbers  {0, 1, . . . , n — 1},  given  n > t > 0. 
Additional  variables  Cf+i  and  Cj_|_2  are  used  as  sentinels. 

LI.  [Initialize.]  Set  c3  <-  j - 1 for  1 < j < t;  also  set  ct+1  n and  ct+2  «-  0. 

L2.  [Visit.]  Visit  the  combination  ct . . . c2cj. 

L3.  [Find  j.]  Set  j -f-  1.  Then,  while  c3  + 1 = cj+1,  set  c3  <-  j - 1 and  j 4-  j + 1; 

eventually  the  condition  cj  T 1 ^ cj+i  will  occur. 

L4.  [Done?]  Terminate  the  algorithm  if  j > t. 

L5.  [Increase  cj .]  Set  c3  <—  cj  + 1 and  return  to  L2.  | 

The  running  time  of  this  algorithm  is  not  difficult  to  analyze.  Step  L3  sets 
cj  3 ~ 1 just  after  visiting  a combination  for  which  cJ+1  = a + j,  and  the 
number  of  such  combinations  is  the  number  of  solutions  to  the  inequalities 

n>  ct>  ■■■>  Cj+1  > j; 


(16) 
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but  this  formula  is  equivalent  to  a (t  — j)-combination  of  the  n — j objects 
{n- 1, . . . , j},  so  the  assignment  Cj  4-  j — 1 occurs  exactly  ("~j)  times.  Summing 
for  1 < j < t tells  us  that  the  loop  in  step  L3  is  performed 

U)  <-> 

times  altogether,  or  an  average  of 

/n\//n\_  n!  j n!  t 

\a  + l)/\t)  ~ (s  + l)!(t-  1)!  / Iff!  “ a + 1 (l8) 

times  per  visit.  This  ratio  is  less  than  1 when  t < s,  so  Algorithm  L is  quite 
efficient  in  such  cases. 

But  the  quantity  t/(s  + 1)  can  be  embarrassingly  large  when  t is  near  n 
and  s is  small.  Indeed,  Algorithm  L occasionally  sets  Cj  4—  j — 1 needlessly,  at 
times  when  Cj  already  equals  j — 1.  Further  scrutiny  reveals  that  we  need  not 
always  search  for  the  index  j that  is  needed  in  steps  L4  and  L5,  since  the  correct 
value  of  j can  often  be  predicted  from  the  actions  just  taken.  For  example, 
after  we  have  increased  c4  and  reset  c3c2cx  to  their  starting  values  210,  the  next 
combination  will  inevitably  increase  c3.  These  observations  lead  to  a tuned- up 
version  of  the  algorithm: 

Algorithm  T (Lexicographic  combinations) . This  algorithm  is  like  Algorithm  L, 
but  faster.  It  also  assumes,  for  convenience,  that  t < n. 

Tl.  [Initialize.]  Set  Cj  4—  j — 1 for  1 < j < t;  then  set  ct+ 1 4-  n,  ct+2  4—  0,  and 
j 4-  t. 

T2.  [Visit.]  (At  this  point  j is  the  smallest  index  such  that  cJ+1  > j.)  Visit  the 
combination  ct . . . c2ci.  Then,  if  j > 0,  set  x 4—  j and  go  to  step  T6. 

T3.  [Easy  case?]  If  cx  + 1 < c2,  set  Cj  4—  Ci  + 1 and  return  to  T2.  Otherwise  set 
3 2- 

T4.  [Find  j.}  Set  c,-_i  4-  j - 2 and  x 4-  cj  + 1.  If  x — Cj+ x,  set  j 4-  j + 1 and 
repeat  step  T4. 

T5.  [Done?]  Terminate  the  algorithm  if  j > t. 

T6.  [Increase  c,-.]  Set  Cj  4—  x,  j 4—  j — 1,  and  return  to  T2.  | 

Now  j = 0 in  step  T2  if  and  only  if  cx  > 0,  so  the  assignments  in  step  T4  are 
never  redundant.  Exercise  6 carries  out  a complete  analysis  of  Algorithm  T. 

Notice  that  the  parameter  n appears  only  in  the  initialization  steps  LI 
and  Tl,  not  in  the  principal  parts  of  Algorithms  L and  T.  Thus  we  can  think 
of  the  process  as  generating  the  first  (")  combinations  of  an  infinite  list,  which 
depends  only  on  t.  This  simplification  arises  because  the  list  of  f-combinations 
for  n + 1 things  begins  with  the  list  for  n things,  under  our  conventions;  we  have 
been  using  lexicographic  order  on  the  decreasing  sequences  ct . . . cx  for  this  very 
reason,  instead  of  working  with  the  increasing  sequences  cx . . . ct. 

Derrick  Lehmer  noticed  another  pleasant  property  of  Algorithms  L and  T 
[Applied  Combinatorial  Mathematics , edited  by  E.  F.  Beckenbach  (1964),  27-30]: 
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Theorem  L.  The  combination  ct . . . C2C1  is  visited  after  exactly 

(?) + " + (?)  + (?) 

other  combinations  have  been  visited. 


Proof.  There  are  (?)  combinations  c't . . . with  c'  = Cj  for  t>j>k  and 
ck  < ck  1 namely  ct . . . c*,+1  followed  by  the  ^-combinations  of  (0, . . . , c*  — 1}.  | 

When  t = 3,  for  example,  the  numbers 


©+(!)+(?).  g) +©+(?),  ©+©+(?) (!) + (3)  + (f) 

that  correspond  to  the  combinations  c3c2Cl  in  Table  1 simply  run  through  the 
sequence  0,  1,  2,  ...,  19.  Theorem  L gives  us  a nice  way  to  understand  the 
combinatorial  number  system  of  degree  t,  which  represents  every  nonnegative 
integer  N uniquely  in  the  form 


N~  ( /)  +'“+  (T)  + (7)’  nt  > ■ • ■ > n-2  > ni  > 0.  (20) 

[See  Ernesto  Pascal,  Giornale  di  Matematiche  25  (1887),  45-49.] 

Binomial  trees.  The  family  of  trees  T„  defined  by 


arises  in  several  important  contexts  and  sheds  further  light  on  combination 
generation.  For  example,  T4  is 


and  T5,  rendered  more  artistically,  appears  as  the  frontispiece  to  Volume  1 of 
this  series  of  books. 

Notice  that  T„  is  like  Tn_!,  except  for  an  additional  copy  of  T„_x;  therefore 
T„  has  2”  nodes  altogether.  Furthermore,  the  number  of  nodes  on  level  t is  the 
binomial  coefficient  (”);  this  fact  accounts  for  the  name  “binomial  tree.”  Indeed, 
the  sequence  of  labels  encountered  on  the  path  from  the  root  to  each  node  on 
evel t defines  a combination  ct...cx,  and  all  combinations  occur  in  lexicographic 
order  from  left  to  right.  Thus,  Algorithms  L and  T can  be  regarded  as  procedures 
to  traverse  the  nodes  on  level  t of  the  binomial  tree  Tn. 
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The  infinite  binomial  tree  T. is  obtained  by  letting  n — » oo  in  (21).  The  root 
of  this  tree  has  infinitely  many  branches,  but  every  node  except  for  the  overall 
root  at  level  0 is  the  root  of  a finite  binomial  subtree.  All  possible  t-combinations 
appear  in  lexicographic  order  on  level  t of  . 

Let’s  get  more  familiar  with  binomial  trees  by  considering  all  possible  ways 
to  pack  a rucksack.  More  precisely,  suppose  we  have  n items  that  take  up 
respectively  w„_i,  . . . , tvl , «;0  units  of  capacity,  where 

wn- 1 > • • • > u>i  > w0  > 0;  (23) 

we  want  to  generate  all  binary  vectors  an_i . . . dido  such  that 

a - w = an- \wn-i  H h aiWi  + a0w 0 < N,  (24) 

where  N is  the  total  capacity  of  a rucksack.  Equivalently,  we  want  to  find  all 
subsets  C of  {0, 1, . . . , n — 1}  such  that  w(C)  = Ylcec  wc  — such  subsets  will 
be  called  feasible.  We  will  write  a feasible  subset  as  C\ . . . ct , where  C\  > • • • > 
ct  > 0,  numbering  the  subscripts  differently  from  the  convention  of  (3)  above 
because  t is  variable  in  this  problem. 

Every  feasible  subset  corresponds  to  a node  of  Tn,  and  our  goal  is  to  visit 
each  feasible  node.  Clearly  the  parent  of  every  feasible  node  is  feasible,  and  so  is 
the  left  sibling,  if  any;  therefore  a simple  tree  exploration  procedure  works  well: 

Algorithm  F ( Filling  a rucksack).  This  algorithm  generates  all  feasible  ways 
Ci ...  ct  to  fill  a rucksack,  given  wn-i,  . . ■ , «'i,  w 0,  and  N . We  let  Sj  — wj  — Wj _ 1 
for  1 < j < n. 

FI.  [Initialize.]  Set  t <—  0,  Co  <—  n,  and  r <—  N. 

F2.  [Visit.]  Visit  the  combination  Ci . . . ct,  which  uses  N — r units  of  capacity. 
F3.  [Try  to  add  w0-]  If  ct  > 0 and  r > w0,  set  t <—  t + 1,  ct  t-  0,  r t—  r — wq, 
and  return  to  F2. 

F4.  [Try  to  increase  cj.]  Terminate  if  t = 0.  Otherwise,  if  ct_i  > ct  + 1 and 
r > <^ce+i)  set  ct  <—  ct  + 1,  r 4—  r — 6Ct,  and  return  to  F2. 

F5.  [Remove  ct.}  Set  r t—  r + wCt , t <—  t — 1,  and  return  to  F4.  | 

Notice  that  the  algorithm  implicitly  visits  nodes  of  Tn  in  preorder,  skipping  over 
unfeasible  subtrees.  A11  element  c > 0 is  placed  in  the  rucksack,  if  it  fits,  just 
after  the  procedure  has  explored  all  possibilities  using  element  c — 1 in  its  place. 
The  running  time  is  proportional  to  the  number  of  feasible  combinations  visited 
(see  exercise  20). 

Incidentally,  the  classical  “knapsack  problem”  of  operations  research  is  dif- 
ferent: It  asks  for  a feasible  subset  C such  that  v(C)  = ^2ceC  v(c)  is  maximum, 
where  each  item  c has  been  assigned  a value  v{c).  Algorithm  F is  not  a particu- 
larly good  way  to  solve  that  problem,  because  it  often  considers  cases  that  could 
be  ruled  out.  For  example,  if  C and  C'  are  subsets  of  {1, . . . , n — 1}  with  w(C)  < 
w(C')  < N — wq  and  v(C)  > v(C'),  Algorithm  F will  examine  both  C U 0 and 
C'  U 0,  but  the  latter  subset  will  never  improve  the  maximum.  We  will  consider 
methods  for  the  classical  knapsack  problem  later;  Algorithm  F is  intended  only 
for  situations  when  all  of  the  feasible  possibilities  are  potentially  relevant. 
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Gray  codes  for  combinations.  Instead  of  merely  generating  all  combinations, 
we  often  prefer  to  visit  them  in  such  a way  that  each  one  is  obtained  by  making 
only  a small  change  to  its  predecessor. 

For  example,  we  can  ask  for  what  Nijenhuis  and 
Wilf  have  called  a “revolving  door  algorithm” : Imagine 
two  rooms  that  contain  respectively  s and  t people,  with 
a revolving  door  between  them.  Whenever  a person 
goes  into  the  opposite  room,  somebody  else  comes  out.  Can  we  devise  a sequence 
of  moves  so  that  each  (s,  ^-combination  occurs  exactly  once? 

The  answer  is  yes,  and  in  fact  a huge  number  of  such  patterns  exist.  For 
example,  it  turns  out  that  if  we  examine  all  n-bit  strings  an_i . . . ciiao  in  the 
well-known  order  of  Gray  binary  code  (Section  7.2.1. 1),  but  select  only  those 
that  have  exactly  s Os  and  t Is,  the  resulting  strings  form  a revolving-door  code. 

Here’s  the  proof:  Gray  binary  code  is  defined  by  the  recurrence  Tn  = 0rn_! , 
l^n-i  7. 2. 1.1— (5),  so  its  (s,t)  subsequence  satisfies  the  recurrence 


rst 


or 


(s—  l)t  5 


ir 


R 

«(<-!) 


(25) 


when  st  > 0.  We  also  have  rs0  = 0s  and  rot  = 1*.  Therefore  it  is  clear  by 
induction  that  Fst  begins  with  0s  1(  and  ends  with  10s  1*—1  when  st  > 0.  The 
transition  at  the  comma  in  (25)  is  from  the  last  element  of  0r(s_1)t  to  the 
last  element  of  irs(t_j),  namely  from  010s_1lt_1  = 010s_1llt_2  to  1 10s  1*— 2 = 
110s  01‘-2  when  t > 2,  and  this  satisfies  the  revolving-door  constraint.  The 

case  t = 1 also  checks  out.  For  example,  F33  is  given  by  the  columns  of 


000111 

011010 

110001 

101010 

001101 

011100 

110010 

101100 

001110 

010101 

110100 

100101 

001011 

010110 

111000 

100110 

011001 

010011 

101001 

100011 

and  r23  can  be  found  in  the  first  two  columns  of  this  array.  One  more  turn 
of  the  door  takes  the  last  element  into  the  first.  [These  properties  of  Fst  were 
discovered  by  J.  E.  Miller  in  her  Ph.D.  thesis  (Columbia  University,  1971),  then 
independently  by  D.  T.  Tang  and  C.  N.  Liu,  IEEE  Trans.  C-22  (1973),  176-180. 
A loopless  implementation  was  presented  by  J.  R.  Bitner,  G.  Ehrlich,  and  E.  M. 
Reingold,  CACM  19  (1976),  517-521.] 

When  we  convert  the  bit  strings  0.50403(12(11(10  in  (26)  to  the  corresponding 
index-list  forms  c3c2ci,  a striking  pattern  becomes  evident: 


210 

431 

540 

531 

320 

432 

541 

532 

321 

420 

542 

520 

310 

421 

543 

521 

430 

410 

530 

510 

The  first  components  c3  occur  in  nondecreasing  order;  but  for  each  fixed  value 
of  c3,  the  values  of  c2  occur  in  non  increasing  order.  And  for  fixed  c3c2,  the  values 
of  d are  again  nondecreasing.  The  same  is  true  in  general:  All  combinations 
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ct  ■ . . c2ci  appear  in  lexicographic  order  of 

( ct%  ~Ct- 1,  ct- 2,  ...,  (-l)t_1ci)  (28) 

in  the  revolving-door  Gray  code  F st . This  property  follows  by  induction,  because 
(25)  becomes 

rs«  = r(s_1)t,  (s+t- i)rf(t_1}  (29) 

for  st  > 0 when  we  use  index-list  notation  instead  of  bitstring  notation.  Conse- 
quently the  sequence  can  be  generated  efficiently  by  the  following  algorithm  due 
to  W.  H.  Payne  [see  ACM  Trans.  Math.  Software  5 (1979),  163-172]: 

Algorithm  R ( Revolving-door  combinations).  This  algorithm  generates  all  t- 
combinations  ct . . . c?ci  of  {0, 1, ...  ,n  — 1}  in  lexicographic  order  of  the  alter- 
nating sequence  (28),  assuming  that  n > t > 1.  Step  R3  has  two  variants, 
depending  on  whether  t is  even  or  odd. 

Rl.  [Initialize.]  Set  Cj  «—  j - 1 for  t > j > 1,  and  ct+1  n. 

R2.  [Visit.]  Visit  the  combination  ct . . . C2C1. 

R3.  [Easy  case?]  If  t is  odd:  If  cj  + 1 < C2,  increase  c\  by  1 and  return  to  R2, 
otherwise  set  j 4-  2 and  go  to  R4.  If  t is  even:  If  cL  > 0,  decrease  C\  by  1 
and  return  to  R2,  otherwise  set  j 4—  2 and  go  to  R5. 

R4.  [Try  to  decrease  cr)  (At  this  point  cj  = Cj-i  + 1.)  If  Cj  > j,  set  Cj  «-  c_,_ 1, 
Cj_i  <—  j — 2,  and  return  to  R2.  Otherwise  increase  j by  1. 

R5.  [Try  to  increase  cr]  (At  this  point  cj- 1 — j - 2.)  If  Cj  + 1 < cJ+1,  set 
Cj- 1 4—  Cj,  Cj  «—  Cj  + 1,  and  return  to  R2.  Otherwise  increase  j by  1,  and 
go  to  R4  if  j <t.  | 

Exercises  21-25  explore  further  properties  of  this  interesting  sequence.  One  of 
them  is  a nice  companion  to  Theorem  L:  The  combination  ctct-i  . . . c2Ci  is  visited 
by  Algorithm  R after  exactly 

N = (T)  ■ (' YD +-"+(_i)t(C2D  -(-i}t(cli+1)  -i*  °dd]  (30) 

other  combinations  have  been  visited.  We  may  call  this  the  representation  of  N 
in  the  “alternating  combinatorial  number  system”  of  degree  t;  one  consequence, 
for  example,  is  that  every  positive  integer  has  a unique  representation  of  the 
form  N — (3)  — (2)  + (j)  with  a > b > c > 0.  Algorithm  R tells  us  how  to  add  1 
to  N in  this  system. 

Although  the  strings  of  (26)  and  (27)  are  not  in  lexicographic  order,  they 
are  examples  of  a more  general  concept  called  genlex  order,  a name  coined  by 
Timothy  Walsh.  A sequence  of  strings  aq,  . . . , ajy  is  said  to  be  in  genlex  order 
when  all  strings  with  a common  prefix  occur  consecutively.  For  example,  all 
3-combinations  that  begin  with  53  appear  together  in  (27). 

Genlex  order  means  that  the  strings  can  be  arranged  in  a trie  structure,  as 
in  Fig.  31  of  Section  6.3,  but  with  the  children  of  each  node  ordered  arbitrarily. 
When  a trie  is  traversed  in  any  order  such  that  each  node  is  visited  just  before  or 
just  after  its  descendants,  all  nodes  with  a common  prefix  — that  is,  all  nodes  of 
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a subtiie  appear  consecutively.  This  principle  corresponds  to  recursive  genera- 
tion schemes,  so  it  makes  genlex  order  convenient.  Many  of  the  algorithms  we’ve 
seen  for  generating  n-tuples  have  therefore  produced  their  results  in  some  version 
of  genlex  order;  similarly,  the  method  of  “plain  changes”  (Algorithm  7.2.1.2P) 
visits  permutations  in  a genlex  order  of  the  corresponding  inversion  tables. 

The  revolving-door  method  of  Algorithm  R is  a genlex  routine  that  changes 
only  one  element  of  the  combination  at  each  step.  But  it  isn’t  totally  satisfactory, 
because  it  frequently  must  change  two  of  the  indices  cj  simultaneously,  in  order 
to  preserve  the  condition  ct  > ■ ■ ■ > c2  > ci.  For  example,  Algorithm  R changes 
210  into  320,  and  (27)  includes  nine  such  “crossing”  moves. 

The  source  of  this  defect  can  be  traced  to  our  proof  that  (25)  satisfies  the 
revolving-door  property:  We  observed  that  the  string  010s_1lR-2  is  followed 
by  110®-101t-2  when  t > 2.  Hence  the  recursive  construction  Fst  involves 
transitions  of  the  form  110a0  -H-  010al,  when  a substring  like  11000  is  changed 
to  01001  or  vice  versa;  the  two  Is  cross  each  other. 

A Gray  path  for  combinations  is  said  to  be  homogeneous  if  it  changes  only 
one  of  the  indices  c,  at  each  step.  A homogeneous  scheme  is  characterized 
in  bitstring  form  by  having  only  transitions  of  the  forms  10“  h 0“1  within 
strings,  for  a > 1,  when  we  pass  from  one  string 
to  the  next.  With  a homogeneous  scheme  we  can, 
for  example,  play  all  /-note  chords  on  an  n-note 
keyboard  by  moving  only  one  finger  at  a time. 

A slight  modification  of  (25)  yields  a genlex 
scheme  for  (s,  ^-combinations  that  is  pleasantly 
homogeneous.  The  basic  idea  is  to  construct  a 
sequence  that  begins  with  0aR  and  ends  with  R0S,  and  the  following  recursion 
suggests  itself  almost  immediately:  Let  Ks0  = 0s,  Kot  = R,  Ks(_1}  = 0,  and 


Kst  0R'(s_1)t,  llKs{t_2)  for  st  > 0.  (31) 


At  the  commas  of  this  sequence  we  have  0R0S  1 followed  by  10R_10S_1 
10  1 followed  by  110' R 2;  both  of  these  transitions  are  homogeneous 
though  the  second  one  requires  the  1 to  jump  across  s 0s.  The  combinations 
for  s = t — 3 are 


and 
, al- 

^33 


000111 

010101 

101100 

100011 

001011 

010011 

101001 

110001 

001101 

011001 

101010 

110010 

001110 

011010 

100110 

110100 

010110 

011100 

100101 

111000 

m bitstring  form,  and  the  corresponding  “finger  patterns”  are 


210 

420 

532 

510 

310 

410 

530 

540 

320 

430 

531 

541 

321 

431 

521 

542 

421 

432 

520 

543. 

(32) 


(33) 
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When  a homogeneous  scheme  for  ordinary  combinations  ct . . . C\  is  converted 
to  the  corresponding  scheme  (6)  for  combinations  with  repetitions  dt...di,  it 
retains  the  property  that  only  one  of  the  indices  dj  changes  at  each  step.  And 
when  it  is  converted  to  the  corresponding  schemes  (9)  or  (11)  for  compositions 
Pt  ■ ■ -Po  or  qt . . . q0,  only  two  (adjacent)  parts  change  when  cj  changes. 

Near-perfect  schemes.  But  we  can  do  even  better!  All  (s,  ^-combinations 
can  be  generated  by  a sequence  of  strongly  homogeneous  transitions  that  are 
either  01  10  or  001  o 100.  In  other  words,  we  can  insist  that  each  step  causes 

a single  index  cj  to  change  by  at  most  2.  Let’s  call  such  generation  schemes 
near-perfect. 

Imposing  such  strong  conditions  actually  makes  it  fairly  easy  to  discover 
near-perfect  schemes,  because  comparatively  few  choices  are  available.  Indeed, 
if  we  restrict  ourselves  to  genlex  methods  that  are  near-perfect  on  n-bit  strings, 
T.  A.  Jenkyns  and  D.  McCarthy  observed  that  all  such  methods  can  be  easily 
characterized  [Ars  Combinatoria  40  (1995),  153-159]: 

Theorem  N.  If  st  > 0,  there  are  exactly  2s  near-perfect  ways  to  list  all  ( s,t )- 
combinations  in  a genlex  order.  I11  fact,  when  1 < a < s,  there  is  exactly  one 
such  listing,  Nsta,  that  begins  with  l‘0s  and  ends  with  0alt0s_a;  the  other  s 
possibilities  are  the  reverse  lists,  Nfta . 

Proof.  The  result  certainly  holds  when  s — t = 1;  otherwise  we  use  induction  on 
s+t.  The  listing  Nsta,  if  it  exists,  must  have  the  form  lAs(t_1),  for  some 

near-perfect  genlex  listings  Xs(t_!)  and  Y(s_1)t.  If  t = 1,  is  the  single 

string  0s;  hence  Y(s_i)(  must  be  A^(s^i)i(a-i)  if  a > 1,  and  it  must  be 
if  a — 1-  On  the  other  hand  if  t > 1,  the  near-perfect  condition  implies  that  the 
last  string  of  cannot  begin  with  1;  hence  A^^  = for  some  b. 

If  a > 1,  Y(b  -1  )t  must  be  hence  b must  be  1;  similarly,  b must  be  1 

if  s = 1.  Otherwise  we  have  a = 1 < s,  and  this  forces  Y(s_1)t  = for 

some  c.  The  transition  from  10fcY-10s~,>  to  oc+1ltOs_1_c  is  near-perfect  only  if 
c = 1 and  b = 2.  | 


( 1)1  •>  0 N(s 


N„ta  = < 


IN. 


-(1—1)2?  07V*_i)ti, 


The  proof  of  Theorem  N yields  the  following  recursive  formulas  when  st  > 0: 

-i)t(a— i)?  if  1 < a < s; 

if  1 = a < s;  (34) 

G(t- 1)1?  ui'?  if  1 = a = s. 

Also,  of  course,  Nsoa  = 0s. 

Let  us  set  Ast  = Arstl  and  Bst  — Nst2-  These  near-perfect  listings,  discovered 
by  Phillip  J.  Chase  in  1976,  have  the  net  effect  of  shifting  a leftmost  block  of  Is 
to  the  right  by  one  or  two  positions,  respectively,  and  they  satisfy  the  following 
mutual  recursions: 


Ast  - 1 0A^_1)t;  Bst  = l-4s(t-i),  0A(s_1)t.  (35) 

"To  take  one  step  forward,  take  two  steps  forward,  then  one  step  backward;  to 
take  two  steps  forward,  take  one  step  forward,  then  another.”  These  equations 
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Table  2 


CHASE’S  SEQUENCES  FOR  (3, 3)-COMBINATIONS 

r? 


A33 

_ pR 
~ °33 

B33 

— C33 

543 

531 

321 

420 

543 

520 

432 

541 

530 

320 

421 

542 

510 

430 

540 

510 

310 

431 

540 

530 

431 

542 

520 

210 

430 

541 

531 

421 

532 

521 

410 

432 

521 

532 

420 

410 

210 

310 

320 

321 


hold  for  all  integer  values  of  s and  t,  if  we  define  Aat  and  Bst  to  be  0 when  s or 
t is  negative,  except  that  AM)  = Zl00  = e (the  empty  string).  Thus  Ast  actually 
takes  min(s,  1)  forward  steps,  and  Bst  actually  takes  min(s,2).  For  example, 
Table  2 shows  the  relevant  listings  for  s = t = 3,  using  an  equivalent  index-list 
form  c3c2ci  instead  of  the  bit  strings  a5a4a3a2a1ao. 

Chase  noticed  that  a computer  implementation  of  these  sequences  becomes 
simpler  if  we  define 


c-{£ 

if  s + f is  odd;  ~ _ f A*,  if  a + t is  even; 

if  s + f is  even;  st  \ if  a + t is  odd. 

(36) 

[See  Congressus  Numerantium  69  (1989),  215-242.]  Then  we  have 

Cat  = \ 

C lCs(t-i),  0C(s-i)t>  if  s + t is  odd; 

[ lCs(t-i)>  0C(a_1)t,  if  s + f is  even; 

(37) 

Cst  = | 

f 0C(._i)t,  lCsp^j),  if  s + t is  even; 

1 0C(s_ip,  lCs(t_X),  if  s + t.  is  odd. 

(38) 

When  bit  a}  is  ready  to  change,  we  can  tell  where  we  are  in  the  recursion  by 
testing  whether  j is  even  or  odd. 

Indeed,  the  sequence  Cst  can  be  generated  by  a surprisingly  simple  algo- 
rithm, based  on  general  ideas  that  apply  to  any  genlex  scheme.  Let  us  say  that 
bit  aj  is  active  in  a genlex  algorithm  if  it  is  supposed  to  change  before  anything  to 
its  left  is  altered.  (In  other  words,  the  node  for  an  active  bit  in  the  corresponding 
trie  is  not  the  rightmost  child  of  its  parent.)  Suppose  we  have  an  auxiliary  table 
wn  . . . u.’i w0 • where  Wj  = 1 if  and  only  if  either  aj  is  active  or  j < r,  where  r is 
the  least  subscript  such  that  ar  / <z0;  we  also  let  wn  = 1.  Then  the  following 
method  will  find  the  successor  of  an_  t . . . a1ao: 

Set  j 4—  r.  If  Wj  = 0,  set  Wj  4-  1,  j 4—  j + 1,  and  repeat  until 
Wj  — 1.  Terminate  if  j = n ; otherwise  set  w:/  4-  0.  Change  aj 
to  1 — CLj , and  make  any  other  changes  to  cij—i  . . . clq  and  r that  v39) 
apply  to  the  particular  genlex  scheme  being  used. 

The  beauty  of  this  approach  comes  from  the  fact  that  the  loop  is  guaranteed  to 
be  efficient:  We  can  prove  that  the  operation  J <-]  + l will  be  performed  less 
than  once  per  generation  step,  on  the  average  (see  exercise  36). 
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By  analyzing  the  transitions  that  occur  when  bits  change  in  (37)  and  (38), 
we  can  readily  flesh  out  the  remaining  details: 

Algorithm  C ( Chase’s  sequence).  This  algorithm  visits  all  (s, ^-combinations 
an- 1 ■ • • aiao,  where  n = s + t,  in  the  near-perfect  order  of  Chase’s  sequence  Cst. 

Cl.  [Initialize.]  Set  a3  4-  0 for  0 < j < s,  aj  4-  1 for  s < j < n,  and  w3  4-  1 
for  0 < j < n.  If  s > 0,  set  r<-s;  otherwise  set  r 4—  t. 

C2.  [Visit.]  Visit  the  combination  an_!  . . . cqao- 

C3.  [Find  j and  branch.]  Set  j 4-  r.  While  Wj  = 0,  set  wj  4 - 1 and  j 4-  j + 1. 
Terminate  if  j — n\  otherwise  set  Wj  4 — 0 and  make  a four-way  branch:  Go 
to  C4  if  j is  odd  and  aj  7^  0,  to  C5  if  j is  even  and  a3  7^  0,  to  C6  if  j is  even 
and  a3  = 0,  to  C7  if  j is  odd  and  a3  = 0. 

C4.  [Move  right  one.]  Set  aj_1  4-  1,  a3  4-0.  If  r — j and  j > 1,  set  r 4-  j - 1; 

otherwise  if  r = j — 1 set  r 4—  j.  Return  to  C2. 

C5.  [Move  right  two.]  If  a3_2  ± 0,  go  to  C4.  Otherwise  set  a,_2  4-  1,  a3  4-  0. 
If  r — j,  set  r 4—  max(j  - 2, 1);  otherwise  if  r = j - 2,  set  r 4—  j - 1.  Return 
to  C2. 

C6.  [Move  left  one.]  Set  a3  4-  1,  aj^1  4-  0.  If  r = j and  j > 1,  set  r 4-  j - 1; 
otherwise  if  r = j - 1 set  r 4-  j.  Return  to  C2. 

C7.  [Move  left  two.]  If  a3-i  / 0,  go  to  C6.  Otherwise  set  aj  4—  1,  aj_2  4—  0.  If 
r = j - 2,  set  r 4-  j;  otherwise  if  r = j - 1,  set  r 4-  j - 2.  Return  to  C2.  | 

*Analysis  of  Chase’s  sequence.  The  magical  properties  of  Algorithm  C cry 
out  for  further  exploration,  and  a closer  look  turns  out  to  be  quite  instructive. 
Given  a bit  string  an_  1 . . . aido,  let  us  define  an  = 1,  un  = n mod  2,  and 

Uj  = (1  - u3+i)aj+i,  Vj  — (uj  + j)  mod  2,  ujj  — (vj  + aj)  mod  2,  (40) 

for  n > j > 0.  For  example,  we  might  have  n = 26  and 

a25  . . . aiao  = 11001001000011111101101010, 

u25 • ■ • uiu0  = 10100100100001010100100101, 

V2S  ■ ..Vivo  = 00001110001011111110001111, 

W25 . . . uqw0  = 11000111001000000011100101. 

With  these  definitions  we  can  prove  by  induction  that  Vj  = 0 if  and  only  if  bit 
a3  is  being  “controlled”  by  C rather  than  by  C in  the  recursions  (37H38)  that 
generate  un_! . . . aia0,  except  when  aj  is  part  of  the  final  run  of  0s  or  Is  at  the 
right  end.  Therefore  w3  agrees  with  the  value  computed  by  Algorithm  C at  the 
moment  when  an_  1 . . . aiao  is  visited,  for  r < j < n.  These  formulas  can  be  used 
to  determine  exactly  where  a given  combination  appears  in  Chase’s  sequence  (see 
exercise  39). 

If  we  want  to  work  with  the  index-list  form  cj . . . c2ci  instead  of  the  bit 
strings  an_j...aiao,  it  is  convenient  to  change  the  notation  slightly,  writing 
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Ct(n)  for  Cst  and  Ct(n)  for  Cst  when  s + 1 = n.  Then  C0(n)  = C0(n) 
the  recursions  for  t > 0 take  the  form 

= e,  and 

Ct+i(n  + 1)  = | 

f nCt(n),  Ct+1(n), 
[nCt(n),  Ct+i(n), 

if  n is  even; 
if  n is  odd; 

(42) 

Ct+i(n  + 1)  = | 

C Ci+iM,  ndt(n), 
[Ct+i(n),  nCt(n), 

if  n is  odd; 

if  n is  even. 

(43) 

These  new  equations  can  be  expanded  to  tell  us,  for  example,  that 

Ct+i(9)  = 8C7t(8),  6Ct(6),  4Ct(4),  ...,  3Ct(3),  5Ct(5),  7Ct(7); 

Ct+1(8)  = 7Ct(7),  6Ct(6),  4Ct(4),  3Ct(3),  5Ct(5); 

Ct+1(9)=  6C((6),  4Ct(4),  ...,  3Ct(3),  5Ct(5),  7Ct(7),  8Ct(8);  ^ 

Ct+1(8)=  6Ct(6),  4Ct(4),  ...,  3Ct(3),  5Ct(5),  7Ct(7); 

notice  that  the  same  pattern  predominates  in  all  four  sequences.  The  meaning  of 
“•  • •”  in  the  middle  depends  on  the  value  of  t:  We  simply  omit  all  terms  nCt(n) 
and  nCt(n)  where  n < t. 

Except  for  edge  effects  at  the  very  beginning  or  end,  all  of  the  expansions 
in  (44)  are  based  on  the  infinite  progression 

10,  8,  6,  4,  2,  0,  1,  3,  5,  7,  9,  (45) 

which  is  a natural  way  to  arrange  the  nonnegative  integers  into  a doubly  infinite 
sequence.  If  we  omit  all  terms  of  (45)  that  are  < t , given  any  integer  t > 0, 
the  remaining  terms  retain  the  property  that  adjacent  elements  differ  by  either 
1 or  2.  Richard  Stanley  has  suggested  the  name  endo-order  for  this  sequence, 
because  we  can  remember  it  by  thinking  “even  numbers  decreasing,  odd  . . . ” . 
(Notice  that  if  we  retain  only  the  terms  less  than  N and  complement  with  respect 
to  N , endo-order  becomes  organ-pipe  order;  see  exercise  6.1  18.) 

We  could  program  the  recursions  of  (42)  and  (43)  directly,  but  it  is  interest- 
ing to  unwind  them  using  (44),  thus  obtaining  an  iterative  algorithm  analogous 
to  Algorithm  C.  The  result  needs  only  0(t)  memory  locations,  and  it  is  especially 
efficient  when  t is  relatively  small  compared  to  n.  Exercise  45  contains  the  details. 

Near-perfect  multiset  permutations.  Chase’s  sequences  lead  in  a natural 
way  to  an  algorithm  that  will  generate  permutations  of  any  desired  multiset 
{s0  • 0,  si  • 1, . . . , sd  ■ d)  in  a near-perfect  manner,  meaning  that 

i)  every  transition  is  either  aj+iaj  ajaj+1  or  aj+1ajaj_  1 -o-  aJ_1aJaj+ 1; 

ii)  transitions  of  the  second  kind  have  a,j  = min(aj_j,  aj+i). 

Algorithm  C tells  us  how  to  do  this  when  d = 1,  and  we  can  extend  it  to  larger 
values  of  d by  the  following  recursive  construction  [CACM  13  (1970),  368-369 
376]:  Suppose 


c*o,  aq,  . . . , ajv-i 
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is  any  near-perfect  listing  of  the  permutations  of  {sx  ■ 1, . . . , sj  • d}.  Then  Algo- 
rithm C,  with  s = So  and  t = sx  H + Sd,  tells  us  how  to  generate  a listing 

A j = ctj 0s , ...,  0aoy0s“a  (46) 

in  which  all  transitions  are  Ox  xO  or  OOx  o xOO ; the  final  entry  has  a — 1 or  2 
leading  zeros,  depending  on  s and  t.  Therefore  all  transitions  of  the  sequence 

A-o>  A2,  (A-jv-i  or  ^n-i)  (47) 

are  near-perfect;  and  this  list  clearly  contains  all  the  permutations. 

For  example,  the  permutations  of  {0,0,0, 1, 1,2}  generated  in  this  way  are 

211000,  210100,  210001,  210010,  200110,  200101,  200011,  201001,  201010,  201100, 
021100,  021001,  021010,  020110, 020101, 020011, 000211,  002011,  002101,  002110, 
001120,  001102,  001012,  000112,  010012, 010102, 010120,  011020,  011002, 011200, 
101200, 101020, 101002, 100012, 100102, 100120, 110020, 110002, 110200, 112000, 
121000, 120100, 120001, 120010, 100210, 100201, 100021, 102001, 102010, 102100, 
012100,  012001, 012010,  010210,  010201, 010021,  000121,  001021, 001201, 001210. 

*Perfect  schemes.  Why  should  we  settle  for  a near-perfect  generator  like  Csl, 
instead  of  insisting  that  all  transitions  have  the  simplest  possible  form  01  10? 

One  reason  is  that  perfect  schemes  don’t  always  exist.  For  example,  we 
observed  in  7.2.1.2-(2)  that  there  is  no  way  to  generate  all  six  permutations  of 
{1, 1,2,2}  with  adjacent  interchanges;  thus  there  is  no  perfect  scheme  for  (2,2)- 
combinations.  In  fact,  our  chances  of  achieving  perfection  are  only  about  1 in  4: 

Theorem  P.  The  generation  of  all  ( s , t) -combinations  as+t- 1 . . . axoo  by  adja- 
cent interchanges  01  O 10  is  possible  if  and  only  if  s < 1 or  t < 1 or  st  is  odd. 

Proof.  Consider  all  permutations  of  the  multiset  {s  • 0,t  • 1}.  We  learned  in 
exercise  5.1.2-16  that  the  number  nik  of  such  permutations  having  k inversions 
is  the  coefficient  of  zk  in  the  z-nomial  coefficient 

, , s+t  t 

(s t ) = n (i+z+---+zk~i)/i[(i+z+---+zk-1).  (48) 

k=s+ 1 fc= 1 

Every  adjacent  interchange  changes  the  number  of  inversions  by  ±1,  so  a perfect 
generation  scheme  is  possible  only  if  approximately  half  of  all  the  permutations 
have  an  odd  number  of  inversions.  More  precisely,  the  value  of  (s^"t)_1  = 
mo  — mi  + ra2  — • • • must  be  0 or  ±1.  But  exercise  49  shows  that 
/s  + t\  / }(s  + f)/2j  \ . 

{ t J_,  = (,  l«/2J  Jls,‘seTenl-  <«> 

and  this  quantity  exceeds  1 unless  s < 1 or  t.  < 1 or  st  is  odd. 

Conversely,  perfect  schemes  are  easy  with  s < 1 or  t < 1,  and  they  turn 
out  to  be  possible  also  whenever  st  is  odd.  The  first  nontrivial  case  occurs 
for  s = t = 3,  when  there  are  four  essentially  different  solutions;  the  most 
symmetrical  of  these  is 

210  — 310  — 410  — 510  — 520  — 521  — 531  — 532  — 432  — 431  — 

421  — 321  — 320  — 420  — 430  — 530  — 540  — 541  — 542  — 543  (50) 
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(see  exercise  51).  Several  authors  have  constructed  Hamiltonian  paths  in  the 
relevant  graph  for  arbitrary  odd  numbers  s and  t:  for  example,  the  method 
of  Eades,  Hickey,  and  Read  [JACM  31  (1984),  19-29]  makes  an  interesting 
exercise  in  programming  with  recursive  coroutines.  Unfortunately,  however,  none 
of  the  known  constructions  are  sufficiently  simple  to  describe  in  a short  space, 
or  to  implement  with  reasonable  efficiency.  Perfect  combination  generators  have 
therefore  not  yet  proved  to  be  of  practical  importance.  | 

In  summary,  then,  we  have  seen  that  the  study  of  (s,  ^-combinations  leads 
to  many  fascinating  patterns,  some  of  which  are  of  great  practical  importance 
and  some  of  which  are  merely  elegant  and/or  beautiful.  Figure  46  illustrates  the 
principal  options  that  are  available  in  the  case  s = t = 5,  when  (V0)  = 252  combi- 
nations arise.  Lexicographic  order  (Algorithm  L),  the  revolving-door  Gray  code 
(Algorithm  R),  the  homogeneous  scheme  K§ 5 of  (31),  and  Chase’s  near-perfect 
scheme  (Algorithm  C)  are  shown  in  parts  (a),  (b),  (c),  and  (d)  of  the  illustration. 
Part  (e)  shows  the  near-perfect  scheme  that  is  as  close  to  perfection  as  possible 
while  still  being  in  genlex  order  of  the  c array  (see  exercise  34),  while  part  (f)  is 
the  perfect  scheme  of  Eades,  Hickey,  and  Read.  Finally,  Figs.  46(g)  and  46(h) 
are  listings  that  proceed  by  rotating  . . . o,q  <—  . . . a^aj  or  by  swapping 

aj  ao>  akin  to  Algorithms  7.2.1.2C  and  7.2.1.2E  (see  exercises  55  and  56). 

*Combinations  of  a multiset.  If  multisets  can  have  permutations,  they  can 
have  combinations  too.  For  example,  consider  the  multiset  (6,  b,  b , b,  g,  g , g,  r,  r,  r, 
w,  w} , representing  a sack  that  contains  four  blue  balls  and  three  that  are  green, 
three  red,  two  white.  There  are  37  ways  to  choose  five  balls  from  this  sack;  in 
lexicographic  order  (but  descending  in  each  combination)  they  are 

gbbbb , ggbbb , gggbb,  rbbbb,  rgbbb,  rggbb,  rgggb,  rrbbb,  rrgbb , rrggb , 
rrggg,  rrrbb,  rrrgb,  rrrgg,  wbbbb,  wgbbb , wggbb , wgggb,  wrbbb , wrgbb , 
wrggb,  wrggg , wrrbb,  wrrgb , wrrgg,  wrrrb,  wrrrg , wwbbb,  wwgbb , wwggb, 
wwggg , wwrbb,  wwrgb,  wwrgg,  wwrrb , wwrrg , wwrrr.  (31) 

This  fact  might  seem  frivolous  and/or  esoteric,  yet  we  will  see  in  Theorem  W 
below  that  the  lexicographic  generation  of  multiset  combinations  yields  optimal 
solutions  to  significant  combinatorial  problems. 

James  Bernoulli  observed  in  his  Ars  Conjectandi  (1713),  119-123,  that  we 
can  enumerate  such  combinations  by  looking  at  the  coefficient  of  2 5 in  the 
product  (l  + z + z2)(l  + z + z2  + z3)2(l  + z + z2+z3  + z4).  Indeed,  his  observation 
is  easy  to  understand,  because  we  get  all  possible  selections  from  the  sack  if  we 
multiply  out  the  polynomials 

(1  + w + ww)(l  +r  + rr  + rrr){  1 + g + gg  + ggg)(l  + b + bb  + bbb  + bbbb). 

Multiset  combinations  are  also  equivalent  to  bounded  compositions,  namely 
to  compositions  in  which  the  individual  parts  are  bounded.  For  example,  the  37 
multicombinations  listed  in  (51)  correspond  to  37  solutions  of 

5 = r3  + r2  + r1  + r0,  0 < r3  < 2,  0<r2,r!<3,  0 < r0  < 4, 

namely  5 = 0+0+1+4  = 0+0+2+3  = 0+0+3+2  = 0+1+0+4  = • • . = 2+3+0+0. 
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Fig.  46.  Examples 
of  (5,  5)-combinations: 

a)  lexicographic; 

b)  revolving-door; 

c)  homogeneous; 

d)  near-perfect; 

e)  nearer-perfect; 

f)  perfect; 

g)  suffix-rotated; 

h)  right-swapped. 
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Bounded  compositions,  in  turn,  are  special  cases  of  contingency  tables , which 
are  of  great  importance  in  statistics.  And  all  of  these  combinatorial  configura- 
tions can  be  generated  with  Gray-like  codes  as  well  as  in  lexicographic  order. 
Exercises  60-63  explore  some  of  the  basic  ideas  involved. 


Shadows.  Sets  of  combinations  appear  frequently  in  mathematics.  For  example, 
a set  of  2-combinations  (namely  a set  of  pairs)  is  essentially  a graph,  and  a set  of 
t-combmations  for  general  t is  called  a uniform  hypergraph.  If  the  vertices  of  a 
convex  polyhedron  are  perturbed  slightly,  so  that  no  three  are  collinear,  no  four 
he  in  a plane,  and  in  general  no  t + 1 lie  in  a (t  - l)-dimensional  hyperplane, 
the  resulting  (t  - l)-dimensional  faces  are  “simplexes”  whose  vertices  have  great 
significance  in  computer  applications.  Researchers  have  learned  that  such  sets 
of  combinations  have  important  properties  related  to  lexicographic  generation. 

If  a is  any  f-combination  c(...c2c1,  its  shadow  da  is  the  set  of  all  its 

SUbSetS  Cf-1  • • • C2Cl’  • • • . . . . C3C1,  ct...  c3c2.  For  example, 

<9o310  {310,510,530,531}.  We  can  also  represent  a /-combination  as  a bit 

string  a„_i . . . aia0,  in  which  case  da  is  the  set  of  all  strings  obtained  by  chang- 
ing a 1 to  a 0:  5101011  = {001011, 100011, 101001, 101010}.  If  A is  any  set  of 
f-combinations,  we  define  its  shadow 


dA  = |J{  da  | a G A } (52) 

to  be  the  set  of  all  (t  - ^-combinations  in  the  shadows  of  its  members  For 
example,  555310  = {10,30,31,50,51,53}. 

These  definitions  apply  also  to  combinations  with  repetitions,  namely  to 
multicombinations:  55330  = {330,530,533}  and  555330  = {30,33,50,53}.  In 
general,  when  A is  a set  of  /-element  multisets,  dA  is  a set  of  (t  - l)-element 
multisets.  Notice,  however,  that  dA  never  has  repeated  elements  itself. 

The  upper  shadow  ga  with  respect  to  a universe  U is  defined  similarly,  but 
it  goes  from  /-combinations  to  (/  -F  1 {-combinations: 

Qa  = { /3  C U | a e dp },  for  a £ U;  (53) 

eA  = U{  6a  I « e A },  for  A C U.  (54) 

If,  for  example,  U — {0, 1,  2,  3,  4, 5, 6},  we  have  g>5310  = {53210,54310,65310}: 
on  the  other  hand,  if  U = {oo-0,  oo  l, . . . ,oo-6},  we  have  g>5310  = {53100  53110 
53210,53310,54310,55310,65310}.  ’ 

The  following  fundamental  theorems,  which  have  many  applications  in  var- 
ious branches  of  mathematics  and  computer  science,  tell  us  how  small  a set’s 
shadows  can  be: 

Theorem  K.  If  A is  a set  of  N t-combinations  contained  in  U = {0,1 ,n-l} 

then  ’ ’ ’’ 

|5A|  > |5Pjvt|  and  \eA\  > \eQNnt\,  (55) 

where  PNt  denotes  the  first  N combinations  generated  by  Algorithm  L,  namely 
the  N lexicographically  smallest  combinations  ct...c2c1  that  satisfy  (3),  and 
Q Nnt  denotes  the  N lexicographically  largest.  | 
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Theorem  M.  If  A is  a set  of  N t-multicombinations  contained  in  the  multiset 
U = {oo  • 0,  oo  • 1, . . . , oo  • s},  then 

\dA\  > \dPNt\  and  \qA\  > \eQNst\,  (56) 

where  PNt  denotes  theJV  lexicographically  smallest  multicombinations  dt . . . d2di 
that  satisfy  (6),  and  QNst  denotes  the  N lexicographically  largest.  | 


Both  of  these  theorems  are  consequences  of  a stronger  result  that  we  shall 
prove  later.  Theorem  K is  generally  called  the  Kruskal-Katona  theorem,  because 
it  was  discovered  by  J.  B.  Kruskal  [Math.  Optimization  Techniques , edited  by 
R.  Bellman  (1963),  251-278]  and  rediscovered  by  G.  Katona  [Theory  of  Graphs, 
Tihany  1966,  edited  by  Erdos  and  Katona  (Academic  Press,  1968),  187-207]; 
M.  P.  Schiitzenberger  had  previously  stated  it  in  a less- well- known  publication, 
with  incomplete  proof  [RLE  Quarterly  Progress  Report  55  (1959),  117-118]. 
Theorem  M goes  back  to  F.  S.  Macaulay,  many  years  earlier  [Proc.  London 
Math.  Soc.  (2)  26  (1927),  531-555], 

Before  proving  (55)  and  (56),  let’s  take  a closer  look  at  what  those  formulas 
mean.  We  know  from  Theorem  L that  the  first  N of  all  t-combinations  visited 
by  Algorithm  L are  those  that  precede  nt . . . n2rii,  where 


N = 


nt  > ■ ■ ■ > n2  > ni  > 0 


is  the  degree-t  combinatorial  representation  of  N.  Sometimes  this  representation 
has  fewer  than  t nonzero  terms,  because  nj  can  be  equal  to  j - 1;  let’s  suppress 
the  zeros,  and  write 


nt  > nt- 1 >■■■>  nv  > v > 1.  (57) 


Now  the  first  ("')  combinations  ct . . .ci  are  the  t-combinations  of  {0, ... , nt  — 1}; 
the  next  ("(_-/ ) are  those  in  which  ct  = nt  and  q_x  ...  cx  is  a (f  - l)-combination 
of  {0, . . . ,nt_x-l};  and  so  on.  For  example,  if  t = 5 and  N = (®)  + Q + (3),  the 
first  N combinations  are 


Pn 5 = {43210, . . . , 87654}  U {93210, . . . , 96543}  U {97210, . . . , 97321}.  (58) 

The  shadow  of  this  set  Pvs  is,  fortunately,  easy  to  understand:  It  is 

dPN 5 = {3210, . . . , 8765}  U {9210, . . . , 9654}  U {9710, . . . , 9732},  (59) 

namely  the  first  (®)  + Q + (2)  combinations  in  lexicographic  order  when  t = 4. 
I11  other  words,  if  we  define  Kruskal’s  function  Kt  by  the  formula 

KtN=  (t-1)  + (tn-2)+‘--+(n-l)  (6o) 

when  N has  the  unique  representation  (57),  with  nt0  = 0,  we  have 

dPNt  = P(KtN)(t- !)•  (61) 
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Theorem  K tells  us,  for  example,  that  a graph  with  a million  edges  can 
contain  at  most 


/ 1414\  / 1009 \ 

( 3 ) + ( 2 ) 


470,700,300 


triangles,  that  is,  at  most  470,700,300  sets  of  vertices  {u,v,w}  with  u — v 

w — u.  The  reason  is  that  1000000  = (14214)  + (10°9)  by  exercise  17,  and  the  edges 
-P(ioooooo)2  do  support  (14314)  + (102°9)  triangles;  but  if  there  were  more,  the  graph 
would  necessarily  have  at  least  k3470700301  = (14214)  + (1°09)  + Q = 1000001 
edges  in  their  shadow. 

Kruskal  defined  the  companion  function 


A tN  = 


(62) 


to  deal  with  questions  such  as  this.  The  k and  A functions  are  related  by  an 
interesting  law  proved  in  exercise  72: 


M + N=(S^t^J  implies  ksM  + \tN  = , if  st  > 0.  (63) 

Turning  to  Theorem  M,  the  sizes  of  dPNt  and  QQNst  turn  out  to  be 

\dPNt\  = HtN  and  |e<5jvst|  = N + ksN  (64) 

(see  exercise  81),  where  the  function  p.t  satisfies 

= (T-  i1)  + CT-  2 *) + • • • + _ /)  (6s) 

when  N has  the  combinatorial  representation  (57). 

Table  3 shows  how  these  functions  ntN,  A tN,  and  ptN  behave  for  small 
values  of  t and  N . When  t and  N are  large,  they  can  be  well  approximated 
in  terms  of  a remarkable  function  t(x)  introduced  by  Teiji  Takagi  in  1903;  see 
Fig.  47  and  exercises  82-85. 

Theorems  K and  M are  corollaries  of  a much  more  general  theorem  of  discrete 
geometry,  discovered  by  Da-Lun  Wang  and  Ping  Wang  [SIAM  J.  Applied  Math. 
33  (1977),  55-59],  which  we  shall  now  proceed  to  investigate.  Consider  the 
discrete  n- dimensional  torus  T(m,i , . . . ,m„)  whose  elements  are  integer  vectors 
x = (^li  ■ ■ • -,xn)  with  0 < x\  < mi,  . . . , 0 < xn  < mn.  We  define  the  sum  and 
difference  of  two  such  vectors  x and  y as  in  Eqs.  4.3.2-(2)  and  4.3.2  -(3): 

x + y = ((aq+  Vl)  mod  mi,...,  (xn  + yn)  mod  m„),  (66) 

X~V  = ((xi-yi)modm1,...,(xn-yn)modmn).  (67) 

We  also  define  the  so-called  cross  order  on  such  vectors  by  saying  that  x ■<  y if 
and  only  if 


vx  < vy  or  ( vx  = vy  and  x > y lexicographically);  (68) 

here,  as  usual,  f(x1;  . . . ,xn)  = aq  + ■ ■ • + xn.  For  example,  when  mj  = m2  = 2 
and  rn3  = 3,  the  12  vectors  in  increasing  cross  order  are 

000,  100,  010,  001,  110,  101,  Oil,  002,  111,  102,  012,  112,  (69) 
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Table  3 

EXAMPLES  OF  THE  KRUSKAL-MACAULAY  FUNCTIONS  k,  A,  AND  /z 


N = 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

«i  N = 

0 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

k2N  = 

0 

2 

3 

3 

4 

4 

4 

5 

5 

5 

5 

6 

6 

6 

6 

6 

7 

7 

7 

7 

7 

k3N  = 

0 

3 

5 

6 

6 

8 

9 

9 

10 

10 

10 

12 

13 

13 

14 

14 

14 

15 

15 

15 

15 

K4  N = 

0 

4 

7 

9 

10 

10 

13 

15 

16 

16 

18 

19 

19 

20 

20 

20 

23 

25 

26 

26 

28 

K5N  = 

0 

5 

9 

12 

14 

15 

15 

19 

22 

24 

25 

25 

28 

30 

31 

31 

33 

34 

34 

35 

35 

Ai  N = 

0 

0 

1 

3 

6 

10 

15 

21 

28 

36 

45 

55 

66 

78 

91 
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\2n  = 

0 

0 

0 

1 

1 

2 

4 

4 

5 

7 

10 

10 

11 

13 

16 

20 

20 

21 

23 

26 

30 

A 3N  = 

0 

0 

0 

0 

1 

1 

1 

2 

2 

3 

5 

5 

5 

6 

6 

7 

9 

9 

10 

12 

15 

A 4N  = 

0 

0 

0 

0 

0 

1 

1 

1 

1 

2 

2 

2 

3 

3 

4 

6 

6 

6 

6 

7 

7 

A 5N  = 

0 

0 

0 

0 

0 

0 

1 

1 

1 

1 

1 

2 

2 

2 

2 

3 

3 

3 

4 

4 

5 

Pi  N = 

0 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

H2N  = 

0 

1 

2 

2 

3 

3 

3 

4 

4 

4 

4 

5 

5 

5 

5 

5 

6 

6 

6 

6 

6 

P-3N  = 

0 

1 

2 

3 

3 

4 

5 

5 

6 

6 

6 

7 

8 

8 

9 

9 

9 

10 

10 

10 

10 

p,4N  = 

0 

1 

2 

3 

4 

4 

5 

6 

7 

7 

8 

9 

9 

10 

10 

10 

11 

12 

13 

13 

14 

P5  N = 

0 

1 

2 

3 

4 

5 

5 

6 

7 

8 

9 

9 

10 

11 

12 

12 

13 

14 

14 

15 

15 

K5N  — N 


0 1/4  1/2  3/4  1 


Fig.  47.  Approximating  a Kruskal  function  with  the  Takagi  function.  (The 
smooth  curve  in  the  left-hand  graph  is  the  lower  bound  k5N  — N of  exercise  80.) 


omitting  parentheses  and  commas  for  convenience.  The  complement  of  a vector 
in  T(nii, . . . , m„)  is 


x = (mi  — 1 — xi, . . . ,mn  - 1 - xn).  (70) 

Notice  that  x -<  y holds  if  and  only  if  x >7  y.  Therefore  we  have 

rank(x)  + rank(ic)  — T - 1,  where  T = mx  . . . mn,  (71) 

if  rank(:r)  denotes  the  number  of  vectors  that  precede  x in  cross  order. 

We  will  find  it  convenient  to  call  the  vectors  “points”  and  to  name  the  points 
e0,  ei,  . . . , er-i  in  increasing  cross  order.  Thus  we  have  e7  = 002  in  (69),  and 
er  = ex—i—r  in  general.  Notice  that 


ei  = 100...00,  e2  = 010  . . . 00,  ...,  e„  = 000...01; 


(72) 
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(73) 
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these  are  the  so-called  unit  vectors.  The  set 

Sn  = {e0,ei, . . . ,ejv_x} 

consisting  of  the  smallest  N points  is  called  a standard  set , and  in  the  special 
case  N = n + 1 we  write 

E = {eo,ei,...,en}  = {000  . . . 00, 100  . . . 00, 010 . . . 00, . . . , 000  . . . 01}.  (74) 

Any  set  of  points  X has  a spread  X+,  a core  X°,  and  a dual  X~  defined 
by  the  rules 

= { a:  e ST  \ x e X or  x - ei  G X or  • ■ • or  a;  - en  € X };  (75) 

X ~ ix  ^ $t  \ x £ X and  x + ex  € X and  • • • and  x + en  e X };  (76) 
X~  = {xeST\xiX}. 

We  can  also  define  the  spread  of  X algebraically,  writing 

X+  = X + E,  (78) 

where  X + Y denotes  { x + y \ x <5  X and  y e Y }.  Clearly 

X+  C Y if  and  only  if  X C Y° . (79) 

These  notions  can  be  illustrated  in  the  two-dimensional  case  m.i  =4 , m2  — 6.  by 
the  more-or-less  random  toroidal  arrangement  X = {00, 12, 13, 14, 15  21,  22  25| 
for  which  we  have,  pictorially, 


o 

• 

• 

+ 

• 

+ 

o 

• 

• 

+ 

o 

o 

• 

+ 

o 

o 

• 

+ 

o 

Z 

+ 

+ 

o 

(8o) 


X°  and  X+ 


X" 


X~°  and  X~+ 


here  X in  the  first,  two  diagrams  consists  of  points  marked  • or  o,  X°  comprises 
just  the  os,  and  X+  consists  of  +s  plus  .s  plus  oS.  Notice  that  if  we  rotate  the 
diagram  for  and  X~+  by  180°,  we  obtain  the  diagram  for  I”  and  X+  but 
with  (.,o,+,  ) respectively  changed  to  (+,  ,.,o);  and  in  fact  the  identities  ’ 

X°  = X+  = x—  (8l) 

hold  in  general  (see  exercise  86). 

Now  we  are  ready  to  state  the  theorem  of  Wang  and  Wang: 

Theorem  W.  Let  X be  any  set  of  N points  in  the  discrete  torus  T(mi  m ) 
where  nn  < • • ■ < m„.  Then  |X+|  > |S+|  and  |X°|  < \S^\. 

Proof.  In  other  words,  the  standard  sets  SN  have  the  smallest  spread  and  largest 
core,  among  all  X-point  sets.  We  will  prove  this  result  by  following  a general 
approach  first  used  by  F.  W.  J.  Whipple  to  prove  Theorem  M [Proc.  London 
Math.  Soc.  (2)  28  (1928),  431-437],  The  first  step  is  to  prove  that  the  spread 
and  the  core  of  standard  sets  are  standard: 
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Lemma  S.  There  are  functions  a and  (5  such  that  = SaN  and  = S@n- 

Proof.  We  may  assume  that  N > 0.  Let  r be  maximum  with  er  £ S^,  and  let 
aN  = r + 1;  we  must  prove  that  eq  £ for  0 < q < r.  Suppose  eq  = x — 
( xi , . . . , xn ) and  er  = y = (yi, , yn ),  and  let  k be  the  largest  subscript  with 
xk  > 0.  Since  y £ S^,  there  is  a subscript  j such  that  y — e,  £ SN.  It  suffices  to 
prove  that  x — ek  ^ y — ej,  and  exercise  88  does  this. 

The  second  part  follows  from  (8i),  with  /3N  = T — a(T  — N),  because 
SN  = &T-N-  I 

Theorem  W is  obviously  true  when  n = 1,  so  we  assume  by  induction  that 
it  has  been  proved  in  n — 1 dimensions.  The  next  step  is  to  compress  the  given 
set  X in  the  &th  coordinate  position,  by  partitioning  it  into  disjoint  sets 

Xk(a)  = {x  £ X \ xk  = a}  (82) 

for  0 < a < mk  and  replacing  each  Xk(a)  by 

Xk(a)  = { (sl>  ■ • • i Sfc-L  <b  Sfc,  ■ ■ ■ , Sn_i)  | (Si,  • • ■ , «n-l)  G 5'|X*(a)|  }>  (83) 

a set  with  the  same  number  of  elements.  The  sets  5 used  in  (83)  are  standard  in 
the  (n  - l)-dimensional  torus  T(mlt . . . , mk-i,  mk+i, . . . , mn).  Notice  that  we 
have  (xi, . . . ,xk-i,a,xk+ 1, . . .,xn)  ^ (2/1 , . . . , J/fc-i,  a,  yk+i, . . . ,yn)  if  and  only 
if  (xi, . . . ,Xfe_i,Xfc+1, . . . ,x„)  ^ (2/1, . . • ,Vk-i,yk+i,  ■ ■ ■ ,yn );  therefore  X'k(a)  = 
Xk(a)  if  and  only  if  the  (n  — l)-dimensional  points  (xi, . . . ,Xfc_i,Xfc+1, . . . , x„) 
with  (xi, . . . ,xk-i,a,xk+ 1, . . . ,xn)  G X are  as  small  as  possible  when  projected 
onto  the  (n  — l)-dimensional  torus.  We  let 

CkX  = X' (0)  u X'k(l)  U • • • U Xk(mk  - 1)  (84) 

be  the  compression  of  X in  position  k.  Exercise  90  proves  the  basic  fact  that 
compression  does  not  increase  the  size  of  the  spread: 

|*+|  > \{CkX)+l  fori  <k<n.  (85) 

Furthermore,  if  compression  changes  X,  it  replaces  some  of  the  elements  by  other 
elements  of  lower  rank.  Therefore  we  need  to  prove  Theorem  W only  for  sets  X 
that  are  totally  compressed,  having  X = CkX  for  all  k. 

Consider,  for  example,  the  case  n — 2.  A totally  compressed  set  in  two 
dimensions  has  all  points  moved  to  the  left  of  their  rows  and  the  bottom  of  their 
columns,  as  in  the  eleven-point  sets 
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the  rightmost  of  these  is  standard,  and  has  the  smallest  spread.  Exercise  91 
completes  the  proof  of  Theorem  W in  two  dimensions. 
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When  n > 2,  suppose  x = (xu  . . . ,xn)  e X and  Xj  > 0.  The  condition 
CkX  — X implies  that,  if  0 < i < j and  i p k / j . we  have  x + et  — e.  £ X. 
Applying  this  fact  for  three  values  of  k tells  us  that  x + a - ej  e X whenever 
0 < t < j.  Consequently 


Xn(a)  + En( 0)  C X„(a  — l)  + e„  for  0 < a < 


(86) 


where  m — mn  and  En(Q)  is  a clever  abbreviation  for  the  set  {e0, . . . , en_j}. 

Let  Xn(a)  have  Na  elements,  so  that  N = |W|  = No  + N -4 IW  , and 

let  Y = X+.  Then  m_1’ 


Yn(a)  - (Xn((a-  1)  mod  to)  + e„)  U (Xn(a)  + En(0)) 
is  standard  in  n - 1 dimensions,  and  (86)  tells  us  that 


ETm-i  < /3Nm~2  < Nm_2  < ■ ■ ■ < Ni  < pN0  < No  < aN0. 
where  a and  /3  refer  to  coordinates  1 through  n - 1 . Therefore 

1^1  = l^n(0)|  + |Tn(l)|  + |rn(2)|  + f |y„(m  — 1)1 

= aN0  + N0  + Ari  d f Arm_2  = a No  + N - Nm_1. 

The  proof  of  Theorem  W now  has  a beautiful  conclusion.  Let  Z = SN,  and 
suppose  \Zn(a)\  = Ma.  We  want  to  prove  that  |X+|  > \Z+\,  namely  that 


aN0  + N - Nm_1  > aM0  + N - Mm_1;  (87) 

because  the  arguments  of  the  previous  paragraph  apply  to  Z as  well  as  to  X 
We  will  prove  (87)  by  showing  that  N m_x  < Mm_x  and  N0  > M0. 

Using  the  (n  - l)-dimensional  a and  /3  functions,  let  us  define 


Nm-l  Nm_  1,  Nm_2  — aNln-i,  ... 
<'  = N0,  N"  = pNg,  TV"  = pN[', 


K = aN l Nq  = aN[- 
iV"_!  = PK_2. 


Then  we  have  N'a  < Na  < Ng  for  0 < a < m,  and  it  follows  that 


(88) 

(89) 


N>  ~ No  + K + ■ ■ ■ + N'm_ j < N < N"  = Ng  + N"  + ■■■  + iV"_1.  (90) 

Exercise  92  proves  that  the  standard  set  Z'  = SN,  has  exactly  N'a  elements  with 
nth  coordinate  equal  to  a,  for  each  a ; and  by  the  duality  between  a and  P,  the 
standard  set  Z"  = S',v„  likewise  has  exactly  Ng  elements  with  nth  coordinate  a. 
Finally,  therefore, 


Mm-i  = I Zn(m  - 1)|  > | Z'n{m  - 1)|  = JVTO_1, 

M.  = |Zn(0)|<|Z"(0)|=JV0, 

because  Z'  C Z C Z"  by  (90).  By  (81)  we  also  have  |X°|  < \Z°\.  | 

Now  we  are  ready  to  prove  Theorems  K and  M,  which  are  in  fact  special 
cases  of  a substantially  more  general  theorem  of  Clements  and  Lindstrom  that 
applies  to  arbitrary  multisets  [J.  Combinatorial  Theory  7 (1969),  230-238]: 
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Corollary  C.  If  A is  a set  of  N t-multicombinations  contained  in  the  multiset 
U = {s0  • 0,  • 1, . . . , Sd  ■ d},  where  s0  > «i  > • • • > s<i,  then 

\dA\  > \dPNt\  and  \qA\  > \qQnA,  (91) 

where  H.y,  denotes  the  N lexicographically  smallest  multicombinations  dt . . . d2d\ 
ofU,  and  Q^t  denotes  the  N lexicographically  largest. 

Proof.  Multicombinations  of  U can  be  represented  as  points  *1  ...  x„  of  the  torus 
T(mi, . . . , mn ) , where  n = d + 1 and  mj  = sn^}  + 1;  we  let  Xj  be  the  number 
of  occurrences  of  n — j . This  correspondence  preserves  lexicographic  order.  For 
example,  if  U = {0,0,0, 1, 1,2,3},  its  3-multicombinations  are 

000,  100,  110,  200,  210,  211,  300,  310,  311,  320,  321,  (92) 

in  lexicographic  order,  and  the  corresponding  points  X1X2X3X4  are 

0003, 0012, 0021, 0102,  0111, 0120, 1002, 1011, 1020, 1101, 1110.  (93) 

Let  Tw  be  the  points  of  the  torus  that  have  weight  Xi  H h xn  = w.  Then 

every  allowable  set  A of  t- multicombinations  is  a subset  of  Tt.  Furthermore  — 
and  this  is  the  main  point  -the  spread  of  T0  U T\  U • ■ • U Tt_  1 U A is 

(T0  U Ti  U • • • U Tt-!  U A)+  = T+  U T+  U ■ • • U T+_x  U A+ 

= T0  U Ti  U • • • U Tt  U qA.  (94) 

Thus  the  upper  shadow  qA  is  simply  (T0  U Ti  U • • • U Tt-\  U A)+  n T(+1,  and 
Theorem  W tells  us  in  essence  that  |A|  = N implies  \qA\  > |g(5M+iv  nTt)|, 
where  M = |To  U • • • UTt_i|.  Hence,  by  the  definition  of  cross  order,  Sm+n  HX} 
consists  of  the  lexicographically  largest  N t- multicombinations,  namely  Q,vt  ■ 
The  proof  that  \dA\  > \dPpft\  now  follows  by  complementation  (see  exer- 
cise 94).  | 

EXERCISES 

1.  [ M23 ) Explain  why  Golomb’s  rule  (8)  makes  all  sets  {ci,. . . ,ct}  C {0, . . . , n — 1} 
correspond  uniquely  to  multisets  {ei, . . . ,et}  C {00  • 0, . . . , 00  • n — t}. 

2.  [16]  What  path  in  an  11  x 13  grid  corresponds  to  the  bit  string  (13)? 

► 3.  [21]  (R.  R.  Fenichel,  1968.)  Show  that  the  compositions  qt  + • • • + qi  +qo  of  s into 
t + 1 nonnegative  parts  can  be  generated  in  lexicographic  order  by  a simple  loopless 
algorithm. 

4.  [16]  Show  that  every  composition  qt  ...  <70  of  s into  t + 1 nonnegative  parts  corre- 
sponds to  a composition  rs  . . . ro  of  t into  s + 1 nonnegative  parts.  What  composition 
corresponds  to  10224000001010  under  this  correspondence? 

► 5.  [20]  What  is  a good  way  to  generate  all  of  the  integer  solutions  to  the  following 
systems  of  inequalities? 

a)  n > xt  > xt-i  > xt-2  > xt-3  > • ■ ■ > x\  > 0,  when  t is  odd. 

b)  > xt~i  X2  x\  0,  where  a » b means  a > b + 2. 

6.  [M22]  How  often  is  each  step  of  Algorithm  T performed? 
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7.  [22]  Design  an  algorithm  that  runs  through  the  “dual”  combinations  bs  ...  in 
decreasing  lexicographic  order  (see  (5)  and  Table  1).  Like  Algorithm  T,  your  algorithm 
should  avoid  redundant  assignments  and  unnecessary  searching. 

8.  [M23]  Design  an  algorithm  that  generates  all  (s,  ^-combinations  a„-i...aiao 
lexicographically  in  bitstring  form.  The  total  running  time  should  be  0((n)),  assuming 
that  st  > 0. 


9.  [M26]  When  all  (s,  ^-combinations  an_i  . . . a\ao  are  listed  in  lexicographic  order, 
let  2 Ast  be  the  total  number  of  bit  changes  between  adjacent  strings.  For  example, 
A33  = 25  because  there  are  respectively 


2 + 2 + 2 + 4 + 2 + 2 + 4 + 2 + 2-1-6  + 2 + 2 + 4-1-2  + 2-1-4-1-2-1-2-1-2  = 50 

bit  changes  between  the  20  strings  in  Table  1. 

a)  Show  that  Ast  = min (s,t)  + + A3(t_l)  when  st  > 0;  Ast  = 0 when  st  = 0. 

b)  Prove  that  Ast  < 2(s+(). 

► 10.  [21]  The  “World  Series”  of  baseball  is  traditionally  a competition  in  which  the 
American  League  champion  (A)  plays  the  National  League  champion  (N)  until  one  of 
them  has  beaten  the  other  four  times.  What  is  a good  way  to  list  all  possible  scenarios 
AAAA,  AAANA,  AAANNA,  . . . , NNNN?  What  is  a simple  way  to  assign  consecutive 
integers  to  those  scenarios? 

11.  [19]  Which  of  the  scenarios  in  exercise  10  occurred  most  often  during  the  1900s? 
Which  of  them  never  occurred?  [Hint:  World  Series  scores  are  easily  found  on  the 
Internet.] 

12.  [HM32]  A set  V of  n-bit  vectors  that  is  closed  under  addition  modulo  2 is  called 
a binary  vector  space. 

a)  Prove  that  every  such  V contains  2f  elements,  for  some  integer  t,  and  can  be 
represented  as  the  set  {xiai  ® • • • © xtat  \ 0 < xu  . . . , xt  < 1}  where  the  vectors 

1 ■ •••!  Ot  form  a “canonical  basis”  with  the  following  property:  There  is  a t- 
combination  c(...c2ci  of  {0,1,..., n - 1}  such  that,  if  ak  is  the  binary  vector 
ak(n-i)  ■ ■ -a/cia/co,  we  have 

akCj  = [j  = k]  for  1 <j,k<t\  au  = 0 for  0 < l < ck,  1 < k < t. 

For  example,  the  canonical  bases  with  n = 9,  t = 4,  and  c4C3C2Cx  = 7641  have  the 
general  form 

ai  = *00*0**10, 
a2  = *00*1000  0, 
a3  = *0100000  0, 
a4  = *1000000  0; 

there  are  28  ways  to  replace  the  eight  asterisks  by  0s  and/or  Is,  and  each  of  these 
defines  a canonical  basis.  We  call  t the  dimension  of  V. 

b)  How  many  t-dimensional  spaces  are  possible  with  n-bit  vectors? 

c)  Design  an  algorithm  to  generate  all  canonical  bases  (au...,at)  of  dimension  t. 
Hint.  Let  the  associated  combinations  ct  . . . c\  increase  lexicographically  as  in 
Algorithm  L. 

d)  What  is  the  1000000th  basis  visited  by  your  algorithm  when  n = 9 and  t = 4? 

13.  [25]  A one-dimensional  Ising  configuration  of  length  n,  weight  t,  and  energy  r. 

is  a binary  string  an^...a0  such  that  al  = * and  bj  - r,  where  b,  = 
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aj  ® aj-\.  For  example,  ai2  . . . oo  = 1100100100011  has  weight  6 and  energy  6,  since 
b12...bi  =010110110010. 

Design  an  algorithm  to  generate  all  such  configurations,  given  n,  t,  and  r. 

14.  [26]  When  the  binary  strings  an-i---aiao  of  (s,  ^-combinations  are  generated 
in  lexicographic  order,  we  sometimes  need  to  change  2 min(s,  t)  bits  to  get  from  one 
combination  to  the  next.  For  example,  011100  is  followed  by  100011  in  Table  1. 
Therefore  we  apparently  cannot  hope  to  generate  all  combinations  with  a loopless 
algorithm  unless  we  visit  them  in  some  other  order. 

Show,  however,  that  there  actually  is  a way  to  compute  the  lexicographic  successor 
of  a given  combination  in  0(1)  steps,  if  each  combination  is  represented  indirectly  in  a 
doubly  linked  list  as  follows:  There  are  arrays  Z[0],  . . . , l[n]  and  r[0],  . . . , r[n ] such  that 
l[r[j]]  = j for  0 < j < n.  If  xo  = Z[0]  and  Xj  = Z[xj_x]  for  0 < j < n,  then  aj  = [xj  > s] 
for  0 < j < n. 

15.  [M22]  Use  the  fact  that  dual  combinations  bs  . . . b2b\  occur  in  reverse  lexico- 
graphic order  to  prove  that  the  sum  (*”)  + • • ■ + (22)  + (611)  has  a simple  relation 
to  the  sum  (c/)  + • • • + (c22)  + (7). 

16.  [M21]  What  is  the  millionth  combination  generated  by  Algorithm  L when  t is 
(a)  2?  (b)  3?  (c)  4?  (d)  5?  (e)  1000000? 

17.  [HM25]  Given  N and  t,  what  is  a good  way  to  compute  the  combinatorial  repre- 
sentation (20)? 

► 18.  [20]  What  binary  tree  do  we  get  when  the  binomial  tree  Tn  is  represented  by 
“right  child”  and  “left  sibling”  pointers  as  in  exercise  2. 3. 2-5? 

19.  [21]  Instead  of  labeling  the  branches  of  the  binomial  tree  T4  as  shown  in  (22),  we 
could  label  each  node  with  the  bit  string  of  its  corresponding  combination: 


0011  0101  0110  1001  1010  1100 


0111  1011  1101  1110 


1111 

If  Too  has  been  labeled  in  this  way,  suppressing  leading  zeros,  preorder  is  the  same  as 
the  ordinary  increasing  order  of  binary  notation;  so  the  millionth  node  turns  out  to  be 
11110100001000111111.  But  what  is  the  millionth  node  of  T0 0 in  postorder ? 

20.  [M20]  Devise  generating  functions  g and  h such  that  Algorithm  F finds  exactly 
[zw]  g(z)  feasible  combinations  and  sets  1 1—  t + 1 exactly  [z'v]  h(z)  times. 

21.  [M22]  (Joan  E.  Miller,  1971.)  Prove  the  alternating  combination  law  (30). 

22.  [M23]  What  is  the  millionth  revolving-door  combination  visited  by  Algorithm  R 
when  t is  (a)  2?  (b)  3?  (c)  4?  (d)  5?  (e)  1000000? 

23.  [M23]  Suppose  we  augment  Algorithm  R by  setting  j «—  t + 1 in  step  Rl,  and 
j «—  1 if  R3  goes  directly  to  R2.  Find  the  probability  distribution  of  j,  and  its  average 
value.  What  does  this  imply  about  the  running  time  of  the  algorithm? 
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► 24.  [ M25 ] (W.  H.  Payne,  1974.)  Continuing  the  previous  exercise,  let  jk  be  the  value 
°f  3 on  the  fcth  visit  by  Algorithm  R.  Show  that  \jk+1  - jk\  < 2,  and  explain  how  to 
make  the  algorithm  loopless  by  exploiting  this  property. 

25.  [ M35 ] Let  ct  . . . c2c1  and  ct  . . . c'2c'i  be  the  TVth  and  ,/V'th  combinations  generated 
by  the  revolving-door  method,  Algorithm  R.  If  the  set  C = {ct, . . . , c2,  cx}  has  m>  0 
elements  not  in  C'  = {c't, . . . ,c'2,ci},  prove  that  | N - IV' | > 

26.  [26]  Do  elements  of  the  ternary  reflected  Gray  code  have  properties  similar  to  the 

revolving-door  Gray  code  Tst,  if  we  extract  only  the  n-tuples  an-i  ■ . .oia0  such  that 
(a)  an-i  -|  1-  ai  + a0  = t?  (b)  {an_i, . . . , ai,  a0}  = {r  ■ 0,  s ■ 1,  t • 2}? 

► 27.  [25]  Show  that  there  is  a simple  way  to  generate  all  combinations  of  at  most  t 

elements  of  {0, 1, . . . , n - 1},  using  only  Gray-code-like  transitions  0 -H-  1 and  01  10. 

(In  other  words,  each  step  should  either  insert  a new  element,  delete  an  element,  or 
shift  an  element  by  ±1.)  For  example, 

0000,  0001,  0011,  0010,  0110,  0101,  0100,  1100,  1010,  1001,  1000 

is  one  such  sequence  when  n = 4 and  t = 2.  Hint:  Think  of  Chinese  rings. 

28.  [M21\  True  or  false:  A listing  of  (s, ^-combinations  a„_i...aio0  in  bitstring 
form  is  in  genlex  order  if  and  only  if  the  corresponding  index-form  listings  bs  . . . b2bi 
(for  the  0s)  and  ct  . . . C2C1  (for  the  Is)  are  both  in  genlex  order. 

► 29.  [M28]  (P.  J.  Chase.)  Given  a string  on  the  symbols  +,  -,  and  0,  say  that  an 
R-block  is  a substring  of  the  form  -k+1  that  is  preceded  by  0 and  not  followed  by  an 
L-block  is  a substring  of  the  form  +-k  that  is  followed  by  0;  in  both  cases  k > 0.  For 
example,  the  string  -KXH-+-+++-000-  has  two  L-blocks  and  one  R-block,  shown  in  gray. 
Notice  that  blocks  cannot  overlap. 

We  form  the  successor  of  such  a string  as  follows,  whenever  at  least  one  block  is 
present:  Replace  the  rightmost  0-k+1  by  -+fc0,  if  the  rightmost  block  is  an  R-block; 
otherwise  replace  the  rightmost  +-*0  by  0+k+1 . Also  negate  the  first  sign,  if  any,  that 
appears  to  the  right  of  the  block  that  has  been  changed.  For  example, 

-+00++-  -4  -0+0-+ > -0+-0—  ->  -0+— +0  -0+—  0+  -00+++-. 

where  the  notation  a — > /3  means  that  /3  is  the  successor  of  a. 

a)  What  strings  have  no  blocks  (and  therefore  no  successor)? 

b)  Can  there  be  a cycle  of  strings  with  a0  — >«i  — t • — >•  ->a0? 

c)  Prove  that  if  a ->■  /3  then  -(3  — >•  -a,  where  ” means  “negate  all  the  signs.” 
(Therefore  every  string  has  at  most  one  predecessor.) 

d)  Show  that  if  a0  -t  «i  ->•••-►  afc  and  k > 0,  the  strings  a0  and  ak  do  not  have 
all  their  0s  in  the  same  positions.  (Therefore,  if  qo  has  s signs  and  t zeros,  k must 
be  less  than  (s|').) 

e)  Prove  that  every  string  a with  s signs  and  t zeros  belongs  to  exactly  one  chain 

tto  Qi  — t > 

30.  [M32]  The  previous  exercise  defines  2s  ways  to  generate  all  combinations  of  s 0s 
and  t Is,  via  the  mapping  + H 0,  - ef  0,  and  0 >4  1.  Show  that  each  of  these  ways 
is  a homogeneous  genlex  sequence,  definable  by  an  appropriate  recurrence.  Is  Chase’s 
sequence  (37)  a special  case  of  this  general  construction? 

31.  [M23]  How  many  genlex  listings  of  (s,  ^-combinations  are  possible  in  (a)  bitstring 
form  a„_i  . . . a iao?  (b)  index-list  form  ct . . . c2ci? 
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► 32.  [ M32 } How  many  of  the  genlex  listings  of  (s,  f)-combination  strings  o„_i  . . . aiao 
(a)  have  the  revolving-door  property?  (b)  are  homogeneous? 

33.  [ HM33 ] How  many  of  the  genlex  listings  in  exercise  31(b)  are  near-perfect? 

34.  [M32]  Continuing  exercise  33,  explain  how  to  find  such  schemes  that  are  as  near 
as  possible  to  perfection,  in  the  sense  that  the  number  of  “imperfect”  transitions  Cj  t— 
Cj  ± 2 is  minimized,  when  s and  t are  not  too  large. 

35.  [M26]  How  many  steps  of  Chase’s  sequence  Cst  use  an  imperfect  transition? 

► 36.  [M21  ] Prove  that  method  (39)  performs  the  operation  j «—  j + 1 a total  of  exactly 
(s+t)  _ times  as  it  generates  all  (s, ^-combinations  an_ 1 ...aiao,  given  any  genlex 
scheme  for  combinations  in  bitstring  form. 

► 37.  [27]  What  algorithm  results  when  the  general  genlex  method  (39)  is  used  to 
produce  (s,  ^-combinations  a„_  1 . . . aiao  in  (a)  lexicographic  order?  (b)  the  revolving- 
door  order  of  Algorithm  R?  (c)  the  homogeneous  order  of  (31)? 

38.  [26]  Design  a genlex  algorithm  like  Algorithm  C for  the  reverse  sequence  C^t. 

39.  [M21]  When  s = 1 2 and  t = 14,  how  many  combinations  precede  the  bit  string 
11001001000011111101101010  in  Chase’s  sequence  Cst ? (See  (41).) 

40.  [M22]  What  is  the  millionth  combination  in  Chase’s  sequence  Cst,  when  s = 12 
and  t = 14? 

41.  [M2 7]  Show  that  there  is  a permutation  c( 0),  c(l),  c(2),  ...  of  the  nonnegative 
integers  such  that  the  elements  of  Chase’s  sequence  Cst  are  obtained  by  complementing 
the  least  significant  s + t bits  of  the  elements  c(k)  for  0 < k < 2s+t  that  have  weight 
u(c{k))  = s.  (Thus  the  sequence  c(0),  ...,  c(2n  — 1)  contains,  as  subsequences,  all  of 
the  Cst  for  which  s + t = n,  just  as  Gray  binary  code  5(0),  . . . , g(2n  — 1)  contains  all 
the  revolving-door  sequences  Psf.)  Explain  how  to  compute  the  binary  representation 
c(k)  = ( . . . 020100)2  from  the  binary  representation  k = ( . . . 626160)2- 

42.  [HM34]  Use  generating  functions  of  the  form  '*T/S  t gstwszt  to  analyze  each  step  of 
Algorithm  C. 

43.  [20]  Prove  or  disprove:  If  s(x)  and  p(x)  denote  respectively  the  successor  and 
predecessor  of  x in  endo-order,  then  s(x  + 1)  = p(x)  + 1. 

► 44.  [M21]  Let  Ct(n)  — 1 denote  the  sequence  obtained  from  Ct{n)  by  striking  out 
all  combinations  with  ci  = 0,  then  replacing  ct  ■ ■ . ci  by  (ct  — 1) . . . (ci  — 1)  in  the 
combinations  that  remain.  Show  that  Ct(n)  — 1 is  near-perfect. 

45.  [32]  Exploit  endo-order  and  the  expansions  sketched  in  (44)  to  generate  the 
combinations  Ct  ■ . .C2C1  of  Chase’s  sequence  Ct(n)  with  a nonrecursive  procedure. 

► 46.  [33]  Construct  a nonrecursive  algorithm  for  the  dual  combinations  6S  . . . 626i  of 
Chase’s  sequence  Cst,  namely  for  the  positions  of  the  zeros  in  an-\  ■ ■ ■ aiao- 

47.  [26]  Implement  the  near-perfect  multiset  permutation  method  of  (46)  and  (47). 

48.  [M21  ] Suppose  cto,  ai , . . . , aj/_i  is  any  listing  of  the  permutations  of  the  multiset 
(si  • 1, . . . , Sd  ■ d},  where  ak  differs  from  Qjt+i  by  the  interchange  of  two  elements.  Let 
Po,  ■ ■ ■ , Pm-  1 be  any  revolving-door  listing  for  (s,  t)-combinations,  where  s = so,  t = 
Si  + - • - + Sd,  and  M = (s^f).  Then  let  Aj  be  the  list  of  M elements  obtained  by  starting 
with  aj  t Po  and  applying  the  revolving-door  exchanges;  here  a t P denotes  the  string 
obtained  by  substituting  the  elements  of  a for  the  Is  in  P,  preserving  left-right  order. 
For  example,  if  Po,  ...,  Pm-  1 is  0110,  0101,  1100,  1001,  0011,  1010,  and  if  aj  = 12, 
then  Aj  is  0120,  0102,  1200,  1002,  0012,  1020.  (The  revolving-door  listing  need  not  be 
homogeneous.) 
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Prove  that  the  list  (47)  contains  all  permutations  of  {s0  • 0,  si  • 1, . . . , Sd  ■ d},  and 
that  adjacent  permutations  differ  from  each  other  by  the  interchange  of  two  elements. 
49.  [HM23]  If  q is  a primitive  mth  root  of  unity,  such  as  e2,ri/m,  show  that 

(n\  _ / [n/mj  \ /nmodm\ 

\k)q  V [fc/mj  / Umodm/, 

► 50.  [HM25]  Extend  the  formula  of  the  previous  exercise  to  q-multinomial  coefficients 

/m  H 1-  nt  \ 

V ni, ...  ,nt  )q 

51.  [2o]  Find  all  Hamiltonian  paths  in  the  graph  whose  vertices  are  permutations  of 
{0,0,0, 1, 1, 1}  related  by  adjacent  transposition.  Which  of  those  paths  are  equivalent 
under  the  operations  of  interchanging  0s  with  Is  and/or  left-right  reflection? 

52.  [MS 7]  Generalizing  Theorem  P,  find  a necessary  and  sufficient  condition  that  all 
permutations  of  the  multiset  {.s0  • 0, . . . , Sd  • d}  can  be  generated  by  adjacent  transpo- 
sitions  djdj—i  dj —\dj . 

53.  [M46]  (D.  H.  Lehmer,  1965.)  Suppose  the  N permutations  of  {s0  ■ 0, . . . , sd  ■ d} 

cannot  be  generated  by  a perfect  scheme,  because  (N  + x)/2  of  them  have  an  even 
number  of  inversions,  where  x > 2.  Is  it  possible  to  generate  them  all  with  a sequence 
of  N + x - 2 adjacent  interchanges  aSk  -H-  aSk~i  for  1 < k < N + x - 1,  where 
x ~ 1 cases  are  spurs  with  <5*,  = S^-i  that  take  us  back  to  the  permutation  we’ve 
just  seen?  For  example,  a suitable  sequence  <5i  . . . Sg  j for  the  90  permutations  of 
{0,0, 1,1, 2, 2),  where  x = = 6>  is  234535432523451«42a*51a42a*51a4, 

where  a = 45352542345355,  if  we  start  with  050403020100  = 221100. 

54.  [MJO]  For  what  values  of  s and  t can  all  (s,  ^-combinations  be  generated  if  we 
allow  end-around  swaps  an_i  o 00  in  addition  to  adjacent  interchanges  a,j  Oj_i? 

►55.  [33]  (Frank  Ruskey,  2004.)  (a)  Show  that  all  (s,  ^-combinations  os+t_i  ...  oio0 
can  be  generated  efficiently  by  doing  successive  rotations  ajaj-1  . . . a0  <r-  Oj_i  . . . ooOj. 
(b)  What  MMIX  instructions  will  take  (as_|-t_i  . . . 0100)2  to  its  successor,  when  s+f  < 64? 

56.  [M49]  (Buck  and  Wiedemann,  1984.)  Can  all  (f, ^-combinations  o2t-i  ... aio0 
be  generated  by  repeatedly  swapping  a0  with  some  other  element? 

► 57.  [22]  (Frank  Ruskey.)  Can  a piano  player  run  through  all  possible  4-note  chords 
that  span  at  most  one  octave,  changing  only  one  finger  at  a time?  This  is  the  problem  of 
generating  all  combinations  ct  ...c  1 such  that  n > ct  > ■ ■ ■ > cx  > 0 and  ct  - c 1 < m, 
where  t = 4 and  (a)  m = 8,  n = 52  if  we  consider  only  the  white  notes  of  a piano 
keyboard;  (b)  m = 13,  n = 88  if  we  consider  also  the  black  notes. 

58.  [20]  Consider  the  piano  player’s  problem  of  exercise  57  with  the  additional  con- 
dition that  the  chords  don’t  involve  adjacent  notes.  (In  other  words,  cj+1  > cj  + 1 for 
t > j > 1.  Such  chords  tend  to  be  more  harmonious.) 

59.  [M25]  Is  there  a perfect  solution  to  the  4-note  piano  player’s  problem,  in  which 
each  step  moves  a finger  to  an  adjacent  key? 

60.  [23]  Design  an  algorithm  to  generate  all  bounded  compositions 

^ = r s + • • • + r\  + ro,  where  0 < Tj  < rrij  for  s > j > 0. 

61.  [32]  Show  that  all  bounded  compositions  can  be  generated  by  changing  only  two 
of  the  parts  at  each  step. 
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► 62.  [MS 7]  A contingency  table  is  an  m x n matrix  of  nonnegative  integers  (a,j)  having 

given  row  sums  r,  = aij  and  column  sums  Cj  = Y1T=  l where  n + ■ ■ ■ + rm  = 

Cl  + • • • + cn. 

a)  Show  that  2 x n contingency  tables  are  equivalent  to  bounded  compositions. 

b)  What  is  the  lexicographically  largest  contingency  table  for  (n, . . . , rm;  ci, . . . , cn), 
when  matrix  entries  are  read  row-wise  from  left  to  right  and  top  to  bottom,  namely 
in  the  order  (ttl  1 , U12 , . . . , Uln,^21,U22,  * ■ * , U2n , * • * > Clml  t am2 1 • • • ,Umn)7 

c)  What  is  the  lexicographically  largest  contingency  table  for  (n, . . . , rm;  ci, . . . , c„), 
when  matrix  entries  are  read  column-wise  from  top  to  bottom  and  left  to  right, 
namely  in  the  order  (un , G&21 , • • • , flm i , U12 , U22 , * * - , ^m2 , • • • ,^in,ct2n;  • • • , 

d ) What  is  the  lexicographically  smallest  contingency  table  for  (n , . . . , rm ; Ci , . . . , c„ ) , 
in  the  row-wise  and  column-wise  senses? 

e)  Explain  how  to  generate  all  contingency  tables  for  (n, . . . , rra;  Ci, . . . , cn)  in  lex- 
icographic order. 

63.  [M41]  Show  that  all  contingency  tables  for  (n, . . . ,rm;ci, . . . ,cn)  can  be  gener- 
ated by  changing  exactly  four  entries  of  the  matrix  at  each  step. 

► 64.  [M30]  Construct  a genlex  Gray  cycle  for  all  of  the  2S(S|‘)  subcubes  that  have 
s digits  and  t asterisks,  using  only  the  transformations  *0  «-+  0*,  *1  -O-  1*,  0 <-)•  1. 
For  example,  one  such  cycle  when  s = t = 2 is 

(00**,  01**,  0*1*,  0**1, 0**0, 0*0*,  *00*,  *01*,  *0*1,  *0*0,  **00,  **01, 

**11,  **10,  *1*0,  *1*1,  *11*,  *10*,  1*0*,  1**0, 1**1, 1*1*,  11**,  10**). 

65.  [M4O]  Enumerate  the  total  number  of  genlex  Gray  paths  on  subcubes  that  use 
only  the  transformations  allowed  in  exercise  64.  How  many  of  those  paths  are  cycles? 

► 66.  [22]  Given  n > t > 0,  show  that  there  is  a Gray  path  through  all  of  the  canonical 
bases  (qi,  ...  .at)  of  exercise  12,  changing  just  one  bit  at  each  step.  For  example,  one 
such  path  when  n = 3 and  t = 2 is 

001  101  101  001  001  011  010 

010’  010’  110’  110’  100’  100’  100' 

67.  [46]  Consider  the  Ising  configurations  of  exercise  13  for  which  ao  = 0.  Given  n, 
t,  and  r,  is  there  a Gray  cycle  for  these  conflgurations  in  which  all  transitions  have  the 
forms  0^1  O lO*  or  01*  +-y  lfc0?  For  example,  in  the  case  n = 9,  t = 5,  r = 6,  there  is 
a unique  cycle 

(010101110,  010110110,  011010110,  011011010,  011101010,  010111010). 

68.  [M01]  If  a is  a t-combination,  what  is  (a)  dlal  (b)  dt+xa! 

► 69.  [MSS]  How  large  is  the  smallest  set  A of  t-combinations  for  which  \dA\  < A|? 

70.  [M25]  What  is  the  maximum  value  of  KtN  — N,  for  N > 0? 

71.  [MS0]  How  many  t-cliques  can  a million-edge  graph  have? 

► 72.  [MSS]  Show  that  if  N has  the  degree-/,  combinatorial  representation  (57),  there 
is  an  easy  way  to  find  the  degree-s  combinatorial  representation  of  the  complementary 
number  M = (s|f)  — N,  whenever  N < (s|().  Derive  (63)  as  a consequence. 

73.  [M23]  (A.  J.  W.  Hilton,  1976.)  Let  A be  a set  of  s-combinations  and  B a set  of 
t-combinations,  both  contained  in  U = {0, . . . , n — 1}  where  n > s + t.  Show  that  if  A 
and  B are  cross-intersecting,  in  the  sense  that  a fl  /3  / 0 for  all  a E A and  /3  € B,  then 
so  are  the  sets  Qmus  and  Qnui  defined  in  Theorem  K,  where  M = |A|  and  N = \B\. 
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74.  [M21]  What  are  \gPNt\  and  \6QNnt  \ in  Theorem  K? 

75.  [M20]  The  right-hand  side  of  (6o)  is  not  always  the  degree-(f  - 1)  combinatorial 
representation  of  KtN,  because  v - 1 might  be  zero.  Show,  however,  that  a positive 
integer  N has  at  most  two  representations  if  we  allow  v = 0 in  (57),  and  both  of  them 
yield  the  same  value  ntN  according  to  (6o).  Therefore 


' KtN  - (* U1 1)  + (r_  2)  + • ■ ■ + (fe  - 1 + „ _ t)  for  1 < fc  < f. 


76.  [M20]  Find  a simple  formula  for  nt  (N  + 1)  - ntN. 

► 77.JM26]  Prove  the  following  properties  of  the  k functions  by  manipulating  binomial 
coefficients,  without  assuming  Theorem  K: 

a)  Kt(M  + N)  < KtM  + KtN. 

b)  Kt(M  + TV)  < max(/ctM,  N)  + Kt~iN. 

Hint:  ( t‘)  + • • • + ("J1)  + ("*)  + • • ■ + (”')  is  equal  to  (miVnn  , 

(mt  Ant\  , , /miAnA  , H \ t 1 ) + 

v t ! + •••+(  x ),  where  V and  A denote  max  and  min. 

78.  [M22]  Show  that  Theorem  K follows  easily  from  inequality  (b)  in  the  previous 
exercise.  Conversely,  both  inequalities  are  simple  consequences  of  Theorem  K.  Hint- 
Any  set  A off-combinations  can  be  written  A = Ai  +Ao0,  where  Ai  = {a  € A | 0 g a}. 

79.  [MSS]  Prove  that  if  t > 2,  we  have  M > fitN  if  and  only  if  M + A t-iM  > N. 

80.  [HM26]  (L.  Lovasz,  1979.)  The  function  (*)  increases  monotonically  from  0 to  00 
as  x increases  from  t - 1 to  00;  hence  we  can  define 


if  N = ^ \ and  x > t — 1. 

Prove  that  ntN  > ntN  for  all  integers  t > 1 and  N > 0.  Hint:  Equality  holds  when  x 
is  an  integer. 

► 81.  [ M27 ] Show  that  the  minimum  shadow  sizes  in  Theorem  M are  given  by  (64). 
82.  [HM31]  The  Takagi  function  of  Fig.  47  is  defined  for  0 < x < 1 by  the  formula 


t(x) 


00  rx 

E/ 

l._r  J 0 


rk(t)  dt, 


where  rk(t)  — (-1)1-2  is  the  Rademacher  function  of  Eq.  7.2.1.1-(i6). 

a)  Prove  that  r(x)  is  continuous  in  the  interval  [0 . . 1],  but  its  derivative  does  not 
exist  at  any  point. 

b)  Show  that  r(x)  is  the  only  continuous  function  that  satisfies 

T(^x)  = t(  1 - \x)  = \x  + |r( x)  for  0 < x < 1. 

c)  What  is  the  asymptotic  value  of  r(e)  when  e is  small? 

d)  Prove  that  r( x)  is  rational  when  x is  rational. 

e)  Find  all  roots  of  the  equation  t(x)  = 1/2. 

f)  Find  all  roots  of  the  equation  r(x)  = max0<x<i  t(x). 

83  [HM46]  Determine  the  set  R of  all  rational  numbers  r such  that  the  equation 

T{x)  - r has  uncountably  many  solutions.  If  r( x)  is  rational  and  x is  irrational,  is  it 
true  that  r(x)  e R?  ( Warning:  This  problem  can  be  addictive.) 

84.  [HM27]  If  T = (2<t  x),  prove  the  asymptotic  formula 


KtN~N  = + 


for  0 < N < T. 
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85.  [HM21]  Relate  the  functions  A , N and  N to  the  Takagi  function  t(x). 

86.  [M20]  Prove  the  law  of  spread/core  duality,  X~+  = X°~. 

87.  [M21]  True  or  false:  (a)  X C Y°  if  and  only  if  C (b)  X°+°  = 1°; 

(c)  aM  < IV  if  and  only  if  M < /3N. 

88.  [M20]  Explain  why  cross  order  is  useful,  by  completing  the  proof  of  Lemma  S. 

89.  [16]  Compute  the  a and  (3  functions  for  the  2x2x3  torus  (69). 

90.  [M22]  Prove  the  basic  compression  lemma,  (85). 

91.  [M24]  Prove  Theorem  W for  two-dimensional  toruses  T(l,m),  l <m. 

92.  [M28]  Let  x = x\  . . . x„-i  be  the  IVth  element  of  the  torus  T(mi, . . . , mn_  1),  and 
let  5 be  the  set  of  all  elements  of  T(m\, . . . ,mn-i,m)  that  are  X x\  . . . xn-i(m— 1) 
in  cross  order.  If  Na  elements  of  S have  final  component  a,  for  0 < a < m,  prove 
that  Nm- 1 = N and  Na-\  — aNa  for  1 < a < m,  where  a is  the  spread  function  for 
standard  sets  in  T(mi, . . . , m„- 1). 

93.  [M25]  (a)  Find  an  N for  which  the  conclusion  of  Theorem  W is  false  when  the 
parameters  mi,  m2,  . . . , mn  have  not  been  sorted  into  nondecreasing  order,  (b)  Where 
does  the  proof  of  that  theorem  use  the  hypothesis  that  mi  < m2  < • • • < mn? 

94.  [M20]  Show  that  the  d half  of  Corollary  C follows  from  the  q half.  Hint:  The 
complements  of  the  multicombinations  (92)  with  respect  to  U are  3211,  3210,  3200, 
3110,  3100,  3000,  2110,  2100,  2000,  1100,  1000. 

95.  [17]  Explain  why  Theorems  K and  M follow  from  Corollary  C. 

► 96.  [M22]  If  S is  an  infinite  sequence  (so,  Si,  S2, . . . ) of  positive  integers,  let 

(Sin))  = [*‘]  n(i  + * + ---  + ^); 

i= 0 

thus  (S^)  is  the  ordinary  binomial  coefficient  (£)  if  so  = si  = s2  = • • • = 1. 

Generalizing  the  combinatorial  number  system,  show  that  every  nonnegative  inte- 
ger N has  a unique  representation 

Ms,rlM?-7))+---+rr)) 

where  nt  > 1 > • • • > nx  > 0 and  { , . . . , ni}  C {so  • 0,  si  • 1,  s2  • 2, . . . }.  Use 

this  representation  to  give  a simple  formula  for  the  numbers  \dPfjt  \ in  Corollary  C. 

► 97.  [M26]  The  text  remarked  that  the  vertices  of  a convex  polyhedron  can  be  per- 
turbed slightly  so  that  all  of  its  faces  are  simplexes.  In  general,  any  set  of  combinations 
that  contains  the  shadows  of  all  its  elements  is  called  a simplicial  complex',  thus  C is  a 
simplicial  complex  if  and  only  if  a C f3  and  /3  G C implies  that  a G C,  if  and  only  if 
C is  an  order  ideal  with  respect  to  set  inclusion. 

The  size  vector  of  a simplicial  complex  Conn  vertices  is  (No,  Ni, , Nn)  when 
C contains  exactly  Nt  combinations  of  size  t. 

a)  What  are  the  size  vectors  of  the  five  regular  solids  (the  tetrahedron,  cube,  octa- 
hedron, dodecahedron,  and  icosahedron),  when  their  vertices  are  slightly  tweaked? 

b)  Construct  a simplicial  complex  with  size  vector  (1,4,  5,2,0). 

c)  Find  a necessary  and  sufficient  condition  that  a given  size  vector  (No,Ni, . . . , Nn) 
is  feasible. 

d)  Prove  that  (No, . . . , Nn)  is  feasible  if  and  only  its  “dual”  vector  (No,  ■ ■ ■ , N„)  is 
feasible,  where  we  define  Nt  = (")  — Nn~t- 
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e)  List  all  feasible  size  vectors  (N0,  Nu  N2,  N3,  N4)  and  their  duals.  Which  of  them 
are  self-dual? 

98  [30]  Continuing  exercise  97,  find  an  efficient  way  to  count  the  feasible  size  vectors 
(iVo,  Ni, . . .,Nn)  when  n < 100. 

99  [M25]  A clutter  is  a set  C of  combinations  that  are  incomparable,  in  the  sense 

that  a C and  a,/3  € C implies  a = /3.  The  size  vector  of  a clutter  is  defined  as  in 
exercise  97. 

a)  Find  a necessary  and  sufficient  condition  that  (M0,  Mu  . . . , M„)  is  the  size  vector 
ot  a clutter. 

b)  List  all  such  size  vectors  in  the  case  n = 4. 

► 100 .[M30]  (Clements  and  Lindstrdm.)  Let  A be  a “simplicial  multicomplex,”  a set 
of  submultisets  of  the  multiset  U in  Corollary  C with  the  property  that  dA  C A.  How 
large  can  the  total  weight  vA  = I a € A}  be  when  |A|  = N? 

101.  [M25]  K f(Xl  xn)  is  a Boolean  formula,  let  F(p)  be  the  probability  that 

J(xi , x n)  - i when  each  variable  x3  independently  is  1 with  probability  p 

a)  Calculate  G(p)  and  H(p)  for  the  Boolean  formulas  g(w,  x,  y,  z)  = WxzVwyzVxyz 

h(w,  x,  y,  z)  = wyz  V xyz.  y ’ 

b)  Show  that  there  is  a monotone  Boolean  function  f(w,x,y,z)  such  that  Ftp)  = 

G(p)  but  there  is  no  such  function  with  F(p)  = H{p).  Explain  how  to  test  this 
condition  in  general. 

102.  [HM35]  (F.  S.  Macaulay,  1927.)  A polynomial  ideal  I in  the  variables  {x\  x } 
is  a set  of  polynomials  closed  under  the  operations  of  addition,  multiplication  by  a 
constant,  and  multiplication  by  any  of  the  variables.  It  is  called  homogeneous  if  it 
consists  of  all  linear  combinations  of  a set  of  homogeneous  polynomials,  namely  of 
polynomials  like  xy  + z whose  terms  all  have  the  same  degree.  Let  Nt  be  the  maximum 
number  of  linearly  independent  elements  of  degree  t in  /.  For  example,  if  ,s  = 2 
the  set  of  all  a(xo  Xl,x2)(x0xj  - 2*1*2)  + f}(xo,xuxa)xoXlxl  where  a and  0 run 
!hhU^  AtP°SS1A  6 P?  y,'?rnialS  ” (x°’x'’x*h  is  a homogeneous  polynomial  ideal 

a)  Prove  that  for  any  such  ideal  / there  is  another  ideal  I'  in  which  all  homogeneous 
polynomials  of  degree  t are  linear  combinations  of  Nt  independent  monomials 
(A  monomial  is  a product  of  variables,  like  x\x2x\.) 

b)  Use  Theorem  M and  (64)  to  prove  that  Nt+1  > Nt  + KsNt  for  all  t > 0. 

c)  Show  that  Nt+l  > Nt  + KsNt  occurs  for  only  finitely  many  t.  (This  statement 

irreT1Vfent/t0  Hllbert  S basis  theorem-”  Proved  by  David  Hilbert  in  Gottinger 
Nachnchten  (1888),  450-457;  Math.  Annalen  36  (1890),  473-534.) 

► i03.  [MSB]  The  shadow  of  a subcube  <h  ...an,  where  each  0j  is  either  0 or  1 or  * is 
obtained  by  replacing  some  * by  0 or  1.  For  example, 

d0*ll*0  = {0011*0,0111*0,0*1100,0*1110}. 

f^eHslt^l  >^’  ^ ^ ^ ^ S6t  °f  ^ SUbCUb6S  - ' ■ - haVi“^  ^ 

104.  [M41]  The  shadow  of  a binary  string  a,  . . . an  is  obtained  by  deleting  one  of  its 
bits,  lor  example, 

<9110010010  = {10010010,  11010010,  11000010,  11001000,  11001010,  11001001}. 

Find  a set  PNn  such  that,  if  A is  any  set  of  N binary  strings  ai  . . . an,  \<)A\  > |Pjvn|. 
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105.  [M20]  A universal  cycle  of  t- combinations  for  {0,1,...,  ra  — 1}  is  a cycle  of 
(")  numbers  whose  blocks  of  t consecutive  elements  run  through  every  f-combination 
{ci , . . . , ct } . For  example, 

(02145061320516243152630425364103546) 

is  a universal  cycle  when  t — 3 and  n = 7. 

Prove  that  no  such  cycle  is  possible  unless  (”)  is  a multiple  of  n. 

106.  [M21]  (L.  Poinsot,  1809.)  Find  a “nice”  universal  cycle  of  2-combinations  for 
{0, 1, ... , 2m}.  Hint:  Consider  the  differences  of  consecutive  elements,  mod  (2m.  + 1). 

107.  [22]  (O.  Terquem,  1849.)  Poinsot ’s  theorem  implies  that  all  28  dominoes  of  a 
traditional  “double-six”  set  can  be  arranged  in  a cycle  so  that  the  spots  of  adjacent 
dominoes  match  each  other: 


How  many  such  cycles  are  possible? 

108.  [M31]  Find  universal  cycles  of  3-combinations  for  the  sets  {0, . . . , n — 1}  when 
n mod  3/0. 

109.  [M31]  Find  universal  cycles  of  3-multicombinations  for  {0, 1,  ...,n—  1}  when 
n mod  3/0  (namely  for  combinations  did2d3  with  repetitions  permitted).  For  exam- 
ple, 

(00012241112330222344133340024440113) 
is  such  a cycle  when  n = 5. 

► 110.  [26]  Cribbage  is  a game  played  with  52  cards,  where  each  card  has  a suit  (4k,  <>, 
Q,  or  6)  and  a face  value  (A,  2,  3,  4,  5,  6,  7,  8,  9,  10,  J,  Q,  or  K).  Its  players  must 
become  adept  at  computing  the  score  of  a 5-card  combination  C = {ci, c2, C3, C4, C5}, 
where  one  card  Ck  is  called  the  starter.  The  score  is  the  sum  of  points  computed  as 
follows,  for  each  subset  5 of  C and  each  choice  of  k:  Let  |5|  = s. 

i)  Fifteens:  If  53MC)  | c € S}  = 15,  where  (v(A),  u(2),  v(3), . . . , v(9),  i>(10),  v(J), 
u(Q),v(K))  = (1,2,3,...  ,9, 10, 10, 10, 10),  score  two  points. 

ii)  Pairs:  If  s = 2 and  both  cards  have  the  same  face  value,  score  two  points. 

iii)  Runs:  If  s > 3 and  the  face  values  are  consecutive,  and  if  C does  not  contain  a 
run  of  length  s + 1,  score  s points. 

iv)  Flushes:  If  s = 4 and  all  cards  of  S have  the  same  suit,  and  if  c*,  ^ S,  score 
4 + [cfc  has  the  same  suit  as  the  others]. 

v)  Nobs:  If  s = 1 and  Ck  S,  score  1 if  the  card  is  J of  the  same  suit  as  cjt. 

For  example,  if  you  hold  {J4k,  5#,  50, 60}  and  if  44k  is  the  starter,  you  score  4x2  for 
fifteens,  2 for  a pair,  2x3  for  runs,  plus  1 for  nobs,  totalling  17. 

Exactly  how  many  combinations  and  starter  choices  lead  to  a score  of  x points, 
for  x — 0,  1,2,  . . . ? 

► 111.  [M26]  (P.  Erdos,  C.  Ko,  and  R.  Rado.)  Suppose  A is  a set  of  r-combinations  of 
an  n-set,  with  a n /?  / 0 whenever  a,p  € A.  Show  that  |A|  < ("“J),  if  r < n/2.  Hint: 
Consider  dn~~rB,  where  B is  the  set  of  complements  of  A. 
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7.2. 1.4.  Generating  all  partitions.  Richard  Stanley’s  magnificent  book  Enu- 
merative  Combinatorics  (1986)  begins  by  discussing  The  Twelvefold  Way,  a 
2x2x3  array  of  basic  combinatorial  problems  that  arise  frequently  in  practice 
(see  Table  1),  based  on  a series  of  lectures  by  Gian-Carlo  Rota.  All  twelve  of 
these  basic  problems  can  be  described  in  terms  of  the  ways  that  a given  number 
of  balls  can  be  placed  into  a given  number  of  urns.  For  example,  there  are  nine 
ways  to  put  2 balls  into  3 urns  if  the  balls  and  urns  are  labeled: 


ssi  © ®® 

A B C A R 


TsT 


i 


(0©  ,®  0) 


@® 
A B Gj 


(The  order  of  balls  within  an  urn  is  ignored.)  But  if  the  balls  are  unlabeled, 
some  of  these  arrangements  are  indistinguishable,  so  only  six  different  ways  are 
possible: 


A 


PP 


JL 


(i) 


If  the  urns  are  unlabeled,  arrangements  like  P©).  and  ©.  ® are  essentially 
the  same,  hence  only  two  of  the  original  nine  arrangements  are  distinguishable. 
And  if  we  have  three  labeled  balls,  the  only  distinct  ways  to  place  them  into 
three  unlabeled  urns  are 


Finally,  if  neither  balls  nor  urns  are  labeled,  these  five  possibilities  reduce  to  only 
three: 


(3) 


The  Twelvefold  Way  considers  all  arrangements  that  are  possible  when  balls  and 
urns  are  labeled  or  unlabeled,  and  when  the  urns  may  optionally  be  required  to 
contain  at  least  one  ball  or  at  most  one  ball. 


Table  1 

THE  TWELVEFOLD  WAY 


balls  per  urn 

unrestricted 

< 1 

> 1 

n labeled  balls, 
m labeled  urns 

n-tuples 
of  m things 

n-permutations 
of  m things 

partitions  of  {1, ...  ,n} 
into  m ordered  parts 

n unlabeled  balls, 
to  labeled  urns 

n-multicombinations 
of  m things 

n-combinations 
of  m things 

compositions  of  n 
into  to  parts 

n labeled  balls, 
m unlabeled  urns 

partitions  of  {1 n} 

into  < to  parts 

n pigeons 
into  m holes 

partitions  of  {1. . . . , n} 
into  m parts 

n unlabeled  balls, 
m unlabeled  urns 

partitions  of  n 
into  < m parts 

n pigeons 
into  m holes 

partitions  of  n 
into  to  parts 
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We’ve  learned  about  n-tuples,  permutations,  combinations,  and  composi- 
tions in  previous  sections  of  this  chapter;  and  two  of  the  twelve  entries  in  Table  1 
are  trivial  (namely  the  ones  related  to  “pigeons”).  So  we  can  complete  our 
study  of  classical  combinatorial  mathematics  by  learning  about  the  remaining 
five  entries  in  the  table,  which  all  involve  partitions. 

Let  us  begin  by  acknowledging  that  the  word  “ partition " 
has  numerous  meanings  in  mathematics. 

Any  time  a division  of  some  object  into  subobjects  is  undertaken, 

the  word  partition  is  likely  to  pop  up. 

— GEORGE  ANDREWS,  The  Theory  of  Partitions  (1976) 

Two  quite  different  concepts  share  the  same  name:  The  partitions  of  a set 
are  the  ways  to  subdivide  it  into  nonempty,  disjoint  subsets;  thus  (2)  illustrates 
the  five  partitions  of  {1,  2,  3},  namely 

{1,2,3},  {1, 2} {3} , {1, 3} {2} , { 1 } {2,  3},  {1}{2}{3}.  (4) 

And  the  partitions  of  an  integer  are  the  ways  to  write  it  as  a sum  of  positive 
integers,  disregarding  order;  thus  (3)  illustrates  the  three  partitions  of  3,  namely 

3,  2+1,  1 + 1 + 1.  (5) 

We  shall  follow  the  common  practice  of  referring  to  integer  partitions  as  simply 
“partitions,”  without  any  qualifying  adjective;  the  other  kind  will  be  called 
“set  partitions”  in  what  follows,  to  make  the  distinction  clear.  Both  kinds  of 
partitions  are  important,  so  we’ll  study  each  of  them  in  turn. 

Generating  all  partitions  of  an  integer.  A partition  of  n can  be  defined 
formally  as  a sequence  of  nonnegative  integers  01  > a2  > • • • such  that  n — 
ai  + a2  + • • • ; for  example,  one  partition  of  7 has  m = a2  = 3,  a3  = 1,  and 
a4  = a5  = • • • = 0.  The  number  of  nonzero  terms  is  called  the  number  of  parts, 
and  the  zero  terms  are  usually  suppressed.  Thus  we  write  7 = 3 + 3 + 1,  or 
simply  331  to  save  space  when  the  context  is  clear. 

The  simplest  way  to  generate  all  partitions,  and  one  of  the  fastest,  is  to  visit 
them  in  reverse  lexicographic  order,  starting  with  ‘n’  and  ending  with  ‘11 ...  1’. 
For  example,  the  partitions  of  8 are 

8,  71,  62,  611,  53,  521,  5111,  44,  431,  422,  4211,  41111,  332,  3311, 

3221,  32111,  311111,  2222,  22211,  221111,  2111111,  11111111,  ^ 

when  listed  in  this  order. 

If  a partition  isn’t  all  Is,  it  ends  with  (x+1)  followed  by  zero  or  more  Is, 
for  some  x > 1;  therefore  the  next  smallest  partition  in  lexicographic  order 
is  obtained  by  replacing  the  suffix  (x+l)l  ...  1 by  x . . ,xr  for  some  appropri- 
ate remainder  r < x.  The  process  is  quite  efficient  if  we  keep  track  of  the 
largest  subscript  q such  that  aq  ^ lt  as  suggested  by  J.  K.  S.  McKay  [CACM 
13  (1970),  52],  and  pad  the  array  with  Is  as  suggested  by  A.  Zoghbi  and 
I.  Stojmenovic  [ International  Journal  of  Computer  Math.  70  (1998),  319-332]: 
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Algorithm  P ( Partitions  of  n in  reverse  lexicographic  order).  Given  an  integer 
n > 1,  this  algorithm  generates  all  partitions  ax  > a2  > • • • > am  > 1 with 
“i  + 02  + • ' • + am  = n and  1 < m < n.  The  value  of  a0  is  also  set  to  zero. 

PI.  [Initialize.]  Set  am  4—  1 for  n > m > 1.  Then  set  m 4—  1 and  ao  4—  0. 

P2.  [Store  the  final  part.]  Set  am  4—  n and  q 4—  m — [n  — 1], 

P3.  [Visit.]  Visit  the  partition  aia 2 . . . am.  Then  go  to  P5  if  aq  ^ 2. 

P4.  [Change  2 to  1+1.]  Set  aq  4-  1,  q 4-  q - 1,  m 4-  m + 1,  and  return  to  P3. 
(At  this  point  we  have  ak  = 1 for  q < k < n.) 

P5.  [Decrease  aq.}  Terminate  the  algorithm  if  q = 0.  Otherwise  set  x 4-  aq  - 1, 
aq  4 — or,  n 4—  m — <7  + 1 , and  m 4—  q + 1 . 

P6.  [Copy  a;  if  necessary.]  If  n < x,  return  to  step  P2.  Otherwise  set  am  4-  x, 
w<-m+l,tn-n-i:  and  repeat  this  step.  | 

Notice  that  the  operation  of  going  from  one  partition  to  the  next  is  particularly 
easy  if  a 2 is  present;  then  step  P4  simply  changes  the  rightmost  2 to  a 1 and  ap- 
pends another  1 at  the  right.  This  happy  situation  is,  fortunately,  the  most  com- 
mon case.  For  example,  nearly  79%  of  all  partitions  contain  a 2 when  n = 100. 

Another  simple  algorithm  is  available  when  we  want  to  generate  all  partitions 
of  n into  a fixed  number  of  parts.  The  following  method,  which  was  featured 
111  C.  F.  Hindenburg’s  18th-century  dissertation  [Intinitmomii  Dignitatum  Ex- 
ponents Indeterminati  (Gottingen,  1779),  73-91],  visits  the  partitions  in  colex 
order,  namely  in  lexicographic  order  of  the  reflected  sequence  oTO  . . . a2ax : 

Algorithm  H ( Partitions  of  n into  m parts).  Given  integers  n > m > 2,  this 
algorithm  generates  all  integer  m-tuples  ax  . . . am  such  that  ax  > ■ ■ ■ > am  > 1 
and  ai  H h am  = n.  A sentinel  value  is  stored  in  am+1. 

HI.  [Initialize.]  Set  ax  4-  n - m + 1 and  a3  4-  1 for  1 < j < m.  Also  set 

am+l  < 1. 


H2.  [Visit.]  Visit  the  partition  ax  . . . am.  Then  go  to  H4  if  a2  > ax  — 1. 
H3.  [Tweak  ax  and  a2.]  Set  ax  4-  ax  — 1,  a2  4-  a2  + 1,  and  return  to  H2. 


H4.  [Find  j .]  Set  j 4—3  and  s 4—  ax  + a2  — 1.  Then,  while  aj  > ax  — 1, 


s 4-  s + aj  and  j 4-  j + 1.  (Now  s = ax  + • • • + 


aj- 1 


H5.  [Increase  aj.]  Terminate  if  j > m.  Otherwise  set 


1. 


set 

• 1 and  aj  < ax  - 1.) 

1,  aj  4—  x, 


H6.  [Tweak  ax  . . . aj]  While  j > 1,  set  a,  4-  x,  s 4-  s - a:,  and  j 4-  j - 1 
Finally  set  ax  4—  s and  return  to  H2.  | 


For  example,  when  n — 11  and  m — 4 the  successive  partitions  visited  are 

8111,  7211,  6311,  5411,  6221,  5321,  4421,  4331,  5222,  4322,  3332.  (7) 

The  basic  idea  is  that  colex  order  goes  from  one  partition  ax  . . . am  to  the  next  by 
finding  the  smallest  j such  that  aj  can  be  increased  without  changing  a,+x  a 
The  new  partition  a\  . . . a'm  will  have  a'x  >•••>«'  = Uj  + 1 and  < + ■■■  + a'  = 
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+ • • • + a,j,  and  these  conditions  are  achievable  if  and  only  if  a}  < a,  — 1. 
Furthermore,  the  smallest  such  partition  a[  . . . a'm  in  colex  order  has  a'2  = ■ ■ ■ = 

a'  = cij  + 1. 

Step  H3  handles  the  simple  case  j = 2,  which  is  by  far  the  most  common. 
And  indeed,  the  value  of  j almost  always  turns  out  to  be  quite  small;  we  will 
prove  later  that  the  total  running  time  of  Algorithm  H is  at  most  a small  constant 
times  the  number  of  partitions  visited,  plus  O(m). 

Other  representations  of  partitions.  We’ve  defined  a partition  as  a sequence 

of  nonnegative  integers  axa 2 . . . with  aq  > a2  > • • • and  <q  + o2  H = n,  but 

we  can  also  regard  it  as  an  n-tuple  of  nonnegative  integers  cjc2  . . . c„  such  that 


Ci  + 2c2  + • • • + ncn  = n.  (8) 

Here  Cj  is  the  number  of  times  the  integer  j appears  in  the  sequence  aia2  . . . ; 
for  example,  the  partition  331  corresponds  to  the  counts  ci  = 1,  c2  = 0,  c3  = 2, 

C4  = c5  = c6  = c7  = 0.  The  number  of  parts  is  then  cx  +c2H |-c„.  A procedure 

analogous  to  Algorithm  P can  readily  be  devised  to  generate  partitions  in  part- 
count  form;  see  exercise  5. 

We  have  already  seen  the  part-count  representation  implicitly  in  formulas 
like  Eq.  1.2.9-(38),  which  expresses  the  symmetric  function 


as 


h 


n 


E 


%di  %d2  • • • xdn 


N>dn>->d2>d1>l 


(9) 


E 

ci,c2,...,cn>0 
c i+2c2H hnc„=n 


Sj1 

lClCi! 


2C2  c2 ! 


(10) 


where  Sj  is  the  symmetric  function  x{  + x]2  + • • • + xJN.  The  sum  in  (9)  is 
essentially  taken  over  all  n-multicombinations  of  N things,  while  the  sum  in  (10) 
is  taken  over  all  partitions  of  n.  Thus,  for  example,  h3  = + |S,1S2  + |53, 

and  when  N = 2 we  have 


x3  + x2y  + xy2  + y3  = ±(x  + y)3  + \{x  + y){x2  + y2)  + i(z3  + y3). 

Other  sums  over  partitions  appear  in  exercises  1.2.5-21,  1.2.9-10,  1.2.9-11, 
1.2.10-12,  etc.;  for  this  reason  partitions  are  of  central  importance  in  the  study  of 
symmetric  functions,  a class  of  functions  that  pervades  mathematics  in  general. 
[Chapter  7 of  Richard  Stanley’s  Enumerative  Combinatorics  2 (1999)  is  an 
excellent  introduction  to  advanced  aspects  of  symmetric  function  theory.] 

Partitions  can  be  visualized  in  an  appealing  way  by  considering  an  array 
of  n dots,  having  ax  dots  in  the  top  row  and  a2  in  the  next  row,  etc.  Such  an 
arrangement  of  dots  is  called  the  Ferrers  diagram  of  the  partition,  in  honor  of 
N.  M.  Ferrers  [see  Philosophical  Mag.  5 (1853),  199-202];  and  the  largest  square 
subarray  of  dots  that  it  contains  is  called  the  Durfee  square , after  W.  P.  Durfee 
[see  Johns  Hopkins  Univ.  Circular  2 (December  1882),  23].  For  example,  the 
Ferrers  diagram  of  8887211  is  shown  with  its  4 x 4 Durfee  square  in  Fig.  48(a). 
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.......  III!  Tig.  48.  The  Ferrers 

• • ....  diagrams  and  Durfee 

* «...  squares  of  two  conju- 

. * * gate  partitions. 

(a)  8887211  (b)  75444443 

The  Durfee  square  contains  k2  dots  when  k is  the  largest  subscript  such  that 
ak>k;  we  may  call  k the  trace  of  the  partition. 

If  a is  any  partition  aia2  . . . , its  conjugate  aT  = bxb2  ...  is  obtained  by 
transposing  its  Ferrers  diagram -that  is,  by  reflecting  the  diagram  about  the 
mam  diagonal.  For  example,  Fig.  48(b)  shows  that  (8887211)T  = 75444443 
When  /3  = a we  obviously  have  a = /3T;  the  partition  /?  has  ax  parts  and  a has 
i parts.  Indeed  there’s  a simple  relation  between  the  part-count  representation 
ci ...  cn  of  a and  the  conjugate  partition  bxb2  . . . , namely 

bj  ~bj+ 1 = c,j  for  all  j > 1.  (1:i) 

This  relation  makes  it  easy  to  compute  the  conjugate  of  a given  partition,  or  to 
write  it  down  by  inspection  (see  exercise  6). 

The  notion  of  conjugation  often  explains  properties  of  partitions  that  would 
otherwise  be  quite  mysterious.  For  example,  now  that  we  know  the  definition 
ol  a we  can  easily  see  that  the  value  of  j - 1 in  step  H5  of  Algorithm  H is 
just  the  second-smallest  part  of  the  conjugate  partition  (ax...  am)T,  if  m < n. 

herefore  the  average  amount  of  work  that  needs  to  be  done  in  steps  H4  and  H6 
is  essentially  proportional  to  the  average  size  of  the  second-smallest  part  of  a 
random  partition  whose  largest  part  is  to.  And  we  will  see  below  that  the 
second-smallest  part  is  almost  always  quite  small. 

Moreover,  Algorithm  H produces  partitions  in  lexicographic  order  of  their 
conjugates.  For  example,  the  respective  conjugates  of  (7)  are 

41111111,  4211111,  422111,  42221,  431111. 

43211,  4322,  4331,  44111,  4421,  443;  (12) 

these  are  the  partitions  of  n = 11  with  largest  part  4.  One  way  to  generate  all 
partitions  of  n is  to  start  with  the  trivial  partition  V,  then  run  Algorithm  H for 
m - 2,  3,  . . . , n m turn;  this  process  yields  all  a in  lexicographic  order  of  nT 
(see  exercise  7).  Thus  Algorithm  H can  be  regarded  as  a dual  of  Algorithm  P. 

There  is  at  least  one  more  useful  way  to  represent  partitions,  called  the  rim 
representation  [see  S.  Comet,  Numer.  Math.  1 (1959),  90-109]  Suppose  we  re 
place  the  dots  of  a Ferrers  diagram  by  boxes,  thereby  obtaining  a tableau  shape  as 
we  did  m Section  5.1.4;  for  example,  the  partition  8887211  of  Fig.  48(a)  becomes 
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The  right-hand  boundary  of  this  shape  can  be  regarded  as  a path  of  length  2 n 
from  the  lower  left  corner  to  the  upper  right  corner  of  an  n x n square,  and  we 
know  from  Table  7.2. 1.3-1  that  such  a path  corresponds  to  an  (n,  n)-combination. 
For  example,  (13)  corresponds  to  the  70-bit  string 

0. . .01001011111010001 ...  1 = 0281102110115011103127,  (14) 

where  we  place  enough  0s  at  the  beginning  and  Is  at  the  end  to  make  exactly 
n of  each.  The  0s  represent  upward  steps  of  the  path,  and  the  Is  represent 
rightward  steps.  It  is  easy  to  see  that  the  bit  string  defined  in  this  way  has 
exactly  n inversions;  conversely,  every  permutation  of  the  multiset  {n  ■ 0,  n ■ 1} 
that  has  exactly  n inversions  corresponds  to  a partition  of  n.  When  the  number 
of  distinct  parts  of  a partition  is  equal  to  t,  its  bit  string  can  be  written  in  the 
form 

Qn-qi-92 9t  4P1  Q91  4P2  Q?2  jPt  Q9t  ]n-pi -p2 Pt 

where  the  exponents  pj  and  qj  are  positive  integers.  Then  the  partition’s  stan- 
dard representation  is 

aia2  ■ ■ ■ = (pH fPt)9‘(pH fpt-i)9*-1  . . . (pi)qi,  (16) 

namely  (1+1-F5-Fl)3(l+1+5)1(1-Fl)1(l)2  = 8887211  in  our  example. 

The  number  of  partitions.  Inspired  by  a question  that  was  posed  to  him 
by  Philippe  Naude  in  1740,  Leonhard  Euler  wrote  two  fundamental  papers 
in  which  he  counted  partitions  of  various  kinds  by  studying  their  generating 
functions  [Commentarii  Academiae  Scientiarum  Petropolitanae  13  (1741),  64- 
93;  Novi  Comment.  Acad.  Sci.  Pet.  3 (1750),  125-169].  He  observed  that  the 
coefficient  of  z"  in  the  infinite  product 

(l  + z + z2-| \-zj  - 1 )(1  + z2  + z4H |-z2fc-| )(1  + z3  + z6H h z3l-\ ) . . . 

is  the  number  of  nonnegative  integer  solutions  to  the  equation  j+2k+3l-\ — n; 

and  1 + zm  + z2m  + ■■■  is  1/(1  - zm).  Therefore  if  we  write 

OO  1 OO 

p(z)  = n = &(«)*">  (w) 

m= 1 n= 0 

the  number  of  partitions  of  n is  p(n).  This  function  P(z)  turns  out  to  have  an 
amazing  number  of  subtle  mathematical  properties. 

For  example,  Euler  discovered  that  massive  cancellation  occurs  when  the 
denominator  of  P(z)  is  multiplied  out: 

(l-z)(l-z2)(l-z3) ...  =l-z-z2  + z5  + z7  -z12-z15  + z22  + z26---- 

= (~Dnz{3n2+n)/2-  (18) 

— oo<n<oo 

A combinatorial  proof  of  this  remarkable  identity,  based  on  Ferrers  diagrams, 
appears  in  exercise  5.1.1-14;  we  can  also  prove  it  by  setting  u — z and  v = z2  in 
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the  even  more  remarkable  identity  that  Jacobi  published  in  1829, 

OO 

TT  00 

= v (-d"„(s)„(t),  (l9) 

k=1  n=-oo 

because  the  left-hand  side  becomes  n^Li (1  - z3k~2)(  1 - z3fc_1)(l  - z3fc\.  gee 
exercise  5.1.1-20.  Euler  pointed  out  that,  because  of  (18),  the  partition  numbers 
tor  n > 0 satisfy  the  unusual  recurrence 

p{n)  = p(n- 1)  + p(n- 2)  - p(n- 5)  - p(n- 7)  + p(n-12)  + p(n-15) , (20) 

with  p(k)  = 0 when  k < 0;  this  recurrence  allows  us  to  compute  their  values 
more  rapidly  than  by  performing  the  power  series  calculations  in  (17): 

n = 0 1 2 3 4 5 6 7 8 9 10  11  12  13  14  15 

p(n)  =1  1 2 3 5 7 11  15  22  30  42  56  77  101  135  176 

We  know  from  Section  1.2.8  that  solutions  to  the  Fibonacci  recurrence 
/ M - /(n  - 1)  + f(n  - 2)  grow  exponentially,  with  /(n)  = 0(0")  when  /( 0) 
and  /( 1)  are  positive.  The  additional  terms  p(n-5)  - p(n-7)’  in  (20)  have  a 
dampening  effect  on  partition  numbers,  however;  in  fact,  if  we  were  to  stop  the 
recurrence  there,  the  resulting  sequence  would  oscillate  between  positive  and  neg- 
ative values.  Further  terms  l+p(n-12)  +p(n-15)’  reinstate  exponential  growth. 

The  actual  growth  rate  of  p(n)  turns  out  to  be  of  order  A^/n  for  a certain 
constant  A.  For  example,  exercise  33  proves  directly  that  p(n)  grows  at  least  as 

as  e /n'  ^nd  one  fairly  easy  way  to  obtain  a decent  upper  bound  is  to 
take  logarithms  in  (17), 


21 


In 


P(e-‘)=  •£ 


1 00  OO  m 

in  p(z)  = y in-J_  = yy^l 
y i~zm  n ’ 

m~ 1 771=1  71=1 

and  then  to  look  at  the  behavior  near  z = 1 by  setting  z = e~l  with  t > 0: 

= rl  J < ^ 1 _ c(2) 

Z-s  n etn  __  1 < n2t  t ■ (22) 

< • • • and  et  > 1,  we  have 

00  OO 

= ^2p(n)e{n~k)t  < Ylp{k)e(n~k^  = entP(e~t)  < eni+c(2)/‘  (23) 

k=n  fc=0 

for  all  t > 0.  Setting  t = y 1 £( 2)/n  gives 


m,n>l  „>1  - n>1 

Consequently,  since  p(n)  < p(n  + 1)  < p(n  + 2)  <-  • ■ • 

P(n) 

1 — e~ 


p(n)  < C,e2C'/"/y^,  where  C - y/C(2)  = n/y/6.  (24) 

We  can  obtain  more  accurate  information  about  the  size  of  lnP(e_t)  by 
using  Euler’s  summation  formula  (Section  1.2.11.2)  or  Mellin  transforms  (Sec- 
tion 5.2.2);  see  exercise  25.  But  the  methods  we  have  seen  so  far  aren’t  powerful 
enough  to  deduce  the  precise  behavior  of  P(e-‘),  so  it  is  time  for  us  to  add  a 
new  weapon  to  our  arsenal  of  techniques. 
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Euler’s  generating  function  P{z)  is  ideally  suited  to  the  Poisson  summation 
formula  [J.  Ecole  Royale  Polytechnique  12  (1823),  404-509,  §63],  according  to 
which 


f(n  + d ) 

n=  — oo 


lim 

M— »oo 


M 

^ ^ ^ 2irmid 

m——M 


e~2™'yf(y)dy , 


(25) 


whenever  / is  a “well-behaved”  function.  This  formula  is  based  on  the  fact 
that  the  left-hand  side  is  a periodic  function  of  9 , and  the  right-hand  side  is  the 
expansion  of  that  function  as  a Fourier  series.  The  function  / is  sufficiently  nice 
if,  for  example,  /^0|/(y)|  dy  < 00  and  either 

i)  f(n  + 9)  is  an  analytic  function  of  the  complex  variable  9 in  the  region 
|Sr#|  < e for  some  e > 0 and  0 < R9  < 1 and  every  n,  and  the  left-hand  side 
of  (25)  converges  uniformly  for  |30|  < e;  or 


ii)  /(#)  — 5 lime->-o(/(#  — e)  + f(8  + e))  = g{9)  — h(9)  for  all  real  numbers  9, 
where  g and  h are  monotone  increasing  and  g(± 00),  h(± 00)  are  finite. 

[See  Peter  Henrici,  Applied  and  Computational  Complex  Analysis  2 (New  York: 
Wiley,  1977),  Theorem  10. 6e.]  Poisson’s  formula  is  not  a panacea  for  summation 
problems  of  every  kind;  but  when  it  does  apply  the  results  can  be  spectacular, 
as  we  will  see. 

Let  us  multiply  Euler’s  formula  (18)  by  z1/24  in  order  to  “complete  the 
square” : 


zl/24 

P(z) 


OO 

J2  (-l)n25("+5)2. 

71—  — OO 


(26) 


Then  for  all  t > 0 we  have  e */24/P(e  *)  = X^-00  /(n)>  where 


f(y)  = e-i  ‘(y+|)2+-y;  (2?) 

and  this  function  / qualifies  for  Poisson’s  summation  formula  under  both  of  the 
criteria  (i)  and  (ii)  stated  above.  Therefore  we  try  to  integrate  e~2nmiy  f (y) , and 
that  integral  turns  out  to  be  easy  for  all  m (see  exercise  27): 


e~a(y+b)2+2ciy  jy 


—c2/a—2bci 


when  a > 0. 


(28) 


Plugging  in  to  (25),  with  9 = 0,  a = §f,  b = |,  and  c = (|  - m)7T,  yields 

OO  OO 

X!  /(n)=  g(m)  = 

71=  — OO  771=  — OO 


-2(m-i)27r2/(3t)  + ^ 


(29) 


These  terms  combine  and  cancel  beautifully,  as  shown  in  exercise  27,  giving 


0-t/ 24 


P(t 


£ (-!)"<! 


-67T2(71+i)2/t 


27 r e <d2)/t 


t P(e_47r2/t) ' 


(30) 


Surprise!  We  have  proved  another  remarkable  fact  about  P(z): 
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Theorem  D.  The  generating  function  (17)  for  partitions  satisfies  the  functional 
relation 

lnP(e-)  = ® + I,n  A _ 1+  (3l) 

when  Kit  > 0.  | 

This  theorem  was  discovered  by  Richard  Dedekind  [Crelle  83  (1877),  265-292 
§6],  who  wrote  t?(t)  for  the  function  z1/2i/P(z)  when  2 = e2™;  his  proof  was 
based  on  a much  more  complicated  theory  of  elliptic  functions.  Notice  that  when 
t is  a small  positive  number,  lnP(e~**2/t)  is  extremely  tiny;  for  example,  when 
t = 0.1  we  have  exp(-47r2/t)  » 3.5  x 10-172.  Therefore  Theorem  D tells  us 
essentially  everything  we  need  to  know  about  the  value  of  P(z)  when  2 is  near  1. 

G.  H.  Hardy  and  S.  Ramanujan  used  this  knowledge  to  deduce  the  asymp- 
totic behavior  of  p(n)  for  large  n,  and  their  work  was  extended  many  years  later 
by  Hans  Rademacher,  who  discovered  a series  that  is  not  only  asymptotic  but 
convergent  [Proc.  London  Math.  Soc.  (2)  17  (1918),  75-115;  43  (1937),  241- 
254],  The  Hardy- Ramanujan-Rademacher  formula  for  pin)  is  surely  one  of  the 
most  astonishing  identities  ever  discovered;  it  states  that 


P(n ) = 


25/433/4(n_  ^24)3/4 


E 

k=i 


Ak(n) 


l3/2 


llVn-1/24).  (32) 


Here  I3/2  denotes  the  modified  spherical  Bessel  function 


©3/2  00 

E 


(z2/  4)a 


t'o  + 5/2)  k\ 
and  the  coefficient  Ak{n)  is  defined  by  the  formula 


2z  ( cosh  z 
7 r 


sinh  2: 


(33) 


^»)  = E1^M2Kt-?))  (34) 

where  o(h,k,  0)  is  the  Dedekind  sum  defined  in  Eq.  3.3.3-(i6).  We  have 

Mn)  = l,  A2(n)  = (-1)",  A3{n)  = 2 cos  — 4n  + 1)?r ; (35) 

18 

and  in  general  Ak(n)  lies  between  -k  and  k. 

A proof  of  (32)  would  take  us  far  afield,  but  the  basic  idea  is  to  use  the 
“saddle  point  method”  discussed  in  Section  7.2.I.5.  The  term  for  k = 1 is  derived 
from  the  behavior  of  P(z)  when  2 is  near  1;  and  the  next  term  is  derived  from 
the  behavior  when  2 is  near  -1,  where  a transformation  similar  to  (31)  can  be 
applied.  In  general,  the  fcth  term  of  (32)  takes  account  of  the  way  P(z)  behaves 
when  2 approaches  e2™h'k  for  irreducible  fractions  h/k  with  denominator  k: 
every  Mi  root  of  unity  is  a pole  of  each  of  the  factors  1/(1  - zk),  1/(1  - z2k). 
1/(1  ~ z3k),  • ■ • in  the  infinite  product  for  P(z). 
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The  leading  term  of  (32)  can  be  simplified  greatly,  if  we  merely  want  a rough 
approximation: 

■n^/ln/Z 

P(n)  = TnV 3 (1  + Q(n~1/2))-  (36) 

Or,  if  we  choose  to  retain  a few  more  details, 


For  example,  p(100)  has  the  exact  value  190,569,292;  formula  (36)  tells  us  that 
p(100)  « 1.993  x 108,  while  (37)  gives  the  far  better  estimate  190,568,944.783. 

Andrew  Odlyzko  has  observed  that,  when  n is  large,  the  Hardy-Ramanujan- 
Rademacher  formula  actually  gives  a near-optimum  way  to  compute  the  precise 
value  of  p(n),  because  the  arithmetic  operations  can  be  carried  out  in  nearly 
0(l°gp(n))  = O (n1/2)  steps.  The  first  few  terms  of  (32)  give  the  main  contri- 
bution; then  the  series  settles  down  to  terms  that  are  of  order  fc-3/2  and  usually 
of  order  k~2 . Furthermore,  about  half  of  the  coefficients  Ak{n)  turn  out  to  be 
zero  (see  exercise  28).  For  example,  when  n = 106,  the  terms  for  k = 1,  2, 
and  3 are  ss  1.47  x 101107,  1.23  x 10JJ°,  and  —1.23  x 10364,  respectively.  The 
sum  of  the  first  250  terms  is  ss  1471684986 . . . 73818.01,  while  the  true  value  is 
1471684986  . . . 73818;  and  123  of  those  250  terms  are  zero. 


The  number  of  parts.  It  is  convenient  to  introduce  the  notation 


n 

TO 


(38) 


for  the  number  of  partitions  of  n that  have  exactly  to  parts.  Then  the  recurrence 


n 

TO 


n — 1 
to  — 1 


n — to 

TO 


(39) 


holds  for  all  integers  to  and  n,  because  |^_^|  counts  the  partitions  whose  smallest 
part  is  1 and  |"mm|  counts  the  others.  (If  the  smallest  part  is  2 or  more,  we  can 
subtract  1 from  each  part  and  get  a partition  of  n — m into  to  parts.)  By  similar 
reasoning  we  can  conclude  that  |m+”|  is  the  number  of  partitions  of  n into  at  most 
to  parts,  namely  into  to  nonnegative  summands.  We  also  know,  by  transposing 
Ferrers  diagrams,  that  |^|  is  the  number  of  partitions  of  n whose  largest  part 
is  to.  Thus  | "J  is  a good  number  to  know.  The  boundary  conditions 

n 

0 = Ono  and 

make  it  easy  to  tabulate  |^|  for  small  values  of  the  parameters,  and  we  obtain 
an  array  of  numbers  analogous  to  the  familiar  triangles  for  (((j , [^] , {”J,  and 
(^)  that  we’ve  seen  before;  see  Table  2.  The  generating  function  is 


= 0 for  to  < 0 or  n < 0 


(40) 


n 


(l-z)(l-z2)...(l-2m)' 


(41) 


S — ^ — * - ■ - f 
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Table  2 

PARTITION  NUMBERS 


0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 
11 


n 

n 

n 

n 

n 

n 

n | 

n 

n , 

n 

n 

n 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


0 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 


0 

0 

1 

1 

2 

2 

3 

3 

4 

4 

5 
5 


0 

0 

0 

1 

1 

2 

3 

4 

5 

7 

8 
10 


0 

0 

0 

0 

1 

1 

2 

3 

5 

6 
9 
11 


0 

0 

0 

0 

0 

1 

1 

2 

3 

5 

7 

10 


0 

0 

0 

0 

0 

0 

1 

1 

2 

3 

5 

7 


0 

0 

0 

0 

0 

0 

0 

1 

1 

2 

3 

5 


0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

2 

3 


0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

2 


0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1 


0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 


Almost  all  partitions  of  n have  ©(V^logn)  parts.  This  fact,  discovered  by 

P.  Erdos  and  J.  Lehner  [Duke  Math.  J.  8 (1941),  335-345],  has  a very  instructive 
proof: 

Theorem  E.  Let  C = n/y/Q  and  m = ^y/^lnn  + x^  + 0(1).  Then 
1 |m  + n| 

F(x)(l  + 0(n-1/2+e)) 


m + n 
m 


p{n ) 

for  all  e > 0 and  all  fixed  x as  n 


oo,  where 


F(x)  = e /C.  (43) 

The  function  F(x)  in  (43)  approaches  0 quite  rapidly  when  x — > -00,  and  it 
rapidly  increases  to  1 when  x ->  +00;  so  it  is  a probability  distribution  function. 
Figure  49(b)  shows  that  the  corresponding  density  function  f(x)  = F'( x)  is 
largely  concentrated  in  the  region  -2  < a;  < 4.  (See  exercise  35.) 


The  values  of  I " I = 


Im-fnl 


. ...  . I m- 1 I are  shown  in  Fig.  49(a)  for  comparison 
when  n = 100;  in  this  case  —^/nlnn  & 18. 

Proof.  We  will  use  the  fact  that  |m+"|  is  the  number  of  partitions  of  n whose 
argest  part  is  < m.  Then,  by  the  principle  of  inclusion  and  exclusion,  Eq.  1.3.3- 
(29),  we  have 

m+n 

j>m  J2  >ji>m  J3>j2>ii>m 

because  p(n  — jx  jr)  is  the  number  of  partitions  of  n that  use  each  of  the 

parts  {j  1, . . . ,jr}  at  least  once.  Let  us  write  this  as 


1 


p{n) 


m+n 

m 


-l-E1  + £2-E3  + "-,  £r=  £ 


P{n~ji- 


jr) 


p(r 


(44) 
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Fig.  49.  Partitions  of  n with  m parts,  when  (a)  n = 100;  (b)  n ->  oo.  (See  Theorem  E.) 


In  order  to  evaluate  £r  we  need  to  have  a good  estimate  of  the  ratio 
p(n  — t)/p(n).  And  we’re  in  luck,  because  Eq.  (36)  implies  that 

T)(jl  — 

= exp(2 CVn  - t - ln(n  - t)  + 0((n  - t)~1/2)  - 2 C\fn  + Inn) 

= exp(-Cfn~1/2  + 0(n_1/2+2e))  if  0 < t < n1/2+e.  (45) 

Furthermore,  if  t > n1/2+e  we  have  p(n  - t)/p(n)  < p(n  - n1/2+e)/p(n)  w 
exp(-Cne),  a value  that  is  asymptotically  smaller  than  any  power  of  n.  Therefore 
we  may  safely  use  the  approximation 


P(n  ~ t) 
P(n) 


~ Ct 


t 


a — exp(— Cn  1/2), 


for  all  values  of  t > 0.  For  example,  we  have 


(46) 


j>m 


p(n  - j)  _ a 
P(n) 


m+1 


1 — a 


(l  + 0(ri 


-l/2+2e 


»+  £ 


n>j>n1/2+e 


P(n  ~ j) 

p(n) 


— Cx 

= ^-(1  + 0(n-1/2+2e))+0(ne-Cn‘), 

because  a/(l  — a)  = nl/2/C  + 0(1)  and  am  = n~1^2e^Cx  + 0(n_1).  A similar 
argument  (see  exercise  36)  proves  that,  if  r = O(logn), 


= t1  + °("',/2+2‘))  + 0(e--n).  (47) 

Finally  — and  this  is  a wonderful  property  of  the  inclusion-exclusion  princi- 
ple in  general  — the  partial  sums  of  (44)  always  “bracket”  the  true  value,  in  the 
sense  that 


1 — £1  + £2 


£27-— 1 


1 

p(n) 


m+n 

m 


< 1 — £1  + £2 


£2r-l  + £2r  (48) 


for  all  r.  (See  exercise  37.)  When  2 r is  near  Inn  and  n is  large,  the  term  £2r  is 
extremely  tiny;  therefore  we  obtain  (42),  except  with  2e  in  place  of  e.  | 
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Fig.  50.  Temperley’s  curve  (49)  for  the 
limiting  shape  of  a random  partition. 


Theorem  E tells  us  that  the  largest  part  of  a random  partition  almost 
always  is  ^y/nlnn  + 0(\/nlogloglogn),  and  when  n is  reasonably  large  the 
other  parts  tend  to  be  predictable  as  well.  Suppose,  for  example,  that  we  take 
all  the  partitions  of  25  and  superimpose  their  Ferrers  diagrams,  changing  dots 
to  boxes  as  in  the  rim  representation.  Which  cells  are  occupied  most  often? 
Figure  50  shows  the  result:  A random  partition  tends  to  have  a typical  shape 
that  approaches  a limiting  curve  as  n — > 00. 

H.  N.  V.  Temperley  [Proc.  Cambridge  Philos.  Soc.  48  (1952),  683-697] 
gave  heuristic  reasons  to  believe  that  most  parts  ak  of  a large  random  partition 
ai  . . . am  will  satisfy  the  approximate  law 


g-Cfc/VH  + e-Cak/VH  w (49) 

and  his  formula  has  subsequently  been  verified  in  a strong  form.  For  example,  a 
theorem  of  Boris  Pittel  [Advances  in  Applied  Math.  18  (1997),  432-488]  allows 
us  to  conclude  that  the  trace  of  a random  partition  is  almost  always  lj^y/n  rs 
0.54yE,  in  accordance  with  (49),  with  an  error  of  at  most  0(vn  Inn)1/2;  thus 
about  29%  of  all  the  Ferrers  dots  tend  to  lie  in  the  Durfee  square. 

If,  on  the  other  hand,  we  look  only  at  partitions  of  n with  m parts,  where 
m is  fixed,  the  limiting  shape  is  rather  different:  Almost  all  such  partitions  have 


if  m and  n are  reasonably  large.  Figure  51  illustrates  the  case  a ~ 50.  m ~ 5. 
In  fact,  the  same  limit  holds  when  m grows  with  n,  but  at  a slower  rate  than 
y/n  [see  Vershik  and  Yakubovich,  Moscow  Math.  J.  1 (2001),  457-468], 


Fig.  51.  The  limiting  shape  (50)  when  there  are  m parts. 
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The  rim  representation  of  partitions  gives  us  further  information  about  par- 
titions that  are  doubly  bounded,  in  the  sense  that  we  not  only  restrict  the  number 
of  parts  but  also  the  size  of  each  part.  A partition  that  has  at  most  m parts, 
each  of  size  at  most  Z,  fits  inside  an  m x l box.  All  such  partitions  correspond 
to  permutations  of  the  multiset  {m  -0,  / • 1}  that  have  exactly  n inversions,  and 
we  have  studied  the  inversions  of  multiset  permutations  in  exercise  5.1.2-16.  In 
particular,  that  exercise  derives  a nonobvious  formula  for  the  number  of  ways 
n inversions  can  happen: 


Theorem  C.  The  number  of  partitions  of  n that  have  no  more  than  m parts 
and  no  part  larger  than  l is 


(1  - zl+1)  (1  - zl+2)  (1  - zl+m) 

(1-2)  (1-22)  (1  — 2m) 


(5l) 


This  result  is  due  to  A.  Cauchy,  Comptes  Rendus  Acad.  Sci.  17  (Paris,  1843), 
523-531.  Notice  that  when  l -»  00  the  numerator  becomes  simply  1.  An  interest- 
ing combinatorial  proof  of  a more  general  result  appears  in  exercise  40  below.  | 


Analysis  of  the  algorithms.  Now  we  know  more  than  enough  about  the 
quantitative  aspects  of  partitions  to  deduce  the  behavior  of  Algorithm  P quite 
precisely.  Suppose  steps  PI,  . . . , P6  of  that  algorithm  are  executed  respectively 
Ti(n),  . . . , Te(n)  times.  We  obviously  have  T\[n)  = 1 and  T3(n)  = p(n);  further- 
more Kirchhoff’s  law  tells  us  that  T2(n)  = T5(n)  and  T^n)  + T5(n)  — T3(n).  We 
get  to  step  P4  once  for  each  partition  that  contains  a 2;  and  this  is  clearly  p(n— 2). 

Thus  the  only  possible  mystery  about  the  running  time  of  Algorithm  P is 
the  number  of  times  we  must  perform  step  P6,  which  loops  back  to  itself.  A 
moment’s  thought,  however,  reveals  that  the  algorithm  stores  a value  > 2 into 
the  array  ai<z2  . . . only  in  step  P2  or  when  we’ll  soon  test  n < x in  P6;  and  every 
such  value  is  eventually  decreased  by  1,  either  in  step  P4  or  step  P5.  Therefore 


T2'(n)  + T6(n)  = p(n)  - 1,  (52) 

where  T"(n)  is  the  number  of  times  step  P2  sets  am  to  a value  > 2.  Let  T2(n)  = 
T2 (n)  + T2'(n),  so  that  T2(n)  is  the  number  of  times  step  P2  sets  am  «—  1.  Then 
T2(n)  + T^(n)  is  the  number  of  partitions  that  end  in  1,  hence 


T2(n)  + T4(n)  = p(n  - 1). 


(53) 


Aha!  We’ve  found  enough  equations  to  determine  all  of  the  required  quantities: 


(Ti(n),...,T6(n))  = 

(l,  p(n)-p(n- 2),  p{n),  p(n- 2),  p(n)  - p(n- 2),  p(n-l)  - l).  (54) 


And  from  the  asymptotics  of  p(n)  we  also  know  the  average  amount  of  compu- 
tation per  partition: 


fTi(n) 

...n(n))  ~ ( 

2C 

0,  — 1.  1 - 

2 C 2 C 

+ oPi 

V p(n)  ’ 

p(n ) / \ 

\Jn 

y/n 1 sfn 

yfr)  \n) 

(55) 


where  C = 7t/\/6  « 1.283.  (See  exercise  45.)  The  total  number  of  memory 
accesses  per  partition  therefore  comes  to  only  3 + C/y/n  + 0(l/n). 
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Whoever  wants  to  go  about  generating  all  partitions 
not  only  immerses  himself  in  immense  labor, 
but  also  must  take  pains  to  keep  fully  attentive, 
so  as  not  to  be  grossly  deceived. 
LEONHARD  EULER,  De  Partitione  Numerorum  (1750) 


Algorithm  H is  more  difficult  to  analyze,  but  we  can  at  least  prove  a decent 
upper  bound  on  its  running  time.  The  key  quantity  is  the  value  of  j,  the  smallest 
subscript  for  which  dj  < ax  - 1.  The  successive  values  of  j when  m = 4 and 
n — 11  are  (2, 2, 2, 3, 2, 2, 3, 4, 2, 3, 5),  and  we  have  observed  that  j = + i 

when  bi . . . bt  is  the  conjugate  partition  (fll . . . am)T  and  rn  < n.  (See  (7)  and 
(12).)  Step  H3  singles  out  the  case  j = 2,  because  this  case  is  not  only  the  most 
common,  it  is  also  especially  easy  to  handle. 


^ Let  cm  (n)  be  the  accumulated  total  value  of  j - 1,  summed  over  all  of  the 
\m\  partitions  generated  by  Algorithm  H.  For  example,  c4(11)  = 1 + 1 + i + 2 + 

1f+ 1 + 2 + 3 + 1 + 2 + 4 = 19-  We  can  regard  cm(n)/| " | as  a good  indication 
of  the  running  time  per  partition,  because  the  time  to  perform  the  most  costly 
steps,  H4  and  H6,  is  roughly  proportional  to  j - 2.  This  ratio  cm(n)/|n|  is  not 
bounded,  because  cm(rn)  = m while  |™|  = 1.  But  the  following  theorem  shows 
that  Algorithm  H is  efficient  nonetheless: 


Theorem  H.  The  cost  measure  cm(n)  for  Algorithm  H is  at  most  3| " | + m. 
Proof  We  can  readily  verify  that  cm(n ) satisfies  the  same  recurrence  as  I" 


cm{n)  — cm_i(n  — 1)  + cm(n  — to),  form,n>l, 

if  we  artificially  define  cm(n)  = 1 when  1 < n < m;  see  (39).  But  the 
conditions  are  now  different: 


(56) 


boundary 


cm(0)  = [m  > 0]; 


co(n)  - 0. 


Table  3 shows  how  cm(n)  behaves  when  m and  n are  small. 

To  prove  the  theorem,  we  will  actually  prove  a stronger  result, 


('m  ( n ) 3 


+ 2m 


1 for  n > m > 2. 


(57) 


(58) 


Exercise  50  shows  that  this  inequality  holds  when  m < n < 2m,  so  the  proof 
will  be  complete  if  we  can  prove  it  when  n > 2m.  In  the  latter  case  we  have 


cm{n)  - ci(n  - m)  + c2(n  - m)  + c3(n  - m)  + • • • + cm(n  - m) 

< 1 + (3|n  2m|  + 3-n+m)  + (3|n + 5-n+m)  + • • ■ 

+ (3l'lmm|  +2m-l-n+m) 

= 3|  ! | +3|n2m|  + • ■• + 3|”“m|  - 3 + m2  - (m  - l)(n- m) 

= 3lml  + 2m2  ~ m - (m  - l)n  - 3 


by  induction;  and  2m2  - m - (m  - l)n  - 3 < 2m  - n - 1 because  n > 2m  + 1 . 


7.2.1.4 


GENERATING  ALL  PARTITIONS  405 


Table  3 

COSTS  IN  ALGORITHM  H 


n 

c0(n) 

ci(n) 

C2(n) 

C3(n) 

c4(n) 

cs(n) 

c6(n) 

C7(n) 

cs(n) 

c9(n) 

Cio(n) 

cn(n) 

0 

0 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

0 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

2 

0 

1 

2 

1 

1 

1 

1 

1 

1 

1 

1 

1 

3 

0 

1 

2 

3 

1 

1 

1 

1 

1 

1 

1 

1 

4 

0 

1 

3 

3 

4 

1 

1 

1 

1 

1 

1 

1 

5 

0 

1 

3 

4 

4 

5 

1 

1 

1 

1 

1 

1 

6 

0 

1 

4 

6 

5 

5 

6 

1 

1 

1 

1 

1 

7 

0 

1 

4 

7 

7 

6 

6 

7 

1 

1 

1 

1 

8 

0 

1 

5 

8 

11 

8 

7 

7 

8 

1 

1 

1 

9 

0 

1 

5 

11 

12 

12 

9 

8 

8 

9 

1 

1 

10 

0 

1 

6 

12 

16 

17 

13 

10 

9 

9 

10 

1 

11 

0 

1 

6 

14 

19 

21 

18 

14 

11 

10 

10 

11 

*A  Gray  code  for  partitions.  When  partitions  are  generated  in  part-count 
form  ci . . . c„  as  in  exercise  5,  at  most  four  of  the  cj  values  change  at  each  step. 
But  we  might  prefer  to  minimize  the  changes  to  the  individual  parts,  generating 
partitions  in  such  a way  that  the  successor  of  Gqa2  . . . is  always  obtained  by 
simply  setting  aj  <—  n3  + 1 and  a k a/t  — 1 for  some  j and  k,  as  in  the 
“revolving  door”  algorithms  of  Section  7.2. 1.3.  It  turns  out  that  this  is  always 
possible;  in  fact,  there  is  a unique  way  to  do  it  when  n = 6: 

111111,  21111,  3111,  2211,  222,  321,  33,  42,  411,  51,  6.  (59) 

And  in  general,  the  | Tn+"  partitions  of  n into  at  most  m parts  can  always  be 
generated  by  a suitable  Gray  path. 

Notice  that  a — > (3  is  an  allowable  transition  from  one  partition  to  another 
if  and  only  if  we  get  the  Ferrers  diagram  for  /?  by  moving  just  one  dot  in  the 
Ferrers  diagram  for  a.  Therefore  aT  — > ST  is  also  an  allowable  transition.  It 
follows  that  every  Gray  code  for  partitions  into  at  most  m parts  corresponds  to 
a Gray  code  for  partitions  into  parts  that  do  not  exceed  m.  We  shall  work  with 
the  latter  constraint. 

The  total  number  of  Gray  codes  for  partitions  is  vast:  There  are  52  when 
n — 7,  and  652  when  n = 8;  there  are  298,896  when  n = 9,  and  2,291,100,484 
when  n = 10.  But  no  really  simple  construction  is  known.  The  reason  is  probably 
that  a few  partitions  have  only  two  neighbors,  namely  the  partitions  dn^d  when 
1 < d < n and  d is  a divisor  of  n.  Such  partitions  must  be  preceded  and  followed 
by  {(d+l)dn/d~‘2(d—  1),  dn^d~l{d— 1)1},  and  this  requirement  seems  to  rule  out 
any  simple  recursive  approach. 

Carla  D.  Savage  [J.  Algorithms  10  (1989),  577-595]  found  a way  to  surmount 
the  difficulties  with  only  a modest  amount  of  complexity.  Let 

[n/m\ 

...  m (n  mod  m) 


= m m 


(60) 
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be  the  lexicographically  largest  partition  of  n with  parts  < m;  our  goal  will 
be  to  construct  recursively  defined  Gray  paths  L(rn.  n)  and  M(m,n)  from  the 
partition  ln  to  p(m,n),  where  L(m,n)  runs  through  all  partitions  whose  parts 
are  bounded  by  m while  M(m,  n ) runs  through  those  partitions  and  a few  more: 
M(m,  n)  also  includes  partitions  whose  largest  part  is  m + 1,  provided  that 
the  other  parts  are  all  strictly  less  than  m.  For  example,  L( 3,8)  is  11111111, 
2111111,  311111,  221111,  22211,  2222,  3221,  32111,  3311,  332,  while  M(3,8)  is 

11111111,  2111111,  221111,  22211,  2222,  3221, 

3311,  32111,  311111,  41111,  4211,  422,  332;  (6i) 

the  additional  partitions  starting  with  4 will  give  us  “wiggle  room”  in  other 
parts  of  the  recursion.  We  will  define  L(m,n)  for  all  n > 0,  but  M(m,n)  only 
for  n > 2m. 

The  following  construction,  illustrated  for  m = 5 to  simplify  the  notation, 
almost  works: 


'm 

L{ 5)  = ^ 4L(oo)R  ^ if  n < 7; 
k 5£(oo) 


M(  5)  = { 


(L(  3) 
4L(2)R 
5L(2) 
431 
44 
53 


M(  4) 

if  n — 8;  ^ 54L(4)R  ' 
55L(5) 


if  n > 9; 

(62) 


1 

rm 

5£(4)r 

5M(4)* 

6£(3) 

> if  11  < n < 13; 

6£(4) 

64£(oo)fi 

554£(4)-R 

.55 L(oo)  j 

,555£(5)  J 

if  n > 14.  (63) 


Here  the  parameter  n in  L(m,n)  and  M(m,n ) has  been  omitted  because  it  can 
be  deduced  from  the  context;  each  L or  M is  supposed  to  generate  partitions  of 
whatever  amount  remains  after  previous  parts  have  been  subtracted.  Thus,  for 
example,  (63)  specifies  that 


M(5,14)  = T(4, 14),  5M(4,9)fl,  6L(4,8),  554L(4,0)fi,  5551,(5,-1); 

the  sequence  £,(5,  —1)  is  actually  empty,  and  £,(4,0)  is  the  empty  string,  so  the 
final  partition  of  M( 5, 14)  is  554  = M(5, 14)  as  it  should  be.  The  notation  £(00) 
stands  for  L(oo,n ) = L(n,n),  the  Gray  path  of  all  partitions  of  n,  starting  with 
ln  and  ending  with  n1 . 

In  general,  £(m)  and  M (in)  are  defined  for  all  m > 3 by  essentially  the 
same  rules,  if  we  replace  the  digits  2,  3,  4,  5,  and  6 in  (62)  and  (63)  by  m-3, 
m-2,  m— 1,  m,  and  m+1,  respectively.  The  ranges  n<7,  ra  = 8,  n>9  become 
n < 2m  - 3,  n = 2m  - 2,  n > 2m  - 1;  the  ranges  11  < n < 13  and  n > 14  become 
2in  + 1 < n < 3m  — 2 and  n > 3m  — 1.  The  sequences  £(0),  £(1),  L( 2)  have 
obvious  definitions  because  the  paths  are  unique  when  m < 2.  The  sequence 
M (2)  is  in>  2in_2>  31”“3,  221ra~4,  2221"-6,  . . . , p(2,  n)  for  n > 5. 
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Theorem  S.  Gray  paths  L'(m,n)  form,n  > 0 and  M'(m,n)  forn  > 2m+l  > 5 
exist  for  all  partitions  with  the  properties  described  above,  except  in  the  case 
L'{ 4,6).  Furthermore,  L'  and  M'  obey  the  mutual  recursions  (62)  and  (63) 
except  in  a few  cases. 

Proof.  We  noted  above  that  (62)  and  (63)  almost  work;  the  reader  may  verify 
that  the  only  glitch  occurs  in  the  case  L(4,6),  when  (62)  gives 

L(4,  6)  = L(2, 6),  3L(1,3)R,  4L(1,2),  321,  33,  42 

= 111111,  21111,  2211,  222,  3111,  411,  321,  33,  42.  (64) 

If  m > 4,  we’re  OK  because  the  transition  from  the  end  of  L(m— 2,  2m— 2)  to 
the  beginning  of  (m— l)L(m— 3,  m—l)R  is  from  (m— 2)(m— 2)2  to  (m— l)(m— 3)2. 
There  is  no  satisfactory  path  L ( 4 , 6 ) , because  all  Gray  codes  through  those  nine 
partitions  must  end  with  either  411,  33,  3111,  222,  or  2211. 

In  order  to  neutralize  this  anomaly  we  need  to  patch  the  definitions  of 
L(m,n ) and  M(m,n)  at  eight  places  where  the  “buggy  subroutine”  L(4,6)  is 
invoked.  One  simple  way  is  to  make  the  following  definitions: 

L\ 4,6)  = 111111,21111,3111,411,321,33,42; 

1/(3, 5)  = 11111,2111,221,311,32.  ( 5 

Thus,  we  omit  222  and  2211  from  L( 4, 6);  we  also  reprogram  L{ 3, 5)  so  that  2111 
is  adjacent  to  221.  Then  exercise  60  shows  that  it  is  always  easy  to  “splice  in” 
the  two  partitions  that  are  missing  from  L(4,6).  | 

EXERCISES 

► 1.  [M21]  Give  formulas  for  the  total  number  of  possibilities  in  each  problem  of  The 
Twelvefold  Way.  For  example,  the  number  of  n-tuples  of  m things  is  mn.  (Use  the 
notation  (38)  when  appropriate,  and  be  careful  to  make  your  formulas  correct  even 
when  m = 0 or  n = 0.) 

► 2.  [20]  Show  that  a small  change  to  step  HI  yields  an  algorithm  that  will  generate 
all  partitions  of  n into  at  most  m parts. 

3.  [Ml  7]  A partition  ai  + • • • + am  of  n into  m parts  ai  > • • • > am  is  optimally 
balanced  if  [ai  — as  \ < 1 for  1 < i,j  < m.  Prove  that  there  is  exactly  one  such  partition, 
whenever  n > m > 1,  and  give  a simple  formula  that  expresses  the  j t h part  a.j  as  a 
function  of  j,  m,  and  n. 

4.  [M22]  (Gideon  Ehrlich,  1974.)  What  is  the  lexicographically  smallest  partition 
of  n in  which  all  parts  are  > r?  For  example,  when  n = 19  and  r = 5 the  answer  is  766. 

► 5.  [23]  Design  an  algorithm  that  generates  all  partitions  of  n in  the  part-count  form 
ci  . . . cn  of  (8).  Generate  them  in  colex  order,  namely  in  the  lexicographic  order  of 
cn  . . .c\,  which  is  equivalent  to  lexicographic  order  of  the  corresponding  partitions 
a\a2  ....  For  efficiency,  maintain  also  a table  of  links  loh  ■ ■ ■ ln  so  that,  if  the  distinct 
values  of  k for  which  Ck  > 0 are  k\  < • • • < kt , we  have 

lo  k\ , lk\  — ^2,  • . . , lki_i  — kt , Ikt  — 6. 

(Thus  the  partition  331  would  be  represented  by  C1...C7  = 1020000,  lo  = 1,  l\  = 3, 
and  I3  = 0;  the  other  links  1 2,  U,  h,  hi.  h can  be  set  to  any  convenient  values.) 
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6.  [20]  Design  an  algorithm  to  compute  byb2  . . . = (aia2  . . . )T , given  aia2 

7.  [M20]  Suppose  ay  . . ,an  and  a'y  . . . a'n  are  partitions  of  n with  ay  > ■ ■ ■ > an  > 0 
and  a'y  > • • • > a'n  > 0,  and  let  their  respective  conjugates  be  by  . . . bn  = (ay  . . . an)T. 
by  ■ . .bn  = (ay  . . . an)T.  Show  that  by  . . . bn  < b'y  . . . b'n  if  and  only  if  an  ..  .ay  < a'n  . . . a'y . 

8.  [15]  When  (pi . . . pt,  qi  ■ ■ ■ qt)  yields  the  rim  representation  of  a partition  aya2  . . . 
as  in  (15)  and  (16),  what’s  the  rim  representation  of  the  conjugate  partition  (aio2  . . . )r? 

9.  [22]  If  ay a2  . . . am  and  byb2  . . . bm  = (aya2  . . . am)T  are  conjugate  partitions,  show 
that  the  multisets  {01  + 1,  a2  + 2, . . . , am  +m}  and  {fei  + 1,  b2  +2, . . . , bm  + m}  are  equal. 

10.  [21]  Two  simple  kinds  of  binary  trees  are  sometimes  helpful  for  reasoning  about 
partitions:  (a)  a tree  that  includes  all  partitions  of  all  integers,  and  (b)  a tree  that 
includes  all  partitions  of  a given  integer  n,  illustrated  here  for  n = 8: 


Deduce  the  general  rules  underlying  these  constructions.  What  order  of  tree  traversal 
corresponds  to  lexicographic  order  of  the  partitions? 

11.  [M22]  How  many  ways  are  there  to  pay  one  euro,  using  coins  worth  1,  2,  5.  10, 
20,  50,  and/or  100  cents?  What  if  you  are  allowed  to  use  at  most  two  of  each  coin? 

► 12.  [M21]  (L.  Euler,  1750.)  Use  generating  functions  to  prove  that  the  number  of 
ways  to  partition  n into  distinct  parts  is  the  number  of  ways  to  partition  n into  odd 
parts.  For  example,  5 = 4+  l = 3 + 2;  5 = 3+  l + l = l + l + l + i + i. 

[Note:  The  next  two  exercises  use  combinatorial  techniques  to  prove  extensions  of 
this  famous  theorem.] 

► 13.  [M23]  (F.  Franklin,  1882.)  Find  a one-to-one  correspondence  a fj  between 
partitions  of  n such  that  a has  exactly  k parts  repeated  more  than  once  if  and  only  if  0 
has  exactly  k even  parts.  (For  example,  the  partition  64421111  has  two  repeated  parts 
{4, 1}  and  three  even  parts  {6,4,2}.  The  case  k = 0 corresponds  to  Euler’s  result.) 

► 14.  (M2 8]  (J.  J.  Sylvester,  1882.)  Find  a one-to-one  correspondence  between  parti- 
tions of  n into  distinct  parts  01  > a2  > • • • > om  that  have  exactly  k “gaps”  where 
% > “j+i  + 1,  and  partitions  of  n into  odd  parts  that  have  exactly  k + 1 different 
values.  (For  example,  when  k = 0 this  construction  proves  that  the  number  of  ways  to 
write  n as  a sum  of  consecutive  integers  is  the  number  of  odd  divisors  of  n.) 

15.  [ M20 ] (J.  J.  Sylvester.)  Find  a generating  function  for  the  number  of  partitions 
that  are  self-conjugate  (namely,  partitions  such  that  a = aT). 

16.  [M21]  Find  a formula  for  £m,n  p(*.  m,  n)wmzn,  where  p(k,  m,  n)  is  the  number  of 
partitions  of  n that  have  rn  parts  and  trace  k.  Sum  it  on  k to  obtain  a nontrivial  identity. 
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17.  [ M26 ] A joint  partition  of  n is  a pair  of  sequences  (oi,...,aP;  6i,...,6s)  of 
positive  integers  for  which  we  have 

ax  ^ ^ fei  > • • ■ > 6S,  and  cti  + * * * + flr  + 6i  + • • ■ + 6S  = n. 

Thus  it  is  an  ordinary  partition  if  s = 0,  and  a partition  into  distinct  parts  if  r = 0. 

a)  Find  a simple  formula  for  the  generating  function  ur+svszn , summed  over  all 
joint  partitions  of  n with  r ordinary  parts  a,  and  s distinct  parts  b3. 

b)  Similarly,  find  a simple  formula  for  vs zn  when  the  sum  is  over  all  joint  partitions 
that  have  exactly  r+s  = t total  parts,  given  the  value  of  t.  For  example,  the  answer 
when  t = 2 is  (1  + u)(l  + vz)z2/((  1 — z)(l  - z2)). 

c)  What  identity  do  you  deduce? 

► 18.  [M23]  (Doron  Zeilberger.)  Show  that  there  is  a one-to-one  correspondence  be- 
tween pairs  of  integer  sequences  (oi,  a?, . . . , ar;  &i,  &2, . . . , 6S)  such  that 

ai  > a,2  >■■■>  ar:  bi  > 62  > • • • > 6s, 

and  pairs  of  integer  sequences  (ci,  c2, . . . , cT+s\  d\,  d?, . . . , dT+s)  such  that 

ci  > C2  >■■■  > cr+s,  dj  G {0, 1}  for  1 < j < r + s, 

related  by  the  multiset  equations 

{oi,a2,...,ar}  = {cj  | dj  = 0}  and  {bi,  62, . . . , 6S}  = {cj  + r + s - j \ d3  = 1 }. 
Consequently  we  obtain  the  interesting  identity 

'y  ' Ur+S Vs Zai^ l"ar+i>lH M>a  _ y ' utVdl~>' *~dt  ^Cl^ fc(+(t-l)di-t \-dt_i 

°1  >"  >ar>0,  r>0  Ci>-  ■>ct>0,  t>0 

b\  >-->b3  >0,  s>0  di,...,dtG{0,l} 

19.  [M22]  (E.  Heine,  1847.)  Prove  the  four-parameter  identity 

(l-wxzrn)(l-wyzm)  wk(x-l )(x—z)  . . . (x-zk~1)(y-l)(y-z) . . . ( y-zk~1)zk 


n 


•j  (l-wzm)(l-wxyzrn)  (1— z)(l— z2) . . . (l-zfc)(l-u>z)(l-wz2)  . . . (l-tuzfc) 

Hint:  Carry  out  the  sum  over  either  k or  I.  in  the  formula 

k 1 kl  {z  - az)(z  - az 2) . . . (z  — azk)  (z  - 6z)(z  - 6z2) . . . (z  - bzl) 


u'v'z 


k,l>  0 


(1  — z)(l  — z2) ...  (1  — zk)  (1  — z)(l  — z2) . . . (1  — zl) 


and  consider  the  simplifications  that  occur  when  6 = auz. 

► 20.  [M21]  Approximately  how  long  does  it  take  to  compute  a table  of  the  partition 
numbers  p(n)  for  1 < n < N,  using  Euler’s  recurrence  (20)? 

21.  [M21]  (L.  Euler.)  Let  q ( n ) be  the  number  of  partitions  of  n into  distinct  parts. 
What  is  a good  way  to  compute  q(n)  if  you  already  know  the  values  of  p(l),  . . . , pfn)? 

22.  [HM21]  (L.  Euler.)  Let  <r(n)  be  the  sum  of  all  positive  divisors  of  the  positive 
integer  n.  Thus,  <r(n)  = n + 1 when  n is  prime,  and  cr(n)  can  be  significantly  larger 
than  n when  n is  highly  composite.  Prove  that,  in  spite  of  this  rather  chaotic  behavior, 
cr{n ) satisfies  almost  the  same  recurrence  (20)  as  the  partition  numbers: 

rr(n)  = cr(n— 1)  + <r(n— 2)  — cr(n— 5)  — a(n— 7)  + a(n— 12)  + <j(n— 15)  — ■ • • 

for  n > 1,  except  that  when  a term  on  the  right  is  ‘cr(0)  ’ the  value  ‘n’  is  used  instead. 
For  example,  cr(ll)  = 1 + 11  = cr(10)  + tr(9)  - cr(6)  - <r( 4)  = 18  + 13-12-7; 
<r(12)  = 1 + 2 + 3 + 4 + 6 + 12  = <r(ll)  + <r(10)  — <r(7)  — cr(5)  + 12  = 12+18-8-6+12. 
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23  [HM25]  Use  Jacobi’s  triple  product  identity  (19)  to  prove  another  formula  that 
he  discovered: 

00 

na-**)3  = l-3z  + 5z3-7z6  + 9z 10-...  = f'(-1)»(2n+1)*("J1). 

k=l  ' J 

n= 0 

24.  [M26]  (S.  Ramanujan,  1919.)  Let  A(z)  = n^l^l  - zk)4. 

a)  Prove  that  [z  ] A(z)  is  a multiple  of  5 when  n mod  5 = 4 

b)  Prove  that  [z*)A(z)B(z)*  has  the  same  property,  if  B is  any  power  series  with 
integer  coefficients. 

c)  Therefore  p(n)  is  a multiple  of  5 when  n mod  5 = 4. 

25.  [HM27]  Improve  on  (22)  by  using  (a)  Euler’s  summation  formula  and  (b)  Mellin 
transforms  to  estimate  lnP(e-‘).  Hint:  The  dilogarithm  function  Lia(*)  = x/l2  + 

x /2  +x/2,  +•••  satisfies  Lia(*)  + Li2(l-*)  = C(2)- (In*)  ln(l-ar). 

26.  [HM22]  In  exercises  5.2.2-44  and  5.2.2-51  we  studied  two  ways  to  prove  that 


£ 


,-k2/n 


= x(\/7rn-  1)  +0(r 


for  all  M > 0. 


Show  that  Poisson’s  summation  formula  gives  a much  stronger  result. 

27.  [HM21]  Prove  (28)  and  complete  the  calculations  leading  to  Theorem  D. 

2S.  | [HM42]  (D.  H Lehmer.)  Show  that  the  Hardy-  Ramanujan-Rademacher  coeffi- 
cients Ak(n)  defined  in  (34)  have  the  following  remarkable  properties: 

a)  If  k is  odd,  then  A2k(km  + 4n  + ( k 2 - l)/8)  = A2(m)Ak(n). 

b)  If  p is  prime,  pe  > 2,  and  k T 2 p,  then 

AMk2m+p2'n-(k2+p2e-l)/24)  = (-1)^=41  Ap.(m)Ak(n). 

In  tins  formula  k2  + p2e  - 1 is  a multiple  of  24  if  p or  k is  divisible  by  2 or  3- 
otherwise  division  by  24  should  be  done  modulo  pek. 

c)  If  p is  prime,  |Ape(n)|  < 2[p>2)pe/2. 

d)  If  pis  prime,  Ap.(n)  / 0 if  and  only  if  1 - 24n  is  a quadratic  residue  modulo  p 
and  either  e = 1 or  24n  modp  ^ 1. 

e)  4 he  probability  that  Ak(n)  = 0,  when  k is  divisible  by  exactly  t primes  > 5 and 
n is  a random  integer,  is  approximately  1 — 2-t. 

► 29.  [Ml 6]  Generalizing  (41),  evaluate  the  sum  V 
30.  [MI  7]  Find  closed  forms  for  the  sums 

I n — km  I 

to  — 1 


(a)  £ 


and 


(b)  £ 


n 

to  — k 


(which  are  finite,  because  the  terms  being  summed  are  zero  when  k is  large). 

3i  [M24]  (A.  De  Morgan  1843.)  Show  that  |"|  = |n/2j  and  |"|  = |(n2  + 6)/12l- 
find  a similar  formula  for  n 3 1 L 1/  Jt 

I 4 I 

32.  [M15]  Prove  that  \”\  < p(n  - to)  for  all  m,n  > 0.  When  does  equality  hold? 

33.  [. HM20 ] Use  the  fact  that  there  are  exactly  (»^)  compositions  of  n into  to  parts, 

Eq.  7.2.1  3 (9),  to  prove  a lower  bound  on  \^\.  Then  set  to  = [^H\  to  obtain  an  ele- 
mentary lower  bound  on  p(n). 
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► 34.  [ HM21 ] Show  that  |n  is  the  number  of  partitions  of  n into  m.  distinct 

parts.  Consequently 


n 

m 


77i ! (m  — 1)! 


(-o(?)) 


when  m < n1^3. 


35.  [HM21]  In  the  Erdos-Lehner  probability  distribution  (43),  what  value  of  x is 
(a)  most  probable?  (b)  the  median?  (c)  the  mean?  (d)  What  is  the  standard  deviation? 

36.  [HM24]  Prove  the  key  estimate  (47)  that  is  needed  in  Theorem  E. 

37.  [M22]  Prove  the  inclusion-exclusion  bracketing  lemma  (48),  by  analyzing  how 
many  times  a partition  that  has  exactly  q different  parts  exceeding  m is  counted  in  the 
rth  partial  sum. 

38.  [ M20 ] Given  positive  integers  Z and  m,  what  generating  function  enumerates 
partitions  that  have  exactly  m parts,  and  largest  part  Z?  (See  Eq.  (51).) 

39.  [M20]  (A.  Cauchy.)  Continuing  exercise  38,  what  is  the  generating  function  for 
the  number  of  partitions  into  m parts,  all  distinct  and  less  than  Z? 

► 40.  [M25]  (F.  Franklin.)  Generalizing  Theorem  C,  show  that,  for  0 < k < m, 

(l-zi+1)...(l-zi+fc) 

1 J (1  - z)(l  - z2) ...  (1  - zm) 

is  the  number  of  partitions  a\a2  ...  of  n into  m or  fewer  parts  with  the  property  that 
oi  < flfc+i  + Z. 

41.  [HM42]  Extend  the  Hardy  Ramanujan-Rademacher  formula  (32)  to  obtain  a 
convergent  series  for  partitions  of  n into  at  most  m parts,  with  no  part  exceeding  l. 

42.  [ HM42 ] Find  the  limiting  shape,  analogous  to  (49),  for  random  partitions  of  n 
into  at  most  Q\fn  parts,  with  no  part  exceeding  ip\/n,  assuming  that  dip  > 1. 

43.  [Ml 8]  Given  n and  k,  how  many  partitions  of  n have  ai  > <12  > • • • > a*,? 

► 44.  [M22]  How  many  partitions  of  n have  their  two  smallest  parts  equal? 

45.  [HM21]  Compute  the  asymptotic  value  of  p(n—l)/p(n),  with  relative  error  0(n~2). 

46.  [M20]  In  the  text’s  analysis  of  Algorithm  P,  which  is  larger,  Tj(n)  or  T^'in)? 

► 47.  [ HM22 ] (A.  Nijenhuis  and  H.  S.  Wilf,  1975.)  The  following  simple  algorithm, 
based  on  a table  of  the  partition  numbers  p(0),  p(l),  ...,  p(n),  generates  a random 
partition  of  n using  the  part-count  representation  ci  . . . cn  of  (8).  Prove  that  it  produces 
each  partition  with  equal  probability. 

Nl.  [Initialize.]  Set  m <—  n and  ci  . . . c„  «—  0 . . . 0. 

N2.  [Done?]  Terminate  if  m = 0. 

N3.  [Generate.]  Generate  a random  integer  M in  the  range  0 < M < mp(m). 
N4.  [Choose  parts.]  Set  s «—  0.  Then  for  j = 1,2,...,  and  for  k = 1,  2,  ... , [m/jj, 
repeatedly  set  s 4—  s + kp(m  — jk)  until  s > M. 

N5.  [Update.]  Set  c&  cjt  + j,  m «—  m — jk,  and  return  to  N2.  | 


Hint:  Step  N4,  which  is  based  on  the  identity 


00  Lm/iJ 

EE  kp(m  — jk)  = mp(m), 

3= 1 k= 1 

chooses  each  particular  pair  of  values  (j,  k)  with  probability  kp(m  — jk)/(mp(m)). 
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48.  [HM40]  Analyze  the  running  time  of  the  algorithm  in  the  previous  exercise. 

” (4f9'  (a)  ^hat  is  ‘he  generating  function  F(z)  for  the  sum  of  the  smallest  parts 

of  all  partitions  of  n?  (The  series  begins  2 + 3z2  + 5z3  + 9 24  -|_  12ZS  + ■ ■ ■ ) 

(b)  Find  the  asymptotic  value  of  (zn]F(z),  with  relative  error  OfrT1). 

50.  [HM33]  Let  c(m)  = cm(2m)  in  the  recurrence  (56),  (57). 

a)  Prove  that  cm(m  + k)  = m — k + c(k)  for  0 < k < m. 

b)  Consequently  (58)  holds  for  m < n < 2m,  if  c(m)  < 3 p(m)  for  all  m > 0. 

c Show  that  c(m)  - m is  the  sum  of  the  second-smallest  parts  of  all  partitions  of  m 

d)  Find  a one-to-one  correspondence  between  all  partitions  of  n with  second-smallest 
part  k and  all  partitions  of  numbers  < n with  smallest  part  k + 1. 

e)  Describe  the  generating  function  Y'  nc(m)zrn . 
tj  Conclude  that  c(m)  < 3 p(m)  for  all  m > 0. 

51.  [M46]  Make  a detailed  analysis  of  Algorithm  H. 

► 52.  [M21]  What  is  the  millionth  partition  generated  by  Algorithm  P when  n = 64? 

Hint:  p( 64)  = 1741630  = 1000000+1”!  + l60l  + |47l  + |3si  , ,271  1221  , iis,  , 1151 

► 53.  [M21]  What  is  the  millionth  partition  generated  by  Algorithm  H when  m = 32 

o VI A 1009  TT: 1 S' ^ I SO  I I Afi  1 1 r r,  1 , 


and  n = 100?  Hint:  999999  = |®°|  + |??|  + |soj  + |4i|  + j33|  + 1 26 1 + 1211 


► 54.  [M30]  Let  a = a1a2...  and  f3  = M2.. . be  partitions'  If  n.  We  say  that  ct 

majonzes  ft,  written  a + ft  or  ft  A a,  if  a.  + • • • + ak  > bl  + . . . + bk  for  all  />  0. 

a)  True  or  false:  a ft  ft  implies  a > fj  (lexicographically). 

b)  True  or  false:  a ft  ft  implies  ft7  ft  a7. 

c)  Show  that  any  two  partitions  of  n have  a greatest  lower  bound  a A ft  such  that 

a\  c-  “7  a"d  j ~ 7 lf  and  °nly  ,f  a A /3  + 7.  Explain  how  to  compute  a A ft. 

d)  Similarly,  explain  how  to  compute  a least  upper  bound  aW ft  such  that  7 + a and 
7 C P 11  and  only  if  7 y a V ft. 

e)  If  a has  / parts  and  ft  has  m parts,  how  many  parts  do  a A ft  and  a V ft  have? 

) True  or  false:  If  a has  distinct  parts  and  ft  has  distinct  parts,  then  so  do  a A ft 
and  aVp.  H 

► 55.  [M37]  Continuing  the  previous  exercise,  say  that  a covers  ft  if  a ft  ft  and  a + ft 

an  1 a 7 t ft  implies  that  7 = a or  7 = ft.  For  example,  Fig.  52  illustrates  the 
covering  relations  between  partitions  of  the  number  12. 

a)  Let^us  write  “ P lf  Q = aia2  . . . and  ft  = M2  • . . are  partitions  for  which 

ff  ~ T [~]  J It  1]  fT  aH  k - 1 and  SOme  1 * L Prove  that  « covers  ft 
it  and  only  if  a fc>-  (3  or  /?T  £>-  aT . 

b)  Show  that  there  is  an  easy  way  to  tell  if  a covers  ft  by  looking  at  the  rim 
representations  of  a and  ft. 

c)  Let  n = (n22)  + (7)  where  n2  > m > 0 and  n2  > 2.  Show  that  no  partition  of  n 
covers  more  than  n2  — 2 partitions. 

d)  Say  that  the  partition  p is  minimal  if  there  is  no  partition  A with  //,  + A Prove 
that  n is  minimal  if  and  only  if  ,,T  has  distinct  parts. 

e)  Suppose  o = q?q  cx.\  £-•••£>-  q,  and  o — o'r,  b-  rv'  . / v . 

/ . . : / ^ k <lllu  a “ ao  (>-  • • • ^ afc/,  where  ok  and 

ak,  are  minimal  partitions.  Prove  that  k = k'  and  ak  = a'k, 

f)  Explain  how  to  compute  the  lexicographically  smallest  partition  into  distinct  parts 
that  majorizes  a given  partition  a. 

g)  Describe  A the  lexicographically  smallest  partition  of  n into  distinct  parts.  What 
is  the  length  of  all  paths  n1  = a0  t-  ai  >■  ■ ■ ■ >-  Aj? 
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Fig.  52.  The  majorization 
lattice  for  partitions  of  12. 
(See  exercises  54-58.) 


421111111 


133211111 


13^22211 


/ 1331111111  [32221111  12222221 

14111111111  \ ^ \ > 

\ 132211111  122222111 


13211111141  1222211111 

13111111111  1222111111 
12214111111 


[111111111111 


h)  What  are  the  lengths  of  the  longest  and  shortest  paths  of  the  form  n1  = ao,  a i, 
. . . , ai  = ln,  where  a3  covers  <ij+i  for  0 < j < l? 


► 56.  [M32]  Design  an  algorithm  to  generate  all  partitions  a such  that  A X a X p, 
given  partitions  A and  p with  A X p. 

Note:  Such  an  algorithm  has  numerous  applications.  For  example,  to  generate  all 
partitions  that  have  m parts  and  no  part  exceeding  l,  we  can  let  A be  the  smallest  such 
partition,  namely  \n/m\  . . . \n/m\  as  in  exercise  3,  and  let  p be  the  largest,  namely 
((n— m+l)lm_1)  A (l  L ,l / ' J (n  mod  /)).  Similarly,  according  to  a well-known  theorem  of 
H.  G.  Landau  [Bull.  Math.  Biophysics  15  (1953),  143-148],  the  partitions  of  (™)  such 
that 


m 

Lm/2J 

m — 1 

_~2  _ 

L 2 J 

X a X (m— l)(m— 2) . . . 21 


are  the  possible  “score  vectors”  of  a round-robin  tournament,  namely  the  partitions 
ai  . . ,om  such  that  the  jth  strongest  player  wins  aj  games. 
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57.  [M22]  Suppose  a matrix  (aij)  of  Os  and  Is  has  row  sums  n = £ fly  and  column 
urns  e,  - 2^  oy.  By  permuting  rows  and  columns  we  can  assume  that  n > r2  > ■ ■ ■ 
an  ci  c2  > • •• . Then  A = nr2  . . . and  y = cic2  ...  are  partitions  of  n = V o„ 
Prove  that  such  a matrix  exists  if  and  only  if  A X fiT.  ''J 

58  [M23]  {Symmetrical  means.)  Let  a = a,  . . . am  and  /3  = 6,  . . . 6m  be  partitions 
or  n.  Prove  that  the  inequality 


m!  ' 


,«i 


^Pm 


> 


— Y- 

77),!  < 


holds  for  all  nonnegative  values  of  the  variables  (xu...,xm),  where  the  sums  range  over 
all  m!  permutations  of  (1  . . ,m},  if  and  only  if  a t 0.  (For  example,  this  inequality 

reduces  to  (Vl  + ■ ■ ■ + yn)/n  > (Vl  . . . ynf!"  in  the  special  case  m = n,  a = nO  0 

/3=n...i  ,Xj  = yyn.)  ’ 

59  [M22]  The  Gray  path  (59)  is  symmetrical  in  the  sense  that  the  reversed  sequence 
, 01,  . . . , mill  is  the  same  as  the  conjugate  sequence  (111111)T,  (21111)T 
Find  all  Gray  paths  au , . . . , ap(n)  that  are  symmetrical  in  this  way. 

8°.  [23]  Complete  the  proof  of  Theorem  S by  modifying  the  definitions  of  L(m,n) 
and  M(m,  n)  in  all  places  where  L(4,6)  is  called  in  (62)  and  (63). 

61.  [26]  Implement  a partition-generation  scheme  based  on  Theorem  S,  always  speci- 
ymg  the  two  parts  that  have  changed  between  visits. 

62  [46]  Prove  or  disprove:  For  all  sufficiently  large  integers  n and  3 < m.  < n such 

f at  ”modm  ^ °’.and  for  all  partitions  a of  n with  ffll  < rn,  there  is  a Gray  path 
forjdljpartitmns  with  parts  < m,  beginning  at  1"  and  ending  at  a,  unless  a = 1"  or 

“hS  I"  A and " “ “,ere  * °“y  code  throush  ->  “ 

► 64.  [32]  (. Binary  partitions.)  Design  a loopless  algorithm  that  visits  all  partitions  of  n 

nnworo  O tt.L  I _ 1 1 „ r _ . _ 1 


into  powers  of  2,  where  each  step  replaces  2k  + 2k  by  2k+1 


or  vice  versa. 


65.  [23]  It  is  well  known  that  every  commutative  group  of  m elements  can  be  repre- 
sented as  a discrete  torus  T(mu. . . ,mn)  with  the  addition  operation  of  7.2.1  3-(66) 

Z ‘ 'o2m;ia^  mj  18  a multiple  °f  m^+1  for  1 < 3 < n.  For  example,  when 

m 3bU  - 2 • 3 • 5 there  are  six  such  groups,  corresponding  to  the  factorizations 
(mi, 7712,7773)  _ (30,6,2),  (60,6,1),  (90,2,2),  (120,3,1),  (180,2,1),  and  (360,1,1). 

Explain  how  to  generate  all  such  factorizations  systematically  with  an  algorithm 
that  changes  exactly  two  of  the  factors  rrij  at  each  step. 

► 66.  [M25]  ( P-partitions .)  Instead  of  insisting  that  01  > a2  > ■ • • , suppose  we  want 
to  consider  all  nonnegative  compositions  of  n that  satisfy  a given  partial  order.  For 
example  P.  A MacMahon  observed  that  all  solutions  to  the  “up-down”  inequalities 
a 4 _ <i'2  _ a 3 < ai  can  be  divided  into  five  nonoverlapping  types: 

“1  > a2  > a3  > o4;  ai  > a2  > a4  > a3; 
a2  > ai  > o3  > a4;  a2  > ax  > a4  > a3;  a2  > a4  > 01  > a3. 

Each  of  these  types  is  easily  enumerated  since,  for  example,  o2  > > o4  > o3  is 

equivalent  to  a2_-  2 > ax  - 1 > a4  - ! > a3;  the  number  of  solutions  with  a3  > 0 and 

01+02+03+04  - n is  the  number  of  partitions  of  n-1-2-0-1  into  at  most  four  parts 
Explain  how  to  solve  a general  problem  of  this  kind:  Given  any  partial  order 
relation  ^ on  m elements,  consider  all  m-tuples  ox  . . . om  with  the  property  that  «,  > ofc 
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when  j -<  k.  Assuming  that  the  subscripts  have  been  chosen  so  that  j -<  k implies  j < k, 
show  that  all  of  the  desired  m-tuples  fall  into  exactly  N classes,  one  for  each  of  the  out- 
puts of  the  topological  sorting  algorithm  7.2.1.2V.  What  is  the  generating  function  for 
all  such  ai  . . . am  that  are  nonnegative  and  sum  to  n?  How  could  you  generate  them  all? 

67.  [M25]  (P.  A.  MacMahon,  1886.)  A perfect  partition  of  n is  a multiset  that  has 
exactly  n + 1 submultisets,  and  these  multisets  are  partitions  of  the  integers  0,  1,  . . . , n. 
For  example,  the  multisets  {1, 1, 1, 1, 1},  {2,2, 1},  and  {3, 1, 1}  are  perfect  partitions  of  5. 

Explain  how  to  construct  the  perfect  partitions  of  n that  have  fewest  elements. 

68.  [ M23 ] What  partition  of  n into  m parts  has  the  largest  product  ai  ...  am,  when 
(a)  m is  given;  (b)  m is  arbitrary? 

69.  [M30]  Find  all  n < 109  such  that  the  equation  x\  + X2  + ■ ■ ■ + xn  = X1X2  ■ ■ ■ xn 
has  only  one  solution  in  positive  integers  xi  > x.2  > ■ ■ ■ > xn.  (There  is,  for  example, 
only  one  solution  when  n = 2,  3,  or  4;  but  5+2+1+1+1  = 5 ■ 2 • 1 • 1 • 1 and 
3 + 3 + l + l + l = 3-  3-  l-  lland2  + 2 + 2 + l + l = 2-  2-  2-  l-l.) 

70.  [M30]  (“Bulgarian  solitaire.”)  Take  n cards  and  divide  them  arbitrarily  into  one 
or  more  piles.  Then  repeatedly  remove  one  card  from  each  pile  and  form  a new  pile. 

Show  that  ifn  = 1 + 2 + • • • + m,  this  process  always  reaches  a self-repeating  state 
with  piles  of  sizes  {m,  m — 1, . . . , 1}.  For  example,  if  n = 10  and  if  we  start  with  piles 
whose  sizes  are  {3, 3,  2,  2},  we  get  the  sequence  of  partitions 

3322  ->  42211  -A  5311  -A  442  — > 3331  ->  4222  -A  43111  — > 532  -»  4321  ->  4321 

What  cycles  of  states  are  possible  for  other  values  of  n? 

71.  [Mf6]  Continuing  the  previous  problem,  what  is  the  maximum  number  of  steps 
that  can  occur  before  n-card  Bulgarian  solitaire  reaches  a cyclic  state? 

72.  [M30]  How  many  partitions  of  n have  no  predecessor  in  Bulgarian  solitaire? 

73.  [M25]  Suppose  we  write  down  all  partitions  of  n,  for  example 

6,  51,  42,  411,  33,  321,  3111,  222,  2211,  21111,  111111 
when  n = 6,  and  change  each  jth  occurrence  of  k to  j in  each  one: 

1,  11,  11,  112,  12,  111,  1123,  123,  1212,  11234,  123456. 

a)  Prove  that  this  operation  yields  a permutation  of  the  individual  elements. 

b)  How  many  times  does  the  element  k appear  altogether? 

7. 2. 1.5.  Generating  all  set  partitions.  Now  let’s  shift  gears  and  concentrate 
on  a rather  different  kind  of  partition.  The  partitions  of  a set  are  the  ways 
to  regard  that  set  as  a union  of  nonempty,  disjoint  subsets  called  blocks.  For 
example,  we  listed  the  five  essentially  different  partitions  of  {1,2,3}  at  the 
beginning  of  the  previous  section,  in  7.2.1.4-(2)  and  7.2.1.4-(4).  Those  five 
partitions  can  also  be  written  more  compactly  in  the  form 

123,  12|3,  13|2,  1|23,  1|2|3,  (i) 

using  a vertical  line  to  separate  one  block  from  another.  In  this  list  the  elements 
of  each  block  could  have  been  written  in  any  order,  and  so  could  the  blocks 
themselves,  because  T3|2’  and  ‘31 1 2’  and  ‘2|13’  and  ‘2|31’  all  represent  the  same 
partition.  But  we  can  standardize  the  representation  by  agreeing,  for  example, 
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to  list  the  elements  of  each  block  in  increasing  order,  and  to  arrange  the  blocks  in 
increasing  order  of  their  smallest  elements.  With  this  convention  the  partitions 
of  {1,2, 3, 4}  are 

1234,  123|4,  124|3,  12|34,  12|3|4,  134|2,  13|24,  13|2|4, 

14|23,  1 1 234,  1|23|4,  14|2|3,  1|24|3,  1|2|34,  1|2|3|4,  (2) 

obtained  by  placing  4 among  the  blocks  of  (i)  in  all  possible  ways. 

Set  partitions  arise  in  many  different  contexts.  Political  scientists  and 
economists,  for  example,  often  see  them  as  “coalitions”;  computer  system  de- 
signers may  consider  them  to  be  “cache  hit  patterns”  for  memory  accesses; 
poets  know  them  as  “rhyme  schemes”  (see  exercises  34-37).  We  saw  in  Section 
2.3.3  that  any  equivalence  relation  between  objects  — namely  any  binary  relation 
that  is  reflexive,  symmetric,  and  transitive  — defines  a partition  of  those  objects 
into  so-called  “equivalence  classes.”  Conversely,  every  set  partition  defines  an 
equivalence  relation:  If  FI  is  a partition  of  {1,2,...,  n ] we  can  write 

j = k (modulo  77)  (g) 

whenever  j and  k belong  to  the  same  block  of  77. 

One  of  the  most  convenient  ways  to  represent  a set  partition  inside  a com- 
puter is  to  encode  it  as  a restricted  growth  string , namely  as  a string  aia2  . . . an 
of  nonnegative  integers  in  which  we  have 

“i  = 0 and  aj+1  < 1 + max(al5 . . . , a,)  for  1 < j < n.  (4) 

The  idea  is  to  set  aj  = ak  if  and  only  if  j = k , and  to  choose  the  smallest 
available  number  for  a.,  whenever  j is  smallest  in  its  block.  For  example,  the 
restricted  growth  strings  for  the  fifteen  partitions  in  (2)  are  respectively 

0000,  0001,  0010,  0011,  0012,  0100,  0101,  0102, 

0110,  0111,  0112,  0120,  0121,  0122,  0123.  ^ 

This  convention  suggests  the  following  simple  generation  scheme,  due  to  George 
Hutchinson  [CACM  6 (1963),  613-614]: 

Algorithm  H ( Restricted  growth  strings  in  lexicographic  order).  Given  n > 2. 
this  algorithm  generates  all  partitions  of  {l,2,...,n}  by  visiting  all  strings 
Oia2...a„  that  satisfy  the  restricted  growth  condition  (4).  We  maintain  an 
auxiliary  array  M2...6n,  where  b]+1  = 1 + max(a1). . . , a,);  the  value  of  bn  is 
actually  kept  in  a separate  variable,  m,  for  efficiency. 

Hi.  [Initialize.]  Set  a% . . . an  0 . . . 0,  bi  . . . bn_i  ■<—  1 ...  1,  and  m 1. 

H2.  [Visit.]  Visit  the  restricted  growth  string  a1...a„,  which  represents  a 
partition  into  m + [an  = to]  blocks.  Then  go  to  H4  if  an  = m. 

H3.  [Increase  an.\  Set  an  ■<—  an  + 1 and  return  to  H2. 

H4.  [Find  j.}  Set  jf-n-  1;  then,  while  a,  = bj , set  j 4-  j - 1. 
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H5.  [Increase  aj.]  Terminate  if  j = 1.  Otherwise  set  aj  <—  aj  + 1. 

H6.  [Zero  out  aj+ 1 . . . an.]  Set  m 4—  bj  + [aj  = bj]  and  j 4—  j + 1.  Then,  while 
j < n,  set  aj  4—  0,  bj  4—  m,  and  j 4—  j + 1.  Finally  set  an  4—  0 and  go  back 
to  H2.  | 


Exercise  47  proves  that  steps  H4  TI6  are  rarely  necessary,  and  that  the  loops 
in  H4  and  H6  are  almost  always  short.  A linked-list  variant  of  this  algorithm 
appears  in  exercise  2. 

Gray  codes  for  set  partitions.  One  way  to  pass  quickly  through  all  set 
partitions  is  to  change  just  one  digit  of  the  restricted  growth  string  a\ . . . an  at 
each  step,  because  a change  to  aj  simply  means  that  element  j moves  from  one 
block  to  another.  An  elegant  way  to  arrange  such  a list  was  proposed  by  Gideon 
Ehrlich  [JACM  20  (1973),  507-508]:  We  can  successively  append  the  digits 

0,  m,  m — 1,  . . . , 1 or  1,  . . . , m — 1,  to,  0 (6) 

to  each  string  ai...an_i  in  the  list  for  partitions  of  n — 1 elements,  where 
m = 1 + max(ai, . . . , an_i),  alternating  between  the  two  cases.  Thus  the  list  ‘00, 
0T  for  n = 2 becomes  ‘000,  001,  Oil,  012,  010’  for  n = 3;  and  that  list  becomes 


0000,  0001,  0011,  0012,  0010,  0110,  0112,  0111, 

0121,  0122,  0123,  0120,  0100,  0102,  0101  ^ 

when  we  extend  it  to  the  case  n = 4.  Exercise  14  shows  that  Ehrlich’s  scheme 
leads  to  a simple  algorithm  that  achieves  this  Gray-code  order  without  doing 
much  more  work  than  Algorithm  H. 

Suppose,  however,  that  we  aren’t  interested  in  all  of  the  partitions;  we  might 
want  only  the  ones  that  have  exactly  m blocks.  Can  we  run  through  this  smaller 
collection  of  restricted  growth  strings,  still  changing  only  one  digit  at  a time? 
Yes;  a very  pretty  way  to  generate  such  a list  has  been  discovered  by  Frank 
Ruskey  [ Lecture  Notes  in  Comp.  Sci.  762  (1993),  205-206],  He  defined  two 
such  sequences,  Amn  and  A'mn,  both  of  which  start  with  the  lexicographically 
smallest  m-block  string  0”_m01 . . . (m— 1).  The  difference  between  them,  if 
n > m + 1,  is  that  Amn  ends  with  01 . . . (m-l)0n_m  while  A'mn  ends  with 
Qn— m — i oi . . . (m— 1)0.  Here  are  Ruskey’s  recursive  rules,  when  1 < m < n: 


A 


m(n+ 1) 


a: 


m(n-(-l) 


A(rn- i)n(m_1)’  A£„(m-1), . . . , A*n  1,  Am„0,  if  m is  even; 

A'(m- i)n(TO_1)’  Amn(m-1), . . . , A%n  1,  Amn 0,  if  m is  odd; 

j4(m-l)n(m_1)>  - 41°>  if  TO  is  even; 

■ , Amnl,  A£n0,  if  TO  is  odd. 


(8) 

(9) 


(In  other  words,  we  begin  with  either  A(m_1)n(m-1)  or  A'(m_1)n(m-1)  and  then 
use  either  A^nj  or  Amnj,  alternately,  as  j decreases  from  to  — 1 to  0.)  Of  course 
the  base  cases  are  simply  one-element  lists, 


Ai„  = A[n  = {0"}  and  Ann  = {01 . . . (n-1)}. 


(10) 
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With  these  definitions  the  {*}  = 25  partitions  of  {1,2,  3,  4,  5}  into  three  blocks 
are 

00012,  00112,  01112,  01012,  01002,  01102,  00102, 

00122,  01122,  01022,  01222,  01212,  01202, 

01201,  01211,  01221,  01021,  01121,  00121, 

00120,  01120,  01020,  01220,  01210,  01200. 

(See  exercise  17  for  an  efficient  implementation.) 

In  Ehrlich  s scheme  (7)  the  rightmost  digits  of  oq  . . . an  vary  most  rapidly, 
but  in  Ruskey’s  scheme  most  of  the  changes  occur  near  the  left.  In  both  cases,' 
however,  each  step  affects  just  one  digit  cij , and  the  changes  are  quite  simple: 
Either  aj  changes  by  ±1,  or  it  jumps  between  the  two  extreme  values  0 and 
1 + max(ai, . . . , (i  j _ 1 ) . Under  the  same  constraints,  the  sequence  ,4)  n , A'2n,  . . . , 
runs  through  all  partitions,  in  increasing  order  of  the  number  of  blocks. 

The  number  of  set  partitions.  We’ve  seen  that  there  are  5 partitions  of 
{1,2,3}  and  15  of  {1,2, 3, 4}.  A quick  way  to  compute  these  counts  was  dis- 
covered by  C.  S.  Peirce,  who  presented  the  following  triangle  of  numbers  in  the 
American  Journal  of  Mathematics  3 (1880),  page  48: 

1 

2 1 

5 3 2 

15  10  7 5 (l2) 

52  37  27  20  15 

203  151  114  87  67  52 

Here  the  entries  wn2 , . . . , runn  of  the  nth  row  obey  the  simple  recurrence 

T^nk  = ro(n-i)fc+CTn(fc+1)  if  1 < k < n;  wnn  = m(n_i)1  if  n > 1;  (13) 

and  wn  = 1.  Peirce’s  triangle  has  many  remarkable  properties,  some  of  which 
are  surveyed  in  exercises  26-31.  For  example,  zonk  is  the  number  of  partitions 
of  {1,2,. . . ,n}  in  which  k is  the  smallest  of  its  block. 

The  entries  on  the  diagonal  and  in  the  first  column  of  Peirce’s  triangle,  which 
tell  us  the  total  number  of  set  partitions,  are  commonly  known  as  Bell  numbers , 
because  E.  T.  Bell  wrote  several  influential  papers  about  them  [AMM  41  (1934), 
411-419;  Annals  of  Math.  (2)  35  (1934),  258-277;  39  (1938),  539-557],  We  shall 
denote  Bell  numbers  by  wn,  following  the  lead  of  Louis  Comtet,  in  order  to  avoid 
confusion  with  the  Bernoulli  numbers  Bn.  The  first  few  cases  are 

n=0123456  7 8 9 10  11  12 

= 1 1 2 5 15  52  203  877  4140  21147  115975  678570  4213597 

Notice  that  this  sequence  grows  rapidly,  but  not  as  fast  as  n!;  we  will  prove  below 
that  wn  = ©(n/logn)". 

The  Bell  numbers  wn  — wn  1 for  n > 0 must  satisfy  the  recurrence  formula 

ron+1  = + (2)^n-2  - 1 = (14) 

k 
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because  every  partition  of  {1, . . . , n + 1}  is  obtained  by  choosing  k elements  of 
{1, . . . , n}  to  put  in  the  block  containing  n+1  and  by  partitioning  the  remaining 
elements  in  mn-k  ways,  for  some  k.  This  recurrence,  found  by  Yoshisuke  Matsu- 
naga  in  the  18th  century  (see  Section  7. 2. 1.7),  leads  to  a nice  generating  function, 


n(z) 


oo 

Wn 

n= 0 


n! 


Us) 


discovered  by  W.  A.  Whitworth  [Choice  and  Chance,  3rd  edition  (1878),  3.XXIV], 
For  if  we  multiply  both  sides  of  (14)  by  zn/n\  and  sum  on  n we  get 


rfc\  / 00 

n'(z)  = E <=»+■  7 = (E  rr)  (E 

Ti—O  k= 0 m= 0 


and  (15)  is  the  solution  to  this  differential  equation  with  77(0)  = 1. 

The  numbers  zun  had  been  studied  for  many  years  because  of  their  curious 
properties  related  to  this  formula,  long  before  Whitworth  pointed  out  their 
combinatorial  connection  with  set  partitions.  For  example,  we  have 


ZUn 


7 i‘"i  E Y 


Dkz 


k= 0 


k\ 


kn 

Id 


(16) 


[Mat.  Sbornik  3 (1868),  62;  4 (1869),  39;  G.  Dobihski,  Archiv  der  Math,  und 
Physik  61  (1877),  333-336;  63  (1879),  108-110].  Christian  Kramp  discussed 
the  expansion  of  ee*  in  Der  polynomische  Lehrsatz,  ed.  by  C.  F.  Hindenburg 
(Leipzig:  1796),  112-113;  he  mentioned  two  ways  to  compute  the  coefficients, 
namely  either  to  use  (14)  or  to  use  a summation  of  p(n)  terms,  one  for  each 
ordinary  partition  of  n.  (See  Arbogast’s  formula,  exercise  1.2.5-21.  Kramp, 
who  came  close  to  discovering  that  formula,  seemed  to  prefer  his  partition-based 
method,  not  realizing  that  it  would  require  more  than  polynomial  time  as  n got 
larger  and  larger;  and  he  computed  116015,  not  115975,  for  the  coefficient  of  210.) 

* Asymptotic  estimates.  We  can  learn  how  fast  zon  grows  by  using  one  of  the 
most  basic  principles  of  complex  residue  theory:  If  the  power  series  J2kLoakzk 
converges  whenever  \z\  < r,  then 


^n  — 1 


1 f qp  + g\z  + a2z2  

2iri  J zn 


(17) 


if  the  integral  is  taken  along  a simple  closed  path  that  goes  counterclockwise 
around  the  origin  and  stays  inside  the  circle  |z|  = r.  Let  f(z)  = YlT=oakZk~n 
be  the  integrand.  We’re  free  to  choose  any  such  path,  but  special  techniques 
often  apply  when  the  path  goes  through  a point  z0  at  which  the  derivative  f (z0) 
is  zero,  because  we  have 


f(z0  + ee'6)  = f{z0)  + tMe2e2ie  + 0(e3}  (l8) 

in  the  vicinity  of  such  a point.  If,  for  example,  f(z0)  and  f"(z0)  are  real  and 
positive,  say  f(z0)  = u and  f"(z0)  = 2v,  this  formula  says  that  the  value  of 
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zo  — e 


zo  + ie 


zo-ie 


zo  + 


€ 


Fig.  53.  The  behavior  of  an  analytic 
function  near  a saddle  point. 


f(z o ± e)  is  approximately  u + ve2  while  f(z0±  ie)  is  approximately  u - ve2.  If  z 
moves  from  z0  - ie  to  z0  + ie,  the  value  of  f(z)  rises  to  a maximum  value  u,  then 
tails  again;  but  the  larger  value  u + ve2  occurs  both  to  the  left  and  to  the  right  of 
this  path.  In  other  words,  a mountaineer  who  goes  hiking  on  the  complex  plane, 
when  the  altitude  at  point  2 is  5ft/ (2),  encounters  a “pass”  at  z0;  the  terrain 
looks  like  a saddle  at  that  point.  The  overall  integral  of  f(z)  will  be  the  same 
if  taken  around  any  path,  but  a path  that  doesn’t  go  through  the  pass  won’t  be 
as  nice  because  it  will  have  to  cancel  out  some  higher  values  of  f(z)  that  could 
have  been  avoided.  Therefore  we  tend  to  get  best  results  by  choosing  a path  that 
goes  through  z0,  in  the  direction  of  increasing  imaginary  part.  This  important 
technique,  due  to  P.  Debye  [Math.  Annalen  67  (1909),  535-558],  is  called  the 
“saddle  point  method.” 

Let’s  get  familiar  with  the  saddle  point  method  by  starting  with  an  example 
for  which  we  already  know  the  answer: 

(n  - 1)!  27 ri  f zn  ^ Z ' 

Our  goal  is  to  find  a good  approximation  for  the  value  of  the  integral  on  the  right 
when  n is  large.  It  will  be  convenient  to  deal  with  f(z)  = ezlzn  by  writing  it  as 
e9  where  g(z)  = z—nlnz ; then  the  saddle  point  occurs  where  g'(z0)  = 1 -n/z0 
is  zero,  namely  at  z0  = n.  If  z = n + it  we  have 


9{z)  = 5(n)  + ]T 


^“,(nW 


fc= 2 


k\ 


— n 


. f it6 

n In  n 1 

2 n 3 7i2 


4n3  5n4 


because  gW(z)  = (-l)fc(*  - 1 )\n/zk  when  k > 2.  Let’s  integrate  f(z)  on  a 
rectangular  path  from  n - im  to  n + im  to  -n  + im  to  -n  - im  ton  - im : 


1 / e2  1 fm  1 f~n 

2M  f ^ dz  = 2^  /_m  + **)  dt+^-iJn  f(t  + lm)  dt 

1 r-m  1 /*« 


im)  dt. 
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Clearly  \f(z)\  < 2 n/(n)  on  the  last  three  sides  of  this  path  if  we  choose  m = 2 n, 
because  \ez\  = eKz  and  \z\  > max(|5ftz|,  |Sz|);  so  we’re  left  with 


1 

27 ri 


/ 


1 

27T 


e3(n+it)  dt  + Q 


Now  we  fall  back  on  a technique  that  we’ve  used  several  times  before  — 
for  example  to  derive  Eq.  5.1.4-(53):  If  /(f)  is  a good  approximation  to  f(t ) 
when  t eA,  and  if  the  sums  YlteB  I/Ml  and  I/Ml  are  both  small,  then 
^2teAuc  /M  is  a g°°d  approximation  to  J2t&AuB  /M-  The  same  idea  applies  to 
integrals  as  well  as  sums.  [This  general  method,  introduced  by  Laplace  in  1782, 
is  often  called  “trading  tails”;  see  C-Math  §9.4.]  If  |f|  < n1^2+e  we  have 


eg(n+it)  = exp 


t1 


it 3 


+ • • 


^eXPl-2^  + 3n2 


2n  3n2 
f2  it3  t*_ 
4 n3 

4 


) 


+ 0(n 


5e  — 3/2 


= ^e-*2/(2n)/1  + ^ + jl 

n”  V 3n2  4n3 

And  when  jt|  > n1/2+e  we  have 


18n4 


+ 0(n9e-3/2)). 


|e9(n+it)|  < |/(„  + inl/a+e)|  = exp  (- 2 ln(l  + n2-l  ))  = 0(€  " ^ ) . 

Furthermore  the  incomplete  gamma  function 


e-t2/(2n)tk  dt  _ 2(fc  — l)/2n(fc+l)/2 


Ik  + 1 n2e\ 

V 2 ’"2~J 


= 0(n0(x)e-”2e/2) 


is  negligible.  Thus  we  can  trade  tails  and  obtain  the  approximation 


2ixnr 

en 

2-nnr> 


/: 


= -*7(2' 


7° + i/3  + i74 


",(1+£+^^+0'“-3/2»)-“ 
18^7«  + 0(»-3'S)), 


where  Ik  = e * /(2n)ffc  dt.  Of  course  /;.  = 0 when  k is  odd.  Otherwise  we 
can  evaluate  Ik  by  using  the  well-known  fact  that 


/: 


t 21  dt  = 


_ r((2Z  + l)/2) 

a(2*  + l)/2 


\/27r 


(2a)(2i+1>/2  fJL 


n^-D 


(20) 


when  a > 0;  see  exercise  39.  Putting  everything  together  gives  us,  for  all  e > 0, 
the  asymptotic  estimate 


1 

(n-  1)! 


(l+0  + 


3 

4n 


15 

18n 


+ 


0(n9e~2 


(21) 


this  result  agrees  perfectly  with  Stirling’s  approximation,  which  we  derived  by 
quite  different  methods  in  1.2.11.2-(ig).  Further  terms  in  the  expansion  of 
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g(n  + it)  would  allow  us  to  prove  that  the  true  error  in  (21)  is  only  0{n~2), 
because  the  same  procedure  yields  an  asymptotic  series  of  the  general  form 

en/(v/2^nn-1/2)(i  + Cl/n  + C2/n2  + . . . + Cm/nm  + 0(n-m- 1))  for  aU  m 

Our  derivation  of  this  result  has  glossed  over  an  important  technicality:  The 
function  In  2 is  not  single- valued  along  the  path  of  integration,  because  it  grows 
by  2tt*  when  we  loop  around  the  origin.  Indeed,  this  fact  underlies  the  basic 
mechanism  that  makes  the  residue  theorem  work.  But  our  reasoning  was  valid 
because  the  ambiguity  of  the  logarithm  does  not  affect  the  integrand  f(z)  = 
ez/zn  when  n is  an  integer.  Furthermore,  if  n were  not  an  integer,  we  could 
have  adapted  the  argument  and  kept  it  rigorous  by  choosing  to  carry  out  the 
integral  (19)  along  a path  that  starts  at  —00,  circles  the  origin  counterclockwise 
and  returns  to  —00.  That  would  have  given  us  Hankel’s  integral  for  the  gamma 
function,  Eq.  1.2. 5 ( 17);  we  could  thereby  have  derived  the  asymptotic  formula 


valid  for  all  real  x as  x — 00. 

So  the  saddle  point  method  seems  to  work  -although  it  isn’t  the  simplest 
way  to  get  this  particular  result.  Let’s  apply  it  now  to  deduce  the  approximate 
size  of  the  Bell  numbers: 


^n  — 1 

(n-  1)! 


1 

27rle 


/ 


e9(z)  dz, 


g(z)  = ez  — rc  In  2. 


(23) 


A saddle  point  for  the  new  integrand  occurs  at  the  point  z0  = £ > 0,  where 

= n.  (24) 

(We  should  actually  write  £(n)  to  indicate  that  £ depends  on  n ; but  that  would 
clutter  up  the  formulas  below.)  Let’s  assume  for  the  moment  that  a little  bird 
has  told  us  the  value  of  £.  Then  we  want  to  integrate  on  a path  where  2 = £ + it. 
and  we  have 


<?(£  + it)  = — n ^ln  £ 


(it)H+  1 
2!  £2 


{it)3  £2  - 2! 
3!  £3 


(it)4  £3  + 3! 
4!  £4 


)• 


By  integrating  on  a suitable  rectangular  path,  we  can  prove  as  above  that  the 
integral  in  (23)  is  well  approximated  by 


e9(t,)-na2t2  -nia3t3  +naiti  + ■■■  ^ 


Ik 


£fc-1+(— l)fe(A;— 1)! 

k\  ’ 


(25) 


see  exercise  43.  Noting  that  aktk  is  0{nke~k /2)  inside  this  integral,  we  obtain  an 
asymptotic  expansion  of  the  form 


^n  — 1 


eet~1(n  — 1)! 
£n_1  \/  27rn(£  + 1) 


(26) 
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where  (4  + 1 )3k'bk  is  a polynomial  of  degree  4 k in  4-  (See  exercise  44.)  For 
example, 

_ 2^4  — 3^3  — 20^2  — 184+2  , , 

bl  ~ 24(4  + 1)3  ; (27) 

_ 448  — 156£7  — 6954®  — 69645  + 1092£4  + 2916£3  + 197242  — 724+4 

2 1I52(4  + 1)« ' <3S> 

Stirling’s  approximation  (21)  can  be  used  in  (26)  to  prove  that 

exp(n(4-l  + i)  -^-Iln(4+1)-1- (2g) 

and  exercise  45  proves  the  similar  formula 

= «*(»(*  - 1 + 1)  - \ >»«  + D - 1 - j§;  + o^)2)'  (3°) 

Consequently  we  have  wn/wn^i  « ffi  - n/4-  More  precisely, 

^ - !(-<))' 

But  what  is  the  asymptotic  value  of  4?  The  definition  (24)  implies  that 
4 = Inn  - In 4 = Inn  - ln(lnn  — In 4) 

= ]„„-l„l„„  + o(tet22);  (33) 

V log  n ) 

and  we  can  go  on  in  this  vein,  as  shown  in  exercise  49.  But  the  asymptotic 
series  for  4 developed  in  this  way  never  gives  better  accuracy  than  0(l/(logn)m) 
for  larger  and  larger  m;  so  it  is  hugely  inaccurate  when  multiplied  by  n in 
formula  (29)  for  tun_ 1 or  formula  (30)  for  wn. 

Thus  if  we  want  to  use  (29)  or  (30)  to  calculate  good  numerical  approxima- 
tions to  Bell  numbers,  our  best  strategy  is  to  start  by  computing  a good  numerical 
value  for  4,  without  using  a slowly  convergent  series.  Newton’s  rootfinding 
method,  discussed  in  the  remarks  preceding  Algorithm  4.7N,  yields  the  efficient 
iterative  scheme 

4o  = In  n,  4fc+i  = 1(1  + 4o  ~ ln4fc),  (33) 

which  converges  rapidly  to  the  correct  value.  For  example,  when  n = 100  the 
fifth  iterate 

4s  = 3.38563  01402  9005018488  82443  64529  72686  74917-  (34) 

is  already  correct  to  40  decimal  places.  Using  this  value  in  (29)  gives  us  successive 
approximations 

(1.6176088053  . . . , 1.6187421339  . . . , 1.6187065391  . . . , 1.6187060254  . . . ) x 10114 

when  we  take  terms  up  to  1,  bi/n,  b^/n2,  63/n3  into  account;  the  true  value 
of  w99  is  the  115-digit  integer  16187060274460  . . . 20741. 
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Fig.  54.  The  Stirling  numbers  I™} 
are  greatest  near  m = 28  and  m = 29. 
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Now  that  we  know  the  number  of  set  partitions  wn,  let’s  try  to  figure  out 
how  many  of  them  have  exactly  to  blocks.  It  turns  out  that  nearly  all  partitions 
of  {1, . . . , n}  have  roughly  n/ £ — blocks,  with  about  £ elements  per  block.  For 
example,  Fig.  54  shows  a histogram  of  the  Stirling  numbers  { ” } when  n = 100' 
in  that  case  fa  29.54. 

We  can  investigate  the  size  of  { } by  applying  the  saddle  point  method  to 
formula  1.2.9-(23),  which  states  that 


n! 


= -r[^](ez-l)r 
to! 


n!  1 
to!  27 xi 


f 


,mln(e2-l)-(n+l)lnz 


dz. 


(35) 


Let  a - (n  + l)/m.  The  function  g(z)  = a"1  ln(ez  - 1)  - lnz  has  a saddle  point 
at  cr  > 0 when 


cr 


1 - e~° 


(36) 


Notice  that  a > 1 for  1 < to  < n.  This  special  value  cr  is  given  by 


<7  = «-/?,  p = T(ae~a),  (37) 

where  T is  the  tree  function  of  Eq.  2.3.4.4-(3o).  Indeed,  P is  the  value  between 
0 and  1 for  which  we  have 

Pe~0  = ae~“;  (38) 

the  function  xe~x  increases  from  0 to  er1  when  x increases  from  0 to  1,  then  it 
decreases  to  0 again.  Therefore  fi  is  uniquely  defined,  and  we  have 


e = 


p' 


All  such  pairs  a and  P are  obtainable  by  using  the  inverse  formulas 

0 eCT  _ a 


a = 


e°-  - 1’ 


P = 


1’ 


(39) 


(40) 


for  example,  the  values  a — In  4 and  P — In  2 correspond  to  cr  = In  2. 

We  can  show  as  above  that  the  integral  in  (35)  is  asymptotically  equivalent  to 
an  integral  of  e(«+Ds(^  dz  over  the  path  z = a +it.  (See  exercise  58.)  Exercise  56 
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proves  that  the  Taylor  series  about  z — <r, 

t\  1-/3) 


g(cr  + it)  = g(a) 


2 a2 


-E 


fe= 3 


fc! 


5(feV), 


has  the  property  that 

|5(fc)((r)|  < 2 (k  - 1)!  (1  - p)/crk  for  all  k > 0. 


(4i) 


(42) 


Therefore  we  can  conveniently  remove  a factor  of  N — (n  + 1)(1  — /?)  from  the 
power  series  (n  + 1)5(2),  and  the  saddle  point  method  leads  to  the  formula 


ra!  1 

m\  (a  — 1 3)n~rnf3m\/2iTN 


r 5i  ^2 

1 + ~rz  + ~rp:  + • 

v N N2 


■ + ^i+° 

Nl 


(jV'+O)  ^43) 


as  N — > 00,  where  (1  — ft)2kb^  is  a polynomial  in  a and  /?.  (The  quantity 
(a  — /?)n_m/?m  in  the  denominator  comes  from  the  fact  that  (ea  — l)m/<xn  = 
(a//3  - l)m/(a  - /?)",  by  (37)  and  (39).)  For  example, 


6 - /?3  - 4a/?2  - a2/?  5(2  - /?2  - a/?)2 

8(1-0)  24(1 -/?)2 


(44) 


Exercise  57  proves  that  /V  — ► 00  if  and  only  if  n — m — > 00.  An  asymptotic  ex- 
pansion for  { } similar  to  (43),  but  somewhat  more  complicated,  was  first 
obtained  by  Leo  Moser  and  Max  Wyman,  Duke  Math.  J.  25  (1957),  29-43. 

Formula  (43)  looks  a bit  scary  because  it  is  designed  to  apply  over  the 
entire  range  of  block  counts  m.  Significant  simplifications  are  possible  when  m 
is  relatively  small  or  relatively  large  (see  exercises  60  and  61);  but  the  simplified 
formulas  don’t  give  accurate  results  in  the  important  cases  when  {”}  is  largest. 
Let’s  look  at  those  crucial  cases  more  closely  now,  so  that  we  can  account  for 
the  sharp  peak  illustrated  in  Fig.  54. 

Let  = n as  in  (24),  and  suppose  m = exp(£  + r/y/n)  = ner/^"/(;  we 
will  assume  that  |r|  < n€,  so  that  m is  near  c>A  The  leading  term  of  (43)  can  be 
rewritten 


n!  1 

m!  (a  - /3)n-m/3mv/27r(n  + 1)(1  - 0) 

mn  (n+1)!  en+1  fiyn-n  e-0m  , 

m!"  {n  + l)n+1  y/2ir{n  + 1)  V ~ a)  45) 

and  Stirling’s  approximation  for  (n  + 1)!  is  evidently  ripe  for  cancellation  in  the 
midst  of  this  expression.  With  the  help  of  computer  algebra  we  find 


mn 

m\ 


( 


')); 
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and  the  relevant  quantities  related  to  a and  /3  are 


- = - + + C(£3n2c_2); 

a n ny/n 


g-/3m  _ exp  I 


3n2e_1 


1) 


\fn 


Therefore  the  overall  result  is 

{ e£+”^ } = ^ exp("(«  - 1 + 1)  - 1 - 1 


+ 0(tn 


3„2e-l 


2£  V ' 6(£  + 

The  squared  expression  on  the  last  line  is  zero  when 


(46) 


r 


£(2£  + 3) 

2(£  + 1)\/^ 


+ 0(^2n-3/2); 


thus  the  maximum  occurs  when  the  number  of  blocks  is 


m 


n 

? 


3 + 2$ 
2 + 2£ 


(47) 


By  comparing  (46)  to  (30)  we  see  that  the  largest  Stirling  number  {"}  for  a 
given  value  of  n is  approximately  equal  to  £m„/y/2n n. 

The  saddle  point  method  applies  to  problems  that  are  considerably  more 
difficult  than  the  ones  we  have  considered  here.  Excellent  expositions  of  advanced 
techniques  can  be  found  in  several  books:  N.  G.  de  Bruijn,  Asymptotic  Methods 
in  Analysis  (1958),  Chapters  5 and  6;  F.  W.  J.  Olver,  Asymptotics  and  Special 
Functions  (1974),  Chapter  4;  R.  Wong,  Asymptotic  Approximations  of  Integrals 
(2001),  Chapters  2 and  7. 


* Random  set  partitions.  The  sizes  of  blocks  in  a partition  of  {l,...,n} 
constitute  by  themselves  an  ordinary  partition  of  the  number  n.  Therefore 
we  might  wonder  what  sort  of  partition  they  are  likely  to  be.  Figure  50  in 
Section  7.2. 1.4  showed  the  result  of  superimposing  the  Ferrers  diagrams  of  all 
p(25)  — 1958  partitions  of  25;  those  partitions  tended  to  follow  the  symmetrical 
curve  of  Eq.  7.2.1 .4 -(49).  By  contrast,  Fig.  55  shows  what  happens  when  we 
superimpose  the  corresponding  diagrams  of  all  w25  « 4.6386  x 1018  partitions 
of  the  set  {1, . . . , 25}.  Evidently  the  “shape”  of  a random  set  partition  is  quite 
different  from  the  shape  of  a random  integer  partition. 

This  change  is  due  to  the  fact  that  some  integer  partitions  occur  only  a few 
times  as  block  sizes  of  set  partitions,  while  others  are  extremely  common.  For 
example,  the  partition  n = 1 + 1 + • • • + 1 arises  in  only  one  way,  but  if  n is 
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Fig.  55.  The  shape  of  a random 
set  partition  when  n = 25. 


even  the  partition  n = 2 + 2 H h 2 arises  in  (n  - l)(n  - 3) ...  (1)  ways.  When 

n = 25,  the  integer  partition 

25  = 4 + 4 + 3 + 3 + 3 + 2 + 2 + 2 + 1 + 1 


actually  occurs  in  more  than  2%  of  all  possible  set  partitions.  (This  particular 
partition  turns  out  to  be  most  common  in  the  case  n = 25.  The  answer  to 
exercise  1.2.5-21  explains  that  exactly 

n! 

c\\  l!Cl  c2!  2!C2  ...c„!n!c"  ^ 

set  partitions  correspond  to  the  integer  partition  n = Ci  ■ 1 + c2  • 2 H h cn  • n.) 

We  can  easily  determine  the  average  number  of  fc-blocks  in  a random  par- 
tition of  {1, ...  ,n}:  If  we  write  out  all  zun  of  the  possibilities,  every  particular 
/.--element  block  occurs  exactly  vjn_k  times.  Therefore  the  average  number  is 

)~w  <49> 

An  extension  of  Eq.  (31)  above,  proved  in  exercise  64,  shows  moreover  that 


- 


kt(k£  + k + 1) 

2(£  + l)2n 


+ 0 


)) 


if  k < n2/3, 


(50) 


where  £ is  defined  in  (24).  Therefore  if,  say,  k <ne,  formula  (49)  simplifies  to 


n- 

Jd 


There  are,  on  average,  about  £ blocks  of  size  1,  and  £2/2!  blocks  of  size  2,  etc. 

The  variance  of  these  quantities  is  small  (see  exercise  65),  and  it  turns  out 
that  a random  partition  behaves  essentially  as  if  the  number  of  /c-blocks  were 
a Poisson  deviate  with  mean  £fc//d.  The  smooth  curve  shown  in  Fig.  55  runs 
through  the  points  (/(fc),A?)  in  Ferrers-like  coordinates,  where 

f(k)  = £fc+1/(fc  + 1)!  + £fc+2/(Zs  + 2)!  + £fc+3/(fc  + 3)!  + • • • 


(52) 
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is  the  approximate  distance  from  the  top  line  corresponding  to  block  size  k > 0. 
(This  curve  becomes  more  nearly  vertical  when  n is  larger.) 

The  largest  block  tends  to  contain  approximately  e£  elements.  Furthermore, 
the  probability  that  the  block  containing  element  1 has  size  less  than  £ + 
approaches  the  probability  that  a normal  deviate  is  less  than  a.  [See  John 
Haigh,  J.  Combinatorial  Theory  A13  (1972),  287-295;  V.  N.  Sachkov,  Prob- 
abilistic Methods  in  Combinatorial  Analysis  (1997),  Chapter  4,  translated  from 
a Russian  book  published  in  1978;  Yu.  Yakubovich,  J.  Mathematical  Sciences  87 
(1997),  4124-4137,  translated  from  a Russian  paper  published  in  1995;  B.  Pittel, 
J.  Combinatorial  Theory  A79  (1997),  326-359.] 

A nice  way  to  generate  random  partitions  of  {1, 2, . . . , n}  was  introduced  by 
A.  J.  Stam  in  the  Journal  of  Combinatorial  Theory  A35  (1983),  231  -240:  Let 
M be  a random  integer  that  takes  the  value  m with  probability 


mn 

Pm  — j ; 
era!  wn 


(53) 


these  probabilities  sum  to  1 because  of  (16).  Once  M has  been  chosen,  generate 
a random  n- tuple  XtX2  . . . Xn,  where  each  Xt  is  uniformly  and  independently 
distributed  between  0 and  M — 1.  Then  let  i = j in  the  partition  if  and  only  if 
Xt  = Xy  This  procedure  works  because  each  set  partition  that  has  k blocks  is 
obtained  with  probability  Em>o(TO-/™n)pm  = l/wn. 


For  example,  if  n — 25  we  have 


p4  « .00000372 
p5  « .00019696 
p6  « .00313161 
p7  ss  .02110279 
p8  « .07431024 


Pq  ~ .15689865 
Pio  « .21855285 
Pn  ~ -21526871 
P12  « .15794784 
P13  « .08987171 


p 14  ~ .04093663 
Pi5  « .01531445 
Pi e « .00480507 
P17  « .00128669 
Pis  « .00029839 


P19  « .00006068 
P20  w .00001094 
P21  « .00000176 
P22  ~ .00000026 
P23  « .00000003 


and  the  other  probabilities  are  negligible.  So  we  can  usually  get  a random 
partition  of  25  elements  by  looking  at  a random  25-digit  integer  in  radix  9,  10, 
11,  or  12.  The  number  M can  be  generated  using  3.4.1  (3);  it  tends  to  be 
approximately  n/£  = e?  (see  exercise  67). 


*Partitions  of  a multiset.  The  partitions  of  an  integer  and  the  partitions  of 
a set  are  just  the  extreme  cases  of  a far  more  general  problem,  the  partitions  of 
a multiset.  Indeed,  the  partitions  of  n are  essentially  the  same  as  the  partitions 
of  {1, 1, ... , 1},  where  there  are  n Is. 

From  this  standpoint  there  are  essentially  p(n)  types  of  multisets  with  n el- 
ements. For  example,  five  different  cases  of  multiset  partitions  arise  when  n = 4: 


1234,  123 14,  124|3,  12|34,  12|3|4,  134|2,  13|24,  13|2|4, 

14|23,  14|2|3,  1|234,  1|23|4,  1|24|3,  1|2|34,  1|2|3|4; 
1123,  112|3,  113|2,  11 1 23,  11|2|3,  123|1,  12|13,  12|1|3,  13|1|2,  1|1|23,  1|1|2|3; 
1122,  1 12 1 2,  1 1 1 22,  1 1 1 2|  2,  122|1,  12|12,  12|1|2,  1|1|22,  1|1|2|2; 

1112,  111|2,  1 12|  1,  11 1 12,  1 1 1 1 12,  12|  1 1 1,  1 1 1 1 1 12; 

1111,  m|i,  ii|n,  ii|i|i,  i|i|i|i. 


(54) 
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When  the  multiset  contains  m distinct  elements,  with  nx  of  one  kind,  n2  of 
another,  . . . , and  nm  of  the  last,  we  write  p(nx,  n2, . . . , nm)  for  the  total  number 
of  partitions.  Thus  the  examples  in  (54)  show  that 


p(l,  1,1,1)  = 15,  p( 2, 1,1)  = 11,  p( 2, 2)  = 9,  p(3,l)  = 7,  p(4)  = 5.  (55) 

Partitions  with  m = 2 are  often  called  “bipartitions”;  those  with  m — 3 are 
“tripartitions”;  and  in  general  these  combinatorial  objects  are  known  as  multi- 
partitions. The  study  of  multipartitions  was  inaugurated  long  ago  by  P.  A. 
MacMahon  [ Philosophical  Transactions  181  (1890),  481-536;  217  (1917),  81- 
113;  Proc.  Cambridge  Philos.  Soc.  22  (1925),  951-963];  but  the  subject  is  so  vast 
that  many  unsolved  problems  remain.  In  the  remainder  of  this  section  and  in 
the  exercises  below  we  shall  take  a glimpse  at  some  of  the  most  interesting  and 
instructive  aspects  of  the  theory  that  have  been  discovered  so  far. 

In  the  first  place  it  is  important  to  notice  that  multipartitions  are  essentially 
the  partitions  of  vectors  with  nonnegative  integer  components,  namely  the  ways 
to  decompose  such  a vector  as  a sum  of  such  vectors.  For  example,  the  nine 
partitions  of  {1, 1,2,2}  listed  in  (54)  are  the  same  as  the  nine  partitions  of  the 
bipartite  column  vector  namely 


2 20  20  200  11  11  110  110  1100 

2’  11’  02>  Oil!  2 0 ’ 11>  10l!  002!  0011- 


(56) 


(We  drop  the  + signs  for  brevity,  as  in  the  case  of  one-dimensional  integer 
partitions.)  Each  partition  can  be  written  in  canonical  form  if  we  list  its  parts 
in  nonincreasing  lexicographic  order. 

A fairly  simple  algorithm  suffices  to  generate  the  partitions  of  any  given 
multiset.  I11  the  following  procedure  we  represent  partitions  on  a stack  that 
contains  triples  of  elements  (c,  u,  v),  where  c denotes  a component  number,  u > 0 
denotes  the  yet-unpartitioned  amount  remaining  in  component  c,  and  v denotes 
the  c component  of  the  current  part,  where  0 < v < u.  Triples  are  actually  kept 
in  three  arrays  (co,  Ci, . . . ),  (mq,  ux.  . . . ),  and  (v0,  vx, . . . ) for  convenience,  and  a 
“stack  frame”  array  (/0,  /1, . . . ) is  also  maintained  so  that  the  (l  + l)st  vector  of 
the  partition  consists  of  elements  /;  through  /;+i  — 1 in  the  c,  u,  and  v arrays. 
For  example,  the  following  arrays  would  represent  the  bipartition  f 201  i°i: 


3 

0 1 

2 3 

4 5 

6 7 

8 9 

10 

11 

Cj 

1 2 

1 2 

1 2 

1 2 

1 2 

2 

2 

Ui 

J 

9 9 

Ci 

OO 

4 6 

2 6 

1 5 

4 

1 

VJ 

3 1 

2 2 

2 0 

1 1 

1 1 

3 

1 

0 

rH 

0 

-* 

(O 

00 

t—H 

r— 1 

II 

II 

II 

II 

II 

II 

II 

< 

< 

< 

Algorithm  M ( Multipartitions  in  decreasing  lexicographic  order).  Given  a 
multiset  {nx  ■ 1. . . . , nrn  ■ m \.  this  algorithm  visits  all  of  its  partitions  using  arrays 
fofi  ■ ■ ■ fn,  c0C! . . . cmn,  U0U1  . . . umn,  and  w0^i  • • • vmn  as  described  above,  where 
n — nx  + ■ ■ ■ + nm.  We  assume  that  m > 0 and  nx , . . . , nm  > 0. 
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Ml.  [Initialize.]  Set  Cj  4—  j + 1 and  Uj  4—  Vj  4—  Tij+i  for  0 < j < m;  also  set 
/o  4-  a 4-  l 4-  0 and  /i  4-  t 4-  m.  (In  the  following  steps,  the  current 
stack  frame  runs  from  a to  b - 1,  inclusive.) 

M2.  [Subtract  v from  u.]  (At  this  point  we  want  to  find  all  partitions  of  the 
vector  u in  the  current  frame,  into  parts  that  are  lexicographically  < v. 
First  we  will  use  v itself.)  Set  j 4-  a,  k 4-  b,  and  x 4-  0.  Then  while  j <b 
do  the  following:  Set  uk  4-  -Vj.  If  uk  = 0,  just  set  x 4-  1 and  j 4-  j + 1. 
Otherwise  if  x = 0,  set  ck  4-  cj,  vk  4-  min(u,-,  uk),  x 4-  [uk  < vj],  k 4-  fc  + 1, 
j 4-  j + 1-  Otherwise  set  ck  4-  ch  vk  4-  uk,  k 4-  k + 1,  j 4-  j + 1.  (Notice 
that  x = [v  has  changed].) 

M3.  [Push  if  nonzero.]  If  k > b,  set  a 4-  b,  b 4-  k,  l 4-  l + 1,  /,+1  4-  6,  and 
return  to  M2. 

M4.  [Visit  a partition.]  Visit  the  partition  represented  by  the  l -f  1 vectors 
currently  in  the  stack.  (For  0 < k < l,  the  vector  has  v,  in  component  c7, 
for  fk  <j<  fk+ 1.) 

M5.  [Decrease  u.]  Set  j 4-  b - 1;  while  v3  = 0,  set  j 4-  j - 1.  Then  if  j = a and 
= f»  g°  to  M6.  Otherwise  set  vj  4-  v3  - 1,  and  vk  4-  uk  for  j < k < b. 
Return  to  M2. 

M6.  [Backtrack.]  Terminate  if  l = 0.  Otherwise  set  l 4-  Z - 1,  b 4-  a,  a 4-  fh 
and  return  to  M5.  | 

The  key  to  this  algorithm  is  step  M2,  which  decreases  the  current  residual  vector, 
u,  by  the  largest  permissible  part,  v;  that  step  also  decreases  v,  if  necessary,  to 
the  lexicographically  largest  vector  < v that  is  less  than  or  equal  to  the  new 
residual  amount  in  every  component.  (See  exercise  68.) 

Let  us  conclude  this  section  by  discussing  an  amusing  connection  between 
multipartitions  and  the  least-significant-digit-first  procedure  for  radix  sorting 
(Algorithm  5.2.5R).  The  idea  is  best  understood  by  considering  an  example.  See 
Table  1,  where  Step  (0)  shows  nine  4-partite  column  vectors  in  lexicographic 
order.  Serial  numbers  ®-®  have  been  attached  at  the  bottom  for  identifica- 
tion. Step  (1)  performs  a stable  sort  of  the  vectors,  bringing  their  fourth  (least 
significant)  entries  into  decreasing  order;  similarly,  Steps  (2),  (3),  and  (4)  do  a 
stable  sort  on  the  third,  second,  and  top  rows.  The  theory  of  radix  sorting  tells 
us  that  the  original  lexicographic  order  is  thereby  restored. 

Suppose  the  serial  number  sequences  after  these  stable  sorting  operations  are 
respectively  04,  0.304 , 02O3O4.  and  04  0-20:304 , where  the  a’s  are  permutations; 
Table  1 shows  the  values  of  04,  q3,  o2.  and  04  in  parentheses.  And  now  comes 
the  point:  Wherever  the  permutation  a}  has  a descent,  the  numbers  in  row  j 
after  sorting  must  also  have  a descent,  because  the  sorting  is  stable.  (These 
descents  are  indicated  by  caret  marks  in  the  table.)  For  example,  where  0:3  has 
8 followed  by  7,  we  have  5 followed  by  3 in  row  3.  Therefore  the  entries  aj  ...  <19  in 
row  3 after  Step  (2)  are  not  an  arbitrary  partition  of  their  sum;  they  must  satisfy 

ai  > a2  > <23  > <24  > 05  > 06  > (17  > as  > a9. 
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Table  1 

RADIX  SORTING  AND  MULTIPARTITIONS 


Step  (0):  Original  partition 
655432100 
32104564  2 
6 6 3 1 1 5 2 0 7 
4 2 1 3 3 1 1 2 5 

©(DO®©©©®® 


Step  (1):  Sort  row  4 
064350521 
230424156 
761160352 
5A4  3 3a2  2a 1 1 1 

@®@©©®@©© 

( 9A1  4 5A2  8a3  6 7 ) 


Step  (2):  Sort  row  3 
065251430 
2325  1 6044 
766  5a3  2a 1 10 
5 4 2 1 1 1 3 3 2 

®®©©@©@©® 
a3  = ( 1 2 5 8a7  9,3  4 6 ) 


Step  (3):  Sort  row  2 
123060554 
6A5  4 4a3a2  2 1 0 
2 5 1 0 6 7 6 3 1 
1 1 3 2 4 5 2 1 3 

©©©®®@©@® 

a2 = ( 6a4  8 9A2A 1 357) 


Step  (4):  Sort  row  1 
655  4A3A2A1  0 0 
321045642 
6 6 3 1 1 5 2 0 7 
421331125 

®©@®©©©®® 

ai  = ( 5 7 8 9A3A2A1  4 6 ) 


But  the  numbers  (ai  — 2,  a 2 — 2,  a3  — 2,  04  — 2,  as  — 1,  ag  — 1,  ay,  as,  ag)  do  form  an 
essentially  arbitrary  partition  of  the  original  sum,  minus  (4  + 6).  The  amount  of 
decrease,  4 + 6,  is  the  sum  of  the  indices  where  descents  occur;  this  number  is 
what  we  called  inda3,  the  “index”  of  a3,  in  Section  5.1.1. 

Thus  we  see  that  any  given  partition  of  an  m-partite  number  into  at  most  r 
parts,  with  extra  zeros  added  so  that  the  number  of  columns  is  exactly  r,  can 
be  encoded  as  a sequence  of  permutations  am  of  {1, . . . ,r}  such  that 

the  product  ai...  am  is  the  identity,  together  with  a sequence  of  ordinary  one- 
dimensional partitions  of  the  numbers  (n  1 - ind ax,  . . . , nm  - indam)  into  at 
most  r parts.  For  example,  the  vectors  in  Table  1 represent  a partition  of 
(26,27,31,22)  into  9 parts;  the  permutations  alt  ...,  a4  appear  in  the  table, 
and  we  have  (indaq, . . . , ind  04)  = (15, 10, 10, 11);  the  partitions  are  respectively 

26-15  = (322111100),  27-10  = (332222210), 

31-10  - (544321110),  22-11  = (221111111). 

Conversely,  any  such  permutations  and  partitions  will  yield  a multipartition 
of  («!,..., nm).  If  r and  m are  small,  it  can  be  helpful  to  consider  these 
r!m  1 sequences  of  one-dimensional  partitions  when  listing  or  reasoning  about 
multipartitions,  especially  in  the  bipartite  case.  [This  construction  is  due  to 
Basil  Gordon,  J.  London  Math.  Soc.  38  (1963),  459-464.] 

A good  summary  of  early  work  on  multipartitions,  including  studies  of 
partitions  into  distinct  parts  and/or  strictly  positive  parts,  appears  in  a paper 
by  M.  S.  Cheema  and  T.  S.  Motzkin,  Proc.  Symp.  Pure  Math.  19  (Arner.  Math. 
Soc.,  1971),  39-70. 

EXERCISES 

1.  [20]  (G.  Hutchinson.)  Show  that  a simple  modification  to  Algorithm  H will 
generate  all  partitions  of  {1, . . . , n}  into  at  most  r blocks,  given  n and  r > 2. 
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► 2.  [22]  When  set  partitions  are  used  in  practice,  we  often  want  to  link  the  elements 

of  each  block  together.  Thus  it  is  convenient  to  have  an  array  of  links  h...ln  and  an 
array  of  headers  hi  . . . ht  so  that  the  elements  of  the  jth  block  of  a t-block  partition 
are  ii  > ■ ■ ■ > ik,  where 


ii—hj,  12  — /,!,  ...,  ik  = lik_l,  and  lik  = 0. 

For  example,  the  representation  of  137|25|489|6  would  have  t = 4,  l\  . . ,lg  = 001020348 
and  hi  . . . h4  = 7596. 

Design  a variant  of  Algorithm  H that  generates  partitions  using  this  representation. 
3.  [M23]  What  is  the  millionth  partition  of  {1, . . . , 12}  generated  by  Algorithm  H? 

► 4.  [21]  If  xi . . . xn  is  any  string,  let  p(x i . . . xn)  be  the  restricted  growth  string  that 
corresponds  to  the  equivalence  relation  j = k <S=>  Xj  = xk.  Classify  each  of  the 
five-letter  English  words  in  the  Stanford  GraphBase  by  applying  this  p function;  for 
example,  p(tooth)  = 01102.  How  many  of  the  52  set  partitions  of  five  elements  are  rep- 
resentable by  English  words  in  this  way?  What’s  the  most  common  word  of  each  type? 

5.  [22]  Guess  the  next  elements  of  the  following  two  sequences:  (a)  0,  1,  1,  1,  12,  12. 
12,  12,  12,  12,  100,  121,  122,  123,  123,  . . . ; (b)  0,  1,  12,  100,  112,  121,  122,  123, 

► 6.  [25]  Suggest  an  algorithm  to  generate  all  partitions  of  {1, . . . ,n}  in  which  there 
are  exactly  c\  blocks  of  size  1,  C2  blocks  of  size  2,  etc. 

7.  [M20]  How  many  permutations  ax  . . . an  of  {l,...,n}  have  the  property  that 
flfc-i  > ak  > a,j  implies  j > k ? 

8.  [20]  Suggest  a way  to  generate  all  permutations  of  {1, . . . ,n}  that  have  exactly 
m left-to-right  minima. 

9.  [M20]  How  many  restricted  growth  strings  ai...an  contain  exactly  k3  occurrences 
of  j,  given  the  integers  k0,  kX)  . . kn-i? 

10.  [25]  A semilabeled  tree  is  an  oriented  tree  in  which  the  leaves  are  labeled  with  the 
integers  {1, . . . ,k},  but  the  other  nodes  are  unlabeled.  Thus  there  are  15  semilabeled 
trees  with  5 vertices: 


1 


1234 


Find  a one-to-one  correspondence  between  partitions  of  {l,...,n}  and  semilabeled 
trees  with  n + 1 vertices. 

►11.  [28]  We  observed  in  Section  7.2. 1.2  that  Dudeney’s  famous  problem  send -(-more  = 
money  is  a “pure”  alphametic,  namely  an  alphametic  with  a unique  solution.  His  puzzle 
corresponds  to  a set  partition  on  13  digit  positions,  for  which  the  restricted  growth 
string  p(sendmoremoney)  is  0123456145217;  and  we  might  wonder  how  lucky  he  had  to 
be  in  order  to  come  up  with  such  a construction.  How  many  restricted  growth  strings  of 
length  13  define  pure  alphametics  of  the  form  aia2a3a4  + a5a6a7as  = a9aioaliai2a13? 

12.  [MSI  ] ( The  partition  lattice.)  If  /7  and  77'  are  partitions  of  the  same  set,  we  write 
77  ■<  77'  if  x = \ y (modulo  77)  whenever  x = y (modulo  77').  In  other  words,  77  A 77' 
means  that  IT  is  a “refinement”  of  77,  obtained  by  partitioning  zero  or  more  of  the 
latter’s  blocks;  and  77  is  a “crudification”  or  coalescence  of  77',  obtained  by  merging 
zero  or  more  blocks  together.  This  partial  ordering  is  easily  seen  to  be  a lattice,  with 
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77  V 77'  the  greatest  common  refinement  of  77  and  77',  and  with  77  A 77'  their  least 
common  coalescence.  For  example,  the  lattice  of  partitions  of  {1,2, 3, 4}  is 


if  we  represent  partitions  by  restricted  growth  strings  01020304;  upward  paths  in  this 
diagram  take  each  partition  into  its  refinements.  Partitions  with  t blocks  appear  on 
level  t from  the  bottom,  and  their  descendants  form  the  partition  lattice  of  {1, ...  ,7}. 

a)  Explain  how  to  compute  77  V 77',  given  ax...an  and  a!x  . . ,a'n. 

b)  Explain  how  to  compute  77  A 77',  given  ax...an  and  o{  . . . a'n. 

c)  When  does  77'  cover  77  in  this  lattice?  (See  exercise  7.2.1.4-55.) 

d)  If  77  has  t blocks  of  sizes  si,  . . . , st,  how  many  partitions  does  it  cover? 

e)  If  77  has  t blocks  of  sizes  sx,  . . . , st,  how  many  partitions  cover  it? 

f)  True  or  false:  If  77  V 77'  covers  77,  then  77'  covers  77  A 77'. 

g)  True  or  false:  If  77'  covers  77  A 77',  then  77  V 77'  covers  77. 

h)  Let  6(  II ) denote  the  number  of  blocks  of  77.  Prove  that 

6(77) + 6(77')  < 6(77  V 77')  + 6(77  A 77'). 

13.  [ M28 ] (Stephen  C.  Milne,  1977.)  If  A is  a set  of  partitions  of  {1,  ...,n},  its 
shadow  dA  is  the  set  of  all  partitions  77'  such  that  77  covers  77'  for  some  77  g A.  (We 
considered  the  analogous  concept  for  the  subset  lattice  in  7.2.1.3-(54).) 

Let  77i,  772,  ...  be  the  partitions  of  {1, . . . ,n}  into  t blocks,  in  lexicographic  order 
of  their  restricted  growth  strings;  and  let  77{,  77a,  ...  be  the  (t  - l)-block  partitions, 
also  in  lexicographic  order.  Prove  that  there  is  a function  fnt{N)  such  that 

d{n1, . . . , 77jv}  = {77{, . . . , 77}nt(N)}  for  0 < TV  < | ™ j. 

Hint:  The  diagram  in  exercise  12  shows  that  (/43(0), . . . , /43(6))  = (0,  3, 5,  7,  7,  7,  7). 

14.  [23]  Design  an  algorithm  to  generate  set  partitions  in  Gray-code  order  like  (7). 

15.  [M21]  What  is  the  final  partition  generated  by  the  algorithm  of  exercise  14? 

16.  [16]  The  list  (11)  is  Ruskey’s  A35;  what  is  A35? 

17.  [26]  Implement  Ruskey’s  Gray  code  (8)  for  all  m-block  partitions  of  {1, ... , n}. 

18.  [MJ6]  For  which  n is  it  possible  to  generate  all  restricted  growth  strings  ax...an 
in  such  a way  that  some  a3  changes  by  ±1  at  each  step? 

19.  [28]  Prove  that  there’s  a Gray  code  for  restricted  growth  strings  in  which,  at  each 
step,  some  aj  changes  by  either  ±1  or  ±2,  when  (a)  we  want  to  generate  all  wn  strings 
“i  • • ■ +;  °r  (b)  we  want  to  generate  only  the  {^}  cases  with  max(cn, . . . ,an)  = m-  1. 
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20.  [17]  If  77  is  a partition  of  {1, ...  ,n},  its  conjugate  77r  is  defined  by  the  rule 

j = k (modulo  77t)  n+l-j  = n+l-k  (modulo  77). 

Suppose  77  has  the  restricted  growth  string  001010202013;  what  is  the  restricted  growth 
string  of  TIL  ? 

21.  [M27]  How  many  partitions  of  {1, . . . , n}  are  self-conjugate? 

22.  [M2 3]  If  X is  a random  variable  with  a given  distribution,  the  expected  value  of 
A is  called  the  nth  moment  of  that  distribution.  What  is  the  nth  moment  when  Y is 
(a)  a Poisson  deviate  with  mean  1 (Eq.  3.4.1-(4o))?  (b)  the  number  of  fixed  points  of 
a random  permutation  of  {1, . . . , m},  when  m > n (Eq.  1.3.3-(27))? 

23.  [HM30]  If  /(*)  = J2  “kxk  is  a polynomial,  let  f{w)  stand  for  '£akwk. 

a)  Prove  the  symbolic  formula  /(w  + 1)  = (For  example,  if  f(x)  is  the 

polynomial  x , this  formula  states  that  w2  + 2^  + u70  = zu3.) 

b)  Similarly,  prove  that  f(w  + k)  = zu±f(tv)  for  all  positive  integers  k. 

c)  If  p is  prime,  prove  that  xun+p  = wn  + zun+1  (modulo  p).  Hint:  Show  first  that 
x-  = xp  - X. 

d)  Consequently  wn+N  = wn  (modulo  p)  when  N = pp_1  + pp_2  + • • • -F  p -)-  1 

24.  [HM35]  Continuing  the  previous  exercise,  prove  that  the  Bell  numbers  satisfy  the 
periodic  law  wn+pC-lN  = wn  (modulo  pe),  if  p is  an  odd  prime.  Hint:  Show  that 

xp~=ge{x)  + 1 (modulo  pe,  p°-'9l{x),  . . . , and  ppe- 1(*)),  where  9j(x)  = (xp-x-l)< 

25.  [M27]  Prove  that  Wn/wn^  < wn+1/wn  < zun/mn-1  + 1. 

► 26.  [M22]  According  to  the  recurrence  equations  (13),  the  numbers  wnk  in  Peirce’s 
triangle  count  the  paths  from  @)  to  (Tl)  in  the  infinite  directed  graph 


Explain  why  each  path  from  © to  <lT)  corresponds  to  a partition  of  (1, ... , n}. 

► 27._[M?5]  A “vacillating  tableau  loop”  of  order  n is  a sequence  of  integer  partitions 
Afc  = akiak2ak3  . . . with  akl  > ak2  > ak3  > ■ ■ ■ for  0 < k < 2 n,  such  that  A0  = \2n  = e0 
and  Afc  = Afc_!  + (~l)ketk  for  1 < k < 2n  and  for  some  tk  with  0 < tk  < n;  here 
ef  denotes  the  unit  vector  0,-110n-(  when  0 < t < n,  and  e0  is  all  zeros. 

a)  List  all  the  vacillating  tableau  loops  of  order  4.  [Hint:  There  are  15  altogether.] 

b)  Prove  that  exactly  wnk  vacillating  tableau  loops  of  order  n have  t2k-i  =0. 

► 28.  [M25]  ( Generalized  rook  polynomials.)  Consider  an  arrangement  of  m H am 

square  cells  in  rows  and  columns,  where  row  k contains  cells  in  columns  1 ak 

Place  zero  or  more  “rooks”  into  the  cells,  with  at  most  one  rook  in  each  row'  and  at 
most  one  in  each  column.  An  empty  cell  is  called  “free”  if  there  is  no  rook  to  its  right 
and  no  rook  below.  For  example,  Fig.  56  shows  two  such  placements,  one  with  four 
rooks  in  rows  of  lengths  (3, 1,4, 1,5, 9, 2, 6, 5),  and  another  with  nine  on  a 9 x 9 square 

oard.  Rooks  are  indicated  by  solid  circles;  hollow  circles  have  been  placed  above  and 
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to  the  left  of  each  rook,  thereby  leaving  the  free  cells  blank. 
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Fig.  56.  Rook  placements 
and  free  cells. 


Let  R(ai, . . . , am ) be  the  polynomial  in  x and  y obtained  by  summing  xry!  over  all 
legal  rook  placements,  where  r is  the  number  of  rooks  and  / is  the  number  of  free  cells; 
for  example,  the  left-hand  placement  in  Fig.  56  contributes  x4y17  to  the  polynomial 
R(3, 1,4, 1,5, 9, 2, 6, 5). 

a)  Prove  that  we  have  R(a  i,...,am)  = R(<u, . . . , a^-i,  aj+1,  a, , aJ+2, . . . , am);  in 
other  words,  the  order  of  the  row  lengths  is  irrelevant,  and  we  can  assume  that 
ai  > • ■ ■ > am  as  in  a tableau  shape  like  7. 2. 1.4  (13). 

b)  If  ai  > • • • > am  and  if  hi  ...  bn  = (a  1 . . . am)T  is  the  conjugate  partition,  prove 
that  R{a  1, . . . ,am)  = R(bx,. . 

c)  Find  a recurrence  for  evaluating  R(a  1, . . . , am)  and  use  it  to  compute  R( 3, 2, 1). 

d)  Generalize  Peirce’s  triangle  (12)  by  changing  the  addition  rule  (13)  to 

runk(x,y)  = xw(n_1)k(x,y)  + ywn{k+1){x,y),  1 < k < n. 

Thus  w2i(x,y)  = x+y,  w32(x,y)  = x+xy+y2,  w3X(x,y ) = x2 +2xy+xy2 +y3 , etc. 
Prove  that  the  resulting  quantity  wnk(x,  y)  is  the  rook  polynomial  R(a  1, . . . , o„_  1) 
where  dj  = n — j — [j  < k], 

e)  The  polynomial  zunX  (x,  y)  in  part  (d)  can  be  regarded  as  a generalized  Bell  number 
zun(x,  y),  representing  paths  from  © to  (n)  in  the  digraph  of  exercise  26  that  have 
a given  number  of  “x  steps”  to  the  northeast  and  a given  number  of  “ y steps”  to 
the  east.  Prove  that 


wn{x,y)  = ]T  xn-1-max(ai’  -'a")2/ai+-+“'* 

a 1 ...On 

summed  over  all  restricted  growth  strings  ax  . . . a„  of  length  n. 

29.  [ M26 ) Continuing  the  previous  exercise,  let  Rr(ax, . . . , am)  = [xr]  R(ax , . . . , am) 
be  the  polynomial  in  y that  enumerates  free  cells  when  r rooks  are  placed, 

a)  Show  that  the  number  of  ways  to  place  n rooks  on  an  n x n board,  leaving  / cells 
free,  is  the  number  of  permutations  of  { 1, . . . , n}  that  have  / inversions.  Thus,  by 
Eq.  5.1.1  (8)  and  exercise  5.1.2  16,  we  have 


Rn(n, . . . ,n)  — n\y  — J"J (1  + y + ■ ■ • + yk  :). 


fc=l 


b)  What  is  Rr(n, . . . ,n),  the  generating  function  for  r rooks  on  an  m x n board? 

c)  If  ax  > ■ ■ ■ > am  > 0 and  t is  a nonnegative  integer,  prove  the  general  formula 


n^r 


,aj  — 


3 = 1 


m ft 

(t- 


(t  - k)\y 


Rm  — k (Ol , • • • ? O'm  ) • 
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[Note:  The  quantity  t\v/(t-k)\v  = is  zero  when  k >t>  0. 

Thus,  for  example,  when  t = 0 the  right-hand  side  reduces  to  i?m(ai, . . . , am).  We 
can  compute  Rm,  i?m_i,  . . . , R„  by  successively  setting  t = 0,  1,  . . . , m.\ 

d)’ffll  - “2  - ^ fl™  ^ 0 fnd  a'l  > ai  > > a'm  > 0,  show  that  we 

ave  R.{  a-j , a2  . . . ,om)  _ R(a'1,a2, a'm ) if  and  only  if  the  associated  multisets 

{aj  + 1,  a2+2, . . . , am+m}  and  {a'x+1,  a'2+2, . . . ,a'm+m } are  the  same. 

30.  [HM30]  The  generalized  Stirling  number  {”}9  is  defined  by  the  recurrence 


Thus  { "J,  is  a polynomial  in  q:  and  is  the  ordinary  Stirling  number  because 
it  satisfies  the  recurrence  relation  in  Eq.  1.2.6-(46). 

a)  Prove  that  the  generalized  Bell  number  wn{x,  y)  = R(n- 1, . . . , 1)  of  exercise  28(e) 
has  the  explicit  form 


Wn(x,»)  = . 

b)  Show  that  generalized  Stirling  numbers  also  obey  the  recurrence 


c)  Find  generating  functions  for  {”}9,  generalizing  1.2.9-(23)  and  1.2.9  (28). 

31.  [HM23]  Generalizing  (15),  show  that  the  elements  of  Peirce’s  triangle  have  a 
simple  generating  function,  if  we  compute  the  sum 


T!  Wnk 
n,k 


wn~k  z fc~1 


(n  — k)!  (k  — 1)! ' 


32.  [M2 2}  Let  S„  be  the  number  of  restricted  growth  strings  ai  . . ,a„  for  which  the 
sum  ai  + • • • + a„  is  even  minus  the  number  for  which  a,  + • • • + an  is  odd.  Prove  that 

<Sn  = (1,0, -1,-1, 0,1)  when  n mod  6 = (1,  2, 3, 4, 5, 0). 

Hint:  See  exercise  28(e). 

33.  [M21]  How  many  partitions  of  {1,2, ...  ,n}  have  1 ^ 2,  2 ^ 3,  . . . , k — 1 ^ k? 

34.  [14]  Many  poetic  forms  involve  rhyme  schemes,  which  are  partitions  of  the  lines 
of  a stanza  with  the  property  that  j = k if  and  only  if  line  j rhymes  with  line  k.  For 
example,  a “limerick”  is  generally  a 5-line  poem  with  certain  rhythmic  constraints  and 
with  a rhyme  scheme  described  by  the  restricted  growth  string  00110. 

What  rhyme  schemes  were  used  in  the  classical  sonnets  by  (a)  Guit.tone  d’ Arezzo 
c 1270)?  (b)  Petrarch  (c.  1350)?  (c)  Spenser  (1595)?  (d)  Shakespeare  (1609)? 
(e)  Elizabeth  Barrett  Browning  (1850)? 

35.  [M21]  Let  w'n  be  the  number  of  schemes  for  n-line  poems  that  are  “completely 
rhymed,”  m the  sense  that  every  line  rhymes  with  at  least  one  other.  Thus  we  have 

{w0,  TOi,  w2,  . ■ . ) = (1,  0,  1,  1,  4,  11,  41,  . . . ).  Give  a combinatorial  proof  of  the  fact 
that  TVn  + Wn+1  = SE7„. 

36.  [M22]  Continuing  exercise  35,  what  is  the  generating  function  £n  vo'rizn/n\'! 
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37.  [M18]  Alexander  Pushkin  adopted  an  elaborate  structure  in  his  poetic  novel 
Eugene  Onegin  (1833),  based  not  only  on  “masculine”  rhymes  in  which  the  sounds  of 
accented  final  syllables  agree  with  each  other  (pain-gain,  form-warm,  pun-fun,  bucks- 
crux),  but  also  on  “feminine”  rhymes  in  which  one  or  two  unstressed  syllables  also  par- 
ticipate (humor-tumor,  tetrameter-pentameter,  lecture-conjecture,  iguana-piranha). 
Every  stanza  of  Eugene  Onegin  is  a sonnet  with  the  strict  scheme  01012233455477, 
where  the  rhyme  is  feminine  or  masculine  according  as  the  digit  is  even  or  odd.  Several 
modern  translators  of  Pushkin’s  novel  have  also  succeeded  in  retaining  the  same  form 
in  English  and  German. 


How  do  I justify  this  stanza?  / These  feminine  rhymes?  My  wrinkled  muse? 
This  whole  passd  extravaganza?  / How  can  I (careless  of  time)  use 
The  dusty  bread  molds  of  Onegin  / In  the  brave  bakery  of  Reagan? 
The  loaves  will  surely  fail  to  rise  / Or  else  go  stale  before  my  eyes. 
The  truth  is,  I can’t  justify  it.  / But  as  no  shroud  of  critical  terms 
Can  save  my  corpse  from  boring  worms,  / I may  as  well  have  fun  and  try  it. 

If  it  works,  good ; and  if  not,  well,  / A theory  won't  postpone  its  knell. 

— VIKRAM  SETH,  The  Golden  Gate  (1986) 

A 14-line  poem  might  have  any  of  4 = 24,011,157  complete  rhyme  schemes, 
according  to  exercise  35.  But  how  many  schemes  are  possible  if  we  are  allowed  to 
specify,  for  each  block,  whether  its  rhyme  is  to  be  feminine  or  masculine? 

► 38.  [M30]  Let  <rk  be  the  cyclic  permutation  (1,2 ,k).  The  object  of  this  exercise 
is  to  study  the  sequences  k\k2  . . -kn,  called  a-cycles , for  which  rrkl<Tk2  ...akn  is  the 
identity  permutation.  For  example,  when  n = 4 there  are  exactly  15  cr-cycles,  namely 


1111,  1122, 1212, 1221, 1333,  2112, 2121,  2211, 2222,  2323, 3133, 3232, 3313, 3331, 4444. 

a)  Find  a one-to-one  correspondence  between  partitions  of  (1,  2, . . . ,n}  and  cr-cycles 
of  length  n. 

b)  How  many  cr-cycles  of  length  n have  1 < ki,  . . . , kn  < m,  given  m and  n? 

c)  How  many  cr-cycles  of  length  n have  ki  = j,  given  i,  j,  and  n? 

d)  How  many  cr-cycles  of  length  n have  fci,  . . . , kn  > 2? 

e)  How  many  partitions  of  {1, . . . , n}  have  1 ^ 2,  2 ^ 3,  . . . , n - 1 ^ n,  and  n ^ 1? 

39.  [HM16]  Evaluate  /0ooe-t,  tq  dt  when  p and  q are  nonnegative  integers.  Hint: 
See  exercise  1.2.5-20. 


40.  [ HM20 ] Suppose  the  saddle  point  method  is  used  to  estimate  [zn  1]ecz.  The 
text’s  derivation  of  (21)  from  (19)  deals  with  the  case  c = 1;  how  should  that  derivation 
change  if  c is  an  arbitrary  positive  constant? 

41.  [ HM21 ] Solve  the  previous  exercise  when  c = — 1. 


42.  [ HM23 ] Use  the  saddle  point  method  to  estimate  [zn  x]  e2*  with  relative  error 
0(l/n2). 

43.  [HM22]  Justify  replacing  the  integral  in  (23)  by  (25). 

44.  [HM22]  Explain  how  to  compute  61,  b2,  ...  in  (26)  from  a2,  a3,  . . . in  (25). 

► 45.  [HM23]  Show  that,  in  addition  to  (26),  we  also  have  the  expansion 


„ b'i  b'2 

1 + — + 4 + ' 


£,ns/2ir n(£  + 1)  V n 
where  b\  = -(2£4  + 9£3  + 16£2  + 6£  + 2)/(24(£  + l)3 


+ ^ + o 

nm 


(nm+l))’ 
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46.  [HM25]  Estimate  the  value  of  zunk  in  Peirce’s  triangle  when  n — > oo. 

47.  [ M21 ] Analyze  the  running  time  of  Algorithm  H. 

48.  [HM25]  If  n is  not  an  integer,  the  integral  in  (23)  can  be  taken  over  a Hankel 
contour  to  define  a generalized  Bell  number  wx  for  all  real  x > 0.  Show  that,  as  in  (16), 


1 -A  kx 

Ux  ~ p2^h  \' 


e ^ k\ 
k= 0 


► 49.  [HM35]  Prove  that,  for  large  n,  the  number  £ defined  in  Eq.  (24)  is  equal  to 

ok 


In  1 


In  In  n + ^ 

j,k>  0 


j + k 
3 + 1 


k\  ’ 


_ 1 _ In  In  n 

Inn’  Inn 


► 50.  [HM21]  If  £(n)es(n)  = n and  £(n)  > 0,  how  does  £(n  + k)  relate  to  £(n)? 

51.  [HM27]  Use  the  saddle  point  method  to  estimate  tn  = n!  [ zn ] ez+z2/ 2,  the  number 
of  involutions  on  n elements  (aka  partitions  of  {1, . . . ,n}  into  blocks  of  sizes  < 2). 

52.  [HM22]  The  cumulants  of  a probability  distribution  are  defined  in  Eq.  1.2.10- 

(23).  What  are  the  cumulants,  when  the  probability  that  a random  integer  equals  k is 
(a)  ei-‘(wktk/k\?  (b)  -i/k\? 


► 53.  [HM30]  Let  G(z ) = ^2^L0PkZk  be  the  generating  function  for  a discrete  prob- 
ability distribution,  converging  for  \z\  < 1 + <5;  thus  the  coefficients  pk  are  non- 
negative, G(l)  = 1,  and  the  mean  and  variance  are  respectively  p = G'(l)  and 
a ~ p (1)  + Cr(l)  — G(l)  .If  XU  . . . , Xn  are  independent  random  variables  having 
this  distribution,  the  probability  that  + • • • + Xn  = m is  [ zm } G{z)n,  and  we  often 
want  to  estimate  this  probability  when  m is  near  the  mean  value  // // . 

Assume  that  Po  / 0 and  that  no  integer  d > 1 is  a common  divisor  of  all 
subscripts  k with  pk  / 0;  this  assumption  means  that  m does  not  have  to  satisfy 
any  special  congruence  conditions  mod  d when  n is  large.  Prove  that 


[^n+r]G(z)n 


asnAoo, 


when  pn  + r is  an  integer.  Hint:  Integrate  G(z)n/zfln+r  on  the  circle  \z\  = 1. 

54.  [HM20]  If  q and  i3  are  defined  by  (40),  show  that  their  arithmetic  and  geometric 
means  are  respectively  = scoths  and  Vo/?  = scschs,  where  s = o/2. 

55.  [HM20]  Suggest  a good  way  to  compute  the  number  0 needed  in  (43). 

► 56.  [HM26]  Let  g(z ) = a 1 ln(ez  — 1)  — In  2 and  0 = a — 0 as  in  (37) 

a)  Prove  that  (_<r)»+ifl(»+U(ff)  = n!  _ ^ where  the  Eulerian  num. 

bers  (k)  are  defined  in  Section  5.1.3. 

b)  Prove  that  £n!  < £fc"=0  (nk)ak/3n~k  < n!  for  all  a > 0.  Hint:  See  exercise  5.1.3-25. 

c)  Now  verify  the  inequality  (42). 

57.  [HM22]  In  the  notation  of  (43),  prove  that  (a)  n+l-m  < 2N-,  (b)  N<  2(n+l-m). 

58.  [HM31]  Complete  the  proof  of  (43)  as  follows. 

a)  Show  that  for  all  0 > 0 there  is  a number  r > 2cr  such  that  r is  a multiple  of  27r 
and  |e  1 — 1|/|<t  + it\  is  monotone  decreasing  for  0 < t < r. 

b)  Prove  that  f_T  exp((n  + \)g(o  + it))  dt  leads  to  (43). 

c)  Show  that  the  corresponding  integrals  over  the  straight-line  paths  2 = t ± ir  for 
— n < t < a and  2 = — n ± it  for  — r < t < r are  negligible. 
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► 59.  [HM23]  What  does  (43)  predict  for  the  approximate  value  of  {"}? 

60.  [HM25]  (a)  Show  that  the  partial  sums  in  the  identity 

f n \ _ mn  (m  - l)n  (m  - 2)n  m 0n 

ImJ  m\  1!  (m  — 1)!  2!(m-2)!  m\ 0! 

alternately  overestimate  and  underestimate  the  final  value,  (b)  Conclude  that 

{ } = — -(l  — 0(ne~n  ))  when  m < n1_i. 

ImJ  m!  ' ~ 

(c)  Derive  a similar  result  from  (43). 

61.  [HM26]  Prove  that  if  m = n — r where  r < ne  and  e < n1/2,  Eq.  (43)  yields 


2rr! 


(i  + o(„2-1)  + o(i)). 


62.  [HM40]  Prove  rigorously  that  if  ^ei  = n,  the  maximum  { ” } occurs  either  when 
m = |_e4  — lj  or  when  m = — 1]. 

► 63.  [ M35 ] (J.  Pitman.)  Prove  that  there  is  an  elementary  way  to  locate  the  maximum 
Stirling  numbers,  and  many  similar  quantities,  as  follows:  Suppose  0 < Pj  < 1. 

a)  Let  f(z ) = (1+Pi(z— 1)) . . . {l+pn[z— 1))  and  a*,  = [ zk\  thus  ak  is  the  proba- 

bility that  k heads  turn  up  after  n independent  coin  flips  with  the  respective  prob- 
abilities pi,  . . . , pn ■ Prove  that  ak~ 1 < a k whenever  k < p = pi  + • • • +pn,  ak  / 0. 

b)  Similarly,  prove  that  afc+1  < ak  whenever  k > p and  a*,  / 0. 

c)  If  f(x)  — ao  + aix  + ••■-)-  anxn  is  any  nonzero  polynomial  with  nonnegative 
coefficients  and  with  n real  roots,  prove  that  ak-i  < ak  when  k < //  and  afc+1  < ak 
when  k > p,  where  p = /'(1)//(1).  Therefore  if  am  = max(a0, . . . , an)  we  must 
have  either  m = [pj  or  m = \p~\. 

d)  Under  the  hypotheses  of  (c),  and  with  a,j  = 0 when  j < 0 or  j > n,  show  that 
there  are  indices  s < t,  such  that  ak+i  - ak  < ak  - ak-i  if  and  only  if  s < k < t. 
(Thus,  a histogram  of  the  sequence  (a0,  oi, . . . , a„)  is  always  “bell-shaped.”) 

e)  What  do  these  results  tell  us  about  Stirling  numbers? 

64.  [ HM21 ] Prove  the  approximate  ratio  (50),  using  (30)  and  exercise  50. 

► 65.  [HM22]  What  is  the  variance  of  the  number  of  blocks  of  size  k in  a random 
partition  of  (1, . . . ,n}? 

66.  [M46]  What  partition  of  n leads  to  the  most  partitions  of  {1, . . . , n}? 

67.  [ HM20 ] What  are  the  mean  and  variance  of  M in  Stam’s  method  (53)? 

68.  [21]  How  large  can  variables  l and  b get  in  Algorithm  M,  when  that  algorithm  is 
generating  all  p(n  1, . . . , nm)  partitions  of  {m  • 1, . . . , nm  • m}? 

► 69.  [22]  Modify  Algorithm  M so  that  it  produces  only  partitions  into  at  most  r parts. 

► 70.  [M22]  Analyze  the  number  of  r-block  partitions  possible  in  the  n-element  multi- 
sets (a)  {0, . . . , 0, 1};  (b)  {1, 2, . . . , n — 1,  n — 1}.  What  is  the  total,  summed  over  r? 

71.  [M20]  How  many  partitions  of  {m  • 1, . . . , nm  • m}  have  exactly  2 parts? 

72.  [ M26 ] Can  p(n,n)  be  evaluated  in  polynomial  time? 

► 73.  [M32]  Can  p( 2, . . . , 2)  be  evaluated  in  polynomial  time  when  there  are  n 2s? 

74.  [M46]  Can  p(n, . . . , n)  be  evaluated  in  polynomial  time  when  there  are  n ns? 

75.  [HM41]  Find  the  asymptotic  value  of  p(n,  n). 

76.  [HM36]  Find  the  asymptotic  value  of  p( 2, . . . , 2)  when  there  are  n 2s. 

77.  [HM46]  Find  the  asymptotic  value  of  p(n, . . . ,n)  when  there  are  n ns. 
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78.  [20]  What  partition  of  (15, 10, 10, 11)  leads  to  the  permutations  Ql,a2  a3  and 

a.4  shown  in  Table  1? 

79.  [22]  A sequence  «i,  u2,  u3,  ...  is  called  universal  for  partitions  of  {1, . . . ,n}  if 
its  subsequences  («m+i,  um+2, . . . , um+n)  for  0 < m < wn  represent  all  possible  set 

^ltnITnUod^.the  COnvention  “J  = k if  and  only  if  Um+j  = um+k .”  For  example, 
(.11,9,0, 1,0,  2,  2)  is  a universal  sequence  for  partitions  of  {1,2,3}. 

Write  a program  to  find  all  universal  sequences  for  partitions  of  {1,2, 3, 4}  with 
the  properties  that  (i)  ui  = u2  = u3  = u4  = 0;  (ii)  the  sequence  has  restricted  growth; 
(m)  0 < Uj  < 3;  and  (iv)  u16  = «i7  = «is  = 0 (hence  the  sequence  is  essentially  cyclic). 

80.  [Jim]  Prove  that  universal  cycles  for  partitions  of  {1,2,...,  n}  exist  in  the  sense 
ol  the  previous  exercise  whenever  n > 4. 

81.  [29]  Find  a way  to  arrange  an  ordinary  deck  of  52  playing  cards  so  that  the  fol- 
lowing trick  is  possible:  Five  players  each  cut  the  deck  (applying  a cyclic  permutation) 
as  often  as  they  like.  Then  each  player  takes  a card  from  the  top.  A magician  tells 
them  to  look  at  their  cards  and  to  form  affinity  groups,  joining  with  others  who  hold 
the  same  suit:  Everybody  with  clubs  gets  together,  everybody  with  diamonds  forms 
another  group,  and  so  on.  (The  Jack  of  Spades  is,  however,  considered  to  be  a “joker”- 
its  holder,  if  any,  should  remain  aloof.) 

Observing  the  affinity  groups,  but  not  being  told  any  of  the  suits,  the  magician 
can  name  all  five  cards,  if  the  cards  were  suitably  arranged  in  the  first  place. 

82.  [22]  In  how  many  ways  can  the  following  15  dominoes,  optionally  rotated,  be 
partitioned  into  three  sets  of  five  having  the  same  sum  when  regarded  as  fractions? 

B+B+i+l+S  - B+l+i+!+l  - i+B+!+l+! 

Just  as  in  a single  body  there  are  pairs  of  individual  members, 
called  by  the  same  name  but  distinguished  as  right  and  left, 
so  when  my  speeches  had  postulated  the  notion  of  madness, 
as  a single  generic  aspect  of  human  nature, 
the  speech  that  divided  the  left-hand  portion 
repeatedly  broke  it  down  into  smaller  and  smaller  parts. 

SOCRATES,  Phaedrus  266A  (c.  370  B.C.) 

7.2. 1.6.  Generating  all  trees.  We’ve  now  completed  our  study  of  the  classical 
concepts  of  combinatorics:  tuples,  permutations,  combinations,  and  partitions. 
But  computer  scientists  have  added  another  fundamental  class  of  patterns  to 
the  traditional  repertoire,  namely  the  hierarchical  arrangements  known  as  trees. 
Trees  sprout  up  just  about  everywhere  in  computer  science,  as  we’ve  seen  in 
Section  2.3  and  in  nearly  every  subsequent  section  of  The  Art  of  Computer 
Programming.  Therefore  we  turn  now  to  the  study  of  simple  algorithms  by 
which  trees  of  various  species  can  be  explored  exhaustively. 

First  let’s  review  the  basic  connection  between  nested  parentheses  and  for- 
ests of  trees.  For  example, 

12  345  678  9a  b cdef 

(())((())((()(()))())(()(())))  m 

12  34  5 6789a  b cdef 


| Li  | 
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illustrates  a string  containing  fifteen  left  parens  ‘ (’  labeled  1,  2,  . . . , f , and  fifteen 
right  parens  ‘)  ’ also  labeled  1 through  f ; gray  lines  beneath  the  string  show  how 
the  parentheses  match  up  to  form  fifteen  pairs  12,  21,  3f , 44,  53,  6a,  78,  85,  97, 
a6,  b9,  ce,  db,  ed,  and  fc.  This  string  corresponds  to  the  forest 


in  which  the  nodes  are  (12) , (21) , (Sf) , . . . , (fc)  in  preorder  (sorted  by  first  coor- 
dinates) and  @ , (12) , (53) , . . . , (3f)  in  postorder  (sorted  by  second  coordinates). 
If  we  imagine  a worm  that  crawls  around  the  periphery  of  the  forest, 


(3) 


seeing  a ( whenever  it  passes  the  left  edge  of  a node  and  a ‘)  ’ whenever  it  passes 
a node’s  right  edge,  that  worm  will  have  reconstructed  the  original  string  (1). 
The  forest  in  (2)  corresponds,  in  turn,  to  the  binary  tree 


via  the  “natural  correspondence”  discussed  in  Section  2.3.2;  here  the  nodes  are 
©’  @>  ©>  •••>  @ in  symmetric  order,  also  known  as  inorder.  The  left 
subtree  of  node  (x)  in  the  binary  tree  is  the  leftmost  child  of  (x)  in  the  forest, 
or  it  is  an  “external  node”  □ if  @ is  childless.  The  right  subtree  of  @ in  the 
binary  tree  is  its  right  sibling  in  the  forest,  or  □ if  @ is  the  rightmost  child  in 
its  family.  Roots  of  the  trees  in  the  forest  are  considered  to  be  siblings,  and  the 
leftmost  root  of  the  forest  is  the  root  of  the  binary  tree. 
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Table  1 

NESTED  PARENTHESES  AND  RELATED  OBJECTS  WHEN  n = 4 
a\ a-2  . . . as  forest  binary  tree  di<f2d3cLi 


()()()() 

• • • • 

\ 

1111 

()()(()) 

••  I 

1102 

()(())() 

• I • 

fa 

1021 

()(()()) 

• A 

1012 

()((())) 

* i 

'jp 

1003 

(())()() 

I ” 

0211 

(())(()) 

: : 

‘^h 

0202 

(()())() 

A • 

0121 

(()()()) 

a. 

0112 

(()(())) 

0103 

((()))() 

l * 

0031 

((())()) 

I"* 

jCti" 

0022 

((()())) 

A 

0013 

(((()))) 

i 

£ 

0004 

21222324 

P1P2P3P4 

C1C2C3C4 

matching 

1357 

1234 

0000 

O 

1356 

1243 

0001 

’I) 

1347 

1324 

0010 

o* 

1346 

1342 

0011 

I'-. 

• 

1345 

1432 

0012 

• 

• , 

1257 

2134 

0100 

\f: 

1256 

2143 

0101 

lth 

. s 

1247 

2314 

0110 

V 

1246 

2341 

0111 

0 

1245 

2431 

0112 

1237 

3214 

0120 

\W' 

1236 

3241 

0121 

1235 

3421 

0122 

•O' 

1234 

4321 

0123 

A string  aia2...a2n  of  parentheses  is  properly  nested  if  and  only  if  it 
contains  n occurrences  of  ‘ (’  and  n occurrences  of  where  the  fcth  ‘ (’  precedes 
the  kth  ‘)  ’ for  1 < k < n.  The  easiest  way  to  explore  all  strings  of  nested  paren- 
theses is  to  visit  them  in  lexicographic  order.  The  following  algorithm,  which 
considers  T to  be  lexicographically  smaller  than  includes  some  refinements 
for  efficiency  suggested  by  I.  Semba  [Inf.  Processing  Letters  12  (1981),  188  192]: 
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Algorithm  P ( Nested  parentheses  in  lexicographic  order).  Given  an  integer 
n > 2,  this  algorithm  generates  all  strings  aia2 . . . a2n  of  nested  parentheses. 
PI.  [Initialize.]  Set  a2k-i  t-  ‘(’  and  a2k  t-  ‘) ’ for  1 < k < n;  also  set  a0 
and  m <—  2n  — 1.  (We  use  ao  as  a sentinel  in  step  P4.) 

P2.  [Visit.]  Visit  the  nested  string  aia2...a2n.  (At  this  point  am  — and 
ak  = ‘)’  for  m < k < 2 n.) 

P3.  [Easy  case?]  Set  am  t-  ‘)  Then  if  om_!  = set  am_i  <-  ‘ (’,  m 4-  m - 1, 
and  return  to  P2. 

P4.  [Find  j]  Set  j <—  m — 1 and  k 2n  — 1.  While  aj  = ‘ set  a3  *)’, 
ak  <—  ‘ (’,  j t—  j — 1,  and  k k — 2. 

P5.  [Increase  n.j.]  Terminate  the  algorithm  if  j = 0.  Otherwise  set  aj  <—  ‘ (’, 
m «—  2n  — 1,  and  go  back  to  P2.  | 

We  will  see  later  that  the  loop  in  step  P4  is  almost  always  short:  The  operation 
aj  ‘)’  is  performed  only  about  | times  per  nested  string  visited,  on  the  average. 

Why  does  Algorithm  P work?  Let  Apq  be  the  sequence  of  all  strings  a that 
contain  p left  parentheses  and  q > p right  parentheses,  where  (,-pa  is  properly 
nested,  listed  in  lexicographic  order.  Then  Algorithm  P is  supposed  to  generate 
Ann,  where  it  is  easy  to  see  that  Apq  obeys  the  recursive  rules 

Apq  ) Ap(q_i),  ( *f  0 < p < q ^ 0 ; A0o  = ej  (5) 

also  Apq  is  empty  if  p < 0 or  p > q.  The  first  element  of  Apq  is  )9-p()  ...  (), 
where  there  are  p pairs  ‘O’;  the  last  element  is  (p)9.  Thus  the  lexicographic 
generation  process  consists  of  scanning  from  the  right  until  finding  a trailing 
string  of  the  form  a3  . . . a2n  = )(p+1)9  and  replacing  it  by  09+1-p0  ...  O. 
Steps  P4  and  P5  do  this  efficiently,  while  step  P3  handles  the  simple  case  p = 0. 

Table  1 illustrates  the  output  of  Algorithm  P when  n = 4,  together  with  the 
corresponding  forest  and  binary  tree  as  in  (2)  and  (4).  Several  other  equivalent 
combinatorial  objects  also  appear  in  Table  1:  For  example,  a string  of  nested 
parentheses  can  be  run-length  encoded  as 

()di0d2  ()d„,  (6) 

where  the  nonnegative  integers  d\d2  ■ ■ - dn  are  characterized  by  the  constraints 

d\  + d2  + • • • + dfc  < k for  1 < k < n;  d\  + d2  + ■ ■ ■ + dn  — n.  (7) 

We  can  also  represent  nested  parentheses  by  the  sequence  zi  z2  . . . zn . which 
specifies  the  indices  where  the  left  parentheses  appear.  In  essence,  ZiZ2  . . . zn  is 
one  of  the  (2")  combinations  of  n things  from  the  set  {1,2, . . . ,2n},  subject  to 
the  special  constraints 

Zfc_  1 < Zfc  < 2k  for  1 < k < n,  (8) 

if  we  assume  that  z0  = 0.  The  z’s  are  of  course  related  to  the  d's: 

dk  = Zk+i  - Zfc  - 1 for  1 < k < n.  (9) 

Algorithm  P becomes  particularly  simple  when  it  is  rewritten  to  generate  the 
combinations  ziz2  . . . zn  instead  of  the  strings  aia2  . . . a2n.  (See  exercise  2.) 
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A parenthesis  string  can  also  be  represented  by  the  permutation  pxp2  p 
where  the  kth  right  parenthesis  matches  the  pfcth  left  parenthesis;  in  other  words’ 
the  fcth  node  of  the  associated  forest  in  postorder  is  the  pkth  node  in  preorder. 

y exercise  2.3.2-20,  node  j is  a (proper)  descendant  of  node  k in  the  forest 
if  and  only  if  j < k and  p3  > pfe,  when  we  label  the  nodes  in  postorder.  The 
inversion  table  Clc2  . . . c„  characterizes  this  permutation  by  the  rule  that  exactly 
ck  elements  to  the  right  of  k are  less  than  k (see  exercise  5.1. 1-7)-  allowable 
inversion  tables  have  ci  = 0 and 

0 < Cfc+1  < cfc  + 1 for  1 < k < n.  (10) 

Moreover,  exercise  3 proves  that  ck  is  the  level  of  the  forest’s  fcth  node  in  preorder 
(the  depth  of  the  kth  left  parenthesis),  a fact  that  is  equivalent  to  the  formula 

ck  = 2k  — 1 — zk.  (1X) 

Table  1 and  exercise  6 also  illustrate  a special  kind  of  matching , by  which  2 n 
people  at  a circular  table  can  simultaneously  shake  hands  without  interference. 

Thus  Algorithm  P can  be  useful  indeed.  But  if  our  goal  is  to  generate  all 
binary  trees,  represented  by  left  links  M2  . . . ln  and  right  links  nr2  . . . rn  the 
exicographic  sequence  in  Table  1 is  rather  awkward;  the  data  we  need  to  get 
from  one  tree  to  its  successor  is  not  readily  available.  Fortunately,  an  ingenious 
alternative  scheme  for  direct  generation  of  all  linked  binary  trees  is  also  available: 

Algorithm  B ( Binary  trees).  Given  n > 1,  this  algorithm  generates  all  binary 
trees  with  n internal  nodes,  representing  them  via  left  links  hl2...ln  and  right 
inks  rir2  ...rn,  with  nodes  labeled  in  preorder.  (Thus,  for  example,  node  1 is 
always  the  root,  and  lk  is  either  Hlor  0;  if  h = 0 and  n > 1 then  rq  = 2.) 

Bl.  [Initialize.]  Set  lk  <-  k + 1 and  rk  <-  0 for  1 < k < n;  also  set  ln  «-  rn  <-  0, 
and  set  ln+1  •<—  1 (for  convenience  in  step  B3). 

B2.  [Visit.]  Visit  the  binary  tree  represented  by  ZXZ2  ...ln  and  rqr2  ...rn. 

B3.  [Find  j.}  Set  j 4-  1.  While  l,  = 0,  set  r}  <-  0,  l3  <-  j + 1,  and  j <-  j + 1 
1 hen  terminate  the  algorithm  if  j > n. 

B4.  [Find  k and  y.}  Set  y <-  l3  and  k <-  0.  While  ry  > 0,  set  k^y  and  y <-  ry. 

B5.  [Promote  y.j  If  k > 0,  set  rk  0;  otherwise  set  l3  0.  Then  set  r <-r- 
rj-h-y,  and  return  to  B2.  | V 3' 

[See  W.  Skarbek,  Theoretical  Computer  Science  57  (1988),  153-159;  step  B3 
uses  an  idea  of  J.  Korsh.]  Exercise  44  proves  that  the  loops  in  steps  B3  and  B4 
both  tend  to  be  very  short.  Indeed,  fewer  than  9 memory  references  are  needed, 
on  the  average,  to  transform  a linked  binary  tree  into  its  successor. 

Table  2 shows  the  fourteen  binary  trees  that  are  generated  when  n = 4 
together  with  their  corresponding  forests  and  with  two  related  sequences:  Arrays 
6162  •■•e”  and  •sis2  • • • sn  are  defined  by  the  property  that  node  k in  preorder  has 
ek  children  and  sk  descendants  in  the  associated  forest.  (Thus  sk  is  the  size  of  k' s 
eft  subtree  in  the  binary  tree;  also,  sk  + 1 is  the  length  of  the  SCOPE  link  in  the 
sense  of  2.3.3-(5).)  The  next  column  repeats  the  fourteen  forests  of  Table  1 in 
the  lexicographic  ordering  of  Algorithm  P,  but  mirror-reversed  from  left  to  right. 
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Table  2 

LINKED  BINARY  TREES  AND  RELATED  OBJECTS  WHEN  n = 4 


W21314 

rir2r3r4 

binary  tree 

forest 

eie2e3e4 

S1S2S3S4 

colex  forest 

lsib/rchild 

2340 

0000 

i 

1110 

3210 

• • • • 

\ 

0340 

2000 

* 1 

0110 

0210 

: •• 

1 

do 

2040 

0300 

2010 

3010 

• : • 

2040 

3000 

lf\£h 

1 : 

1010 

1010 

A * 

0040 

2300 

** 

••  1 

0010 

0010 

i * 

2300 

0040 

A 

1200 

3200 

•• : 

0300 

2040 

* A 

0200 

0200 

: : 

jC^*b 

an 

2300 

0400 

{- 

2100 

3100 

* A 

2300 

4000 

l * 

1100 

2100 

A* 

0300 

2400 

• : * 

0100 

0100 

2000 

0340 

A. 

3000 

3000 

* i 

> 

2000 

4300 

-•s> 

A * 

2000 

2000 

V' 

2000 

3040 

: •* 

1000 

1000 

A 

0000 

2340 

\ 

• • • • 

0000 

0000 

i 

And  the  final  column  shows  the  binary  tree  that  represents  the  colex  forest;  it 
also  happens  to  represent  the  forest  in  column  4,  but  by  links  to  left  sibling  and 
right  child  instead  of  to  left  child  and  right  sibling.  This  final  column  provides  an 
interesting  connection  between  nested  parentheses  and  binary  trees,  so  it  gives 
us  some  insight  into  why  Algorithm  B is  valid  (see  exercise  19). 
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*Gray  codes  for  trees.  Our  previous  experiences  with  other  combinatorial 
patterns  suggest  that  we  can  probably  generate  parentheses  and  trees  by  making 
only  small  perturbations  to  get  from  one  instance  to  another.  And  indeed,  there 
are  at  least  three  very  nice  ways  to  achieve  this  goal. 

Consider  first  the  case  of  nested  parentheses,  which  we  can  represent  by 
the  sequences  z\Z2-..zn  that  satisfy  condition  (8).  A “near-perfect”  way  to 
generate  all  such  combinations,  in  the  sense  of  Section  7.2. 1.3,  is  one  in  which 
we  run  through  all  possibilities  in  such  a way  that  some  component  Zj  changes 
by  ±1  or  ±2  at  each  step;  this  means  that  we  get  from  each  string  of  parentheses 
to  its  successor  by  simply  changing  either  ()<+)(  or  ())<-►))(  in  the  vicinity 
of  the  j/t.h  left  parenthesis.  Here’s  one  way  to  do  the  job  when  n = 4: 

1357, 1356, 1346, 1345, 1347, 1247, 1245, 1246, 1236, 1234, 1235, 1237, 1257, 1256. 

And  we  can  extend  any  solution  for  n — 1 to  a solution  for  n,  by  taking  each 
pattern  Z\Z2  ■ ■ ■ zn-i  and  letting  zn  run  through  all  of  its  legal  values  using  endo- 
order  or  its  reverse  as  in  7 .2.1.3  (45),  proceeding  downward  from  2n  — 2 and  then 
up  to  2n  — 1 or  vice  versa,  and  omitting  all  elements  that  are  < zn_j. 

Algorithm  N ( Near-perfect  nested  parentheses).  This  algorithm  visits  all  n- 
combinations  Zi ...  zn  of  {1, ... , 2n}  that  represent  the  indices  of  left  parentheses 
in  a nested  string,  changing  only  one  index  at  a time.  The  process  is  controlled 
by  an  auxiliary  array  g1  . . . gn  that  represents  temporary  goals. 

Nl.  [Initialize.]  Set  zj  4-  2j  - 1 and  g3  4-  2j  - 2 for  1 < j < n. 

N2.  [Visit.]  Visit  the  n-combination  z± . . . zn.  Then  set  j 4—  n. 

N3.  [Find  j.}  If  zj  = gj , set  gj  4-^  0 1 (thereby  complementing  the  least 
significant  bit),  j 4-  j — 1,  and  repeat  this  step. 

N4.  [Home  stretch?]  If  9j  - Zj  is  even,  set  Zj  4-^  + 2 and  return  to  N2. 

N5.  [Decrease  or  turn.]  Set  t 4—  Zj  — 2.  If  t < 0,  terminate  the  algorithm. 
Otherwise,  if  t < Zj_i,  set  t 4—  t + 2[t  < Zj_  1]  + 1.  Finally  set  2,  4—  t and 
go  back  to  N2.  | 

[A  somewhat  similar  algorithm  was  introduced  by  D.  Rcelants  van  Baronaigien  in 
J.  Algorithms  35  (2000),  100-107;  see  also  Xiang,  Ushijima,  and  Tang,  Inf.  Proc. 
Letters  76  (2000),  169-174.  F.  Ruskey  and  A.  Proskurowski,  in  J.  Algorithms 
11  (1990),  68-84,  had  previously  shown  how  to  construct  perfect  Gray  codes 
for  all  tables  z\  ...  zn  when  n > 4 is  even,  thus  changing  some  Zj  by  only  ±1 
at  every  step;  but  their  construction  was  quite  complex,  and  no  known  perfect 
scheme  is  simple  enough  to  be  of  practical  use.  Exercise  48  shows  that  perfection 
is  impossible  when  n > 5 is  odd.] 

If  our  goal  is  to  generate  linked  tree  structures  instead  of  strings  of  paren- 
theses, perfection  of  the  2-index  changes  is  not  good  enough,  because  simple 
swaps  like  0 4->  )(  don't  necessarily  correspond  to  simple  link  manipulations.  A 
far  better  approach  can  be  based  on  the  “rotation”  algorithms  by  which  we  were 
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able  to  keep  search  trees  balanced  in  Section  6.2.3.  Rotation  to  the  left  changes 


from 


(i3) 


“Node  (a)  becomes  the  leftmost  child  of  its  right  sibling.”  Rotation  to  the  right 
is,  of  course,  the  opposite  transformation:  “The  leftmost  child  of  (b)  becomes 
its  left  sibling.”  The  vertical  line  in  (12)  stands  for  a connection  to  the  overall 
context,  either  a left  link  or  a right  link  or  the  pointer  to  the  root.  Any  or  all 
of  the  subtrees  a,  p,  or  w may  be  empty.  The  ‘ ’ in  (13),  which  represents 

additional  siblings  at  the  left  of  the  family  containing  ( 3 ),  might  also  be  empty. 

The  nice  thing  about  rotations  is  that  only  three  links  change:  The  right 
link  from  (a),  the  left  link  from  (7t),  and  the  pointer  from  above.  Rotations 
preserve  inorder  of  the  binary  tree  and  postorder  of  the  forest.  (Notice  also  that 
the  binary-tree  form  of  a rotation  corresponds  in  a natural  way  to  an  application 
of  the  associative  law 


(afi)oj  = a(fico) 


(m) 


in  the  midst  of  an  algebraic  formula.) 

A simple  scheme  very  much  like  the  classical  reflected  Gray  code  for  n-tuples 
(Algorithm  7.2.1.1H)  and  the  method  of  plain  changes  for  permutations  (Algo- 
rithm 7.2.1.2P)  can  be  used  to  generate  all  binary  trees  or  forests  via  rotations. 
Consider  any  forest  on  n - 1 nodes,  with  k roots  (A),  . . . , (A).  Then  there  are 
k+  1 forests  on  n nodes  that  have  the  same  postorder  sequence  on  the  first  n — 1 
nodes  but  with  node  (n)  last;  for  example,  when  k = 3 they  are 


obtained  by  successively  rotating  (/Q),  (A),  and  (Aj)  to  the  left.  Moreover,  at 
the  extremes  when  (rT)  is  either  at  the  right  or  at  the  top,  we  can  perform 
any  desired  rotation  on  the  other  n — 1 nodes,  because  node  (n)  isn’t  in  the 
way.  Therefore,  as  observed  by  J.  M.  Lucas,  D.  Rcelants  van  Baronaigien,  and 
F.  Ruskey  [J.  Algorithms  15  (1993),  343-366],  we  can  extend  any  list  of  the 
( n — l)-node  trees  to  a list  of  all  n-node  trees  by  simply  letting  node  (rT)  roam 
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back  and  forth.  A careful  attention  to  low-level  details  makes  it  possible  in  fact 
to  do  the  job  with  remarkable  efficiency: 

Algorithm  L ( Linked  binary  trees  by  rotations).  This  algorithm  generates  all 
pairs  of  arrays  l0li . . . ln  and  rq  . . . rn  that  represent  left  links  and  right  links  of 
n-node  binary  trees,  where  l0  is  the  root  of  the  tree  and  the  links  (lk,rk)  point 
respectively  to  the  left  and  right  subtrees  of  the  kth  node  in  symmetric  order. 
Equivalently,  it  generates  all  n-node  forests,  where  lk  and  rk  denote  the  left  child 
and  right  sibling  of  the  kth  node  in  postorder.  Each  tree  is  obtained  from  its  pre- 
decessor by  doing  a single  rotation.  Two  auxiliary  arrays  k\  . . ,kn  and  oqO\  . . . o„, 
representing  backpointers  and  directions,  are  used  to  control  the  process. 

LI.  [Initialize.]  Set  l3  <-  0,  r3  <-  j + 1,  k3  <- j - 1,  and  o3  <-  -1  for  1 < j < n- 
also  set  l0  <-  o0  <-  1,  ln  *-  rn  <-  0,  kn  <-  n - 1,  and  on  < 1. 

L2.  [Visit.]  Visit  the  binary  tree  or  forest  represented  by  l0lt . . . ln  and  r1...rn. 
Then  set  j «-  n and  p 0. 

L3.  [Find  j .]  If  o3  > 0,  set  m <-  l3  and  go  to  L5  if  m / 0.  If  o3  < 0,  set  m <-  k3: 
then  go  to  L4  if  m ± 0,  otherwise  set  p <-  j.  If  m = 0 in  either  case,  set 
°j  i °j > j j - 1,  and  repeat  this  step. 

L4.  [Rotate  left.]  Set  rm  <-  l3,  l3  <-  m,  x <-  km,  and  k3  <-  x.  If  x = 0,  set 
lp  4-  j , otherwise  set  rx  <-  j.  Return  to  L2. 

L5.  [Rotate  right.]  Terminate  if  j = 0.  Otherwise  set  l3  <—  rm,  rm  4-  j,  k3  <—  m, 
x e-  km.  If  x = 0,  set  lp  <—  m,  otherwise  set  rx  <—  m.  Go  back  to  L2.  | 

Exercise  38  proves  that  Algorithm  L needs  only  about  9 memory  references  per 
tree  generated;  thus  it  is  almost  as  fast  as  Algorithm  B.  (In  fact,  two  memory 
references  per  step  could  be  saved  by  keeping  the  three  quantities  on,  Zn,  and  kn 
in  registers.  But  of  course  Algorithm  B can  be  speeded  up  too.) 

Table  3 shows  the  sequence  of  binary  trees  and  forests  visited  by  Algorithm  L 
when  n = 4,  with  some  auxiliary  tables  that  shed  further  light  on  the  process. 
The  permutation  (?i 929394  lists  the  nodes  in  preorder,  when  they  have  been 
numbered  in  postorder  of  the  forest  (symmetric  order  of  the  binary  tree);  it 
is  the  inverse  of  the  permutation  P1P2P3P4  in  Table  1.  The  “coforest”  is  the 
conjugate  (right-to-left  reflection)  of  the  forest;  and  the  numbers  U1U2U3U4  are 
its  scope  coordinates,  analogous  to  S1S2S3S4  in  Table  2.  A final  column  shows 
the  so-called  “dual  forest.”  The  significance  of  these  associated  quantities  is 
explored  in  exercises  11-13,  19,  24,  26,  and  27. 

The  links  l0h...ln  and  rq  . . . rn  in  Algorithm  L and  Table  3 are  not  com- 
parable to  the  links  Zi . . . ln  and  rq  . . . rn  in  Algorithm  B and  Table  2,  because 
Algorithm  L preserves  inorder/postorder  while  Algorithm  B preserves  preorder. 
Node  k in  Algorithm  L is  the  fcth  node  from  left  to  right  in  the  binary  tree,  so 
lo  is  needed  to  identify  the  root;  but  node  k in  Algorithm  B is  the  fcth  node  in 
preorder,  so  the  root  is  always  node  1 in  that  case. 

Algorithm  L has  the  desired  property  that  only  three  links  change  per  step; 
but  we  can  actually  do  even  better  in  this  respect  if  we  stick  to  the  preorder 
convention  of  Algorithm  B.  Exercise  25  presents  an  algorithm  that  generates 
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Table  3 

BINARY  TREES  AND  FORESTS  GENERATED  BY  ROTATIONS  WHEN  n = 4 


tO  W2W4 

rir2r3r4 

k\k2k3k4 

binary  tree 

forest 

<71<J2<J3<14 

coforest 

^1142^3144 

dual 

10000 

2340 

0123 

\ 

• • • • 

1234 

• • • • 

0000 

i 

10003 

2400 

0122 

•• : 

1243 

I ** 

1000 

A 

10002 

4300 

0121 

* A 

1423 

A * 

2000 

40001 

2300 

0120 

A* 

4123 

A\ 

3000 

* I 

40021 

3000 

0110 

4132 

3100 

• A 

10023 

4000 

0111 

> 

* 1 

1432 

I ' 

2100 

A* 

10020 

3040 

0113 

• I ‘ 

1324 

* I * 

0100 

30010 

2040 

0103 

A * 

3124 

* A 

0200 

I I 

40013 

2000 

0100 

A 

4312 

A 

3200 

” I 

40123 

0000 

0000 

i 

4321 

i 

3210 

• • • • 

30120 

0040 

0003 

i * 

3214 

' i 

0210 

I “ 

20100 

0340 

0023 

‘Aa 

: •• 

2134 

•• : 

0010 

I ‘ 

20103 

0400 

0022 

: 1 

2143 

1 1 

1010 

A * 

40102 

0300 

0020 

4213 

3010 

* I ' 

all  linked  binary  trees  or  forests  by  changing  just  two  links  per  step,  preserving 
preorder.  One  link  becomes  zero  while  another  becomes  nonzero.  This  prune- 
and-graft  algorithm,  which  is  the  third  of  the  three  “very  nice  Gray  codes  for 
trees”  promised  above,  has  only  one  downside:  Its  controlling  mechanism  is  a bit 
trickier  than  that  of  Algorithm  L,  so  it  needs  about  40%  more  time  to  do  the  cal- 
culations when  we  include  the  cost  of  deciding  what  links  to  change  at  each  step. 
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The  number  of  trees.  There’s  a simple  formula  for  the  total  number  of  outputs 
that  are  generated  by  Algorithms  P,  B,  N,  and  L,  namely 

n _ 1 (2 n\  (2 n\  ( 2n  \ 

" n + 1 \ n ) InJ  Vn-lJ’  (15) 

we  proved  this  fact  in  Eq.  2.3.4.4-(i4).  The  first  few  values  are 


n=0123456  7 8 9 10  11  12  13 

Cn  = 1 1 2 5 14  42  132  429  1430  4862  16796  58786  208012  742900 
and  they  are  called  Catalan  numbers  because  of  some  influential  papers  written 
by  Eugene  Catalan  [Journal  de  math.  3 (1838),  508-516;  4 (1839),  95-99], 
Stirling  s approximation  tells  us  the  asymptotic  value, 


c 4"  (i  9 , 145  H55  36939 

y/n n3/2  V 8 n 128 n2  1024n3  + 32768n4  + °^n  V’  (l6) 

in  particular  we  can  conclude  that 


Cn  — k 

~c7 


when  fc  < — . 


(And  of  course  Cn^i/Cn  is  equal  to  (n+ l)/(4n  — 2),  exactly,  by  (15).) 
2. 3. 4. 4 we  also  derived  the  generating  function 


(17) 

In  Section 


C(z)  = C0  + ClZ  + C2z2  + C3z3  + ■ • • = ~'^=7TZ  (18) 

and  proved  the  important  formula 

- ^(2n+;_1)  - (2”+;_1)-(2”„+-rr^  ^ 

see  the  answer  to  exercise  2.3.4.4-33,  and  CMath  equation  (5.70). 

These  facts  give  us  more  than  enough  information  to  analyze  Algorithm  P. 
our  algorithm  for  lexicographic  generation  of  nested  parentheses.  Step  P2  is 
obviously  performed  Cn  times;  then  P3  usually  makes  a simple  change  and  goes 
back  to  P2.  How  often  do  we  need  to  go  on  to  step  P4?  Easy:  It’s  the  number 
of  times  that  step  P2  finds  m — 2n  - 1.  And  m is  the  location  of  the  rightmost 
_(’,  so  we  have  m = 2n  - 1 in  exactly  Cn^  cases.  Thus  the  probability  that 
P3  sets  m 4 m 1 and  returns  immediately  to  P2  is  (Cn  - Cn_1)/Cn  ss  3/4. 
by  (17)-On  the  other  hand  when  we  do  get  to  step  P4,  suppose  we  need  to  set 
aj  -f-  ‘)’  and  ak  <-  ‘ (’  exactly  h - 1 times  in  that  step.  The  number  of  cases 
with  h > x is  the  number  of  nested  strings  of  length  2 n that  end  with  x trivial 
pairs  ()...(),  namely  C„_x.  Therefore  the  total  number  of  times  the  algorithm 
changes  a,  and  ak  in  step  P4  is 


Cn  — 1 + Cn  — 2 + • ' ' + Ci 


= Cn{ 


Cn- 1 Cra_  2 Ci 

Cn  Cn  +'"+C~n 


by  (17);  we  have  proved  the  claim  for  efficiency  made  earlier. 


(20) 
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For  a deeper  understanding  it  is  helpful  to  study  the  recursive  structure 
underlying  Algorithm  P,  as  expressed  in  (5).  The  sequences  Apq  in  that  formula 
have  Cpq  elements,  where 


Cpq  Cp(g-l)  + C(p_i)9  , 


if0<p<g^0; 


Coo  — l; 


(21) 


and  Cpq  = 0 if  p < 0 or  p > q.  Thus  we  can  form  the  triangular  array 


C*oo 

1 

C01 

On 

1 

1 

C02 

O2 

C22 

1 

2 

2 

C03 

C13 

C23 

C33 

= 

1 

3 

5 

5 

0)4 

o14 

C24 

C34 

044 

1 

4 

9 

14 

14 

Co  5 

o15 

C25 

C35 

C45 

055 

1 

5 

14 

28 

42 

Coo 

C16 

C26 

C30 

046 

O56  066 

1 

6 

20 

48 

90 

(22) 


42 

132 


132 


in  which  every  entry  is  the  sum  of  its  nearest  neighbors  above  and  to  the  left; 

the  Catalan  numbers  Cn  = Cnn  appear  on  the  diagonal.  The  elements  of  this 

triangle,  which  themselves  have  a venerable  pedigree  going  back  to  de  Moivre 
in  1711,  are  called  “ballot  numbers,”  because  they  represent  sequences  of  p + q 
ballots  for  which  a running  tabulation  never  favors  a candidate  with  p votes  over 
an  opponent  who  receives  q votes.  The  general  formula 

q-p  + l fp  + q\  fp  + q\  fp  + q\  , , 

P ) = ( P Mp-J  <23> 


Cpq  — 


(24) 


(25) 


g+i  \ p / \ p j \p 

can  be  proved  by  induction  or  in  a variety  of  more  interesting  ways;  see  exercise  39 
and  the  answer  to  exercise  2. 2. 1-4.  Notice  that,  because  of  (19),  we  have 

Cpq  = [Zr}C(z)i-P+1. 

When  n = 4,  Algorithm  P essentially  describes  the  recursion  tree 

<1D— — 

(ysT  JS)  @r  (la 

(op  @)  (gp  (02)  Q3)  (QP  Q2)  (op  (Q2)  (§3)  (Ql)  (Q2) 
because  the  specification  (5)  implies  that  Ann  — ( A(n_l)n  and  that 

Apq  = )9  p(A(p_x)p,  )q  p ( A(p_x)(p+1),  )9  p 2(A(p_1)(p+ 2), 

. . . , ( A(p_1)?  when  0 < p < q.  (26) 

The  number  of  leaves  below  node  (pq)  in  this  recursion  tree  is  Cpq . and  node  (pq) 
appears  exactly  C'(n_9)(„_1_p)  times  on  level  n — 1 — p;  therefore  we  must  have 

^ ' C(n  — q)(n  — l—p)  Cpq  — Cn , for  0 ^ p <(  fi.  (27) 


The  fourteen  leaves  of  (25),  from  left  to  right,  correspond  to  the  fourteen  rows 
of  Table  1,  from  top  to  bottom.  Notice  that  the  entries  in  column  C1C2C3C4  of 
that  table  assign  the  respective  numbers  0000,  0001,  0010,  . . . , 0123  to  the  leaves 
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of  (25),  m accord  with  “Dewey  decimal  notation”  for  tree  nodes  (but  with  indices 
starting  at  0 instead  of  1,  and  with  an  extra  0 tacked  on  at  the  beginning). 

A worm  that  crawls  from  one  leaf  to  the  next,  around  the  bottom  of  the 
recursion  tree,  will  ascend  and  descend  h levels  when  h of  the  coordinates  cj  . . . cn 
are  changed,  namely  when  Algorithm  P resets  the  values  of  h (s  and  h )s. 
This  observation  makes  it  easy  to  understand  our  previous  conclusion  that  the 
condition  h > x occurs  exactly  C„_x  times  during  a complete  crawl. 

Yet  another  way  to  understand  Algorithm  P arises  when  we  contemplate  an 
infinite  directed  graph  that  is  suggested  by  the  recursion  (21): 


(28) 


Clearly  Cpq  is  the  number  of  paths  from  © to  (00)  in  this  digraph,  because 
of  (21).  And  indeed,  every  string  of  parentheses  in  Apq  corresponds  directly  to 
such  a path,  with  ‘ (’  signifying  a step  to  the  left  and  ‘)  ’ signifying  a step  upward. 
Algorit  hm  P explores  all  such  paths  systematically  by  trying  first  to  go  upward 
when  extending  a partial  path. 

Therefore  it  is  easy  to  determine  the  ATth  string  of  nested  parentheses  that 
is  visited  by  Algorithm  P,  by  starting  at  node  © and  doing  the  following 
calculation  when  at  node  ©:  If  p = q = 0,  stop;  otherwise,  if  N < Cp ( 1}, 

emit  “)’,  set  q «—  q — 1,  and  continue;  otherwise  set  N <-  AT  - Cp(q_xy  emit 
(’,  set  p i-  p - 1,  and  continue.  The  following  algorithm  [Frank  Ruskey,  Ph  D 
thesis  (University  of  California  at  San  Diego,  1978),  16-24]  avoids  the  need  to 
precompute  the  Catalan  triangle  by  evaluating  Cpq  on  the  fly  as  it  goes: 

Algorithm  U ( Unrank  a string  of  nested  parentheses).  Given  n and  N,  where 
1 <N<Cn,  this  algorithm  computes  the  Nth  output  a1 . . . a2n  of  Algorithm  P. 

Ul.  [Initialize.]  Set  q «-  n and  m <-  p «-  c <-  1.  While  p < n,  set  p <-  p + 1 
and  c «-  ((4p  - 2 )c)/(p  + 1). 

U2.  [Done?]  Terminate  the  algorithm  if  q = 0. 

U3.  [Go  up?]  Set  d ((9  + 1 )(q-  p)c)/((q  + p)(q-p+  1)).  (At  this  point  we 

have  1 < N < c — Cpq  and  d = Cp(q_xy)  If  N < d , set  q «-  q - 1,  c d, 

dm  t—  ‘) ’,  m <—  m + 1,  and  return  to  U2. 

U4.  [Go  left.]  Set  p p - 1,  c c — d,  N N - d,  am  4-  ‘(’,  m <-  m + 1, 

and  return  to  U3.  | 

Random  trees.  We  could  choose  a string  axa2  ...a2n  of  nested  parentheses 
at  random  by  simply  applying  Algorithm  U to  a random  integer  N between  1 
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and  Cn.  But  that  idea  isn’t  really  very  good,  when  n is  bigger  than  32  or  so,  be- 
cause Cn  can  be  quite  large.  A simpler  and  better  way,  proposed  by  D.  B.  Arnold 
and  M.  R.  Sleep  [ACM  Trans.  Prog.  Languages  and  Systems  2 (1980),  122-128], 
is  to  generate  a random  “worm  walk”  by  starting  at  @)  in  (28)  and  repeatedly 
taking  leftward  or  upward  branches  with  the  appropriate  probabilities.  The 
resulting  algorithm  is  almost  the  same  as  Algorithm  U,  but  it  deals  only  with 
nonnegative  integers  less  than  n2  + n + 1: 

Algorithm  W ( Uniformly  random  strings  of  nested  parentheses).  This  algo- 
rithm generates  a random  string  aia2  . . . a2n  of  properly  nested  (s  and  )s. 

Wl.  [Initialize.]  Set  p 4—  q 4—  n and  m 4—  1. 

W2.  [Done?]  Terminate  the  algorithm  if  q = 0. 

W3.  [Go  up?]  Let  A be  a random  integer  in  the  range  0 < X < (q+p){q—p+ 1). 
If  X < [q  + l)(g  — p),  set  q 4—  q — 1,  am  4—  ‘) m 4—  to  + 1,  and  return 
to  W2. 

W4.  [Go  left.]  Set  p p — 1,  am  4-  1 (’,  m 4—  m + 1,  and  return  to  W3.  | 

A worm’s  walk  can  be  regarded  as  a sequence  u’oWi  . . . w 2n,  where  wm  is  the 
worm’s  current  depth  after  m steps.  Thus,  wq  = 0;  wm  = wm-\  + 1 when  am  — 
1 (’;  wm  = wm_x  — 1 when  am  = ‘) and  we  have  wm  > 0,  w2n  = 0.  The  sequence 
w0w  1 . . . w30  corresponding  to  (1)  and  (2)  is  0121012321234345432321232343210. 
At  step  W3  of  Algorithm  W we  have  q+p  — 2n  + l—  m and  q — p = wm- \. 

Let’s  say  that  the  outline  of  a forest  is  the  path  that  runs  through  the  points 
( m , —Wm)  in  the  plane,  for  0 < m < 2 n,  where  WoWi  • • • W2 n is  the  worm  walk 
corresponding  to  the  associated  string  . . . a2„  of  nested  parentheses.  Figure  57 
shows  what  happens  if  we  plot  the  outlines  of  all  50-node  forests  and  darken  each 
point  according  to  the  number  of  forests  that  lie  above  it.  For  example,  w\  is 
always  1,  so  the  triangular  region  at  the  upper  left  of  Fig.  57  is  solid  black. 
But  u>2  is  either  0 or  2,  and  0 occurs  in  C49  sa  C50/4  cases;  so  the  adjacent 
diamond-shaped  area  is  a 75%  shade  of  gray.  Thus  Fig.  57  illustrates  the  shape 
of  a random  forest,  analogous  to  the  shapes  of  random  partitions  that  we’ve  seen 
in  Figs.  50,  51,  and  55  of  Sections  7.2. 1.4  and  7. 2. 1.5. 


Fig.  57.  The  shape  of  a random  50-node  forest. 

Of  course  we  can’t  really  draw  the  outlines  of  all  those  forests,  since  there 
are  C50  = 1,978,261,657,756,160,653,623,774,456  of  them.  But  with  the  help  of 
mathematics  we  can  pretend  that  we’ve  done  so.  The  probability  that  w2m  = 2fc 
is  C(m—k)(m4-k)C(n—m~.k)(n—m+k)/Cni  because  there  are  C(m—k)(vn+k)  ways  to 
start  with  m + k (s  and  m — k )s,  and  C (n_m_fc)(„_m+fc)  ways  to  finish  with 
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Fig*  58.  Locations  of  the  internal  nodes  in  a random  50-node  binary  tree. 


n (m  + k)  (s  and  n — (m  — k)  ) s.  By  (23)  and  Stirling’s  approximation,  this 
probability  is 


(2k  + l)2(n  + 1) 


(m  + k + 1 )(n  — m + k 
(2k  + l)2 


1) 


cm-k)( 


2 n — 2m 
n — m + k 


^±iL_e-e„(l  + o(< 


'(.) 
0+o(S)) 


(29) 


when  m — 0n  and  n 00,  for  0 < 0 < 1.  The  average  value  of  w2to  is  worked 
out  in  exercise  57;  it  comes  to 


= 4 


0(1  — 0)n 


1+0 


(ts)-  <30) 


and  it  is  illustrated  for  n = 50  as  a curved  line  in  Fig.  57. 

When  n is  large,  worm  walks  approach  the  so-called  “Brownian  excur- 
sion,” which  is  an  important  concept  in  probability  theory.  See,  for  example, 
Paul  Levy,  Processus  Stochastiques  et  Mouvement  Brownien  (1948),  225-237; 
Guy  Louchard,  J.  Applied  Prob.  21  (1984),  479-499,  and  BIT  26  (1986),  17- 
34;  David  Aldous,  Electronic  Communications  in  Probability  3 (1998),  79-90; 
Jon  Warren,  Electronic  Communications  in  Probability  4 (1999),  25-29;  J.-F. 
Marckert,  Random  Structures  & Algorithms  24  (2004),  118-132. 

What  is  the  shape  of  a random  binary  tree?  This  question  was  investigated 
by  Frank  Ruskey  in  SIAM  J.  Algebraic  and  Discrete  Methods  1 (1980),  43-50, 
and  the  answer  turns  out  to  be  quite  interesting.  Suppose  we  draw  a binary  tree 
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Fig.  59.  Locations  of  the  external  nodes  in  a random  50-node  binary  tree. 


as  in  (4),  with  the  mth  internal  node  at  horizontal  position  to  when  the  nodes 
are  numbered  in  symmetric  order.  If  all  of  the  50-node  binary  trees  are  drawn 
in  this  way  and  superimposed  on  each  other,  we  get  the  distribution  of  node 
positions  shown  in  Fig.  58.  Similarly,  if  we  number  the  external  nodes  from  0 
to  n in  symmetric  order  and  place  them  at  horizontal  positions  .5,  1.5,  . . . , n + .5, 
the  “fringes”  of  all  50-node  binary  trees  form  the  distribution  shown  in  Fig.  59. 
Notice  that  the  root  node  is  most  likely  to  be  either  number  1 or  number  n,  at 
the  extreme  left  or  right;  it  is  least  likely  to  be  either  [(n  + 1)/2J  or  [(n+  l)/2], 
in  the  middle. 

As  in  Fig.  57,  the  smooth  curves  in  Figs.  58  and  59  show  the  average  node 
depths;  exact  formulas  are  derived  in  exercises  58  and  59.  Asymptotically,  the 
average  depth  of  external  node  m is 


— 1 + Oy-/=J,  when  to  — On  and  n — y 00,  (31) 

for  all  fixed  ratios  9 with  0 < 9 < 1,  curiously  like  (30);  and  the  average  depth 
of  internal  node  m is  asymptotically  the  same,  but  with  ‘ — 1’  replaced  by  ‘-3’. 
Thus  we  can  say  that  the  average  shape  of  a random  binary  tree  is  approximately 
the  lower  half  of  an  ellipse,  n units  wide  and  4 \fnjn  levels  deep. 

Three  other  noteworthy  ways  to  generate  random  encodings  of  forests  are 
discussed  in  exercises  60,  61,  and  62.  They  are  less  direct  than  Algorithm  W, 
yet  they  have  substantial  combinatorial  interest.  The  first  one  begins  with  an 
arbitrary  random  string  containing  n (s  and  n ) s,  not  necessarily  nested;  each 
of  the  ( n ) possibilities  is  equally  likely.  It  then  proceeds  to  convert  every  such 
string  into  a sequence  that  is  properly  nested,  in  such  a way  that  exactly  n + 1 
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strings  map  into  each  final  outcome.  The  second  method  is  similar,  but  it  starts 
with  a sequence  of  n + 1 Os  and  n 2s,  mapping  them  in  such  a way  that  exactly 
2n  + 1 original  strings  produce  each  possible  result.  And  the  third  method 
produces  each  output  from  exactly  n of  the  bit  strings  that  contain  exactly 
n - 1 Is  and  n + 1 Os.  In  other  words,  the  three  methods  provide  combinatorial 
proofs  of  the  fact  that  C„  is  simultaneously  equal  to  (2/*) /(n+1),  (2n+1) /(2n+l), 
and  (n-i)/n-  For  example,  when  n = 4 we  have  14  = 70/5  = 126/9  = 56/4. 

If  we  want  to  generate  random  binary  trees  directly  in  linked  form,  we  can 
use  a beautiful  method  suggested  by  J.  L.  Remy  [RAIRO  Informatique  Theorique 
19  (1985),  179-195],  His  approach  is  particularly  instructive  because  it  shows 
how  random  Catalan  trees  might  actually  occur  “in  nature,”  using  a deliciously 
simple  mechanism  based  on  a classical  idea  of  Olinde  Rodrigues  [J.  de  Math. 
3 (1838),  549],  Let  us  suppose  that  our  goal  is  to  obtain  not  only  an  ordinary 
n-node  binary  tree,  but  a decorated  binary  tree,  namely  an  extended  binary  tree 
in  which  the  external  nodes  have  been  labeled  with  the  numbers  0 to  n in  some 
order.  There  are  (n  + 1)!  ways  to  decorate  any  given  binary  tree;  so  the  total 
number  of  decorated  binary  trees  with  n internal  nodes  is 

Dn  = (n+l)\Cn  = ^ = (4n-2)Dn_1.  (32) 

Remy  observed  that  there  are  4n  — 2 easy  ways  to  build  a decorated  tree  of 
order  n from  a given  decorated  tree  of  order  n - 1:  We  simply  choose  any  one 
of  the  2n  - 1 nodes  (internal  or  external)  in  the  given  tree,  say  x,  and  replace  it 
by  either 


thus  inserting  a new  internal  node  and  a new  leaf  while  moving  x and  its 
descendants  (if  any)  down  one  level. 

For  example,  here’s  one  way  to  construct  a decorated  tree  of  order  6: 


Notice  that  every  decorated  tree  is  obtained  by  this  process  in  exactly  one  way, 
because  the  predecessor  of  each  tree  must  be  the  tree  we  get  by  striking  out  the 
highest-numbered  leaf.  Therefore  Remy’s  construction  produces  decorated  trees 
that  are  uniformly  random;  and  if  we  ignore  the  external  nodes,  we  get  random 
binary  trees  of  the  ordinary,  undecorated  variety. 

One  appealing  way  to  implement  Remy’s  procedure  is  to  maintain  a table  of 
links  LqLi  . . . L2n  where  external  (leaf)  nodes  have  even  numbers  and  internal 
(branch)  nodes  have  odd  numbers.  The  root  is  node  L0;  the  left  and  right 
children  of  branch  node  2k  - 1 are  respectively  L2k- 1 and  L2k.  for  1 < k < n. 
Then  the  program  is  short  and  sweet: 
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Algorithm  R ( Growing  a random  binary  tree).  This  algorithm  constructs  the 
linked  representation  LqL\  . . . L2n  of  a uniformly  random  binary  tree  with  N 
internal  nodes,  using  the  conventions  explained  above. 

Rl.  [Initialize.]  Set  n 0 and  L0  0. 

R2.  [Done?]  (At  this  point  the  links  LqLi  . . . L2n  represent  a random  n-node 
binary  tree.)  Terminate  the  algorithm  if  n = N. 

R3.  [Advance  n.]  Let  X be  a random  integer  between  0 and  4«  + 1,  inclusive. 
Set  n <-  n + 1,  b «—  X mod  2,  k «-  [X/2J,  L2„_ {,  2 n,  L2n-i+b  «-  Lk, 

Lk  4—  2n  — 1,  and  return  to  R2.  | 

*Chains  of  subsets.  Now  that  we’ve  got  trees  and  parentheses  firmly  in  mind, 
it’s  a good  time  to  discuss  the  Christmas  tree  pattern*  which  is  a remarkable 
way  to  arrange  the  set  of  all  2"  bit  strings  of  length  n into  ( ^n"2j ) rows  and  n + 1 
columns,  discovered  by  de  Bruijn,  van  Ebbenhorst  Tengbergen,  and  Kruyswijk 
[Nieuw  Archief  voor  Wiskunde  (2)  23  (1951),  191-193]. 

The  Christmas  tree  pattern  of  order  1 is  the  single  row  ‘0  T;  and  the  pattern 
of  order  2 is 

10  • (35) 

00  01  11 

In  general  we  get  the  Christmas  tree  pattern  of  order  n + 1 by  taking  every  row 
Lo i 2 ■ ■ ■ rrj  of  the  order-n  pattern  and  replacing  it  by  the  two  rows 

cr20  . . . <7S0 

<y i0  (j\l  ...  crs_!l  crsl 

(The  first  of  these  rows  is  omitted  when  s — 1.) 

Proceeding  in  this  way,  we  obtain  for  example  the  pattern  of  order  8 that 
appears  in  Table  4 on  the  next  page.  It  is  easy  to  verify  by  induction  that 

i)  Each  of  the  2”  bit  strings  appears  exactly  once  in  the  pattern. 

ii)  The  bit  strings  with  k Is  all  appear  in  the  same  column. 

iii)  Within  each  row,  consecutive  bit  strings  differ  by  changing  a 0 to  a 1. 

If  we  think  of  the  bit  strings  as  representing  subsets  of  {1, . . . ,n},  with  1-bits 
to  indicate  the  members  of  a set,  property  (iii)  says  that  each  row  represents 
a chain  in  which  each  subset  is  covered  by  its  successor.  In  symbols,  using  the 
notation  of  Section  7.1.3,  every  row  ai  <j2  ...  crs  has  the  property  that  at  C a]+\ 
and  v{(jj+i)  = + 1 for  1 < j < s. 

Properties  (i)  and  (ii)  tell  us  that  there  are  exactly  (£)  elements  in  column  k. 
if  we  number  the  columns  from  0 to  n.  This  observation,  together  with  the  fact 
that  each  row  is  centered  among  the  columns,  proves  that  the  total  number  of 
rows  is  max0<fc<„  (]!)  = (^n”2j) ' fLS  claimed.  Let  us  call  this  number  Mn. 


* This  name  was  chosen  for  sentimental  reasons,  because  the  pattern  has  a general  shape 
not  unlike  that  of  a festive  tree,  and  because  it  was  the  subject  of  the  author’s  ninth  annual 
“Christmas  Tree  Lecture”  at  Stanford  University  in  December  2002. 


(36) 
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00000000 


Table  4 

THE  CHRISTMAS  TREE  PATTERN  OF  ORDER  8 


10101010 

10101000  10101001  10101011 
10101100 

10100100  10100101  10101101 
10100010  10100110  10101110 
10100000  10100001  10100011  10100111  10101111 
10110010 

10110000  10110001  10110011 
10110100 

10010100  10010101  10110101 
10010010  10010110  10110110 
10010000  10010001  10010011  10010111  10110111 
10111000 

10011000  10011001  10111001 
10001010  10011010  10111010 
10001000  10001001  10001011  10011011  10111011 
10001100  10011100  10111100 
10000100  10000101  10001101  10011101  10111101 
10000010  10000110  10001110  10011110  10111110 
10000000  10000001  10000011  10000111  10001111  10011111  10111111 

11001010 

11001000  11001001  11001011 
11001100 

11000100  11000101  11001101 
11000010  11000110  11001110 
11000000  11000001  11000011  11000111  11001111 
11010010 

11010000  11010001  11010011 
11010100 

01010100  01010101  11010101 
01010010  01010110  11010110 
01010000  01010001  01010011  01010111  11010111 
11011000 

01011000  01011001  11011001 
01001010  01011010  11011010 
01001000  01001001  01001011  01011011  11011011 
01001100  01011100  11011100 
01000100  01000101  01001101  01011101  11011101 
01000010  01000110  01001110  01011110  11011110 
01000000  01000001  01000011  01000111  01001111  01011111  11011111 

11100010 

11100000  11100001  11100011 
11100100 

01100100  01100101  11100101 
01100010  01100110  11100110 
01100000  01100001  01100011  01100111  11100111 
11101000 

01101000  01101001  11101001 
00101010  01101010  11101010 
00101000  00101001  00101011  01101011  11101011 
00101100  01101100  11101100 
00100100  00100101  00101101  01101101  11101101 
00100010  00100110  00101110  01101110  11101110 
00100000  00100001  00100011  00100111  00101111  01101111  11101111 

11110000 

01110000  01110001  11110001 
00110010  01110010  11110010 
00110000  00110001  00110011  01110011  11110011 
00110100  01110100  11110100 
00010100  00010101  00110101  01110101  11110101 
00010010  00010110  00110110  01110110  11110110 
00010000  00010001  00010011  00010111  00110111  01110111  11110111 
00111000  01111000  11111000 
00011000  00011001  00111001  01111001  11111001 
00001010  00011010  00111010  01111010  11111010 
ooooiooo  ooooiooi  ooooion  oooiion  ooiiion  oniioii  union 
00001100  00011100  00111100  01111100  11111100 
00000100  00000101  00001101  00011101  00111101  01111101  11111101 
00000010  00000110  00001110  00011110  00111110  01111110  11111110 
00000001  00000011  00000111  00001111  00011111  00111111  01111111  11111111 
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A set  C of  bit  strings  is  called  a clutter,  or  an  “antichain  of  subsets,”  if 
its  bit  strings  are  incomparable  in  the  sense  that  a % r whenever  a and  r are 
distinct  elements  of  C.  A famous  theorem  of  Emanuel  Sperner  [Math.  Zeitschrift 
27  (1928),  544-548]  asserts  that  no  clutter  on  {l,...,n}  can  have  more  than 
Mn  elements;  and  the  Christmas  tree  pattern  provides  a simple  proof,  because 
no  clutter  can  contain  more  than  one  element  of  each  row. 

Indeed,  the  Christmas  tree  pattern  can  be  used  to  show  that  much  more 
is  true.  Let’s  note  first  that  exactly  (”)  - rows  of  length  n + 1 - 2k  are 

present,  for  0 < k < n/2,  because  there  are  exactly  (^)  elements  in  column  k. 
For  example,  Table  4 has  one  row  of  length  9,  namely  the  bottom  row;  it  also 
has  (®)  - (®)  = 7 rows  of  length  7,  (®)  - (®)  = 20  rows  of  length  5,  (®)  - (®)  = 28 
of  length  3,  and  (®)  — (®)  = 14  of  length  1.  Moreover,  these  numbers  (£)  — (fc"1) 
appear  in  the  Catalan  triangle  (22),  because  they’re  equal  to  Ck(n_k ) according 
to  Eq.  (23). 

Further  study  reveals  that  this  Catalan  connection  is  not  simply  a co- 
incidence; nested  parentheses  are,  in  fact,  the  key  to  a deeper  understanding 
of  the  Christmas  tree  pattern,  because  the  theory  of  parentheses  tells  us  where 
an  arbitrary  bit  string  fits  into  the  array.  Suppose  we  use  the  symbols  ( and  ) 
instead  of  1 and  0,  respectively.  Any  string  of  parentheses,  nested  or  not,  can  be 
written  uniquely  in  the  form 

CTO  ) • • ■ Oip—  1 ) OLp  . . . ( CYq  (37) 

for  some  p and  q with  0 < p < q,  where  the  substrings  a0 , . . . , aq  are  properly 
nested  and  possibly  empty;  exactly  p of  the  right  parentheses  and  q - p of  the 
left  parentheses  are  “free”  in  the  sense  that  they  have  no  mate.  For  example, 
the  string 

)(())())())))(((((()(()()((())  (3g) 

has  p = 5,  g = 12,  ao  = e,  ai  = ( () ) () , (*2  = O , £*3  = e,  . . . , £*12  = ( O ) . In 
general,  the  string  (37)  is  part  of  a chain  of  length  q + 1, 

£*0)  • ■ • aq-i ) aq,  a0)  . . . a9_2)a9_i  (a,,  ...,  a0  (aq  . . . (a9,  (39) 

in  which  we  start  with  q free  )s  and  change  them  one  by  one  into  free  (s.  Every 
row  of  the  Christmas  tree  pattern  is  obtained  in  exactly  this  manner,  but  using 
1 and  0 instead  of  ( and  ) ; for  if  the  chain  cry  ...  crs  corresponds  to  the  nested 
strings  ao,  ...,  a3_i,  its  successor  chains  in  (36)  correspond  respectively  to 
ao,  •••>  ots_ 3,  as_2(as_i)  and  to  ao,  ...,  as_ 3,  as_2,  as_  1,  e.  [See  Curtis 
Greene  and  Daniel  J.  Kleitman,  J.  Combinatorial  Theory  A20  (1976),  80-88.] 

Notice  furthermore  that  the  rightmost  elements  in  each  row  of  the  pattern  — 
such  as  10101010,  10101011,  10101100,  10101101,  ...,  11111110,  11111111  in 
the  case  n = 8 — are  in  lexicographic  order.  Thus,  for  example,  the  fourteen 
rows  of  length  1 in  Table  4 correspond  precisely  to  the  fourteen  strings  of 
nested  parentheses  in  Table  1.  This  observation  makes  it  easy  to  generate  the 
rows  of  Table  4 sequentially  from  bottom  to  top,  with  a method  analogous  to 
Algorithm  P;  see  exercise  77. 
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Let  f(x i, . . . ,xn)  be  any  monotone  Boolean  function  of  n variables.  If  c = 
ai...an  is  any  bit  string  of  length  n,  we  can  write  /(cr)  = f(ai, . . . ,an)  for 
convenience.  Any  row  crj  . . . as  of  the  Christmas  tree  pattern  forms  a chain,  so 
we  have 

0 < f{a i)  < •••  < f(as)  < 1.  (4o) 

In  other  words,  there  is  an  index  t such  that  /(cry)  = 0 for  j < t and  /(cry)  = 1 
for  j > t;  we  will  know  the  value  of  /(cr)  for  all  2n  bit  strings  cr  if  we  know  the 
indices  t for  each  row  of  the  pattern. 

Georges  Hansel  [Comptes  Rendus  Acad.  Sci.  (A)  262  (Paris,  1966),  1088- 
1090]  noticed  that  the  Christmas  tree  pattern  has  another  important  property: 
If  o-j-i,  <jj , and  aj+1  are  three  consecutive  entries  of  any  row,  the  bit  string 

aj  ~ ai- 1 © aj  © aj+i  (41) 

lies  in  a previous  row.  In  fact,  cr'  lies  in  the  same  column  as  a3,  and  it  satisfies 

aj-i  Q C (jj+ 1;  (42) 

it  is  called  the  relative  complement  of  cry  in  the  interval  {cr]^1 . . crJ+1).  Hansel’s 
observation  is  easy  to  prove  by  induction,  because  of  the  recursive  rule  (36)  that 
defines  the  Christmas  tree  pattern.  He  used  it  to  show  that  we  can  deduce  the 
values  of  f (cr)  for  all  0 by  actually  evaluating  the  function  at  relatively  few  well- 
chosen  places;  for  if  we  know  the  value  of  /(cr'),  we  will  know  either  or 

/ (a j+i)  because  of  relation  (42). 

Algorithm  H ( Learning  a monotone  Boolean  function).  Let  f(xi, . . . , xn)  be  a 
Boolean  function  that  is  nondecreasing  in  each  Boolean  variable,  but  otherwise 
unknown.  Given  a bit  string  cr  of  length  n,  let  r(cr)  be  the  number  of  the  row 
m which  cr  appears  in  the  Christmas  tree  pattern,  where  1 < r(cr)  < Mn.  If 
1 < ?n  < Mn,  let  s(m)  be  the  number  of  bit  strings  in  row  m;  also  let  y(m,  k)  be 
the  bit  string  in  column  k of  that  row,  for  (n+l-s(m))/2  <k<  (n- l + s(m))/2. 
This  algorithm  determines  the  sequence  of  threshold  values  f(l),  t( 2),  . . . , t(Mn) 
such  that 

/(cr)  = l <*=>-  v{a)  > t(r(cr)),  (43) 

by  evaluating  / at  no  more  than  two  points  per  row. 

HI.  [Loop  on  m.\  Perform  steps  H2  through  H4  for  m = 1,  . . . , Mn;  then  stop. 

H2.  [Begin  row  m.)  Set  a <-  (n  + 1 - s(m))/2  and  z <- (n  - 1 + s(m))/2. 

H3.  [Do  a binary  search.]  If  2 < a + 1,  go  to  H4.  Otherwise  set  k <—  [(a  + z) /2j , 
and 

a <-  x(m,k-l)@x(m,k)<Sx(m,k  + l).  (44) 

If  k > t(r(cr)),  set  z k;  otherwise  set  a 4-  k.  Repeat  step  H3. 

H4.  [Evaluate.]  If  /(x(m,  a))  = 1,  set  t(m)  <—  a ; otherwise,  if  a = z,  set 
t(m)  <-  a + 1;  otherwise  set  t(m)  4-  z + 1 - f(x(m , 2)).  | 
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Hansel’s  algorithm  is  optimum , in  the  sense  that  it  evaluates  / at  the  fewest 
possible  points  in  the  worst  case.  For  if  / happens  to  be  the  threshold  function 

f{a)  = [v(a)>n/ 2],  (45) 

any  valid  algorithm  that  learns  / on  the  first  m rows  of  the  Christmas  tree 
pattern  must  evaluate  f(a)  in  column  \ n/2\  of  each  row,  and  in  column  [n/ 2J  +1 
of  each  row  that  has  size  greater  than  1.  Otherwise  we  could  not  distinguish  / 
from  a function  that  differs  from  it  only  at  an  unexamined  point.  [See  V.  K. 
Korobkov,  Problemy  Kibernetiki  13  (1965),  5-28,  Theorem  5.] 

Oriented  trees  and  forests.  Let’s  turn  now  to  another  kind  of  tree,  in  which 
the  parent-child  relationship  is  important  but  the  order  of  children  in  each  family 
is  not.  A11  oriented  forest  of  n nodes  can  be  defined  by  a sequence  of  pointers 
Pi  ■ ■ Pn,  where  pj  is  the  parent  of  node  j (or  p3  = 0 if  ;j  is  a root);  the  directed 
graph  on  vertices  {0,1,..., n}  with  arcs  {j  ->  pj  \ 1 < j < n}  will  have  no 
oriented  cycles.  An  oriented  tree  is  an  oriented  forest  with  exactly  one  root. 
(See  Section  2. 3. 4. 2.)  Every  n-node  oriented  forest  is  equivalent  to  an  (n  + 1)- 
node  oriented  tree,  because  the  root  of  that  tree  can  be  regarded  as  the  parent  of 
all  the  roots  of  the  forest.  We  saw  in  Section  2. 3.4.4  that  there  are  An  oriented 
trees  with  n nodes,  where  the  first  few  values  are 

n = 123456  7 8 9 10  11  12  13  14 

An  = 1 1 2 4 9 20  48  115  286  719  1842  4766  12486  32973  ’ (4  ’ 

asymptotically,  An  = cann~3/2  + 0(ann~5/2)  where  a « 2.9558  and  c r*  0.4399. 
Thus,  for  example,  only  9 of  the  14  forests  in  Table  1 are  distinct  when  we  ignore 
the  horizontal  left-to-right  ordering  and  consider  only  the  vertical  orientation. 

Every  oriented  forest  corresponds  to  a unique  ordered  forest  if  we  sort  the 
members  of  each  family  appropriately,  using  an  ordering  on  trees  introduced 
by  H.  I.  Scoins  [Machine  Intelligence  3 (1968),  43-60]:  Recall  from  (11)  that 
ordered  forests  can  be  characterized  by  their  level  codes  ci . . . cn , where  node  j 
in  preorder  appears  on  level  cj . An  ordered  forest  is  called  canonical  if  the  level 
code  sequences  for  the  subtrees  in  each  family  are  in  nonincreasing  lexicographic 
order.  For  example,  the  canonical  forests  in  Table  1 are  those  whose  level  codes 
clC2C3C4  are  0000,  0100,  0101,  0110,  0111,  0120,  0121,  0122,  and  0123.  The  level 
sequence  0112  is  not  canonical,  because  the  subtrees  of  the  root  have  respective 
level  codes  1 and  12;  the  string  1 is  lexicographically  less  than  12.  We  can  readily 
verify  by  induction  that  the  canonical  level  codes  are  lexicographically  largest, 
among  all  ways  of  reordering  the  subtrees  of  a given  oriented  forest. 

T.  Beyer  and  S.  M.  Hedetniemi  [SICOMP  9 (1980),  706-712]  noticed  that 
there  is  a remarkably  simple  way  to  generate  oriented  forests  if  we  visit  them  in 
decreasing  lexicographic  order  of  the  canonical  level  codes.  Suppose  cx . . . c„  is 
canonical,  where  > 0 and  Ck+i  = • • ■ = cn  — 0.  The  next  smallest  sequence  is 
obtained  by  decreasing  c^,  then  increasing  c‘k+\ . . . cn  to  the  largest  levels  consis- 
tent with  canonicity;  and  those  levels  are  easy  to  compute.  For  if  j = pt;  is  the 
parent  of  node  k,  we  have  Cj  = Ck  — 1 < c;  for  j < l < k,  hence  the  levels  c.j  ...  cx. 
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represent  the  subtree  currently  rooted  at  node  j.  To  get  the  largest  sequence  of 
levels  less  than  ci  . . . cn  we  therefore  replace  ck  . . . cn  by  the  first  n+l-k  elements 
of  the  infinite  sequence  (cj  . . . ck-i)°°  = c j . . . ck_iCj  . . . ck-iCj  ....  (The  effect 
is  to  remove  k from  its  current  position  as  the  rightmost  child  of  j,  then  to 
append  new  subtrees  that  are  siblings  of  j,  by  cloning  j and  its  descendants 
as  often  as  possible.  This  cloning  process  may  terminate  in  the  midst  of  the 
sequence  Cj...ck- 1,  but  that  causes  no  difficulty  because  every  prefix  of  a 
canonical  level  sequence  is  canonical.)  For  example,  to  obtain  the  successor  of 
any  sequence  of  canonical  codes  that  ends  with  23443433000000000,  we  replace 
the  final  3000000000  by  2344343234. 

Algorithm  O ( Oriented  forests ).  This  algorithm  generates  all  oriented  forests 
on  n nodes,  by  visiting  all  canonical  n-node  forests  in  decreasing  lexicographic 
order  of  their  level  codes  ci  . . . cn . The  level  codes  are  not  computed  explicitly, 
however;  each  canonical  forest  is  represented  directly  by  its  sequence  of  parent 
pointers  p\...pn,  in  preorder  of  the  nodes.  To  generate  all  oriented  trees  on 
n+  1 nodes,  we  can  imagine  that  node  0 is  the  root. 

01.  [Initialize.]  Set  pk  <—  k — 1 for  0 < k < n.  (In  particular,  this  step  makes 
Po  nonzero,  for  use  in  termination  testing;  see  step  04.) 

02.  [Visit.]  Visit  the  forest  represented  by  parent  pointers  px  . . . pn . 

03.  [Easy  case?]  If  pn  > 0,  set  pn  <—  pPa  and  return  to  step  02. 

04.  [Find  j and  A;.]  Find  the  largest  k < n such  that  pk  0.  Terminate  the 
algorithm  if  k — 0;  otherwise  set  j «—  pk  and  d 4-  A:  - j. 

05.  [Clone.]  If  = py  set  pk  4-  py  otherwise  set  pk  4-  + d.  Return  to 

step  02  if  A:  = n;  otherwise  set  A:  4-  k + 1 and  repeat  this  step.  | 

As  in  other  algorithms  we’ve  been  seeing,  the  loops  in  steps  04  and  05  tend  to 
be  quite  short;  see  exercise  88.  Exercise  90  proves  that  slight  changes  to  this 
algorithm  suffice  to  generate  all  arrangements  of  edges  that  form  free  trees. 

Spanning  trees.  Now  let’s  consider  the  minimal  subgraphs  that  “span”  a 
given  graph.  If  G is  a connected  graph  on  n vertices,  the  spanning  trees  of  G 
are  the  subsets  of  n — 1 edges  that  contain  no  cycles;  equivalently,  they  are  the 
subsets  of  edges  that  form  a free  tree  connecting  all  the  vertices.  Spanning  trees 
are  important  in  many  applications,  especially  in  the  study  of  networks,  so  the 
problem  of  generating  all  spanning  trees  has  been  treated  by  many  authors.  In 
fact,  systematic  ways  to  list  them  all  were  developed  early  in  the  20th  century 
by  Wilhelm  Feussner  [Annalen  der  Physik  (4)  9 (1902),  1304-1329],  long  before 
anybody  thought  about  generating  other  kinds  of  trees. 

In  the  following  discussion  we  will  allow  graphs  to  have  any  number  of  edges 
between  two  vertices;  but  we  disallow  loops  from  a vertex  to  itself,  because 
self-loops  cannot  be  part  of  a tree.  Feussner’s  basic  idea  was  very  simple,  yet 
eminently  suited  for  calculation:  If  e is  any  edge  of  G,  a spanning  tree  either 
contains  e or  it  doesn’t.  Suppose  e joins  vertex  u to  vertex  v.  and  suppose  it  is 
part  of  a spanning  tree;  then  the  other  n — 2 edges  of  that  tree  span  the  graph 
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G/e  that  we  obtain  by  regarding  u and  v as  identical.  In  other  words,  the 
spanning  trees  that  contain  e are  essentially  the  same  as  the  spanning  trees  of 
the  contracted  graph  G / e that  results  when  we  shrink  e down  to  a single  point. 
On  the  other  hand  the  spanning  trees  that  do  not  contain  e are  spanning  trees 
of  the  reduced  graph  G\e  that  results  when  we  eliminate  edge  e.  Symbolically, 
therefore,  the  set  5(G)  of  all  spanning  trees  of  G satisfies 


5(G)  = e 5(G  / e)  U S(G\e).  (47) 


Malcolm  J.  Smith,  in  his  Master’s  thesis  at  the  University  of  Victoria  (1997), 
introduced  a nice  way  to  carry  out  the  recursion  (47)  by  finding  all  spanning  trees 
in  a “revolving-door  Gray  code”  order:  Each  tree  in  his  scheme  is  obtained  from 
its  predecessor  by  simply  removing  one  edge  and  substituting  another.  Such 
orderings  are  not  difficult  to  find,  but  the  trick  is  to  do  the  job  efficiently. 

The  basic  idea  of  Smith’s  algorithm  is  to  generate  5(G)  in  such  a way  that 
the  first  spanning  tree  includes  a given  near  tree , namely  a set  of  n — 2 edges 
containing  no  cycle.  This  task  is  trivial  if  n = 2;  we  simply  list  all  the  edges. 
If  n > 2 and  if  the  given  near  tree  is  {ei, . . . , en_2},  we  proceed  as  follows: 
Assume  that  G is  connected;  otherwise  there  are  no  spanning  trees.  Form  G / e 1 
and  append  ei  to  each  of  its  spanning  trees,  beginning  with  one  that  contains 
{e2, . . . , e„_2};  notice  that  {e2, . . . , en_2}  is  a near  tree  of  G/e  1,  so  this  recursion 
makes  sense.  If  the  last  spanning  tree  found  in  this  way  for  G/e  j is  f\ . . . /„_  2, 
complete  the  task  by  listing  all  spanning  trees  for  G \ ei,  beginning  with  one 
that  contains  the  near  tree  {/ 1, . . . , /n_2}. 

For  example,  suppose  G is  the  graph 

P s 

G = Tr  (48) 

with  four  vertices  and  five  edges  {p,  q,  r,  s,  t}.  Starting  with  the  near  tree  {p,  q}, 
Smith’s  procedure  first  forms  the  contracted  graph 


G/p  = 


(49) 


and  lists  its  spanning  trees,  beginning  with  one  that  contains  {g}.  This  list 
might  be  qs,  qt,  ts , tr , rs;  thus  the  trees  pqs,  pqt , pts,  ptr , and  prs  span  G.  The 
remaining  task  is  to  list  the  spanning  trees  of 


G\p 


(50) 


starting  with  one  that  contains  {r,  s};  they  are  rsq,  rqt,  qts. 

A detailed  implementation  of  Smith’s  algorithm  turns  out  to  be  quite  in- 
structive. As  usual  we  represent  the  graph  by  letting  two  arcs  u — > v and  v — > u 
correspond  to  each  edge  u — v,  and  we  maintain  lists  of  “arc  nodes”  to  represent 
the  arcs  that  leave  each  vertex.  We’ll  need  to  shrink  and  unshrink  the  graph’s 
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edges,  so  we  will  make  these  lists  doubly  linked.  If  a points  to  an  arc  node  that 
represents  u — w,  then 

a © 1 points  to  the  “mate”  of  a,  which  represents  v — >u; 
ta  is  the  “tip”  of  a,  namely  v (hence  tQ$1  = u); 
ia  is  an  optional  name  that  identifies  this  edge  (and  equals  zael); 
na  points  to  the  next  element  of  u’s  arc  list; 
pa  points  to  the  previous  element  of  u’s  arc  list; 
and  la  is  a link  used  for  undeleting  arcs  as  explained  below. 

The  vertices  are  represented  by  integers  {1, . . . ,n};  and  arc  number  v - 1 is  a 
header  node  for  vertex  u’s  doubly  linked  arc  list.  A header  node  a is  recognizable 
by  the  fact  that  its  tip,  ta,  is  0.  We  let  dv  be  the  degree  of  vertex  v.  Thus,  for 
example,  the  graph  (48)  might  be  represented  by  (dud2,  d3,  d4)  = (2,  3, 3,  2)  and 
by  the  following  fourteen  nodes  of  arc  data: 

a=  0 1 2 3 4 5 6 7 8 9 10  11  12  13 

ta  = 00001213232434 
ia  = ppqqrrsstt 

na  = 5 4 6 10  9 7 8 0 13  11  12  1 3 2 

pa  = 7 11  13  12  1 0 2 5 6 4 3 9 10  8 

The  implicit  recursion  of  Smith’s  algorithm  can  be  controlled  conveniently 
by  using  an  array  of  arc  pointers  «i  . . . an_  1 . At  level  l of  the  process,  arcs 
ai  ■ ■ ■ at- 1 denote  edges  that  have  been  included  in  the  current  spanning  tree;  a/ 
is  ignored;  and  arcs  a/+i . . . a„_  1 denote  edges  of  a near  tree  on  the  contracted 
graph  ( . . . (G  / a{) . ..)/ a;_i  that  should  be  part  of  the  next  spanning  tree  visited. 

There  s also  another  array  of  arc  pointers  sj  . . . sn_ 2,  representing  stacks 
of  arcs  that  have  been  temporarily  removed  from  the  current  graph.  The  top 
element  of  the  stack  for  level  l is  sp  and  each  arc  a links  to  its  successor,  la 
(which  is  0 at  the  bottom  of  the  stack). 

An  edge  whose  removal  would  disconnect  a connected  graph  is  called  a 
bridge.  One  of  the  key  points  in  the  algorithm  that  follows  is  the  fact  that  we 
want  to  keep  the  current  graph  connected;  therefore  we  don’t  set  G <-  G \ e 
when  e is  a bridge. 

Algorithm  S (All  spanning  trees).  Given  a connected  graph  represented  with 
the  data  structures  explained  above,  this  algorithm  visits  all  of  its  spanning  trees. 

A technique  called  “dancing  links,”  which  we  will  discuss  extensively  in 
Section  7.2. 2.1,  is  used  here  to  remove  and  restore  items  from  and  to  doubly 
linked  lists.  The  abbreviation  “delete(a)”  in  the  steps  below  is  shorthand  for  the 
pair  of  operations 

nPa^na,  pna<-pa;  (51) 

similarly,  “undelete(a)”  stands  for 


Pna  t—  o,  nPa  <-  a. 


(52) 


7.2. 1.6 


GENERATING  ALL  TREES  465 


51.  [Initialize.]  Set  ax  . . . an_x  to  a spanning  tree  of  the  graph.  (See  exercise  94.) 

Also  set  x 4 0,  Z 4 1 , and  sx  4—  0.  If  n = 2,  set  v 4—  1,  e 4-  no,  and  go  to  S5. 

52.  [Enter  level  Z.]  Set  e 4—  a/+1,  u 4—  te,  and  v 4—  fe® i.  If  du  > dv,  interchange 

v 4-4  u and  set  e 4—  e © 1 . 

53.  [Shrink  e.]  (Now  we  will  make  u identical  to  v by  inserting  u’s  adjacency  list 

into  v's.  We  also  must  delete  all  former  edges  between  u and  v , including  e 
itself,  because  such  edges  would  otherwise  become  loops.  Deleted  edges  are 
linked  together  so  that  we  can  restore  them  later  in  step  S7.)  Set  k 4-  du+dv, 
f t—  nu_ i,  and  g 4—  0.  While  tf  / 0,  do  the  following:  If  tf  — v , delete(/), 
delete(/  ® 1),  and  set  k 4—  k — 2,  If  4—  g,  g 4—  /;  otherwise  set  ty®!  4—  v. 
Then  set  / 4—  n/  and  repeat  these  operations  until  = 0.  Finally  set  Ze  4—  <7, 
dv  <-  k,  g i-  v - 1,  nPf  4-  ng,  p„9  •<-  pn/  •(-  p,  ns  -f-  n/,  and  a;  e. 

54.  [Advance  Z.]  Set  Z t—  Z + 1.  If  Z < n — 1,  set  s;  f-  0 and  return  to  S2. 

Otherwise  set  e •<—  rii,-!. 

55.  [Visit..]  (The  current  graph  now  has  only  two  vertices,  one  of  which  is  v.) 
Set  an_x  4-  e and  visit  the  spanning  tree  ax . . .an_x.  (If  x = 0,  this  is  the 
first  spanning  tree  to  be  visited;  otherwise  it  differs  from  its  predecessor  by 
deleting  x and  inserting  e.)  Set  x t-  e and  e ne.  Repeat  step  S5  if  te  ^ 0. 

56.  [Decrease  Z.]  Set  Z 4 — Z — 1.  Terminate  the  algorithm  if  Z = 0;  otherwise  set 
e <r-  at,  u i — te , and  v fe®i. 

57.  [Unshrink  e.]  Set  / 4-  u - 1,  g 4-  v - 1,  ng  <-  npj,  pUg  4-  g,  nPf  4-  /, 
Fn/  /,  and  / 4—  pf.  While  tf  / 0,  set  tf®x  4-  u and  / <—  pj.  Then  set 
/ Ze,  k 4-  dv ; while  / / 0 set  fc  4-  fc  + 2,  undelete(/  ® 1),  undelete(/), 
and  set  / 4—  Z/.  Finally  set  dv  ■t-  k — du. 

58.  [Test  for  bridge.]  If  e is  a bridge,  go  to  S9.  (See  exercise  95  for  one  way 
to  perform  this  test.)  Otherwise  set  x 4-  e,  Ze  4-  s;  4-  e;  delete(e)  and 
delete(e  © 1).  Set  du  4-  du  — 1,  dv  4—  dv  — 1,  and  go  to  S2. 

59.  [Undo  level  Z deletions.]  Set  e 4-  st.  While  e / 0,  set  u 4-  fe,  u 4-  fe01, 

du  <—  du  + 1,  dv  4-  dv  + 1,  undelete(e  ® 1),  undelete(e),  and  e 4-  Ze.  Return 
to  S6.  | 

The  reader  is  encouraged  to  play  through  the  steps  of  this  algorithm  on  a small 
graph  such  as  (48).  Notice  that  a subtle  case  arises  in  steps  S3  and  S7,  if  u' s 
adjacency  list  happens  to  become  empty.  Notice  also  that  several  shortcuts  would 
be  possible,  at  the  expense  of  a more  complicated  algorithm;  we  will  discuss  such 
improvements  later  in  this  section. 

* Series-parallel  graphs.  The  task  of  finding  all  spanning  trees  becomes  espe- 
cially simple  when  the  given  graph  has  a serial  and/or  parallel  decomposition.  A 
series-parallel  graph  between  s and  t is  a graph  G with  two  designated  vertices, 
s and  Z,  whose  edges  can  be  built  up  recursively  as  follows:  Either  G consists  of 
a single  edge,  s — 1 \ or  G is  a serial  superedge  consisting  of  k > 2 series-parallel 
subgraphs  Gj  between  Sj  and  tj,  joined  in  series  with  s = and  tj  = sJ+x  for 
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1 < j < k and  tk  — t;  or  G is  a parallel  superedge  consisting  of  k > 2 series- 
parallel  subgraphs  Gj  between  s and  t joined  in  parallel.  This  decomposition  is 
essentially  unique,  given  s and  t,  if  we  require  that  the  subgraphs  G3  for  serial 
superedges  are  not  themselves  serial  superedges,  and  that  the  subgraphs  G3  for 
parallel  superedges  are  not  themselves  parallel. 

Any  series-parallel  graph  can  be  represented  conveniently  as  a tree,  with  no 
nodes  of  degree  1.  The  leaf  nodes  of  this  tree  represent  edges,  and  the  branch 
nodes  represent  superedges,  alternating  between  serial  and  parallel  from  level 
to  level.  For  example,  the  tree 


corresponds  to  the  series-parallel  graphs  and  subgraphs 


a 


if  the  top  node  A is  taken  to  be  parallel.  Edges  are  named  in  (54),  but  not 
vertices,  because  edges  are  of  prime  importance  with  respect  to  spanning  trees. 

Let’s  say  that  a near  tree  of  a series-parallel  graph  between  s and  t is  a set 
of  n 2 cycle-free  edges  that  do  not  connect  s to  t.  The  spanning  trees  and 
near  trees  of  a series-parallel  graph  are  easy  to  describe  recursively,  as  follows: 
(1)  A spanning  tree  of  a serial  superedge  corresponds  to  spanning  trees  of  all  its 
principal  subgraphs  Gj-  a near  tree  corresponds  to  spanning  trees  in  all  but  one 
of  the  Gj,  and  a near  tree  in  the  other.  (2)  A near  tree  of  a parallel  superedge 
corresponds  to  near  trees  of  all  its  principal  subgraphs  Gj ; a spanning  tree  cor- 
responds to  near  trees  in  all  but  one  of  the  Gj , and  a spanning  tree  in  the  other. 

Rules  (1)  and  (2)  suggest  the  following  data  structures  for  listing  the  span- 
ning trees  and/or  near  trees  of  series-parallel  graphs.  Let  p point  to  a node  in  a 
tree  like  (53).  Then  we  define 


tP  = 1 for  serial  superedges,  0 otherwise  (the  “type”  of  p); 
vp  = 1 if  we  have  a spanning  tree  for  p,  0 if  we  have  a near  tree; 
lp  = pointer  to  p’s  leftmost  child,  or  0 if  p is  a leaf; 
rp  ~ pointer  to  p’s  right  sibling,  wrapping  around  cyclically; 
dp  = pointer  to  a designated  child  of  p,  or  0 if  p is  a leaf. 

If  q points  to  the  rightmost  child  of  p,  its  “right  sibling”  rq  equals  lp.  And  if  q 
points  to  any  child  of  p,  rules  (1)  and  (2)  state  that 


f vp,  if  q = dp-, 
\ tp,  if  (/  / dp. 


(55) 
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(For  example,  if  p is  a branch  node  that  represents  a serial  superedge,  we  must 
have  vq  = 1 for  all  but  one  of  p's  children;  the  only  exception  is  the  designated 
child  dp.  Thus  we  must  have  a spanning  tree  for  all  of  the  subgraphs  that  were 
joined  serially  to  form  p,  except  for  one  designated  subgraph  in  the  case  that  we 
have  a near  tree  for  p.) 

Given  any  setting  of  the  designated-child  pointers  dp,  and  given  any  value 
0 or  1 for  Vp  at  the  root  of  the  tree,  Eq.  (55)  tells  us  how  to  propagate  values 
down  to  all  of  the  leaves.  For  example,  if  we  set  vA  <—  1 in  the  tree  (53),  and 
if  we  designate  the  leftmost  child  of  each  branch  node  (so  that  dA  = a,  dB  = b, 
dc  = c,  and  dB  = /),  we  find  successively 

va~  1|  VB  — 0,  Vb=  0,  Vc  = 1,  Vc=l,  Vd  = 0,  Ve  = 1,  V£>  =0,  Vf  = 0,  vg  = 1.  (56) 

A leaf  node  q is  present  in  the  spanning  tree  if  and  only  if  vq  = 1;  hence 
(56)  specifies  the  spanning  tree  aceg  of  the  series-parallel  graph  A in  (54). 

For  convenience,  let  s say  that  the  configs  of  p are  its  spanning  trees  if 
vp  — 1,  its  near  trees  if  vp  = 0.  We  would  like  to  generate  all  configs  of  the 
root.  A branch  node  p is  called  “easy”  if  vp  = tp;  that  is,  a serial  node  is  easy 
if  its  configs  are  spanning  trees,  and  a parallel  node  is  easy  if  its  configs  are 
near  trees.  If  p is  easy,  its  configs  are  the  Cartesian  product  of  the  configs  of  its 
children,  namely  all  fc-tuples  of  the  children’s  configs,  varying  independently;  the 
designated  child  dp  is  immaterial  in  the  easy  case.  But  if  p is  uneasy,  its  configs 
are  the  union  of  such  Cartesian  A:-tuples,  taken  over  all  possible  choices  of  dp. 

As  luck  would  have  it,  easy  nodes  are  relatively  rare:  At  most  one  child  of 
an  uneasy  node  (namely  the  designated  child)  can  be  easy,  and  all  children  of  an 
easy  node  are  uneasy  unless  they  are  leaves. 

Even  so,  the  tree  representation  of  a series-parallel  graph  makes  the  recursive 
generation  of  all  its  spanning  trees  and/or  near  trees  quite  straightforward  and 
efficient.  The  operations  of  Algorithm  S — shrinking  and  unshrinking,  deleting 
and  undeleting,  bridge  detection  -are  not  needed  when  we  deal  with  series- 
parallel  graphs.  Furthermore,  exercise  99  shows  that  there  is  a pleasant  way  to 
obtain  the  spanning  trees  or  near  trees  in  a revolving-door  Gray  code  order,  by 
using  focus  pointers  as  in  several  algorithms  that  we’ve  seen  earlier. 

Refinements  of  Algorithm  S.  Although  Algorithm  S provides  us  with  a simple 
and  reasonably  effective  way  to  visit  all  spanning  trees  of  a general  graph,  its 
author  Malcolm  Smith  realized  that  the  properties  of  series-parallel  graphs  can  be 
used  to  make  it  even  better.  For  example,  if  a graph  has  two  or  more  edges  that 
run  between  the  same  vertices  u and  v,  we  can  combine  them  into  a superedge; 
the  spanning  trees  of  the  original  graph  can  then  be  obtained  readily  from  those 
of  the  simpler,  reduced  graph.  And  if  a graph  has  a vertex  v of  degree  2,  so  that 
the  only  edges  touching  v are  u — v and  v — w,  we  can  eliminate  v and  replace 
those  edges  by  a single  superedge  between  u and  w.  Furthermore,  any  vertex  of 
degree  1 can  effectively  be  eliminated,  together  with  its  adjacent  edge,  by  simply 
including  that  edge  in  every  spanning  tree. 

After  the  reductions  in  the  preceding  paragraph  have  been  applied  to  a given 
graph  G,  we  obtain  a reduced  graph  G having  no  parallel  edges  and  no  vertices 
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of  degrees  1 or  2,  together  with  a set  of  m > 0 series- parallel  graphs  Si,  ....  Srn , 
representing  edges  (or  superedges)  that  must  be  included  in  all  spanning  trees 
of  G.  Every  remaining  edge  u v of  G corresponds,  in  fact,  to  a series-parallel 
graph  Suv  between  vertices  u and  v.  The  spanning  trees  of  G are  then  obtained 
as  the  union,  taken  over  all  spanning  trees  T of  G,  of  the  Cartesian  product 
of  the  spanning  trees  of  Si , . . . , Sm  and  the  spanning  trees  of  all  Suv  for  edges 
u — v in  T,  together  with  the  near  trees  of  all  Suv  for  edges  u — v that  are  in 
G but  not  in  T.  And  all  spanning  trees  T of  G can  be  obtained  by  using  the 
strategy  of  Algorithm  S. 

In  fact,  when  Algorithm  S is  extended  in  this  way,  its  operations  of  replacing 
the  current  graph  GbyG/eorG\e  typically  trigger  further  reductions,  as  new 
parallel  edges  appear  or  as  the  degree  of  a vertex  drops  below  3.  Therefore  it 
turns  out  that  the  “stopping  state”  of  the  implicit  recursion  in  Algorithm  S, 
namely  the  case  when  only  two  vertices  are  left  (step  S5),  never  actually  arises: 
A reduced  graph  G either  has  only  a single  vertex  and  no  edges,  or  it  has  at  least 
four  vertices  and  six  edges. 

The  resulting  algorithm  retains  the  desirable  revolving-door  property  of 
Algorithm  S,  and  it  is  quite  pretty  (although  about  four  times  as  long  as  the 
original),  see  exercise  100.  Smith  proved  that  it  has  the  best  possible  asymptotic 
running  time:  If  G has  n vertices,  m edges,  and  N spanning  trees,  the  algorithm 
visits  them  all  in  0(m  + n + N)  steps. 

The  performance  of  Algorithm  S and  of  its  souped-up  version  Algorithm  S' 
can  best  be  appreciated  by  considering  the  number  of  memory  accesses  that 
those  algorithms  actually  make  when  they  generate  the  spanning  trees  of  typical 
graphs,  as  shown  in  Table  5.  The  bottom  line  of  that  table  corresponds  to 
the  graph  plane-miles  (16,  0, 0, 1,  0, 0, 0)  from  the  Stanford  GraphBase,  which 
serves  as  an  organic  antidote  to  the  purely  mathematical  examples  on  the 
previous  lines.  The  random  multigraph  on  the  penultimate  line,  also  from 
the  Stanford  GraphBase,  can  be  described  more  precisely  by  its  official  name 
random-graph  ( 16,37, 1, 0, 0, 0, 0, 0, 0, 0).  Although  the  4x4  torus  is  isomorphic 
to  the  4-cube  (see  exercise  7.2.1.1-17),  those  isomorphic  graphs  yield  slightly  dif- 
ferent running  times  because  their  vertices  and  edges  are  encountered  differently 
when  the  algorithms  are  run. 

In  general  we  can  say  that  Algorithm  S is  not  too  bad  on  small  examples, 
except  when  the  graph  is  quite  sparse;  but  Algorithm  S'  begins  to  shine  when 
many  spanning  trees  are  present.  Once  Algorithm  S'  gets  warmed  up,  it  tends 
to  crank  out  a new  tree  after  every  18  or  19  mems  go  by. 

Table  5 also  indicates  that  a mathematically-defined  graph  often  has  a 
surprisingly  “round”  number  of  spanning  trees.  For  example,  D.  M.  Cvetkovic 
[Srpska  Akademija  Nauka,  Matematicheski  Institut  11  (Belgrade:  1971),  135- 
141]  discovered,  among  other  things,  that  the  n-cube  has  exactly 

22"-"— xl (?)  2(2) . . . nOD  (57) 

of  them.  Exercises  104-109  explore  some  of  the  reasons  why  that  happens. 
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Table  5 

RUNNING  TIME  IN  MEMS  NEEDED  TO  GENERATE  ALL  SPANNING  TREES 


m 

n 

N 

Algorithm  S 

Algorithm  S' 

At  per  tree 

path  P10 

9 

10 

1 

794  fi 

473  At 

794.0 

473.0 

path  P100 

99 

100 

1 

9,974  n 

5,063  a t 

9974.0 

5063.0 

cycle  Cio 

10 

10 

10 

3,480  fi 

998  At 

348.0 

99.8 

cycle  Cfioo 

100 

100 

100 

355,605  m 

10,538  At 

3556.1 

105.4 

complete  graph  K4 

6 

4 

16 

1,213  At 

1,336  /t 

75.8 

83.5 

complete  graph  ATo 

45 

10 

100,000,000 

3,759.58 

1,860.95  Mat 

37.6 

18.6 

complete  bigraph  AT5,5 

25 

10 

390,625 

23.43  MAt 

8.88  Mat 

60.0 

22.7 

4x4  grid  P4C1P4 

24 

16 

100,352 

12.01  M/i 

1.87  MAt 

119.7 

18.7 

5x5  grid  P5C1P5 

40 

25 

557,568,000 

54.68  Gat 

10.20  G/t 

98.1 

18.3 

4x4  cylinder  P4aC4 

28 

16 

2,558,976 

230.96  MAt 

49.09  MAt 

90.3 

19.2 

5x5  cylinder  P5C1C5 

45 

25 

38,720,000,000 

3,165.31  Gfi 

711.69  Gat 

81.7 

18.4 

4x4  torus  C4nC4 

32 

16 

42,467,328 

3,168.15  Mai 

823.08  Mat 

74.6 

19.4 

4-cube  P2nP2aP2nP2 

32 

16 

42,467,328 

3,172.19  Mat 

823.38  Mu 

74.7 

19.4 

random  multigraph 

37 

16 

59,933,756 

3,818.19  MAt 

995.91  Mat 

63.7 

16.6 

16  cities 

37 

16 

179,678,881 

11,772.11  MAt 

3,267.43  M/t 

65.5 

18.2 

A general  quasi-Gray  code.  Let’s  close  this  section  by  discussing  something 
completely  different,  yet  still  related  to  trees.  Consider  the  following  hybrid 
variants  of  the  two  standard  ways  to  traverse  a nonempty  forest: 


Prepostorder  traversal 
Visit  the  root  of  the  first  tree 
Traverse  the  subtrees  of  the  first 
tree,  in  postpreorder 
Traverse  the  remaining  trees, 
in  prepostorder 


Postpreorder  traversal 
Traverse  the  subtrees  of  the  first 
tree,  in  prepostorder 
Visit  the  root  of  the  first  tree 
Traverse  the  remaining  trees, 
in  postpreorder 


In  the  first  case,  every  tree  of  the  forest  is  traversed  in  prepostorder,  with  its  root 
first;  but  the  subtrees  of  those  roots  are  traversed  in  postpreorder,  with  roots 
coming  last.  The  second  variant  is  similar  but  with  ‘pre’  and  ‘post’  interchanged. 
And  in  general,  prepostorder  visits  roots  first  on  every  even-numbered  level  of 
the  forest,  but  visits  them  last  on  the  odd-numbered  levels.  For  example,  the 
forest  in  (2)  becomes 


when  we  label  its  nodes  in  prepostorder. 
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Prepostorder  and  postpreorder  are  not  merely  curiosities;  they’re  actually 
useful.  The  reason  is  that  adjacent  nodes,  in  either  of  these  orders,  are  always 
near  each  other  in  the  forest.  For  example,  nodes  k and  k + 1 are  adjacent  in  (58) 
for  k = 1,  4,  6,  8,  10,  13;  they  are  separated  by  only  one  node  when  k - 3,  12,  14; 
and  they’re  three  steps  apart  when  k = 2,  5,  7,  9,  11  (if  we  imagine  an  invisible 
super-parent  at  the  top  of  the  forest).  A moment’s  thought  proves  inductively 
that  at  most  two  nodes  can  possibly  intervene  between  prepostorder  neighbors  or 
postpreorder  neighbors  — because  postpreorder(F)  always  begins  with  the  root 
of  the  first  tree  or  its  leftmost  child,  and  prepostorder(F)  always  ends  with  the 
root  of  the  last  tree  or  its  rightmost  child. 

Suppose  we  want  to  generate  all  combinatorial  patterns  of  some  kind,  and 
we  want  to  visit  them  in  a Gray-code-like  manner  so  that  consecutive  patterns 
are  always  “close”  to  each  other.  We  can  form,  at  least  conceptually,  the  graph  of 
all  possible  patterns  p,  with  edges  p — q for  all  pairs  of  patterns  that  are  close  to 
each  other.  The  following  theorem,  due  to  Milan  Sekanina  [Spisy  Pnrodovedecke 
Fakulty  University  v Brne,  No.  412  (1960),  137-140],  proves  that  a pretty  good 
Gray  code  is  always  possible,  provided  only  that  we  can  get  from  any  pattern  to 
any  other  in  a sequence  of  short  steps: 

Theorem  S.  The  vertices  of  any  connected  graph  can  be  listed  in  a cyclic  order 
{v0,vi,. . . , un_j)  so  that  the  distance  between  vk  and  V(k+1 ) mod  n is  at  most  3, 
for  0 < k < n. 

Proof.  Find  a spanning  tree  in  the  graph,  and  traverse  it  in  prepostorder.  | 

Graph  theorists  traditionally  say  that  the  kt.h  power  of  a graph  G is  the 
graph  Gk  whose  vertices  are  those  of  G,  with  u — v in  Gk  if  and  only  if  there’s  a 
path  of  length  k or  less  from  u to  v in  G.  Thus  they  can  state  Theorem  S much 
more  succinctly,  when  n > 2:  The  cube  of  a connected  graph  is  Hamiltonian. 

Prepostorder  traversal  is  also  useful  when  we  want  to  visit  the  nodes  of  a 
tree  in  loopless  fashion,  with  a bounded  number  of  steps  between  stops: 

Algorithm  Q ( Prepostorder  successor  in  a triply  linked  forest) . If  P points  to  a 
node  in  a forest  represented  by  links  PARENT,  CHILD,  and  SIB,  corresponding  to 
each  node’s  parent,  leftmost  child,  and  right  sibling,  this  algorithm  computes  P’s 
successor  node,  Q,  in  prepostorder.  We  assume  that  we  know  the  level  L at  which 
P appears  in  the  forest;  the  value  of  L is  updated  to  be  the  level  of  Q.  If  P happens 
to  be  the  final  node  in  prepostorder,  the  algorithm  sets  Q 4-  A and  L 4 1. 

Ql.  [Pre  or  post?]  If  L is  even,  go  to  step  Q4. 

Q2.  [Continue  postpreorder.]  Set  Q 4-  SIB(P).  Go  to  Q6  if  Q ^ A. 

Q3.  [Move  up.]  Set  P 4-  PARENT (P)  and  L 4-  L - 1.  Go  to  Q7. 

Q4.  [Continue  prepostorder.]  If  CHILD (P)  = A,  go  to  Q7. 

Q5.  [Move  down.]  Set  Q 4 - CHILD(P)  and  L 4-  L + 1. 

Q6.  [Move  down  if  possible.]  If  CHILD  (Q)  / A,  set  Q 4—  CHILD(Q)  andL^L+l. 
Terminate  the  algorithm. 
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Q7.  [Move  right  or  lip.]  If  SIB(P)  ^ A,  set  Q «—  SIB(P);  otherwise  set  Q 
PARENT(P)  and  L «—  L — 1.  Terminate  the  algorithm.  | 

Notice  that,  as  in  Algorithm  2.4C,  the  link  PARENT (P)  is  examined  only  if 
SIB(P)  = A.  A complete  traversal  is  really  a worm  walk  around  the  forest, 
like  (3):  The  worm  “sees”  the  nodes  on  even-numbered  levels  when  it  passes 
them  on  the  left,  and  it  sees  the  odd-level  nodes  when  it  passes  them  on  the  right. 


EXERCISES 

1.  [15]  If  a worm  crawls  around  the  binary  tree  (4),  how  could  it  easily  reconstruct 
the  parentheses  of  (1)? 

2.  [20]  (S.  Zaks,  1980.)  Modify  Algorithm  P so  that  it  produces  the  combinations 
Z1Z2  ...  zn  of  (8)  instead  of  the  parenthesis  strings  0,10,2  . . . a2„. 

► 3.  [23]  Prove  that  (11)  converts  Z1Z2  ...  zn  to  the  inversion  table  cic2  . . . cn. 

4.  [20]  True  or  false:  If  the  strings  ai  . . . a2n  are  generated  in  lexicographic  order, 
so  are  the  corresponding  strings  di  . . . dn,  zi  . . . zn,  pi  . . .p„,  and  ci  . . . c„. 

5.  [15]  What  tables  d\...dn,  z\...zn,  pi...pn,  and  ci...cn  correspond  to  the 
nested  parenthesis  string  (1)? 

► 6.  [20]  What  matching  corresponds  to  (1)?  (See  the  final  column  of  Table  1.) 

7.  [16]  (a)  What  is  the  state  of  the  string  aia2  . . . a2n  when  Algorithm  P terminates? 
(b)  What  do  the  arrays  I1I2  . . . ln  and  rir2  . . . rn  contain  when  Algorithm  B terminates? 

8.  [15]  What  tables  l\  . . . ln,  r\  . . . rn,  e\  . . . e„,  and  si  . . . sn  correspond  to  the  ex- 
ample forest  (2)? 

9.  [M20]  Show  that  the  tables  a ...cn  and  si  . . . s„  are  related  by  the  law 


Cfc  — [si  ^ k 1 ] — f-  [s2  > k — 2]  T • • • T [s^.— 1 > 1]. 

10.  [M20]  ( Worm  walks.)  Given  a string  of  nested  parentheses  aia2...a2n,  let  Wj 

be  the  excess  of  left  parentheses  over  right  parentheses  in  aio2  . . . a,  , for  0 < j < 2 n. 
Prove  that  W0  + 10H 1-  w2„  = 2(ci  -| 1 - c„)  + n. 

11.  [11]  If  F is  a forest,  its  conjugate  FR  is  obtained  by  left-to-right  mirror  reflection. 
For  example,  the  fourteen  forests  in  Table  1 are 


“i.  • a,  • 1 1 a*,  1 \ j\,  j 

and  their  conjugates  are  respectively 

“*•  J ** , * J *,  a * , | •,  ••:,  : * a,  • j,  j 

as  in  the  colex  forests  of  Table  2.  If  F corresponds  to  the  nested  parentheses  ai<z2  . . . a2n, 
what  string  of  parentheses  corresponds  to  FR? 

12.  [15]  If  F is  a forest,  its  transpose  FT  is  the  forest  whose  binary  tree  is  obtained 
by  interchanging  left  and  right  links  in  the  binary  tree  representing  F.  For  example, 
the  transposes  of  the  fourteen  forests  in  Table  1 are  respectively 


j,  *\,  A.,  | \ A\  I I,  * l * A,  I 

What  is  the  transpose  of  the  forest  (2)? 


•:*,  •*  I 


13.  [20]  Continuing  exercises  11  and  12,  how  do  the  preorder  and  postorder  of  a 
labeled  forest  F relate  to  the  preorder  and  postorder  of  (a)  FR ? (b)  FT1 


472 


COMBINATORIAL  SEARCHING 


7.2.1. 6 


► 14.  [21]  Find  all  labeled  forests  F such  that  FRT  = FTR. 

15.  [20]  Suppose  B is  the  binary  tree  obtained  from  a forest  F by  linking  each  node 
to  its  left  sibling  and  its  rightmost  child,  as  in  exercise  2. 3. 2-5  and  the  last  column  of 
Table  2.  Let  F'  be  the  forest  that  corresponds  to  B in  the  normal  way,  via  left-child 
and  right-sibling  links.  Prove  that  F'  = FRT , in  the  notation  of  exercises  11  and  12. 

16.  [20]  If  F and  G are  forests,  let  FG  be  the  forest  obtained  by  placing  the  trees  of  F 
to  the  left  of  the  trees  of  G;  also  let  F \ G = (GTFT)T . Give  an  intuitive  explanation 
of  the  operator  |,  and  prove  that  it  is  associative. 

17.  [M^6]  Characterize  all  unlabeled  forests  F such  that  FRT  = FTR.  (See  exercise  14.) 

18.  [30]  Two  forests  are  said  to  be  cognate  if  one  can  be  obtained  from  the  other  by 
repeated  operations  of  taking  the  conjugate  and/or  the  transpose.  The  examples  in  ex- 
ercises 11  and  12  show  that  all  forests  on  4 nodes  belong  to  one  of  three  cognate  classes: 

....  xg; 

Study  the  set  of  all  forests  with  15  nodes.  How  many  equivalence  classes  of  cognate 
forests  do  they  form?  What  is  the  largest  class?  What  is  the  smallest  class?  What  is 
the  size  of  the  class  containing  (2)? 

19.  [28]  Let  F\,  F2,  ...,  FN  be  the  sequence  of  unlabeled  forests  that  correspond 
to  the  nested  parentheses  generated  by  Algorithm  P,  and  let  Gi,  G2,  ...,  GN  be 
the  sequence  of  unlabeled  forests  that  correspond  to  the  binary  trees  generated  by 
Algorithm  B.  Prove  that  Gk  = F?TR,  in  the  notation  of  exercises  11  and  12.  (The 
forest  F is  called  the  dual  of  /'  ; it  is  denoted  by  FD  in  several  exercises  below.) 

20.  [25]  Recall  from  Section  2.3  that  the  degree  of  a node  in  a tree  is  the  number  of 
children  it  has,  and  that  an  extended  binary  tree  is  characterized  by  the  property  that 
every  node  has  degree  either  0 or  2.  In  the  extended  binary  tree  (4),  the  sequence  of 
node  degrees  is  2200222002220220002002202200000  in  preorder;  this  string  of  0s  and  2s 
is  identical  to  the  sequence  of  parentheses  in  (1),  except  that  each  ‘(’has  been  replaced 
by  2,  each  “)’  has  been  replaced  by  0,  and  an  additional  0 has  been  appended. 

a)  Prove  that  a sequence  of  nonnegative  integers  bib2  ...  bj\t  is  the  preorder  degree 
sequence  of  a forest  if  and  only  if  it  satisfies  the  following  property  for  1 < k < N: 

bi  + 62  + • ■ • + bk  + / > k if  and  only  if  k < N. 

Here  / = N — bi  — b2  — ■ ■ ■ - bn  is  the  number  of  trees  in  the  forest. 

b)  Recall  from  exercise  2. 3. 4. 5— 6 that  an  extended  ternary  tree  is  characterized  by  the 
property  that  every  node  has  degree  0 or  3;  an  extended  ternary  tree  with  n internal 
nodes  has  2n  + 1 external  nodes,  hence  N = 3n  + 1 nodes  altogether.  Design  an 
algorithm  to  generate  all  ternary  trees  with  n internal  nodes,  by  generating  the 
associated  sequences  bib2  . . . Ay  in  lexicographic  order. 

► 21.  [26]  (S.  Zaks  and  D.  Richards,  1979.)  Continuing  exercise  20,  explain  how  to 

generate  the  preorder  degree  sequences  of  all  forests  that  have  N = n0  H \-nt  nodes, 

with  exactly  n3  nodes  of  degree  j.  Example:  When  n0  = 4,  m = n2  = n3  = 1,  and 
1 = 3,  and  the  valid  sequences  b1b2b3b4bs.be.b7  are 

1203000, 1230000, 1300200, 1302000, 1320000, 2013000,  2030010,  2030100,  2031000, 2103000, 
2130000, 2300010, 2300100, 2301000, 2310000, 3001200, 3002010, 3002100, 3010200, 3012000, 
3020010, 3020100, 3021000, 3100200, 3102000, 3120000, 3200010, 3200100, 3201000, 3210000. 
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► 22.  [30]  (J.  Korsh,  2004.)  As  an  alternative  to  Algorithm  B,  show  that  binary  trees 
can  also  be  generated  directly  and  efficiently  in  linked  form  if  we  produce  them  in  colex 
order  of  the  numbers  dx  . . . dn _i  defined  in  (9).  (The  actual  values  of  di  . . . dn-i  should 
not  be  computed  explicitly;  but  the  links  l\  . . . ln  and  74  . . . rn  should  be  manipulated 
in  such  a way  that  we  get  the  binary  trees  corresponding  successively  to  dxd2  • • • dn-i  = 
000...  0,  100...  0,  010...  0,  110...  0,  020...  0,  001...  0,  ...,  000...  (n-1).) 

► 23.  [25]  (a)  What  is  the  last  string  visited  by  Algorithm  N?  (b)  What  is  the  last 
binary  tree  or  forest  visited  by  Algorithm  L?  Hint:  See  exercise  40  below. 

24.  [22]  Using  the  notation  of  Table  3,  what  sequences  l0li  . . . /15,  rq  . . . r15,  Aq  . . . fc15, 
qi  ■ ■ ■ ?is,  and  «i  . . . U15  correspond  to  the  binary  tree  (4)  and  the  forest  (2)? 

► 25.  [30]  ( Pruning  and  grafting.)  Representing  binary  trees  as  in  Algorithm  B,  design 
an  algorithm  that  visits  all  link  tables  l\  . . . ln  and  74  . . . rn  in  such  a way  that,  between 
visits,  exactly  one  link  changes  from  j to  0 and  another  from  0 to  j,  for  some  index  j. 
(In  other  words,  every  step  removes  some  subtree  j from  the  binary  tree  and  places  it 
elsewhere,  preserving  preorder.) 

26.  [M31]  ( The  Kreweras  lattice.)  Let  F and  F'  be  n-node  forests  with  their  nodes 
numbered  1 to  n in  preorder.  We  write  F K F'  (“F  coalesces  F'”)  if  j and  k are 
siblings  in  F whenever  they  are  siblings  in  F',  for  1 < j < k < n.  Figure  60  illustrates 
this  partial  ordering  in  the  case  n = 4;  each  forest  is  encoded  by  the  sequence  ci . . . c„ 
of  (10)  and  (11),  which  specifies  the  depth  of  each  node.  (With  this  encoding,  j and  k 
are  siblings  if  and  only  if  c,  = ck  < Cj+i, . . . , Ck- 1.) 


Fig.  60.  The  Kreweras  lattice  of  order  4.  Each  forest  is  represented  by 
its  sequence  of  node  depths  C1C2C3C4  in  preorder.  (See  exercises  26-28.) 

a)  Let  77  be  a partition  of  {1, . . . , n}.  Show  that  there  exists  a forest  F,  with  nodes 
labeled  (1, . . . , n)  in  preorder  and  with 

j = k (modulo  77)  j is  a sibling  of  k in  F, 
if  and  only  if  77  satisfies  the  noncrossing  property 
i<j<k<l  and  i = k and  j = l (modulo  77)  implies  i=j  = k = l (modulo  77). 

b)  Given  any  two  n-node  forests  F and  F',  explain  how  to  compute  their  least  upper 
bound  F V F',  the  element  such  that  F K G and  F'  K G if  and  only  if  F V F'  K G. 

c)  When  does  F'  cover  F with  respect  to  the  relation  K?  (See  exercise  7.2.1.4-55.) 

d)  Show  that  if  F'  covers  F,  it  has  exactly  one  less  leaf  than  F. 

e)  How  many  forests  cover  F,  when  node  k has  ek  children  for  1 < k < n? 

f)  Using  the  definition  of  duality  in  exercise  19,  what  is  the  dual  of  the  forest  (2)? 
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g)  Prove  that  F K 7 holds  if  and  only  if  F'D  K F n . (Because  of  this  property,  dual 
elements  have  been  placed  symmetrically  about  the  center  of  Fig.  60.) 

h)  Given  any  two  n-node  forests  F and  F' , explain  how  to  compute  their  greatest 
lower  bound  F A F'\  that  is,  G K F and  G K F'  if  and  only  if  G K F A F' . 

i)  Does  this  lattice  satisfy  a semimodular  law  analogous  to  exercise  7.2.1.5-12(f)? 

► 27.  [M33]  ( The  Tamari  lattice.)  Continuing  exercise  26,  let  us  write  F H F'  if  the 
j th  node  in  preorder  has  at  least  as  many  descendants  in  F'  as  it  does  in  F.  for  all  j. 
In  other  words,  if  F and  F'  are  characterized  by  their  scope  sequences  s,  ...  sn  and 
s'l  • • • s'n  as  in  Table  2,  we  have  F d F'  if  and  only  sJ  < s'-  for  1 < j < n.  (See  Fig.  61.) 

0123  3210 


(a)  (b) 

Fig.  61.  The  Tamari  lattice  of  order  4.  Each  forest  is  represented  by 

its  sequences  of  (a)  node  depths  and  (b)  descendant  counts,  in  preorder. 

(See  exercises  26-28.) 

a)  Show  that  the  scope  coordinates  min(s1,  si)  min(s2,  s'2) . . . min(sn,  s'n)  define  a 
forest  that  is  the  greatest  lower  bound  of  F and  F' . (We  denote  it  by  F 1 F' .) 
Hint:  Prove  that  si . . . sn  corresponds  to  a forest  if  and  only  if  0 < k < s3  implies 
Sj+k  + k < Sj,  for  0 < j < n,  if  we  define  so  = n. 

b)  When  does  F'  cover  F in  this  partial  ordering? 

c)  Prove  that  F F'  if  and  only  if  F'D  H F° . (Compare  with  exercise  26(g).) 

d)  Explain  how  to  compute  a least  upper  bound,  FT  F,  given  F and  F' . 

e)  Prove  that  F K F'  in  the  Kreweras  lattice  implies  F H F'  in  the  Tamari  lattice. 

f)  True  or  false:  F A F'  H F J_  F' . 

g)  True  or  false:  FVF'kFT  F'. 

h)  What  are  the  longest  and  shortest  paths  from  the  top  of  the  Tamari  lattice  to  the 
bottom,  when  each  forest  of  the  path  covers  its  successor?  (Such  paths  are  called 
maximal  chains  in  the  lattice;  compare  with  exercise  7.2.1.4-55(h).) 

28.  [. M26 ] ( The  Stanley  lattice.)  Continuing  exercises  26  and  27,  let  us  define  yet 
another  partial  ordering  on  n-node  forests,  saying  that  F C /■  whenever  the  depth 
coordinates  c1...cn  and  c\  . . . c'n  satisfy  Cj  < c'  for  1 < j < n.  (See  Fig.  62.) 

a)  Prove  that  this  partial  ordering  is  a lattice,  by  explaining  how  to  compute  the 
greatest  lower  bound  FC\F  and  least  upper  bound  FUF'  of  any  two  given  forests. 

b)  Show  that  Stanley’s  lattice  satisfies  the  distributive  laws 

Fn(GUF)  = (FnG)U(Fn  H),  F U (G  n H)  = (F  U G)  n (F  U H). 
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Fig.  62.  The  Stanley  lattice  of  order  4.  Each 
forest  is  represented  by  its  sequence  of  node 
depths  in  preorder.  (See  exercises  26-28.) 


c)  When  does  F'  cover  F in  this  lattice? 

d)  True  or  false:  F C G if  and  only  if  FR  C GR. 

e)  Prove  that  F C F'  in  the  Stanley  lattice  whenever  F H F'  in  the  Tamari  lattice. 

29.  [HMS1 } The  covering  graph  of  a Tamari  lattice  is  sometimes  known  as  an  “associa- 
hedron,”  because  of  its  connection  with  the  associative  law  (14),  proved  in  exercise 
27(b).  The  associahedron  of  order  4,  depicted  in  Fig.  61,  looks  like  it  has  three  square 
faces  and  six  faces  that  are  regular  pentagons.  (Compare  with  Fig.  43  in  exercise 
7.2.1.2-60,  which  shows  the  “permutahedron”  of  order  4,  a well-known  Archimedean 
solid.)  Why  doesn’t  Fig.  61  show  up  in  classical  lists  of  uniform  polyhedra? 

30.  [MZ6]  The  footprint  of  a forest  is  the  bit  string  f\ . . . fn  defined  by 

fj  = [node  j in  preorder  is  not  a leaf]. 

a)  If  F has  footprint  fi  ...  fn,  what  is  the  footprint  of  F°1  (See  exercise  27.) 

b)  How  many  forests  have  the  footprint  10101101111110000101010001011000? 

c)  Prove  that  fj  = [dj  =0],  for  1 < j < n,  in  the  notation  of  (6). 

d)  Two  elements  of  a lattice  are  called  complementary  if  their  greatest  lower  bound 
is  the  bottom  element  while  their  least  upper  bound  is  the  top  element.  Show  that 
F and  F'  are  complementary  in  the  Tamari  lattice  if  and  only  if  their  footprints 
are  complementary,  in  the  sense  that  f[...  f'n-\  — f\  - ■ ■ fn- 1- 

► 31 . [M28]  A binary  tree  with  n internal  nodes  is  called  degenerate  if  it  has  height  n — 1. 

a)  How  many  n-node  binary  trees  are  degenerate? 

b)  We’ve  seen  in  Tables  1,  2,  and  3 that  binary  trees  and  forests  can  be  encoded  by 
various  n-tuples  of  numbers.  For  each  of  the  encodings  ci  . . . cn,  d\ . . . dn,  ei  . . . e„, 
ki  ...kn,  pi  ..  ,pn,  sj  ...  sn,  Ui  ...  un,  and  zi  ...  zn,  explain  how  to  see  at  a glance 
if  the  corresponding  binary  tree  is  degenerate. 

c)  True  or  false:  If  F is  degenerate,  so  is  Fn . 

d)  Prove  that  if  F and  F'  are  degenerate,  so  are  F A F'  = FTF'  and  FYF'  = FT F' . 

► 32.  [M30]  Prove  that  if  F H F' , there  is  a forest  F"  such  that  for  all  G we  have 

F'lG  = F if  and  only  if  FHGHF". 

Consequently  the  semidistributive  laws  hold  in  the  Tamari  lattice: 

F .LG  = F±H  implies  F1(GTF)  = F1G; 

FTG  = FTF  implies  FT  (Glff)  = FT  G. 
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► 33. 

a) 


[M27]  ( Permutation  representation  of  trees.)  Let  a be  the  cycle  (12  ...  n). 
Given  any  binary  tree  whose  nodes  are  numbered  1 to  n in  symmetric  order,  prove 
that  there  is  a unique  permutation  A of  {1, . . . , n}  such  that,  for  I < A < n. 


LLINKCA]  = 


AA , 

0, 


if  k\  < A; 
otherwise: 


RLINKfA]  = 


A' <r  A, 

0, 


if  kaX  > A; 
otherwise. 


Thus  A neatly  packs  2 n link  fields  into  a single  n-e lenient  array. 

b)  Show  that  this  permutation  A is  particularly  easy  to  describe  in  cycle  form  when 
the  binary  tree  is  the  left-sibling/right-child  representation  of  a forest  F.  What  is 
the  cycle  form  of  A (F)  when  F is  the  forest  in  (2)? 

c)  Find  a simple  relation  between  A (F)  and  the  dual  permutation  A ( F ! ) . 

d)  Prove  that,  in  exercise  26,  F'  covers  F if  and  only  if  A(F')  = ( j k)  X(F ),  where 
j and  A are  siblings  in  F. 

e)  Consequently  the  number  of  maximal  chains  in  the  Kreweras  lattice  of  order  n 
is  the  number  of  ways  to  factor  an  n-cycle  as  a product  of  n — 1 transpositions. 
Evaluate  this  number.  Hint:  See  Eq.  1.2. 6 (16). 

34.  [M25]  (R.  P.  Stanley.)  Show  that  the  number  of  maximal  chains  in  the  Stanley 
lattice  of  order  n is  (n(n  - l)/2)!/(l’I-13"-2  . . . (2 n - 5)2(2n  - 3)1). 

35.  [HM37]  (D.  B.  Tyler  and  D.  R.  Hickerson.)  Explain  why  the  denominators  of  the 
asymptotic  formula  (16)  are  all  powers  of  2. 


► 36.  [ M25 ] Analyze  the  ternary  tree  generation  algorithm  of  exercise  20(b).  Hint: 
There  are  (2n  + l)-1  (3")  ternary  trees  with  n internal  nodes,  by  exercise  2.3.4.4-11. 

► 37.  [MfO]  Analyze  the  Zaks-Richards  algorithm  for  generating  all  trees  with  a given 
distribution  n0,  nx,  n2,  . . . , nt  of  degrees  (exercise  21).  Hint:  See  exercise  2.3.4.4-32. 

38.  [M22]  What  is  the  total  number  of  memory  references  performed  by  Algorithm  L, 
as  a function  of  n ? 


39.  [22]  Prove  formula  (23)  by  showing  that  the  elements  of  Apq  in  (5)  correspond  to 
Young  tableaux  with  two  rows. 

40.  [M22]  (a)  Prove  that  Cpq  is  odd  if  and  only  if  p & (q  + 1)  = 0,  in  the  sense  that 
the  binary  representations  of  p and  q + 1 have  no  bits  in  common,  (b)  Therefore  Cn  is 
odd  if  and  only  if  n + 1 is  a power  of  2. 

41.  [M21  ] Show  that  the  ballot  numbers  have  a simple  generating  function  Cpqwpzq. 

► 42.  [ M22 ] How  many  unlabeled  forests  with  n nodes  are  (a)  self-conjugate?  (b)  self- 
transpose? (c)  self-dual?  (See  exercises  11,  12,  19,  and  26.) 

43.  [M21]  Express  Cpq  in  terms  of  the  Catalan  numbers  (C0,  Cu  C2, . . . ),  aiming  for 
a formula  that  is  simple  when  q - p is  small.  (For  example,  C{q_2)q  = Cq  - Cq-X.) 

► 44.  [M2 7]  Prove  that  Algorithm  B makes  only  8|  + 0(n-1)  references  to  memory  per 
binary  tree  visited. 

45.  [M26]  Analyze  the  memory  references  made  by  the  algorithm  in  exercise  22.  How 
does  it  compare  to  Algorithm  B? 

46.  [M30]  (Generalized  Catalan  numbers.)  Generalize  (21)  by  defining 

Cpq(x)  = Cp{q_x)(x)  + xv  pC(p_1)9(x),  if  0 < p < 7 ^ 0;  Coo(x)  = 1; 
and  Cvq(x)  = 0 if  p < 0 or  p > q-  thus  Cpq  = Cpq(  1).  Also  let  Cn(x ) = Cnn(x),  so  that 
(C0(x),C1(x),...)  = <1, 1, 1 + m,  l + 2x  + x2  + x3, 1 + 3z  + 3z2 + 3x3  + 2a;4 + X5 + X6, ...). 
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a)  Show  that  [xk]  Cpq  (x)  is  the  number  of  paths  from  @ to  (00)  in  (28)  that  have 
area  k , where  the  “area”  of  a path  is  the  number  of  rectangular  cells  above  it. 
(Thus  an  L-shaped  path  has  the  maximum  possible  area,  p(q  — p)  + (|).) 

b)  Prove  that  Cn(x)  = ZCl  + -+C"  = path  length(F)  ^ gummed  oyer  ^ 

n- node  forests  F. 

c)  If  C(x,z)  = Y.^=o  Cn(x)zn,  show  that  C(x,z)  = 1 + zC(x,  z)C(x,xz). 

d)  Furthermore,  C(x,  z)C(x,  xz) . . . C(x,  xrz)  - E^O^p+r)^)^. 

47.  [M2 7]  Continuing  the  previous  exercise,  generalize  the  identity  (27). 

48.  [M2S]  (F.  Ruskey  and  A.  Proskurowski.)  Evaluate  Cpq(x)  when  x = —1,  and  use 
this  result  to  show  that  no  “perfect”  Gray  code  for  nested  parentheses  is  possible  when 
n > 5 is  odd. 

49.  [17]  What  is  the  lexicographically  millionth  string  of  15  nested  parenthesis  pairs? 

50.  [20]  Design  the  inverse  of  Algorithm  U:  Given  a string  m . . . a2n  of  nested  paren- 
theses, determine  its  rank  JV  — 1 in  lexicographic  order.  What  is  the  rank  of  (1)? 

51.  [ M22 ] Let  Z\Z2  . . . z„  be  the  complement  of  Z1Z2  . . . zn  with  respect  to  2 n;  in  other 
words,  Zj  =2  n - Zj,  where  Zj  is  defined  in  (8).  Show  that  if  ZiZ2  ...z„  is  the  (N  + l)st 
n-combination  of  (0, 1, . . . , 2n  - 1}  generated  by  Algorithm  7.2.1.3L,  then  ziz2  . . . zn 
is  the  (N  — Kn N + l)st  n-combination  of  (1,  2, . . . , 2n}  generated  by  the  algorithm  of 
exercise  2.  (Here  Kn  denotes  the  nth  Kruskal  function,  defined  in  7.2.1.3-(6o).) 

52.  [M23]  Find  the  mean  and  variance  of  the  quantity  dn  in  Table  1,  when  nested 
parentheses  ai  . . . a2n  are  chosen  at  random. 

53.  [ M28 ] Let  X be  the  distance  from  the  root  of  an  extended  binary  tree  to  the 
leftmost  external  node,  (a)  What  is  the  expected  value  of  X,  when  all  binary  trees  with 
n nodes  are  equally  likely?  (b)  What  is  the  expected  value  of  X in  a random  binary 
search  tree,  constructed  by  Algorithm  6.2.2T  from  a random  permutation  K\  . . . Krl ? 
(c)  What  is  the  expected  value  of  A in  a random  degenerate  binary  tree,  in  the  sense 
of  exercise  31?  (d)  What  is  the  expected  value  of  2X  in  all  three  cases? 

54.  [HM29]  What  are  the  mean  and  variance  of  Ci  + • • • + c„?  (See  exercise  46.) 

55.  [HM33]  Evaluate  C'pq{  1),  the  total  area  of  all  the  paths  in  exercise  46(a). 

56.  [M23]  (Renzo  Sprugnoli,  1990.)  Prove  the  summation  formula 


m — 1 

^ ' CkCn-l-k  — ~ Cn  + 

k= 0 ^ 


2m  — n /2m\  /2n  — 2m  \ 
2n(n  + 1)  V m / V n — m /’ 


for  0 < m < n. 


57.  [M28]  Express  the  sums  Sp(a,b)  = J2k>o  (a-k)  (b2-k)  kP  in  closed  form  for  p — 0, 
1,  2,  3,  and  use  these  formulas  to  prove  (30).' 

58.  [HM34]  Let  timn  be  the  number  of  n-node  binary  trees  in  which  external  node  m 

appears  at  level  l when  the  external  nodes  are  numbered  from  0 to  n in  symmetric 
order.  Also  let  tmn  = so  that  tmn/Cn  is  the  average  level  of  external 

node  m;  and  let  t(ui,  z ) be  the  super  generating  function 

'y  ~ tmnw  z — (1  -\-w)z  + (3+4w+3tu2)22  + (9+13w+13«;2-(-9tt)3)23  + • • • . 

m,n 


Prove  that  t(w,z ) = ( C{z ) - wC(wz))/(l  - w)  - 1 + zC(z)t(w,z ) + wzC(wz)t(w,z), 
and  deduce  a simple  formula  for  the  numbers  tmn. 
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59.  [HM29]  Similarly,  let  Timn  count  all  n-node  binary  trees  in  which  internal  node  m 
appears  at  level  l.  Find  a simple  formula  for  Tm„  = lTim„. 


► 60.  [M26]  ( Balanced  strings.)  A string  a of  nested  parentheses  is  atomic  if  it  has 
the  form  (a')  where  a'  is  nested;  every  nested  string  can  be  represented  uniquely  as  a 
product  of  atoms  c*i  . . . ar.  A string  with  equal  numbers  of  left  and  right  parentheses 
is  called  balanced ; every  balanced  string  can  be  represented  uniquely  as  /?j  . . . j3r  where 
each  Pj  is  either  an  atom  or  a co-atom  (the  reverse  of  an  atom).  The  defect  of  a 
balanced  string  is  half  the  length  of  its  co-atoms.  For  example,  the  balanced  string 


(()))((())))))(()((())((())(() 


has  the  factored  form  P\P2P3PiP5P&P7ps,  — <^iO:^a3a^a§a6a^as,  with  four  atoms 
and  four  co-atoms;  its  defect  is  |a2a4a5a7|/2  = 9. 

a)  Prove  that  the  defect  of  a balanced  string  is  the  number  of  indices  k for  which  the 
fcth  right  parenthesis  precedes  the  fcth  left  parenthesis. 

b)  li  Pi  ...  pr  is  balanced,  we  can  map  it  into  a nested  string  by  simply  reversing 
its  co-atoms.  But  the  following  mapping  is  more  interesting,  because  it  produces 
unbiased  (uniformly  random)  nested  strings  from  unbiased  balanced  strings:  Let 
there  be  s co-atoms  /3q  = afx , . . . , f3is  = af) . Replace  each  co-atom  by  (;  then 
append  the  string  )ais  . . . )a'il,  where  a3  = (a'  ).  For  example,  the  string  above 
is  mapped  into  aq  (q3  ( (a6  (.a8)a'7)a'h)a'A)a2,  which  just  happens  to  equal  the 
string  (i)  illustrated  at  the  beginning  of  this  section. 

Design  an  algorithm  that  applies  this  mapping  to  a given  balanced  string  bi  . . . b2n  • 

c)  Also  design  an  algorithm  for  the  inverse  mapping:  Given  a nested  string  a = 
°i  • • • a2n  and  an  integer  l with  0 < l < n,  compute  a balanced  string  ft  = b\. . . b2n 
of  defect  l for  which  fi  i-»  a.  What  balanced  string  of  defect  11  maps  into  (i)? 

► 61.  [M26]  (Raney  s Cycle  Lemma.)  Let  b\b2  ...  6jv  be  a string  of  nonnegative  integers 
such  that  f = N — bi  - b2 bN  > 0. 

a)  Prove  that  exactly  / of  the  cyclic  shifts  bj+1  . . . bNb\  . . . bj  for  1 < j < N satisfy 
the  preorder  degree  sequence  property  in  exercise  20. 

b)  Design  an  efficient  algorithm  to  determine  all  such  j,  given  bib2  . . . bN. 

c)  Explain  how  to  generate  a random  forest  that  has  N = n0  + f nt  nodes,  with 

exactly  nj  nodes  of  degree  j.  (For  example,  we  obtain  random  n-node  f-ary  trees 
as  a special  case  of  this  general  procedure  when  N = tn  + 1,  no  = (t  — l)n+l, 
ni  = • • ■ = nt-i  = 0,  and  nt  = n.) 

62.  [22]  A binary  tree  can  also  be  represented  by  bit  strings  (lx  n ...  rn),  where 
lj  and  Tj  tell  whether  the  left  and  right  subtrees  of  node  j in  preorder  are  nonempty. 
(See  Theorem  2.3.1A.)  Prove  that  if  l\...ln  and  ri...rn  are  arbitrary  bit  strings 

with  h d F ln  + n + F r„  = n - 1,  exactly  one  cyclic  shift  (/J+1  . . ,lnh  . . ,lu 

ri+ 1 • • • rnri  . . . rj)  yields  a valid  binary  tree  representation,  and  explain  how  to  find  it. 

63.  [16]  If  the  first  two  iterations  of  Remy’s  algorithm  have  produced  lifSp  , what 

decorated  binary  trees  are  possible  after  the  next  iteration?  [ijlo] 

64.  [20]  What  sequence  of  A'  values  in  Algorithm  R corresponds  to  the  decorated 
trees  of  (34),  and  what  are  the  final  values  of  L0L 1 . . . L12 ? 

65.  [38]  Generalize  Remy’s  algorithm  (Algorithm  R)  to  f-ary  trees. 
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66.  [21]  A Schroder  tree  is  a binary  tree  in  which  every  nonnull  right  link  is  colored 
either  white  or  black.  The  number  Sn  of  n-node  Schroder  trees  is 

n = 0 1 2 3 4 5 6 7 8 9 10  11  12 

Sn  = 1 1 3 11  45  197  903  4279  20793  103049  518859  2646723  13648869 

for  small  n.  For  example,  53  = 11  because  the  possibilities  are 

(White  links  are  “hollow”;  external  nodes  have  also  been  attached.) 

a)  Find  a simple  correspondence  between  Schroder  trees  with  n internal  nodes  and 
ordinary  trees  with  n + 1 leaves  and  no  nodes  of  degree  one. 

b)  Devise  a Gray  code  for  Schroder  trees. 

67.  [M22]  What  is  the  generating  function  S(z)  = Snzn  for  Schroder  numbers? 

68.  [10]  What  is  the  Christmas  tree  pattern  of  order  0? 

69.  [20]  Are  the  Christmas  tree  patterns  of  orders  6 and  7 visible  in  Table  4,  possibly 
in  slight  disguise? 

► 70.  [20]  Find  a simple  rule  that  defines,  for  every  bit  string  o,  another  bit  string  o' 
called  its  mate,  with  the  following  properties:  (i)  a"  = a;  (ii)  ]o'\  = |<r|;  (iii)  either 
o C o'  or  o'  C o;  (iv)  v(o)  + v{o')  = |er|. 

71.  [M21]  Let  Mtn  be  the  size  of  the  largest  possible  set  5 of  n-bit  strings  with  the 
property  that,  if  o and  r are  members  of  S with  o C r,  then  i/(r)  < v(o)  + t.  (Thus, 
for  example,  M\n  = Mn  by  Sperner’s  theorem.)  Find  a formula  for  Mtn- 

► 72.  [ M28 ] If  you  start  with  a single  row  o\  02  ■ ■ ■ os  of  length  s and  apply  the  growth 
rule  (36)  repeatedly  n times,  how  many  rows  do  you  obtain? 

73.  [15]  In  the  Christmas  tree  pattern  of  order  30,  what  are  the  first  and  last  elements 
of  the  row  that  contains  the  bit  string  011001001000011111101101011100? 

74.  [ M26 ] Continuing  the  previous  exercise,  how  many  rows  precede  that  row? 

► 75.  [HM23]  Let  ( r\n ^ , r ^ , . . . , r£}1)  be  the  row  numbers  in  which  the  Christmas  tree 

pattern  of  order  n has  n — 1 entries;  for  example,  Table  4 tells  us  that  (rj8' , . . . , ) = 

(20,40,54,62,66,68,69).  Find  formulas  for  — rjn)  and  for  limn_+00  r^/Mn- 

76.  [HM46]  Study  the  limiting  shape  of  the  Christmas  tree  patterns  as  n — > 00.  Does 
it,  for  example,  have  a fractal  dimension  under  some  appropriate  scaling? 

77.  [21  ] Design  an  algorithm  to  generate  the  sequence  of  rightmost  elements  a\  . . ,an 
in  the  rows  of  the  Christmas  tree  pattern,  given  n.  Hint:  These  bit  strings  are 
characterized  by  the  property  that  01  + • • • + a*  > k/ 2 for  0 < k < n. 

78.  [20]  True  or  false:  If  ay  ...  o3  is  a row  of  the  Christmas  tree  pattern,  so  is 
erf*  . . . erf  (the  reverse  sequence  of  reverse  complements). 

79.  [M26]  The  number  of  permutations  pi-..pn  that  have  exactly  one  “descent” 
where  pk  > Pk+i  is  the  Eulerian  number  = 2n  — n — 1,  according  to  Eq.  5.1.3-(i2). 
The  number  of  entries  in  the  Christmas  tree  pattern,  above  the  bottom  row,  is  the  same. 

a)  Find  a combinatorial  explanation  of  this  coincidence,  by  giving  a one-to-one  cor- 
respondence between  one-descent  permutations  and  unsorted  bit  strings. 

b)  Show  that  two  unsorted  bit  strings  belong  to  the  same  row  of  the  Christmas  tree 
pattern  if  and  only  if  they  correspond  to  permutations  that  define  the  same  P 
tableau  under  the  Robinson-Schensted  correspondence  (Theorem  5.1.4A). 
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80.  [30]  Say  that  two  bit  strings  are  concordant  if  we  can  obtain  one  from  the  other 

via  the  transformations  010  100  or  101  110  on  substrings.  For  example,  the 

strings 

011100  B-  011010  B-  010110  B-  010101  b-  011001 

$ $ 

100110  B 100101  B 101001  b 110001 

are  mutually  concordant,  but  no  other  string  is  concordant  with  any  of  them. 

Prove  that  strings  are  concordant  if  and  only  if  they  belong  to  the  same  column 
of  the  Christmas  tree  pattern  and  to  rows  of  the  same  length  in  that  pattern. 

81.  [M30]  A biclutter  of  order  (n,n')  is  a family  S of  bit  string  pairs  (o,o'),  where 
M = n an(l  k'l  = n1 , with  the  property  that  distinct  members  (a,  o')  and  (t,t')  of  S 
are  allowed  to  satisfy  o C r and  o'  C r'  only  if  o / r and  o'  / t'. 

Use  Christmas  tree  patterns  to  prove  that  S contains  at  most  Mn+n/  string  pairs. 

► 82.  [M26]  Let  E(f)  be  the  number  of  times  Algorithm  H evaluates  the  function  /. 

a)  Show  that  Mn  < E(f)  < Mn+ 1,  with  equality  when  / is  constant. 

b)  Among  all  / such  that  E(f)  = Mn,  which  one  minimizes  /(cr)? 

c)  Among  all  / such  that  E(f)  = Mn+1,  which  one  maximizes  /(cr)? 

83.  [M20]  (G.  Hansel.)  Show  that  there  are  at  most  2>Mn  monotone  Boolean  functions 
/ ( x\ , . . . , xn)  of  n Boolean  variables. 

► 84.  [HM2 7]  (D.  Kleitman.)  Let  A be  an  m X n matrix  of  real  numbers  in  which  every 
column  v has  length  ||u||  > 1,  and  let  6 be  an  m-dimensional  column  vector.  Prove  that 
at  most  Mn  column  vectors  x = (ai, . . . ,an)T,  with  components  aj  = 0 or  1,  satisfy 
II  Ax  - 6||  < 2 • Hint:  Use  a construction  analogous  to  the  Christmas  tree  pattern. 

85.  [HM35]  (Philippe  Golle.)  Let  V be  any  vector  space  contained  in  the  set  of 
all  real  n-dimensional  vectors,  but  containing  none  of  the  unit  vectors  (1,0,...,  0), 
(0, 1, 0, . . . , 0),  ...,  (0, . . . , 0, 1).  Prove  that  V contains  at  most  Mn  vectors  whose 
components  are  all  0 or  1;  furthermore  the  upper  bound  Mn  is  achievable. 

86.  [15]  If  (2)  is  regarded  as  an  oriented  forest  instead  of  an  ordered  forest,  what 
canonical  forest  corresponds  to  it?  Specify  that  forest  both  by  its  level  codes  ci  . . . ci5 
and  its  parent  pointers  pi  . . .pi5. 

87.  [M20]  Let  F be  an  ordered  forest  in  which  the  fcth  node  in  preorder  appears  on 
level  ca,  and  has  parent  pk,  where  pk  = 0 if  that  node  is  a root. 

a)  How  many  forests  satisfy  the  condition  ck  = pk  for  1 < k < n? 

b)  Suppose  F and  F'  have  level  codes  cx  . . . cn  and  ci...c^,  respectively,  as  well 
as  parent  links  P\.  ■ .pn  and  p\  ...p'n.  Prove  that,  lexicographically,  cx  . . ,cn  < 
ci  ...c'n  if  and  only  if  Pi-..pn  < pi  . . .p'n. 

88.  [M20]  Analyze  Algorithm  O:  How  often  is  step  04  performed?  What  is  the  total 
number  of  times  pk  is  changed  in  step  05? 

89.  [Mf6]  How  often  does  step  05  set  pk  <—  Pj? 

► 90.  [M27]  If  pi  . . ,pn  is  a canonical  sequence  of  parent  pointers  for  an  oriented  forest, 
the  graph  with  vertices  (0, 1, ... , n}  and  edges  {k  — pk  | 1 < k < n}  is  a free  tree, 
namely  a connected  graph  with  no  cycles.  (See  Theorem  2.3.4. 1 A.)  Conversely,  every 
free  tree  corresponds  to  at  least  one  oriented  forest  in  this  way.  But  the  parent  pointers 
Oil  and  000  both  yield  the  same  free  tree  )~;  similarly,  012  and  010  both  yield 
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The  purpose  of  this  exercise  is  to  restrict  the  sequences  pi  . . .pn  further  so  that 
each  free  tree  is  obtained  exactly  once.  We  proved  in  2.3.4.4-(g)  that  the  number  of 
structurally  different  free  trees  on  n + 1 vertices  has  a fairly  simple  generating  function, 
by  showing  that  a free  tree  always  has  at  least  one  centroid. 

a)  Show  that  a canonical  n-node  forest  corresponds  to  a free  tree  with  a single  centroid 
if  and  only  if  no  tree  in  the  forest  has  more  than  [n/ 2J  nodes. 

b)  Modify  Algorithm  O so  that  it  generates  all  sequences  pi  . . ,pn  that  satisfy  (a). 

c)  Explain  how  to  find  all  pi  . . ,pn  for  free  trees  that  have  two  centroids. 

91.  [M3 7]  (Nijenhuis  and  Wilf.)  Show  that  a random  oriented  tree  can  be  generated 
with  a procedure  analogous  to  the  random  partition  algorithm  of  exercise  7.2.1.4-47. 

92.  [15]  Are  the  first  and  last  spanning  trees  visited  by  Algorithm  S adjacent,  in  the 
sense  that  they  have  n — 2 edges  in  common? 

93.  [20]  When  Algorithm  S terminates,  has  it  restored  the  graph  to  its  original  state? 

94.  [22]  Algorithm  S needs  to  “prime  the  pump”  by  finding  an  initial  spanning  tree 
in  step  SI.  Explain  how  to  do  that  task. 

95.  [26]  Complete  Algorithm  S by  implementing  the  bridge  test  in  step  S8. 

► 96.  [28]  Analyze  the  approximate  running  time  of  Algorithm  S when  the  given  graph 
is  simply  (a)  a path  Pn  of  length  n — 1;  (b)  a cycle  Cn  of  length  n. 

97.  [15]  Is  (48)  a series-parallel  graph? 

98.  [16]  What  series-parallel  graph  corresponds  to  (53)  if  A is  taken  to  be  serial ? 

► 99.  [30]  Consider  a series-parallel  graph  represented  by  a tree  as  in  (53),  together 
with  node  values  that  satisfy  (55).  These  values  define  a spanning  tree  or  a near  tree, 
according  as  vv  is  1 or  0 at  the  root  p.  Show  that  the  following  method  will  generate 
all  of  the  other  configs  of  the  root: 

i)  Begin  with  all  uneasy  nodes  active,  other  nodes  passive. 

ii)  Select  the  rightmost  active  node,  p,  in  preorder;  but  terminate  if  all  nodes  are 
passive. 

iii)  Change  dp  4—  Tdp , update  all  values  in  the  tree,  and  visit  the  new  config. 

iv)  Activate  all  uneasy  nodes  to  the  right  of  p. 

v)  If  dp  has  run  through  all  children  of  p since  p last  became  active,  make  node  p 
passive.  Return  to  (ii). 

Also  explain  how  to  perform  these  steps  efficiently.  Hints:  To  implement  step  (v), 
introduce  a pointer  zp;  make  node  p passive  when  dp  becomes  equal  to  zp,  and  at  such 
times  also  reset  zp  to  the  previous  value  of  dp.  To  implement  steps  (ii)  and  (iv),  use 
focus  pointers  fp  analogous  to  those  in  Algorithms  7.2.1.1L  and  7.2.1. IK. 

100.  [40]  Implement  the  text’s  “Algorithm  S’ ” for  revolving-door  generation  of  all 
spanning  trees,  by  combining  Algorithm  S with  the  ideas  of  exercise  99. 

101.  [46]  Is  there  a simple  revolving-door  way  to  list  all  nn~ 2 spanning  trees  of  the 
complete  graph  Krl?  (The  order  produced  by  Algorithm  S is  quite  complicated.) 

102.  [46]  An  oriented  spanning  tree  of  a directed  graph  Bonn  vertices,  also  known 
as  a “spanning  arborescence,”  is  an  oriented  subtree  of  D containing  n — 1 arcs.  The 
matrix  tree  theorem  (exercise  2.3.4.2-19)  tells  us  that  the  oriented  subtrees  having  a 
given  root  can  readily  be  counted  by  evaluating  an  (n  — 1)  x (n  — 1)  determinant. 

Can  those  oriented  subtrees  be  listed  in  a revolving-door  order,  always  removing 
one  arc  and  replacing  it  with  another? 
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► 103.  [HM39]  ( Sandpiles .)  Consider  any  digraph  D on  vertices  Vo,  Vi,  - . . , V„  with  e,3 
arcs  from  V to  Vj,  where  etl  = 0.  Assume  that  L)  has  at  least  one  oriented  spanning 
tree  rooted  at  Vo;  this  assumption  means  that,  if  we  number  the  vertices  appropriately, 
we  have  ei0  + ■ ■ ■ + e <«_!)  > 0 for  1 < i < n.  Let  dt  = ei0  + ■ ■ ■ + ein  be  the  total 
out-degree  of  V.  Put  xt  grains  of  sand  on  vertex  V for  0 < i < n,  and  play  the 
following  game:  If  > dl  for  any  i > 1,  decrease  xt  by  d,  and  set  Xj  A-  x.j  + el3  for 
all  j 7^  *•  (In  other  words,  pass  one  grain  of  sand  from  V through  each  of  its  outgoing 
arcs,  whenever  possible,  except  when  i = 0.  This  operation  is  called  “toppling”  V, 
and  a sequence  of  topplings  is  called  an  “avalanche.”  Vertex  V0  is  special;  instead  of 
toppling,  it  collects  particles  of  sand  that  essentially  leave  the  system.)  Continue  until 
X{  < di  for  1 < i < n.  Such  a state  x = (xi , . . . ,xn)  is  called  stable. 

a)  Prove  that  every  avalanche  terminates  in  a stable  state  after  a finite  number  of 
topplings.  Furthermore,  the  final  state  depends  only  on  the  initial  state,  not  on 
the  order  in  which  toppling  is  performed. 

b)  Let  a(x ) be  the  stable  state  that  results  from  initial  state  x.  A stable  state  is 
called  recurrent  if  it  is  a{x)  for  some  x with  xt  > dt  for  1 < i < n.  (Recurrent 
states  correspond  to  sandpiles  that  have  evolved  over  a long  period  of  time,  after 
new  grains  of  sand  are  repeatedly  introduced  at  random.)  Find  the  recurrent 
states  in  the  special  case  when  n = 4 and  when  the  only  arcs  of  D are 

Vi  -A  V0,  Vi  -A  V2,  V2  Vo,  V2  -a  Vi,  V3  -A  Vo,  V3  -A  V4,  V4  -a  V0,  V4  -a  V3. 

c)  Let  d = (di, . . . , dn).  Prove  that  x is  recurrent  if  and  only  if  x = a(x  + t),  where 
t is  the  vector  d - a(d). 

d)  Let  at  be  the  vector  (-eii,...,-ei(i_1),(ij,-ej(j+1),...1-ei„),  for  I < i < n- 
thus,  toppling  Vi  corresponds  to  changing  the  state  vector  x = (xu...,xn)  to 
x ~ ai.  Say  that  two  states  x and  x are  congruent , written  x = x' , if  x — x — 

mioi  -I hmnan  for  some  integers  mi,  ... , mn.  Prove  that  there  are  exactly  as 

many  equivalence  classes  of  congruent  states  as  there  are  oriented  spanning  trees 
in  D,  rooted  at  Vo.  Hint:  See  the  matrix  tree  theorem,  exercise  2.3.4.2-19. 

e)  If  x = x'  and  if  both  x and  x'  are  recurrent,  prove  that  x = x' . 

f)  Prove  that  every  congruence  class  contains  a unique  recurrent  state. 

g)  If  D is  balanced , in  the  sense  that  the  in-degree  of  each  vertex  equals  its  out-degree, 
prove  that  x is  recurrent  if  and  only  if  x = a(x  + a),  where  a = (e0i, . . . , e0n). 

h)  Illustrate  these  concepts  when  D is  a “wheel”  with  n spokes:  Let  there  be  3n  arcs, 
Vj  -A  Vo  and  Vj  «a  V,+i  for  1 < j < n,  regarding  Vn+i  as  identical  to  V.  Find 
a one-to-one  correspondence  between  the  oriented  spanning  trees  of  this  digraph 
and  the  recurrent  states  of  its  sandpiles. 

i)  Similarly,  analyze  the  recurrent  sandpiles  when  D is  the  complete  graph  on  n- 1-1 
vertices,  namely  when  e,3  = [i/  j]  for  0 < i,j  < n.  Hint:  See  exercise  6.4-31. 

► 104.  [HM21]  If  G is  a graph  on  n vertices  {V, . . . , V„},  with  eij  edges  between  V and 
Vj,  let  C(G)  be  the  matrix  with  entries  cl3  = -eij  + Sijdl,  where  d,  = en  + • • ■ + ein  is 
the  degree  of  V,.  Let  us  say  that  the  aspects  of  G are  the  eigenvalues  of  C(G),  namely 
the  roots  a0,  a„-i  of  the  equation  det  (al-C(G))  = 0.  Since  C(G)  is  a symmetric 
matrix,  its  eigenvalues  are  real  numbers,  and  we  can  assume  that  a0  < au  < • • • < an-i- 

a)  Prove  that  ao  = 0. 

b)  Prove  that  G has  exactly  c(G)  = c*i  . . . an-i/n  spanning  trees. 

c)  What  are  the  aspects  of  the  complete  graph  Kn7 
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105.  [HM38]  Continuing  exercise  104,  we  wish  to  prove  that  there  is  often  an  easy 
way  to  determine  the  aspects  of  G when  G has  been  constructed  from  other  graphs 
whose  aspects  are  known.  Suppose  G'  has  aspects  a'0,  a'n,_1  and  G"  has  aspects 

a'o,  . . . , what  are  the  aspects  of  G in  the  following  cases? 

a)  G = G’  is  the  complement  of  G' . (Assume  that  e'3  < [i^j]  in  this  case.) 

b)  G = G'  © G"  is  the  direct  sum  (juxtaposition)  of  G'  and  G" . 

c)  G = G'  — G"  is  the  join  of  G'  and  G" . 

d)  G = G'  □ G"  is  the  Cartesian  product  of  G'  and  G" . 

e)  G = L(G')  is  the  line  graph  of  G1 , when  G ' is  a regular  graph  of  degree  d'  (namely 
when  all  vertices  of  G'  have  exactly  d'  neighbors,  and  there  are  no  self-loops). 

f)  G = G'®G"  is  the  direct  product  (conjunction)  of  G'  and  G';,  when  G'  is  regular 
of  degree  d'  and  G"  is  regular  of  degree  d". 

g)  G = G'  eG"  is  the  strong  product  of  regular  graphs  G'  and  G". 

h)  G = G'  a G"  is  the  odd  product  of  regular  graphs  G'  and  G". 

i)  G = G'oG"  is  the  lexicographic  product  of  regular  graphs  G'  and  G". 

► 106.  [HM37]  Find  the  total  number  of  spanning  trees  in  (a)  an  m x n grid  Pm  □ Pn; 
(b)  an  mx  n cylinder  Pm  □ G„;  (c)  an  m x n torus  Gm  □ Cn.  Why  do  these  numbers 
tend  to  have  only  small  prime  factors?  Hint:  Show  that  the  aspects  of  Pn  and  C„  can 
be  expressed  in  terms  of  the  numbers  Okn  = 4 sin2  ~ . 

107.  [M24]  Determine  the  aspects  of  all  connected  graphs  that  have  n < 5 vertices 
and  no  self-loops  or  parallel  edges. 

108.  [HM40]  Extend  the  results  of  exercises  104-106  to  directed  graphs. 

109.  [M46]  Find  a combinatorial  explanation  for  the  fact  that  (57)  is  the  number  of 
spanning  trees  in  the  n-cube. 

► 110.  [ M27 ] Prove  that  if  G is  any  connected  multigraph  without  self-loops,  it  has 

c(G)  > s/( di  - 1) . .. (dn  - 1) 
spanning  trees,  where  dj  is  the  degree  of  vertex  j. 

111.  [05]  List  the  nodes  of  the  tree  (58)  in  postpreorder. 

112.  [15]  If  node  p of  a forest  precedes  node  q in  prepostorder  and  follows  it  in 
postpreorder,  what  can  you  say  about  p and  q? 

► 113.  [20]  How  do  prepostorder  and  postpreorder  of  a forest  F relate  to  prepostorder 
and  postpreorder  of  the  conjugate  forest  FR1  (See  exercise  13.) 

114.  [15]  If  we  want  to  traverse  an  entire  forest  in  prepostorder  using  Algorithm  Q, 
how  should  we  begin  the  process? 

115.  [20]  Analyze  Algorithm  Q:  How  often  is  each  step  performed,  during  the  com- 
plete traversal  of  a forest? 

► 116.  [28]  If  the  nodes  of  a forest  F are  labeled  1 to  n in  prepostorder,  say  that  node  k 
is  lucky  if  it  is  adjacent  to  node  k + 1 in  F,  unlucky  if  it  is  three  steps  away,  and 
ordinary  otherwise,  for  1 < k < n;  in  this  definition,  node  n + 1 is  an  imaginary 
super-root  considered  to  be  the  parent  of  each  root. 

a)  Prove  that  lucky  nodes  occur  only  on  even-numbered  levels;  unlucky  nodes  occur 
only  on  odd-numbered  levels. 

b)  Show  that  the  number  of  lucky  nodes  is  exactly  one  greater  than  the  number  of 
unlucky  nodes,  unless  n = 0. 
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117.  [21]  Continuing  exercise  116,  how  many  n-node  forests  contain  no  unlucky  nodes? 

118.  [. M28 ] How  many  lucky  nodes  are  present  in  (a)  the  complete  t-ary  tree  with 
- !)/(*-  1)  internal  nodes?  (b)  the  Fibonacci  tree  of  order  k,  with  Fk+1  - 1 internal 

nodes?  (See  2.3.4.5-(6)  and  Fig.  8 in  Section  6.2.1.) 

119.  [21]  The  twisted  binomial  tree  fn  of  order  n is  defined  recursively  by  the  rules 


To  = • , 


for  n > 0. 


(Compare  with  7.2.1.3-(2i);  we_ reverse  the  order  of  children  on  alternate  levels.)  Show 
that  prepostorder  traversal  of  Tn  has  a simple  connection  with  Gray  binary  code. 

120.  [22]  True  or  false:  The  square  of  a graph  is  Hamiltonian  if  the  graph  is  connected 
and  has  no  bridges. 

121.  [M3 j]  (F.  Neuman,  1964.)  The  derivative  of  a graph  G is  the  graph  G(l)  obtained 
by  removing  all  vertices  of  degree  1 and  the  edges  touching  them.  Prove  that,  when  T 
is  a free  tree,  its  square  T contains  a Hamiltonian  path  if  and  only  if  its  derivative  has 
no  vertex  of  degree  greater  than  4 and  the  following  two  additional  conditions  hold: 

i)  All  vertices  of  degree  3 or  4 in  T{,)  lie  on  a single  path. 

li)  Between  any  two  vertices  of  degree  4 in  T(,),  there  is  at  least  one  vertex  that  has 
degree  2 in  T. 


► 122.  [31]  (Dudeney’s  Digital  Century  puzzle.)  There  are  many  curious  ways  to  obtain 
the  number  100  by  inserting  arithmetical  operators  and  possibly  also  parentheses  into 
the  sequence  123456789.  For  example, 


100  — 1 + 2x3  + 4x5-64-7  + 8x9  = (1  + 2-  3- 4)  x (5 - 6-  7-  8- 9) 

= ( ( 1/ ( (2  + 3)/4  - 5 + 6))  x 7 + 8)  x 9 . 

a)  How  many  such  representations  of  100  are  possible?  To  make  this  question 
precise,  in  view  of  the  associative  law  and  other  algebraic  properties,  assume 
that  expressions  are  written  in  canonical  form  according  to  the  following  syntax: 

( expression ) -»  ( number ) | ( sum ) | ( product ) | ( quotient ) 

( sum ) -A  ( term ) + ( term ) | ( term ) - ( term ) | ( sum ) + { term ) | ( sum ) - ( term ) 

( term ) -»  ( number ) | ( product ) | ( quotient ) 

(product)  -A  (factor)  x (factor)  | (product)  x (factor)  | ((quotient))  x (factor) 
(quotient)  -A  ( factor )/( factor ) | ( product )/( factor ) | (( quotient ))/( factor ) 
(factor)  -+  (number)  | ((sum)) 

(number)  ->■  (digit) 


The  digits  used  must  be  1 through  9,  in  that  order. 

b)  Extend  problem  (a)  by  allowing  multidigit  numbers,  with  the  syntax 

(number)  ->■  (digit)  | ( number )( digit ) 

For  example,  100  = (l/(2  - 3 + 4))  x 567  - 89.  What  is  the  shortest  such 
representation?  What  is  the  longest? 

c)  Extend  problem  (b)  by  also  allowing  decimal  points: 

(number)  — ¥ (digit  string)  | .(digit  string) 

(digit  string)  -l  (digit)  | (digit  string )( digit ) 

For  example,  100  = (.1  - 2 - 34  x ,5)/(.6  - .789),  amazingly  enough. 
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123.  [21]  Continuing  the  previous  exercise,  what  are  the  smallest  positive  integers 
that  cannot  be  represented  using  conventions  (a),  (b),  (c)? 


► 124.  [40]  Experiment  with  methods  for  drawing  extended  binary  trees  that  are  in- 
spired by  simple  models  from  nature.  For  example,  we  can  assign  a value  v(x)  to  each 
node  x,  called  its  H orton- Strahler  number,  as  follows:  Each  external  (leaf)  node  has 
v(x)  = 0;  an  internal  node  with  children  (Z,r)  has  v(x)  = max(u(Z),  u(r))  + [u(Z)  = v(r)]. 
The  edge  from  internal  node  x to  its  parent  can  be  drawn  as  a rectangle  with  height 
h(v(x))  and  width  w(v(x)),  and  the  edge  rectangles  with  children  (l,r)  can  be  offset  by 
angles  9(v(l(x)),v(r(x))),  —8(v(r(x)),v(l(x))),  for  certain  functions  h,  w,  and  6.  The 
examples  in  Fig.  63  show  typical  results  when  we  choose  w(k)  = 3 + k,  h(k)  = 18 k, 
9(k,k)  = 30°,  9(j,k)  = ((k+l)/j)  x 20°  for  0 < k < j,  and  9(j,k)  = ((k-j)/k)  x 30° 
for  0 < j < k\  the  roots  appear  at  the  bottom.  Part  (a)  of  Fig.  63  is  the  binary  tree  (4); 
part  (b)  is  a random  100-node  tree  generated  by  Algorithm  R;  part  (c)  is  the  Fibonacci 
tree  of  order  11,  which  has  143  nodes;  and  part  (d)  is  a random  100-node  binary  search 
tree.  (The  trees  in  parts  (b),  (c),  and  (d)  clearly  belong  to  different  species.) 
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[This  subject]  has  a relation 
to  almost  every  species  of  useful  knowledge 
that  the  mind  of  man  can  be  employed  upon. 

— JAMES  BERNOULLI,  Ars  Conjectandi  (1713) 

7.2. 1.7.  History  and  further  references.  Early  work  on  the  generation  of 
combinatorial  patterns  began  as  civilization  itself  was  taking  shape.  The  story 
is  quite  fascinating,  and  we  will  see  that  it  spans  many  cultures  in  many  parts  of 
the  world,  with  ties  to  poetry,  music,  and  religion.  There  is  space  here  to  discuss 
only  some  of  the  principal  highlights;  but  perhaps  a few  glimpses  into  the  past 
will  stimulate  the  reader  to  dig  deeper  into  the  roots  of  the  subject,  as  the  world 
gets  ever  smaller  and  as  global  scholarship  continues  to  advance. 

Lists  of  binary  n-tuples  can  be  traced  back  thousands  of  years  to  ancient 
China,  India,  and  Greece.  The  most  notable  source  — because  it  still  is  a best- 
selling book  in  modern  translations  — is  the  Chinese  I Ohing  or  Yijing , whose 
name  means  “the  Bible  of  Changes.”  This  book,  which  is  one  of  the  five  classics 
of  Confucian  wisdom,  consists  essentially  of  26  = 64  chapters;  and  each  chapter 
is  symbolized  by  a hexagram  formed  from  six  lines,  each  of  which  is  either  - - 
(“yin”)  or  — -J“yang”).  For  example,  hexagram  1 is  pure  yang,  g;  hexagram  2 
is  pure  yin,  1 1 ; and  hexagram  64  intermixes  yin  and  yang,  with  yang  on  top:  O . 
Here  is  the  complete  list: 


4 5 6 7 8 9 10  11  12  13  14  15  16 


This  arrangement  of  the  64  possibilities  is  called  King  Wen’s  ordering,  because 
the  basic  text  of  the  I Ching  has  traditionally  been  ascribed  to  King  Wen  (c.  1100 
B.C.),  the  legendary  progenitor  of  the  Chou  dynasty.  Ancient  texts  are,  however, 
notoriously  difficult  to  date  reliably,  and  modern  historians  have  found  no  solid 
evidence  that  anyone  actually  compiled  such  a list  of  hexagrams  before  the  third 
century  B.C. 

Notice  that  the  hexagrams  of  (1)  occur  in  pairs:  Those  with  odd  numbers  are 
immediately  followed  by  their  top-to-bottom  reflections,  except  when  reflection 
would  make  no  change;  and  the  eight  symmetrical  diagrams  are  paired  with 
their  complements  (1  = 2,  27  = 28,  29  = 30,  61  = 62).  Hexagrams  that  are 
composed  from  two  trigrams  that  represent  the  four  basic  elements  heaven  (=), 
earth  (==),  fire  (z=),  and  water  (=-=)  have  also  been  placed  judiciously.  Otherwise 
the  arrangement  appears  to  be  essentially  random,  as  if  a person  untrained  in 
mathematics  kept  listing  different  possibilities  until  being  unable  to  come  up 
with  any  more.  A few  intriguing  patterns  do  exist  between  the  pairs,  but  no 
more  than  are  present  by  coincidence  in  the  digits  of  7r  (see  3.3— (i)). 
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Yin  and  yang  represent  complementary  aspects  of  the  elementary  forces  of 
nature,  always  in  tension,  always  changing.  The  I Ching  is  somewhat  analogous 
to  a thesaurus  in  which  the  hexagrams  serve  as  an  index  to  accumulated  wisdom 
about  fundamental  concepts  like  giving  (H),  receiving  (H),  modesty  (§-§),  joy 
(=s)i  fellowship  (^),  withdrawal  (^),  peace  (s|),  conflict  (hs),  organization 
(=!),  corruption  (O),  immaturity  (p),  elegance  (p),  etc.  One  can  choose 
a pair  of  hexagrams  at  random,  obtaining  the  second  from  the  first  by,  say, 
independently  changing  each  yin  to  yang  (or  vice  versa)  with  probability  1/4; 
this  technique  yields  4096  ways  to  ponder  existential  mysteries,  as  well  as  a 
Markov  process  by  which  change  itself  might  perhaps  give  meaning  to  life. 

A strictly  logical  way  to  arrange  the  hexagrams  was  eventually  introduced 
about  A.D.  1060  by  Shao  Yung.  His  ordering,  which  proceeded  lexicographically 
from  = § to  = = to  = = to  = = to  =-=  to  • • • to  §=  to  = (reading  each  hexagram  from 
bottom  to  top),  was  much  more  user-friendly  than  the  King  Wen  order,  because 
a random  pattern  could  now  be  found  quickly.  When  G.  W.  Leibniz  learned 
about  this  sequence  of  hexagrams  in  1702,  he  jumped  to  the  erroneous  conclusion 
that  Chinese  mathematicians  had  once  been  familiar  with  binary  arithmetic. 
[See  Frank  Swetz,  Mathematics  Magazine  76  (2003),  276-291.  Further  details 
about  the  I Ching  can  be  found,  for  example,  in  Joseph  Needham’s  Science  and 
Civilisation  in  China  2 (Cambridge  University  Press,  1956),  304-345;  R.  J.  Lynn, 
The  Classic  of  Changes  (New  York:  Columbia  University  Press,  1994).] 

Another  ancient  Chinese  philosopher,  Yang  Hsiung,  proposed  a system  based 
on  81  ternary  tetragrams  instead  of  64  binary  hexagrams.  His  Canon  of  Supreme 
Mystery , written  c.  2 B.C.,  has  recently  been  translated  into  English  by  Michael 
Nylan  (Albany,  New  York:  1993).  Yang  described  a complete,  hierarchical  ter- 
nary tree  structure  in  which  there  are  3 regions,  with  3 provinces  in  each  region, 
3 departments  in  each  province,  3 families  in  each  department,  and  9 short  poems 
called  “appraisals”  for  each  family,  hence  729  appraisals  in  all  — making  almost 
exactly  2 appraisals  for  every  day  in  the  year.  His  tetragrams  were  arranged  in 
strict  lexicographic  order  when  read  top-to-bottom:  =,  =,  =,  EFE.  EfE. 

• • • , =;:■  In  fact,  as  explained  on  page  28  of  Nylan’s  book,  Yang  presented  a simple 
way  to  compute  the  rank  of  each  tetragram,  as  if  using  a radix-3  number  system. 
Thus  he  would  not  have  been  surprised  or  impressed  by  Shao  Yung’s  systematic 
ordering  of  binary  hexagrams,  although  Shao  lived  more  than  1000  years  later. 

Indian  prosody.  Binary  n-tuples  were  studied  in  a completely  different  context 
by  pundits  in  ancient  India,  who  investigated  the  poetic  meters  of  sacred  Vedic 
chants.  Syllables  in  Sanskrit  are  either  short  (l)  or  long  (S),  and  the  study 
of  syllable  patterns  is  called  “prosody.”  Modern  writers  use  the  symbols  ^ 
and  instead  of  I and  5.  A typical  Vedic  verse  consists  of  four  lines  with 
n syllables  per  line,  for  some  n > 8;  prosodists  therefore  sought  a way  to  classify 
all  2"  possibilities.  The  classic  work  Chandahsastra  by  Pingala,  written  before 
A.D.  400  and  probably  much  earlier  (the  exact  date  is  quite  uncertain),  described 
procedures  by  which  one  could  readily  find  the  index  k of  any  given  pattern  of 
^s  and  — s,  as  well  as  to  find  the  fcth  pattern,  given  k.  In  other  words,  Pingala 
explained  how  to  rank  any  given  pattern  as  well  as  to  unrank  any  given  index; 
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thus  he  went  beyond  the  work  of  Yang  Hsiung,  who  had  considered  ranking  but 
not  unranking.  Pingala’s  methods  were  also  related  to  exponentiation,  as  we 
have  noted  earlier  in  connection  with  Algorithm  4.6. 3A. 

The  next  important  step  was  taken  by  a prosodist  named  Kedara  in  his 
work  Vrttaratnakara,  thought  to  have  been  written  in  the  8th  century.  Kedara 
gave  a step-by-step  procedure  for  listing  all  the  n-tuples  from ...  — to 

to  • • • to  . . . w,  essentially  Algorithm  7.2.1.1M  in  the  case  of  radix  2.  His 

method  may  well  have  been  the  first-ever  explicit  algorithm  for  combinatorial 
sequence  generation.  [See  B.  van  Nooten,  J.  Indian  Philos.  21  (1993),  31-50.] 

Poetic  meters  can  also  be  regarded  as  rhythms,  with  one  beat  for  each  w 
and  two  beats  for  each  — . An  n-syllable  pattern  can  involve  between  n and  2 n 
beats,  but  musical  rhythms  suitable  for  marching  or  dancing  generally  are  based 
on  a fixed  number  of  beats.  Therefore  it  was  natural  to  consider  the  set  of  all 
sequences  of  v—'S  and  s that  have  exactly  to  beats,  for  fixed  to.  Such  patterns 
are  now  called  Morse  code  sequences  of  length  to,  and  we  know  from  exercise 
4.5.3-32  that  there  are  exactly  Fm+1  of  them.  For  example,  the  21  sequences 
when  m = 7 are 


In  this  way  Indian  prosodists  were  led  to  discover  the  Fibonacci  sequence,  as  we 
have  observed  in  Section  1.2.8. 

Moreover,  the  anonymous  author  of  Prakrta  Paingala  (c.  1320)  discovered 
elegant  algorithms  for  ranking  and  unranking  with  respect  to  m-beat  rhythms. 
To  find  the  fcth  pattern,  one  starts  by  writing  down  m — s,  then  expresses  the 

difference  d = Fm  + l — k as  a sum  of  Fibonacci  numbers  FJ}  H 1-  Fh ; here  Fn 

is  the  largest  Fibonacci  number  that  is  < d and  Fh  is  the  largest  < d-Fh , etc., 
continuing  until  the  remainder  is  zero.  Then  beats  j - 1 and  j are  to  be  changed 
from  to  — , for  j = ji,  . . . , jt.  For  example,  to  get  the  5th  element  of  (2) 
we  compute  21  — 5 = 16  = 13  + 3 = Fj  + F, 4;  the  answer  is  -w — ^ — . 

A few  years  later,  Narayana  Pandita  treated  the  more  general  problem  of 
finding  all  compositions  of  to  whose  parts  are  < q,  where  q is  any  given  posi- 
tive integer.  As  a consequence  he  discovered  the  (/th-order  Fibonacci  sequence 
5.4.2 -(4),  which  was  destined  to  be  used  600  years  later  in  polyphase  sorting; 
he  also  developed  the  corresponding  ranking  and  unranking  algorithms.  [See 
Parmanand  Singh,  Historia  Mathematica  12  (1985),  229-244,  and  exercise  16.] 

Pingala  gave  special  code  names  to  all  the  three-syllable  meters, 
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and  students  of  Sanskrit  have  been  expected  to  memorize  them  ever  since. 
Somebody  long  ago  devised  a clever  way  to  recall  these  codes,  by  inventing 
the  nonsense  word  yamatarajabhanasalagam  Id  1 < 1 vfld  MHHd  IH  );  the  point 
is  that  the  ten  syllables  of  this  word  can  be  written 

ya  rna  ta  ra  ja  bha  na  sa  la  gam  ^ ^ 

and  each  three-syllable  pattern  occurs  just  after  its  code  name.  The  origin  of 
yama . . . lagam  is  obscure,  but  Subhash  Kak  [Indian  J.  History  of  Science  35 
(2000),  123-127]  has  traced  it  back  at  least  to  C.  P.  Brown’s  Sanskrit  Prosody 
(1869),  page  28;  thus  it  qualifies  as  the  earliest  known  appearance  of  a “de  Bruijn 
cycle”  that  encodes  binary  n-tuples. 

Meanwhile,  in  Europe.  In  a similar  way,  classic  Greek  poetry  was  based  on 
groups  of  short  and/or  long  syllables  called  “metrical  feet,”  analogous  to  bars  of 
music.  Each  basic  type  of  foot  acquired  a Greek  name;  for  example,  two  short 

syllables  ‘ww’  were  called  a pyrrhic , and  two  long  syllables  ‘ ’ were  called  a 

spondee , because  those  rhythms  were  used  respectively  in  a song  of  war  (7roppixr|) 
or  a song  of  peace  (crrcovSai.).  Greek  names  for  metric  feet  were  soon  assimilated 
into  Latin  and  eventually  into  modern  languages,  including  English: 


- arsis 

proceleusmatic 

— thesis 

— fourth  paeon 
third  paeon 

pyrrhic 

— — minor  ionic 

' — iambus 

second  paeon 

— trochee 

• — diiambus 

spondee 

■ antispast 

^ first  epitrite 

'-'-s'-'  tribrach 

— first  paeon 

— anapest 

— ss — choriambus 

amphibrach 

— - ditrochee 

bacchius 

— second  epitrite 

— dactyl 

w—.  major  ionic 

— ^ — amphimacer 

^ — third  epitrite 

^ palimbacchius 

fourth  epitrite 

molossus 

dispondee 

Alternative  names,  like  “choree”  instead  of  “trochee,”  or  “cretic”  instead  of 
“amphimacer,”  were  also  in  common  use.  Moreover,  by  the  time  Diomedes  wrote 
his  Latin  grammar  (approximately  A.D.  375),  each  of  the  32  five- syllable  feet 
had  acquired  at  least  one  name.  Diomedes  also  pointed  out  the  relation  between 
complementary  patterns;  he  stated  for  example  that  tribrach  and  molossus  are 
“ contrariusfi  as  are  amphibrach  and  amphimacer.  But  he  also  regarded  dactyl 
as  the  contrary  of  anapest,  and  bacchius  as  the  contrary  of  palimbacchius,  al- 
though the  literal  meaning  of  palimbacchius  is  actually  “reverse  bacchius.”  Greek 
prosodists  had  no  standard  order  in  which  to  list  the  individual  possibilities,  and 
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the  form  of  the  names  makes  it  clear  that  no  connection  to  a radix-two  number 
system  was  contemplated.  [See  H.  Keil,  Grammatici  Latini  1 (1857),  474-482; 
W.  von  Christ,  Metrik  der  Griechen  und  Romer  (1879),  78-79.] 

Surviving  fragments  of  a work  by  Aristoxenus  called  Elements  of  Rhythm 
(c.  325  B.C.)  show  that  the  same  terminology  was  applied  also  to  music.  And 
indeed,  the  same  traditions  lived  on  after  the  Renaissance;  for  example,  we  find 


Spondxus  — — 

Chorarus  — u 


on  page  32  of  Athanasius  Kircher’s  Musurgia  Universalis  2 (Rome:  1650),  and 
Kircher  went  on  to  describe  all  of  the  three- note  and  four-note  rhythms  of  (5). 

Early  lists  of  permutations.  We’ve  traced  the  history  of  formulas  for  counting 
permutations  in  Section  5.1.2;  but  nontrivial  lists  of  permutations  were  not 
published  until  hundreds  of  years  after  the  formula  n!  was  discovered.  The  first 
such  tabulation  currently  known  was  compiled  by  the  Italian  physician  Shabbet.ai 
Donnolo  in  his  commentary  on  the  kabbalistic  Sefer  Yetzirah,  written  in  A.D.  946. 
Table  1 shows  his  list  for  n = 5 as  it  was  subsequently  printed  in  Warsaw  (1884). 
(The  Hebrew  letters  in  this  table  are  typeset  in  a rabbinical  font  traditionally 
used  for  commentaries;  notice  that  the  letter  )3  changes  its  shape  to  0 when  it 
appears  at  the  left  end  of  a word.)  Donnolo  went  on  to  list  120  permutations 
of  the  six-letter  word  133MD,  all  beginning  with  shin  (£5);  then  he  noted  that 
120  more  could  be  obtained  with  each  of  the  other  five  letters  in  front,  making 
720  in  all.  His  lists  involved  groupings  of  six  permutations,  but  in  a haphazard 
fashion  that  led  him  into  error  (see  exercise  4).  Although  he  knew  how  many 
permutations  there  were  supposed  to  be,  and  how  many  should  start  with  a given 
letter,  he  evidently  didn  t have  an  algorithm  for  generating  them. 


Table  1 

A MEDIEVAL  LIST  OF  PERMUTATIONS 

.3W7  , *3)377  ,03’77  , 3»’77  ,’33377  , D’377  ,’33737  ,’7)337  ,7’)337  ,7)3’37  , 07’37  , D’737 
,37’)37  ,73’)37  ,3’7)37  ,’37)37  ,’73)37  ,7’3)37  ,73)3’7  ,37)3’7  ,7)33’7  ,3)37’7  , 037’7  ,073’7 

,7337’3  , 77)3’3  ,77)3’3  , 077’3  ,077’3  ,7)37*3  ,7’3373  ,’7)373  , D’773  ,’»773  ,07’73  ,7»’73 

,D7’73  ,7)3’73  , D’773  ,’33773  ,’7)373  ,7’»73  ,77*03  ,7’7)33  ,’77)33  ,7*7)33  ,7*7)33  ,’77)33 

,3*3377  , D’377  ,3)3’77  ,03’77  , ’33377  ,’3)377  ,7»3’7  , 073’7  ,3»7’7  ,037’7  ,73)3*7  , 37)3’7 

,’37)37  ,’73)37  , 7’3»7  ,3*7)37  , 73’»7  , 37*337  ,7*3337  ,7)3*37  , 07’37  , D’737  ,’33737  *7)337 
,37)37’  ,3)377’  ,73)37’  ,7)337’  ,0377’  ,0737’  ,737)3’  , 737)3*  ,773)3’  ,773»’  ,377)3’  ,377)3’ 

,3)377’  , 0737’  ,7)337’  ,73)37’  , 3)377’  , 0377’  , 77)33’  ,77)33’  ,7)373’  , 0773’  ,7)373’  ,0773’ 

,’773)3  ,’773)3  ,77’3»  ,7*73)3  ,77*3)3  , 7*73)3  ,3’77)3  , 37*7)3  ,73*7)3  , 7*37)3  , ’377)3  ,’737)3 

737’)3  , 377’»  ,773*33  ,773’»  , 377’»  ,737’»  ,37*7)3  ,73*7)3  , 7’37)3  , 3’77)3  ,’737)3  ,’377)3 


A complete  list  of  all  720  permutations  of  {a,  b,  c,  d,  e,  f}  appeared  on  pages 
668-671  of  Jeremias  Drexel’s  Orbis  Phaethon  (Munich:  1629;  also  on  pages  526- 
531  of  the  Cologne  edition  in  1631).  He  offered  it  as  proof  that  a man  with  six 
guests  could  seat  them  differently  at  lunch  and  dinner  every  day  for  a year  — 


7.2.1. 7 


HISTORY  AND  FURTHER  REFERENCES  491 


altogether  360  days,  because  there  were  five  days  of  fasting  during  Holy  Week. 
Shortly  afterwards,  Marin  Mersenne  exhibited  all  720  permutations  of  the  six 
tones  {ut, re, mi, fa, sol, la},  on  pages  111-115  of  his  Traitez  de  la  Voix  et  des 
Chants  (Volume  2 of  Harmonie  Universelle,  1636);  then  on  pages  117-128  he 
presented  the  same  data  in  musical  notation: 


I 

2 

3 

4 

5 

— i 

4-5.X  _j 

A. 

7>»  7*° 


—T. 

X 



Drexel’s  table  was  organized  lexicographically  by  columns;  Mersenne’s  tables 
were  lexicographic  with  respect  to  the  order  ut  < re  < mi  < fa  < sol  < la,  begin- 
ning with  “ut, re, mi, fa, sol, la”  and  ending  with  “la, sol, fa, mi, re, ut.”  Mersenne  also 
prepared  a “grand  et  immense”  manuscript  that  listed  all  40,320  permutations 
of  eight  notes  on  672  folio  pages,  followed  by  ranking  and  unranking  algorithms 
[Bibliotheque  nationale  de  France,  Fonds  Frangais,  no.  24256]. 

We  saw  in  Section  7. 2. 1.2  that  the  important  idea  of  plain  changes,  Algo- 
rithm 7.2.1.2P,  was  invented  in  England  a few  years  later. 

Methods  for  listing  all  permutations  of  a multiset  with  repeated  elements 
were  often  misunderstood  by  early  authors.  For  example,  when  Bhaskara  exhib- 
ited the  permutations  of  {4,  5,  5,  5, 8}  in  section  271  of  his  Lilavati  (c.  1150),  he 
gave  them  in  the  following  order: 


-t^y  yy 

s^yy  y 

y ^yy 

yqyyy 

yy  Yqy 

yys*y 

yy  y vc; 

yyyc;Y 

8-yc;yy 

Yyyqy 

-tyy  y.d 

syvyy 

syytty 

syyy  <r 

yttyqy 

yc;y'<ry 

yy  s-yc; 

yysy* 

y*y  ys 

ycjyy* 

Mersenne  used  a slightly  more  sensible  but  not  completely  systematic  order  on 
page  131  of  his  book  when  he  listed  sixty  anagrams  of  the  Latin  name  IESVS. 
When  Athanasius  Kircher  wanted  to  illustrate  the  30  permutations  of  a five- 
note  melody  on  pages  10  and  11  of  Musurgia  Universalis  2 (1650),  this  lack  of  a 
system  got  him  into  trouble  (see  exercise  5): 


***** 

*«***: 

*****: 

i*i*l: 

*****: 

***p: 

*****: 

:*p*a: 

:****£ 

:*****: 

:**§§: 

3****: 

:§***$: 

-Mm:\ 

But  John  Wallis  knew  better.  On  page  117  of  his  Discourse  of  Combinations 
(1685)  he  correctly  listed  the  60  anagrams  of  “messes”  in  lexicographic  order,  if 
we  let  m < e < s;  and  on  page  126  he  recommended  respecting  alphabetic  order 
“that  we  may  be  the  more  sure,  not  to  miss  any.” 

We  will  see  later  that  the  Indian  pundits  Sarngadeva  and  Narayana  had 
already  developed  a theory  of  permutation  generation  in  the  13th  and  14th 
centuries,  although  their  work  was  ahead  of  its  time  and  remained  obscure. 
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Seki  s list.  Takakazu  Seki  (1642  1708)  was  a charismatic  teacher  and  researcher 
who  revolutionized  the  study  of  mathematics  in  17th-century  Japan.  While  he 
was  studying  the  elimination  of  variables  from  simultaneous  homogeneous  equa- 
tions, he  was  led  to  expressions  such  as  oi b2  - a2bi  and  aib2c3  - aib3c2  + 
a2b3Ci  - a2bic3  + a3bxc2  - a3b2Ci,  which  we  now  recognize  as  determinants. 
In  1683  he  published  a booklet  about  this  discovery,  introducing  an  ingenious 
scheme  for  listing  all  permutations  in  such  a way  that  half  of  them  were  “alive” 
(even)  and  the  other  half  were  “dead”  (odd).  Starting  with  the  case  n — 2,  when 
12  was  alive  and  21  was  dead,  he  formulated  the  following  rules  for  n > 2: 

1)  Take  every  live  permutation  for  n-1,  increase  all  its  elements  by  1,  and  insert 
1 in  front.  This  rule  produces  (n-l)!/2  “basic  permutations”  of  {1, . . . ,n}. 

2)  From  each  basic  permutation,  form  2 n others  by  rotation  and  reflection: 

aia2  . . . an_ian,  a2  . . . «n-i®nai,  ...,  ana\a2  . . . an_1;  (8) 

anan-i ■ ■ ■ a2ai,  a1anan_1  . . . a2,  ...,  an_!  . . . a2aian.  (g) 

If  n is  odd,  those  in  the  first  row  are  alive  and  those  in  the  second  are  dead; 
if  n is  even,  those  in  each  row  are  alternatively  alive,  dead,  . . . , alive,  dead. 
For  example,  when  n = 3 the  only  basic  permutation  is  123.  Thus  123,  231, 
312  are  alive  while  321,  132,  213  are  dead,  and  we’ve  successfully  generated  the 
six  terms  of  a 3 x 3 determinant.  The  basic  permutations  for  n = 4 are  1234, 
1342,  1423;  and  from,  say,  1342  we  get  a set  of  eight,  namely 

+ 1342  - 3421  + 4213  - 2134  + 2431  - 1243  + 3124  - 4312,  (xo) 

alternately  alive  (+)  and  dead  (— ).  A 4 x 4 determinant  therefore  includes  the 
terms  a,\b3c±d2  — a3b^c2di  + ■ ■ • — a±b3c\d2  and  sixteen  others. 

Seki’s  rule  for  permutation  generation  is  quite  pretty,  but  unfortunately  it 
has  a serious  problem:  It  doesn’t  work  when  n > 4.  His  error  seems  to  have 
gone  unrecognized  for  hundreds  of  years.  [See  Y.  Mikami,  The  Development  of 
Mathematics  in  China  and  Japan  (1913),  191-199;  Takakazu  Seki’s  Collected 
Works  (Osaka:  1974),  18-20,  AS— — 0— ; and  exercises  7-8.] 

Lists  of  combinations.  The  earliest  exhaustive  list  of  combinations  known  to 
have  survived  the  ravages  of  time  appears  in  the  last  book  of  Susruta’s  well-known 
Sanskrit  treatise  on  medicine,  Chapter  63,  written  before  A.D.  600  and  perhaps 
much  earlier.  Noting  that  medicine  can  be  sweet,  sour,  salty,  peppery,  bitter, 
and/or  astringent,  Susruta’s  book  diligently  listed  the  (15,20,15,6,1,6)  cases 
that  arise  when  those  qualities  occur  two,  three,  four,  five,  six,  and  one  at  a time. 

Bhaskara  repeated  this  example  in  sections  110—114  of  Lilavatp  and  observed 
that  the  same  reasoning  applies  to  six-syllable  poetic  meters  with  a given  number 
of  long  syllables.  But  he  simply  mentioned  the  totals,  (6, 15,  20, 15,  6, 1),  without 
listing  the  combinations  themselves.  In  sections  274  and  275,  he  observed  that 
the  numbers  (n(n  - 1) . . . (n  - k + 1 ))/(k(k  - 1) . . . (1))  enumerate  compositions 
(that  is,  ordered  partitions)  as  well  as  combinations;  again  he  gave  no  list. 

To  avoid  prolixity  this  is  treated  in  a brief  manner; 
for  the  science  of  calculation  is  an  ocean  without  bounds. 

— BHASKARA  (c.  1150) 
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An  isolated  but  interesting  list  of  combinations  appeared  in  the  remarkable 
algebra  text  Al-Bahir  fi  1-hisab  (T he  Shining  Book  of  Calculation) , written  by 
al-Samaw’al  of  Baghdad  when  he  was  only  19  years  old  (1144).  In  the  closing 
part  of  that  work  he  presented  a list  of  (g0)  = 210  simultaneous  linear  equations 
in  10  unknowns: 


Al-Samaw’aFs  Arabic  original 


Xo 

Xofrr\ 

1 

V- 

Voim 

u 

Vo 

Aom\ 

■a* 

X\ 

VXAVXf 

J 

VXAVXo 

Equivalent  modern  notation 

(1)  X\  + X2  + £3  + x4  + x$  + xg  = 65 

(2)  X\  + X2  + X3  + £4  + £5  + £7  = 70 

(3)  Xl  + X2  + X3  + X4  + X5  + Xs  = 75 

(209)  x4  + x6  + x7  + Xs  + x9  + £io  = 91 

(210)  xs  + xe  + £7  + xs  + £9  + £10  = 100 


Each  combination  of  ten  things  taken  six  at  a time  yielded  one  of  his  equa- 
tions. His  purpose  was  evidently  to  demonstrate  that  over-determined  equations 
can  still  have  a unique  solution  — which  in  this  case  was  (xi,  X2, . . . , x10)  = 
(1,4,9,16,25,10,15,20,25,5).  [Salah  Ahmad  and  Roshdi  Rashed,  Al-Bahir  en 
Algebre  dAs-Samaw’al  (Damascus:  1972),  77-82,  X 1 A-XV \ .] 


Rolling  dice.  Some  glimmerings  of  elementary  combinatorics  arose  also  in 
medieval  Europe,  especially  in  connection  with  the  question  of  listing  all  possible 
outcomes  when  three  dice  are  thrown.  There  are,  of  course,  (|)  = 56  ways  to 
choose  3 things  from  6 when  repetitions  are  allowed.  Gambling  was  officially  pro- 
hibited; yet  these  56  ways  became  rather  well  known.  In  about  A.D.  965,  Bishop 
Wibold  of  Cambrai  in  northern  France  devised  a game  called  Ludus  Clericalis, 
so  that  members  of  the  clergy  could  enjoy  rolling  dice  while  remaining  pious. 
His  idea  was  to  associate  each  possible  roll  with  one  of  56  virtues,  according  to 
the  following  table: 


□ □□  love 

□ □□  faith 

□ □0  h°Pe 

□ □0  justice 
000  prudence 
0010  temperance 
000  courage 
000  peace 
000  chastity 
000  mercy 
000  obedience 
000  fear 
000  foresight 
000  discretion 


000  perseverance 
000  kindness 
000  modesty 
000  resignation 
000  gentleness 
000  generosity 
000  wisdom 
000  remorse 
000  joy 
000  sobriety 
000  satisfaction 
000  sweetness 
000  cleverness 
000  simplicity 


000  hospitality 
000  economy 
000  patience 
000  zeal 
000  poverty 
000  softness 
000  virginity 
000  respect 
000  Piety 
000  indulgence 
000  prayer 
000  affection 
000  judgment 
000  vigilance 


000  mortification 
000  innocence 
000  contrition 
000  confession 
000  maturity 
000  solicitude 
000  constancy 
000  intelligence 
000  sighing 
000  weeping 
000  cheerfulness 
000  compassion 
000  self-control 
000  humility 


Players  took  turns,  and  the  first  to  roll  each  virtue  acquired  it.  After  all  possibil- 
ities had  arisen,  the  most  virtuous  player  won.  Wibold  noted  that  love  ( caritas ) 
is  the  best  virtue  of  all.  He  gave  a complicated  scoring  system  by  which  two 
virtues  could  be  combined  if  the  sum  of  pips  on  all  six  of  their  dice  was  21;  for 
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example,  love  + humility  or  chastity  + intelligence  could  be  paired  in  this  way, 
and  such  combinations  ranked  above  any  individual  virtue.  He  also  considered 
more  complex  variants  of  the  game  in  which  vowels  appeared  on  the  dice  instead 
of  spots,  so  that  virtues  could  be  claimed  if  their  vowels  were  thrown. 

Wibold’s  table  of  virtues  was  presented  in  lexicographic  order,  as  above, 
when  it  was  first  described  by  Balderic  in  his  Chronicon  Cameracense,  about 
150  years  later.  [Patrologia  Latina  134  (Paris:  1884),  1007-1016.]  But  another 
medieval  manuscript  presented  the  possible  dice  rolls  in  quite  a different  order: 


(12) 


mm  000  000  000  000  000  000 

H300  000  000  000  000  000  000 

000  000  000  000  000  000  000 

000  000  000  000  000  000  000 

000  000  000  000  000  000  000 

000  000  000  000  000  000  000 

000  000  000  000  000  000  000 

000  000  000  000  000  000  000 

In  this  case  the  author  knew  how  to  deal  with  repeated  values,  but  had  a very 
complicated,  ad  hoc  way  to  handle  the  cases  in  which  all  dice  were  different.  [See 
D.  R.  Bellhouse,  International  Statistical  Review  68  (2000),  123-136.] 

An  amusing  poem  entitled  “Chaunce  of  the  Dyse,”  attributed  to  John 
Lydgate,  was  written  in  the  early  1400s  for  use  at  parties.  Its  opening  verses 
invite  each  person  to  throw  three  dice;  then  the  remaining  verses,  which  are 
indexed  in  decreasing  lexicographic  order  from  000  to  000  to  • ■ • to  000. 
give  56  character  sketches  that  light-heartedly  describe  the  thrower.  [The  full 
text  was  published  by  E.  P.  Hammond  in  Englische  Studien  59  (1925),  1-16: 
a translation  into  modern  English  would  be  desirable.] 


I pray  to  god  that  euery  wight  may  caste 
Vpon  three  dyse  ryght  as  is  in  hys  herte 
Whether  he  be  rechelesse  or  stedfaste 
So  moote  he  iawghen  outher  elles  smerte 
He  that  is  gilty  his  lyfe  to  converte 
They  that  in  trouthe  haue  suffred  many  a throwe 
Moote  ther  chaunce  fat  as  they  moote  be  knowe. 

— The  Chaunce  of  the  Dyse  (c.  1410) 

Ramon  Llull.  Significant  ripples  of  combinatorial  concepts  also  emanated 
from  an  energetic  and  quixotic  Catalan  poet,  novelist,  encyclopedist,  educator, 
mystic,  and  missionary  named  Ramon  Llull  (c.  1232-1316).  Llull’s  approach  to 
knowledge  was  essentially  to  identify  basic  principles  and  then  to  contemplate 
combining  them  in  all  possible  ways. 

For  example,  one  chapter  in  his  Ars  Conipendiosa  Inveniendi  Veritatem 
(c.  1274)  began  by  enumerating  sixteen  attributes  of  God:  Goodness,  greatness, 
eternity,  power,  wisdom,  love,  virtue,  truth,  glory,  perfection,  justice,  generosity, 
mercy,  humility,  sovereignty,  and  patience.  Then  Llull  wrote  (2J)  = 120  short 
essays  of  about  80  words  each,  considering  God’s  goodness  as  related  to  greatness, 


Ui 
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God  s goodness  as  related  to  eternity,  and  so  on,  ending  with  God’s  sovereignty  as 
related  to  patience.  In  another  chapter  he  considered  seven  virtues  (faith,  hope, 
charity,  justice,  prudence,  fortitude,  temperance)  and  seven  vices  (gluttony,  lust, 
greed,  sloth,  pride,  envy,  anger),  with  {™ ) = 91  subchapters  to  deal  with  each 
pair  in  turn.  Other  chapters  were  systematically  divided  in  a similar  way,  into 
(2)  = 28,  (j5)  = 105,  (2)  = 6,  and  (j6)  = 120  subsections.  (One  wonders  what 
might  have  happened  if  he  had  been  familiar  with  Wibold’s  list  of  56  virtues; 
would  he  have  produced  commentaries  on  all  (526)  = 1540  of  their  pairs?) 


•£>cus 


Ct-vm4- 


lillil 


Mm 

HBffei 


Fig.  64.  Illustrations  in  a manuscript  presented  by  Ramon  Llull  to 
the  doge  of  Venice  in  1280.  [From  his  Ars  Demonstrativa , Biblioteca 
Marciana,  VI  200,  folio  3V.] 


Llull  illustrated  his  methodology  by  drawing  circular  diagrams  like  those  in 
Figure  64.  The  left-hand  circle  in  this  illustration,  Deus,  names  sixteen  divine 
attributes  — essentially  the  same  sixteen  listed  earlier,  except  that  love  {amor) 
was  now  called  will  {voluntas) , and  the  final  four  were  now  respectively  simplicity, 
rank,  mercy,  and  sovereignty.  Each  attribute  is  assigned  a code  letter,  and 
the  illustration  depicts  their  interrelations  as  the  complete  graph  Ki6  on  ver- 
tices (B,  C,  D,  E,  F,  G,  H,  I,  K,  L,  M,  N,  O,  P,  Q,  R).  The  right-hand  figure,  virtutes 
et  vitia , shows  the  seven  virtues  ( b , c,  d,  e,  /,  g , h)  interleaved  with  the  seven  vices 
(■ i,k,l,m,n,o,p );  in  the  original  manuscript  virtues  appeared  in  blue  ink  while 
vices  appeared  in  red.  Notice  that  in  this  case  his  illustration  depicted  two 
independent  complete  graphs  K7,  one  of  each  color.  (He  no  longer  bothered  to 
compare  each  individual  virtue  with  each  individual  vice,  since  every  virtue  was 
clearly  better  than  every  vice.) 

Llull  used  the  same  approach  to  write  about  medicine:  Instead  of  juxta- 
posing theological  concepts,  his  Liber  Principiorum  Medicinse  (c.  1275)  con- 
sidered combinations  of  symptoms  and  treatments.  And  he  also  wrote  books 
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Fig.  65.  Llullian  illustrations 
from  a manuscript  presented  to 
the  queen  of  France,  c.  1325. 
[Badische  Landesbibliot.hek  Karls- 
ruhe, Codex  St.  Peter  perg.  92, 
folios  28v  and  39v.l 
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on  philosophy,  logic,  jurisprudence,  astrology,  zoology,  geometry,  rhetoric,  and 
chivalry  — more  than  200  works  in  all.  It  must  be  admitted,  however,  that  much 
of  this  material  was  highly  repetitive;  modern  data  compression  techniques  would 
probably  reduce  Llull’s  output  to  a size  much  less  than  that  of,  say,  Aristotle. 

He  eventually  decided  to  simplify  his  system  by  working  primarily  with 
groups  of  nine  things.  See,  for  example,  Fig.  65,  where  circle  A now  lists  only  the 
first  nine  of  God’s  attributes  (B,  C,  D,  E,  F,  G,  H,  I,  K).  The  (®)  = 36  associated 
pairs  (BC,  BD, . . . , IK)  appear  in  the  stairstep  chart  at  the  right  of  that  circle.  By 
adding  two  more  virtues,  namely  patience  and  compassion  — as  well  as  two  more 
vices,  namely  lying  and  inconsistency  — he  could  treat  virtues  vis-a-vis  virtues 
and  vices  vis-a-vis  vices  with  the  same  chart.  He  also  proposed  using  the  same 
chart  to  carry  out  an  interesting  scheme  for  voting,  in  an  election  with  nine 
candidates  [see  I.  McLean  and  J.  London,  Studia  Lulliana  32  (1992),  21-37], 

The  encircled  triangles  at  the  lower  left  of  Fig.  65  illustrate  another  key 
aspect  of  Llull  s approach.  Triangle  (B,C,D)  stands  for  (difference,  concordance, 
contrariness);  triangle  (E,  F,  G)  stands  for  (beginning,  middle,  ending);  and  trian- 
gle (H,  I,  K)  stands  for  (greater,  equal,  less).  These  three  interleaved  appearances 
of  A3  represent  three  kinds  of  three- valued  logic.  Llull  had  experimented  earlier 
with  other  such  triplets,  notably  ‘(true,  unknown,  false)’.  We  can  get  an  idea 
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of  how  he  used  the  triangles  by  considering  how  he  dealt  with  combinations  of 
the  four  basic  elements  (earth,  air,  fire,  water):  All  four  elements  are  different; 
earth  is  concordant  with  fire,  which  concords  with  air,  which  concords  with 
water,  which  concords  with  earth;  earth  is  contrary  to  air,  and  fire  is  contrary 
to  water;  these  considerations  complete  an  analysis  with  respect  to  triangle 
(B,  C,  D).  Turning  to  triangle  (E,  F,  G),  he  noted  that  various  processes  in  nature 
begin  with  one  element  dominating  another;  then  a transition  or  middle  state 
occurs,  until  a goal  is  reached,  like  air  becoming  warm.  For  triangle  (H,I,K)  he 
said  that  in  general  we  have  fire  > air  > water  > earth  with  respect  to  their 
"spheres,”  their  “velocities,”  and  their  “nobilities”;  nevertheless  we  also  have, 
for  example,  air  > fire  with  respect  to  supporting  life,  while  air  and  fire  have 
equal  value  when  they  are  working  together. 

Llull  provided  the  vertical  table  at  the  right  of  Fig.  65  as  a further  aid.  (See 
exercise  11  below.)  He  also  introduced  movable  concentric  wheels,  labeled  with 
the  letters  (B,  C,  D,  E,  F,  G,  H,  I,  K)  and  with  other  names,  so  that  many  things 
could  be  contemplated  simultaneously.  In  this  way  a faithful  practitioner  of 
the  Llullian  art  could  be  sure  to  have  all  the  bases  covered.  [Llull  may  have 
seen  similar  wheels  that  were  used  in  nearby  Jewish  communities;  see  M.  Idel, 
J.  Warburg  and  Courtauld  Institutes  51  (1988),  170-174  and  plates  16-17.] 

Several  centuries  later,  Athanasius  Kircher  published  an  extension  of  Llull’s 
system  as  part  of  a large  tome  entitled  Ars  Magna  Sciendi  sive  Combinatoria 
(Amsterdam:  1669),  with  five  movable  wheels  accompanying  page  173  of  that 
book.  Kircher  also  extended  Llull’s  repertoire  of  complete  graphs  Kn  by  provid- 
ing illustrations  of  complete  bipartite  graphs  Km^n\  for  example,  Fig.  66  is  taken 
from  page  171  of  Kircher’s  book,  and  his  page  170  contains  a glorious  picture 
°f  A 18,  is  - 

A 


© 

□ 


Fig.  66.  A'9,9  as  pre- 
sented by  Athanasius 
Kircher  in  1669. 


It  is  an  investigative  and  inventive  art. 
When  ideas  are  combined  in  all  possible  ways, 
the  new  combinations  start  the  mind  thinking  along  novel  channels 
and  one  is  led  to  discover  fresh  truths  and  arguments. 
— MARTIN  GARDNER,  Logic  Machines  and  Diagrams  (1958) 

The  most  extensive  modern  development  of  Llull-like  methods  is  perhaps 
The  Schillinger  System  of  Musical  Composition  by  Joseph  Schillinger  (New  York: 
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Carl  Fischer,  1946),  a remarkable  two- volume  work  that  presents  theories  of 
rhythm,  melody,  harmony,  counterpoint,  composition,  orchestration,  etc.,  from 
a combinatorial  perspective.  On  page  56,  for  example,  Schillinger  lists  the  24 
permutations  of  {a,b,  c,  d}  in  the  Gray-code  order  of  plain  changes  (Algorithm 
7.2.1.2P);  then  on  page  57  he  applies  them  not  to  pitches  but  rather  to  rhythms, 
to  the  durations  of  notes.  On  page  364  he  exhibits  the  symmetrical  cycle 


(2,  0,  3,  4,  2,  5,  6,  4,  0,  1,  6,  2,  3,  1,  4,  5,  3,  6,  0,  5,  1), 


(13) 


a universal  cycle  of  2-combinations  for  the  seven  objects  {0, 1,  2, 3, 4,  5,  6};  in 
other  words,  (13)  is  an  Eulerian  trail  in  K7:  All  Q = 21  pairs  of  digits  occur 
exactly  once.  Such  patterns  are  grist  for  a composer’s  mill.  But  we  can  be 
grateful  that  Schillinger’s  better  students  (like  George  Gershwin)  did  not  commit 
themselves  entirely  to  a strictly  mathematical  sense  of  aesthetics. 


Tacquet,  van  Schooten,  and  Izquierdo.  Three  additional  books  related  to 
our  story  were  published  during  the  1650s.  Andre  Tacquet  wrote  a popular  text, 
Arithmeticse  Theoria  et  Praxis  (Louvain:  1656),  that  was  reprinted  and  revised 
often  during  the  next  fifty  years.  Near  the  end,  on  pages  376  and  377,  he  gave  a 
procedure  for  listing  combinations  two  at  a time,  then  three  at  a time,  etc. 

Frans  van  Schooten’s  Exercitationes  Mathematics  (Leiden:  1657)  was  more 
advanced.  On  page  373  he  listed  all  combinations  in  an  appealing  layout 


a 

b.  ab  . 

c.  ac.  be.  abc 04/ 

d.  ad.  bd.  abd.  cd.  acd.  bed.  abed 

and  he  proceeded  on  the  next  few  pages  to  extend  this  pattern  to  the  letters  e, 
f , g,  h,  i,  k,  et  sic  in  infinitum.  On  page  376  he  observed  that  one  can  replace 
(a,  b,  c,  d)  by  (2,  3,5,  7)  in  (14)  to  get  the  divisors  of  210  that  exceed  unity: 

2 

3 6 

5 10  15  30  05) 

7 14  21  42  35  70  105  210 

And  on  the  following  page  he  extended  the  idea  to 


a 

a.  aa 

b.  ab.  aab 

c.  ac.  aac.  be.  abc.  aabc 

thereby  allowing  two  a’s.  He  didn’t  really  understand  this  extension,  though;  his 
next  example 


(16) 


a 

a.  aa 

a.  aaa  (Xy) 

b.  ab.  aab.  aaab 
b.  bb.  abb.  aabb.  aaabb 

was  botched,  indicating  the  limits  of  his  knowledge  at  the  time.  (See  exercise  13.) 
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On  page  411  van  Schooten  observed  that  the  weights  ( a,b,c,d ) = (1,2,4, 8) 
could  be  assigned  in  (14),  leading  to 


1 

2 3 

4 5 6 7 

8 9 10  11  12  13  14  15 


(18) 


after  addition.  But  he  didn’t  see  the  connection  with  radix-2  numbers. 

Sebastian  Izquierdo’s  two-volume  work  Pharus  Scientiarum  (Lyon:  1659), 
“The  Lighthouse  of  Science,”  included  a nicely  organized  discussion  of  combina- 
torics entitled  Disputatio  29,  De  Combinatione.  He  gave  a detailed  discussion  of 
four  key  parts  of  Stanley’s  Twelvefold  Way,  namely  the  n-tuples,  n-variations, 
n- multicombinations,  and  n-combinations  of  to  objects  that  appear  in  the  first 
two  rows  and  the  first  two  columns  of  Table  7.2. 1.4-  1. 

In  Sections  81-84  of  De  Combinatione  he  listed  all  combinations  of  m letters 
taken  n at  a time,  for  2 < n < 5 and  n < m < 9,  always  in  lexicographic  order; 
he  also  tabulated  them  for  to  = 10  and  20  in  the  cases  n = 2 and  3.  But  when 
he  listed  the  to-  variations  of  m things  taken  n at  a time,  he  chose  a more 
complicated  ordering  (see  exercise  14). 

Izquierdo  was  first  to  discover  the  formula  (m+"_1)  for  combinations  of  m 
things  taken  n at  a time  with  unlimited  repetition;  this  rule  appeared  in  §48— §51 
of  his  work.  But  in  §105,  when  he  attempted  to  list  all  such  combinations  in  the 
case  n = 3,  he  didn’t  know  that  there  was  a simple  way  to  do  it.  In  fact,  his 
listing  of  the  56  cases  for  m — 6 was  rather  like  the  old,  awkward  ordering  of  (12). 

Combinations  with  repetition  were  not  well  understood  until  James  Ber- 
noulli’s Ars  Conjectandi , “The  Art  of  Guessing,”  came  out  in  1713.  In  Part  2, 
Chapter  5,  Bernoulli  simply  listed  the  possibilities  in  lexicographic  order,  and 
showed  that  the  formula  (m+” ~1)  follows  by  induction  as  an  easy  consequence. 
[Niccolo  Tartaglia  had,  incidentally,  come  close  to  discovering  this  formula  in  his 
General  trattato  di  numeri,  et  misure  2 (Venice:  1556),  17r  and  69v;  so  had  the 
Maghrebi  mathematician  Ibn  Mun‘im  in  his  13th-century  Fiqh  al-Hisab .] 

The  null  case.  Before  we  conclude  our  discussion  of  early  work  on  combinations, 
we  should  not  forget  a small  yet  noble  step  taken  by  John  Wallis  on  page  110 
of  his  Discourse  of  Combinations  (1685),  where  he  specifically  considered  the 
combination  of  to  things  taken  0 at  a time:  “It  is  manifest,  That,  if  we  would 
take  None,  that  is,  if  we  would  leave  All ; there  can  be  but  one  case  thereof,  what 
ever  be  the  Number  of  things  exposed.”  Furthermore,  on  page  113,  he  knew  that 
(0)  = 1:  “(for,  here,  to  take  all,  or  to  leave  all,  is  but  one  and  the  same  case).” 

However,  when  he  gave  a table  of  n!  for  n < 24,  he  did  not  go  so  far  as  to 
point  out  that  0!  = 1,  or  that  there  is  exactly  one  permutation  of  the  empty  set. 

The  work  of  Narayana.  A remarkable  monograph  entitled  Ganita  KaumudI 
(“Lotus  Delight  of  Calculation”),  written  by  Narayana  Pandita  in  1356,  has 
recently  become  known  in  detail  to  scholars  outside  of  India  for  the  first  time, 
thanks  to  an  English  translation  by  Parmanand  Singh  [Ganita  Bharatl20  (1998), 
25-82;  21  (1999),  10-73;  22  (2000),  19-85;  23  (2001),  18-82;  24  (2002),  35-98]; 
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see  also  the  Ph.D.  thesis  of  Takanori  Kusuba,  Brown  University  (1993).  Chap- 
ter 13  of  Narayana’s  work,  subtitled  Anka  Pasa  (“Concatenation  of  Numbers”), 
was  devoted  to  combinatorial  generation.  Indeed,  although  the  97  “sutras”  of  this 
chapter  were  rather  cryptic,  they  presented  a comprehensive  theory  of  the  subject 
that  anticipated  developments  in  the  rest  of  the  world  by  several  hundred  years. 

For  example,  Narayana  dealt  with  permutation  generation  in  sutras  49-55a, 
where  he  gave  algorithms  to  list  all  permutations  of  a set  in  decreasing  colex 
order,  together  with  algorithms  to  rank  a given  permutation  and  to  unrank 
a given  serial  number.  These  algorithms  had  appeared  more  than  a century 
earlier  in  the  well-known  work  Sangltaratnakara  (“Jewel-Mine  of  Music”)  by 
Sariigadeva,  §1.4.60-71,  who  thereby  had  essentially  discovered  the  factorial 
representation  of  positive  integers.  Narayana  went  on  in  sutras  57-60  to  extend 
Sarngadeva  s algorithms  so  that  general  multisets  could  readily  be  permuted;  for 
example,  he  listed  the  permutations  of  {1, 1,2,4}  as 

1124, 1214,  2114, 1142, 1412, 4112, 1241,  2141, 1421, 4121, 2411, 4211, 
again  in  decreasing  colex  order. 

Narayana’s  sutras  88-92  dealt  with  systematic  generation  of  combinations. 
Besides  illustrating  the  combinations  of  {1, . . . , 8}  taken  3 at  a time,  namely 

(678,  578,  478,  . . . , 134,  124,  123), 

he  also  considered  a bit-string  representation  of  these  combinations  in  the  reverse 
order  ( increasing  colex),  extending  a 10th-century  method  of  Bhattotpala: 

(11100000,  11010000,  10110000,  ...,00010011,  00001011,  00000111). 

He  almost,  but  not  quite,  discovered  Theorem  7.2.1.3L. 

Permutable  poetry.  Let’s  turn  now  to  a curious  question  that  attracted 
the  attention  of  several  prominent  mathematicians  in  the  seventeenth  century, 
because  it  sheds  considerable  light  on  the  state  of  combinatorial  knowledge  in 
Europe  at  that  time.  A Jesuit  priest  named  Bernard  Bauhuis  had  composed  a 
famous  one-line  tribute  to  the  Virgin  Mary,  in  Latin  hexameter: 

Tot  tibi  sunt  dotes,  Virgo,  quot  sidera  caelo.  (ig) 

[“Thou  hast  as  many  virtues,  O Virgin,  as  there  are  stars  in  heaven”;  see 
his  Epigramrnatum  Libri  V (Cologne:  1615),  49.]  His  verse  inspired  Erycius 
Puteanus,  a professor  at  the  University  of  Louvain,  to  write  a book  entitled 
Pietatis  Thaumata  (Antwerp:  1617),  presenting  1022  permutations  of  Bauhuis’s 
words.  For  example,  Puteanus  wrote 

107  Tot  dotes  tibi,  quot  caelo  sunt  sidera,  Virgo. 

270  Dotes  tot,  caelo  sunt  sidera  quot,  tibi  Virgo. 

329  Dotes,  caelo  sunt  quot  sidera,  Virgo  tibi  tot. 

384  Sidera  quot  caelo,  tot  sunt  Virgo  tibi  dotes.  (20) 

725  Quot  caelo  sunt  sidera,  tot  Virgo  tibi  dotes. 

949  Sunt  dotes  Virgo,  quot  sidera,  tot  tibi  caelo. 

1022  Sunt  caelo  tot  Virgo  tibi,  quot  sidera,  dotes. 
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He  stopped  at  1022,  because  1022  was  the  number  of  visible  stars  in  Ptolemy’s 
well-known  catalog  of  the  heavens. 

The  idea  of  permuting  words  in  this  way  was  well  known  at  the  time;  such 
wordplay  was  what  Julius  Scaliger  had  called  “Proteus  verses”  in  his  Poetices 
Libri  Septem  (Lyon:  1561),  Book  2,  Chapter  30.  The  Latin  language  lends  itself 
to  permutations  like  (20),  because  Latin  word  endings  tend  to  define  the  function 
of  each  noun,  making  the  relative  word  order  much  less  important  to  the  meaning 
of  a sentence  than  it  is  in  English.  Puteanus  did  state,  however,  that  he  had 
specifically  avoided  unsuitable  permutations  such  as 

Sidera  tot  caelo,  Virgo,  quot  sunt  tibi  dotes,  (21) 

because  they  would  place  an  upper  bound  on  the  Virgin’s  virtues  rather  than  a 
lower  bound.  [See  pages  12  and  103  of  his  book.] 

Of  course  there  are  8!  = 40,320  ways  to  permute  the  words  of  (19).  But 
that  wasn’t  the  point;  most  of  those  ways  don’t  “scan.”  Each  of  Puteanus’s  1022 
verses  obeyed  the  strict  rules  of  classical  hexameter,  the  rules  that  had  been 
followed  by  Greek  and  Latin  poets  since  the  days  of  Homer  and  Vergil,  namely: 

i)  Each  word  consists  of  syllables  that  are  either  long  ( — ) or  short  (w). 

ii)  The  syllables  of  each  line  belong  to  one  of  32  patterns, 

{ ~}  {^"}  {™}  — {--}■  (22) 

In  other  words  there  are  six  metrical  feet,  where  each  of  the  first  four  is  either  a 
dactyl  or  a spondee  in  the  terminology  of  (5);  the  fifth  foot  should  be  a dactyl, 
and  the  last  is  either  trochee  or  spondee. 

The  rules  for  long  versus  short  syllables  in  Latin  poetry  are  somewhat  tricky 
in  general,  but  the  eight  words  of  Bauhuis’s  verse  can  be  characterized  by  the 
following  patterns: 

tot  = — , tibi  = j>,  sunt  = — , dotes  = , 

Virgo  = quot  = — , sidera  — — ww,  caelo  = . (23) 

Notice  that  poets  had  two  choices  when  they  used  the  words  ‘tibi’  or  ‘Virgo’. 
Thus,  for  example,  (19)  fits  the  hexameter  pattern 


Tot  ti-bi  sunt  do-  tes,  Vir-  go,  quot  si-de-ra  cae-lo.  (24) 

(Dactyl,  spondee,  spondee,  spondee,  dactyl,  spondee;  “dum-diddy  dum-dum 
dum-dum  dum-dum  dum-diddy  dum-dum.”  The  commas  represent  slight  pauses, 
called  “caesuras,”  when  the  words  are  read;  they  don’t  concern  us  here,  although 
Puteanus  inserted  them  carefully  into  each  of  his  1022  permutations.) 

A natural  question  now  arises:  If  we  permute  Bauhuis’s  words  at  random, 
what  are  the  odds  that  they  scan?  In  other  words,  how  many  of  the  permutations 
obey  rules  (i)  and  (ii),  given  the  syllable  patterns  in  (23)?  G.  W.  Leibniz  raised 
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this  question,  among  others,  in  his  Dissertatio  de  Arte  Combinatoria  (1666),  a 
work  published  when  he  was  applying  for  a position  at  the  University  of  Leipzig. 
At  this  time  Leibniz  was  just  19  years  old,  largely  self-taught,  and  his  under- 
standing of  combinatorics  was  quite  limited;  for  example,  he  believed  that  there 
are  600  permutations  of  {ut,  ut,  re,  mi,  fa,  sol}  and  480  of  {ut,  ut,  re,  re,  mi,  fa}, 
and  he  even  stated  that  (22)  represents  76  possibilities  instead  of  32.  [See  §5  and 
§8  in  his  Problem  6.] 

But  Leibniz  did  realize  that  it  would  be  worthwhile  to  develop  general 
methods  for  counting  all  permutations  that  are  “useful,”  in  situations  when 
many  permutations  are  “useless.”  He  considered  several  examples  of  Proteus 
verses,  enumerating  some  of  the  simpler  ones  correctly  but  making  many  errors 
when  the  words  were  complicated.  Although  he  mentioned  Puteanus’s  work,  he 
didn’t  attempt  to  enumerate  the  scannable  permutations  of  (19). 

A much  more  successful  approach  was  introduced  a few  years  later  by  Jean 
Prestet  in  his  Elemens  des  Mathematiques  (Paris:  1675),  342-438.  Prestet  gave 
a clear  exposition  leading  to  the  conclusion  that  exactly  2196  permutations  of 
Bauhuis’s  verse  would  yield  a proper  hexameter.  However,  he  soon  realized  that 
he  had  forgotten  to  count  quite  a few  cases—  including  those  numbered  270, 
384,  and  725  in  (20).  So  he  completely  rewrote  this  material  when  he  published 
Nouveaux  Elemens  des  Mathematiques  in  1689.  Pages  127-133  of  Prestet ’s  new 
book  were  devoted  to  showing  that  the  true  number  of  scannable  permutations 
was  3276,  almost  50%  larger  than  his  previous  total. 

Meanwhile  John  Wallis  had  treated  the  problem  in  his  Discourse  of  Combi- 
nations (London:  1685),  118-119,  published  as  a supplement  to  his  Treatise  of 
Algebra.  After  explaining  why  he  believed  the  correct  number  to  be  3096,  Wallis 
admitted  that  he  may  have  overlooked  some  possibilities  and/or  counted  some 
cases  more  than  once;  “but  I do  not,  at  present,  discern  either  the  one  and  other.” 

An  anonymous  reviewer  of  Wallis’s  work  remarked  that  the  true  number  of 
metrically  correct  permutations  was  actually  2580  -but  he  gave  no  proof  [Acta 
Eruditorum  5 (1686),  289].  The  reviewer  was  almost  certainly  G.  W.  Leibniz 
himself,  although  no  clue  to  the  reasoning  behind  the  number  2580  has  been 
found  among  Leibniz’s  voluminous  unpublished  notes. 

Finally  James  Bernoulli  entered  the  picture.  In  his  inaugural  lecture  as 
Dean  of  Philosophy  at  the  University  of  Basel,  1692,  he  mentioned  the  tot- 
tibi  enumeration  problem  and  stated  that  a careful  analysis  is  necessary  to 
obtain  the  correct  answer  — which,  he  said,  was  3312(1).  His  proof  appeared 
posthumously  in  the  first  edition  of  his  Ars  Conjectandi  (1713),  79-81.  [Those 
pages  were,  incidentally,  omitted  from  later  editions  of  that  famous  book,  and 
from  his  collected  works,  because  he  didn’t  actually  intend  them  for  publication; 
a proofreader  had  inserted  them  by  mistake.  See  Die  Werke  von  Jakob  Bernoulli 
3 (Basel:  Birkhauser,  1975),  78,  98-106,  108,  154-155.] 

So  who  was  right?  Are  there  2196  scannable  permutations,  or  3276,  or  3096. 
or  2580,  or  3312?  W.  A.  Whitworth  and  W.  E.  Hartley  considered  the  question 
anew  in  The  Mathematical  Gazette  2 (1902),  227-228,  where  they  each  presented 
elegant  arguments  and  concluded  that  the  true  total  was  in  fact  none  of  the 
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above.  Their  joint  answer,  2880,  represented  the  first  time  that  any  two  math- 
ematicians had  independently  come  to  the  same  conclusion  about  this  problem. 

But  exercises  21  and  22,  below,  reveal  the  truth:  Bernoulli  is  vindicated, 
and  everybody  else  was  wrong.  Moreover,  a study  of  Bernoulli’s  systematic 
and  carefully  indented  3-page  derivation  indicates  that  he  was  successful  chiefly 
because  he  adhered  faithfully  to  a discipline  that  we  now  call  the  backtrack 
method.  We  shall  study  the  backtrack  method  thoroughly  in  Section  7.2.2,  where 
we  will  also  see  that  the  tot-tibi  question  is  readily  solved  as  a special  case  of 
the  exact  cover  problem. 

Even  the  wisest  and  most  prudent  people  often  suffer  from 
what  Logicians  call  insufficient  enumeration  of  cases. 

— JAMES  BERNOULLI  (1692) 

Set  partitions.  The  partitions  of  a set  seem  to  have  been  studied  first  in  Japan, 
where  a parlor  game  called  genji-ko  (“Genji  incense”)  became  popular  among 
upperclass  people  about  A.D.  1500.  The  host  of  a gathering  would  secretly  select 
five  packets  of  incense,  some  of  which  might  be  identical,  and  he  would  burn 
them  one  at  a time.  The  guests  would  try  to  discern  which  of  the  scents  were 
the  same  and  which  were  different;  in  other  words,  they  would  try  to  guess  which 
of  the  — 52  partitions  of  {1,2, 3, 4, 5}  had  been  chosen  by  their  host. 


Jffl_ 

fill  I Irhi 


Fig.  67.  Diagrams  used  to  represent  set  partitions 
in  16th  century  Japan.  [From  a copy  in  the  collec- 
tion of  Tamaki  Yano  at  Saitama  University.] 


Soon  it  became  customary  to  represent  the  52  possible  outcomes  by  diagrams 
like  those  in  Fig.  67.  For  example,  the  uppermost  diagram  of  that  illustration, 
when  read  from  right  to  left,  would  indicate  that  the  first  two  scents  are  identical 
and  so  are  the  last  three;  thus  the  partition  is  1 2 1 345.  The  other  two  diagrams, 
similarly,  are  pictorial  ways  to  represent  the  respective  partitions  1 24  | 35  and 
1 1 24 1 35.  As  an  aid  to  memory,  each  of  the  52  patterns  was  named  after  a 
chapter  of  Lady  Murasaki’s  famous  11th-century  Tale  of  Genji,  according  to  the 
following  sequence  [Encyclopedia  Japonicae  (Tokyo:  Sanseido,  1910),  1299]: 


iiiii  inn  iioi  inn  mu  urn  urn  mo  mn  nm  m inn  inn 

mu  imi  inn  nm  nm  imi  mu  nm  mu  n mu  nm  imi 

nm  mn  mu  n mu  mu  mu  mn  mu  mn  n nm  n 

nil  mu  nil  mu  n nm  n n n n 11 

(Once  again,  as  we’ve  seen  in  many  other  examples,  the  possibilities  were  not 
arranged  in  any  particularly  logical  order.) 
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The  appealing  nature  of  these  genji-ko  patterns  led  many  families  to  adopt 
them  as  heraldic  crests.  For  example,  the  following  stylized  variants  of  (25)  were 
found  in  standard  catalogs  of  kimono  patterns  early  in  the  20th  century: 


[See  Fumie  Adachi,  Japanese  Design  Motifs  (New  York:  Dover,  1972),  150-153.] 

Early  in  the  1700s,  Takakazu  Seki  and  his  students  began  to  investigate  the 
number  of  set  partitions  ron  for  arbitrary  n,  inspired  by  the  known  result  that 
w5  = 52.  Yoshisuke  Matsunaga  found  formulas  for  the  number  of  set  partitions 
when  there  are  kj  subsets  of  size  rij  for  1 < j < t,  with  kxn  1 + • • ■ + ktnt  = n 
(see  the  answer  to  exercise  1.2.5-21).  He  also  discovered  the  basic  recurrence 
relation  7.2.1.5-(i4),  namely 

Wn+1  = (o)®7”  + + (9)ron-2  H F (n)070’  ( 2 6) 

by  which  the  values  of  wn  can  readily  be  computed. 

Matsunaga’s  discoveries  remained  unpublished  until  Yoriyuki  Arima’s  book 
Shuki  Sanpd  came  out  in  1769.  Problem  56  of  that  book  asked  the  reader  to 
solve  the  equation  “wn  — 678570’  for  n;  and  Arima’s  answer,  worked  out  in 
detail  (with  credit  duly  given  to  Matsunaga),  was  n ~ 11. 

Shortly  afterwards,  Masanobu  Saka  studied  the  number  {”}  of  ways  that 
an  n-set  can  be  partitioned  into  k subsets,  in  his  work  Sanpo-Gakkai  (1782).  He 
discovered  the  recurrence  formula 

{ D = fc{fc}+<L-i}’  (2?) 

and  tabulated  the  results  for  n < 11.  James  Stirling,  in  his  Methodus  Differen- 
tial^ (1730),  had  discovered  the  numbers  {"}  in  a purely  algebraic  context;  thus 
Saka  was  the  first  person  to  realize  their  combinatorial  significance. 

An  interesting  algorithm  for  listing  set  partitions  was  subsequently  devised 
by  Toshiaki  Honda  (see  exercise  24).  Further  details  about  genji-ko  and  its  rela- 
tion to  the  history  of  mathematics  can  be  found  in  Japanese  articles  by  Tamaki 
Yano,  Sugaku  Seminar  34,11  (Nov.  1995),  58-61;  34,12  (Dec.  1995),  56-60. 

Set  partitions  remained  virtually  unknown  in  Europe  until  much  later,  ex- 
cept for  three  isolated  incidents.  First,  George  and/or  Richard  Puttenham 
published  The  Arte  of  English  Poesie  in  1589,  and  pages  70-72  of  that  book 
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contain  diagrams  similar  to  those  of  genji-ko.  For  example,  the  seven  diagrams 


were  used  to  illustrate  possible  rhyme  schemes  for  5-line  poems,  “whereof  some 
of  them  be  harsher  and  unpleasaunter  to  the  eare  then  other  some  be.”  But  this 
visually  appealing  list  was  incomplete  (see  exercise  25). 

Second,  an  unpublished  manuscript  of  G.  W.  Leibniz  from  the  late  1600s 
shows  that  he  had  tried  to  count  the  number  of  ways  to  partition  {l,...,n} 
into  three  or  four  subsets,  but  with  almost  no  success.  He  enumerated  {”}  by 
a very  cumbersome  method,  which  would  not  have  led  him  to  see  readily  that 
{”}  = 2n_1  - 1.  He  attempted  to  compute  {”}  and  {"}  only  for  n < 5,  and 
made  several  numerical  slips  leading  to  incorrect  answers.  [See  E.  Knobloch, 
Studia  Leibnitiana  Supplementa  11  (1973),  229-233;  16  (1976),  316-321.] 

The  third  European  appearance  of  set  partitions  had  a completely  different 
character.  John  Wallis  devoted  the  third  chapter  of  his  Discourse  of  Combina- 
tions (1685)  to  questions  about  “aliquot  parts,"  the  proper  divisors  of  numbers, 
and  in  particular  he  studied  the  set  of  all  ways  to  factorize  a given  integer.  This 
question  is  equivalent  to  the  study  of  multiset  partitions;  for  example,  the  factor- 
izations of  p3q2r  are  essentially  the  same  as  the  partitions  of  {p,p,p,  q,  q,  r},  when 
p,  q,  and  r are  prime  numbers.  Wallis  devised  an  excellent  algorithm  for  listing 
all  factorizations  of  a given  integer  n,  essentially  anticipating  Algorithm  7.2.1.5M 
(see  exercise  28).  But  he  didn’t  investigate  the  important  special  cases  that  arise 
when  n is  the  power  of  a prime  (equivalent  to  integer  partitions)  or  when  n is 
squarefree  (equivalent  to  set  partitions).  Thus,  although  Wallis  was  able  to  solve 
the  more  general  problem,  its  complexities  paradoxically  deflected  him  from  dis- 
covering partition  numbers,  Bell  numbers,  or  Stirling  subset  numbers,  or  from  de- 
vising simple  algorithms  that  would  generate  integer  partitions  or  set  partitions. 

Integer  partitions.  Partitions  of  integers  arrived  on  the  scene  even  more  slowly. 
We  saw  above  that  Bishop  Wibold  (c.  965)  knew  the  partitions  of  n into  exactly 
three  parts  < 6.  So  did  Galileo,  who  wrote  a memo  about  them  (c.  1627)  and  also 
studied  their  frequency  of  occurrence  as  rolls  of  three  dice.  [“Sopra  le  scoperte  de 
i dadi,  in  Galileo’s  Opere,  Volume  8,  591-594;  he  listed  partitions  in  decreasing 
lexicographic  order.]  Thomas  Harriot,  in  unpublished  work  a few  years  earlier, 
had  considered  up  to  six  dice  [see  J.  Stedall,  Historia  Math.  34  (2007),  398], 

Mersenne  listed  the  partitions  of  9 into  any  number  of  parts,  on  page  130  of 
his  Traitez  de  la  Voix  et  des  Chants  (1636).  With  each  partition  9 = cq  + ■ • • + ak 
he  also  computed  the  multinomial  coefficient  9!/ (cq ! . . . a /.■!);  as  we’ve  seen  earlier, 
he  was  interested  in  counting  various  melodies,  and  he  knew  for  example  that 
there  are  9!/ (3!  3!  3!)  = 1680  melodies  on  the  nine  notes  {a,  a,  a,  b,  b,  6,  c,  c,  c}. 
But  he  failed  to  mention  the  cases  8 + 1 and  3 + 2+1  + 1 + 1 + 1,  probably 
because  he  hadn  t listed  the  possibilities  in  any  systematic  way. 

Leibniz  considered  two-part  partitions  in  Problem  3 of  his  Dissertatio  de 
Arte  Combinatoria  (1666),  and  his  unpublished  notes  show  that  he  subsequently 
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spent  considerable  time  trying  to  enumerate  the  partitions  that  have  three  or 
more  summands.  He  called  them  “discerptions,”  or  (less  frequently)  “divul- 
sions”  — in  Latin  of  course  — or  sometimes  “sections”  or  “dispersions”  or  even 
“partitions.”  He  was  interested  in  them  primarily  because  of  their  connection 
with  the  monomial  symmetric  functions  xaT  x“2  ....  But  his  many  attempts 
led  to  almost  total  failure,  except  in  the  case  of  three  summands,  when  he  almost 
(but  not  quite)  discovered  the  formula  for  |"|  in  exercise  7.2.1.4-31.  For  example, 
he  carelessly  counted  only  21  partitions  of  8,  forgetting  the  case  2 + 2 + 2 + 1 + 1; 
and  he  got  only  26  for  p(9),  after  missing  3 + 2 + 2 + 2,  3 + 2 + 2+1  + 1, 
2 + 2 + 2 + 1 + 1 + 1,  and  2 + 2+1  + 1 + 1 + 1 + 1—  in  spite  of  the  fact  that 
he  was  trying  to  list  partitions  systematically  in  decreasing  lexicographic  order. 
[See  E.  Knobloch,  Studia  Leibnitiana  Supplementa  11  (1973),  91-258;  16  (1976), 
255-337;  Historia  Mathematica  1 (1974),  409-430.] 

Abraham  de  Moivre  had  the  first  real  success  with  partitions,  in  his  paper 
A Method  of  Raising  an  infinite  Multinomial  to  any  given  Power,  or  Extracting 
any  given  Root  of  the  same”  [Philosophical  Transactions  19  (1697),  619-625  and 
Fig.  5],  He  proved  that  the  coefficient  of  zm+n  in  ( az  + bz2  + cz3  + ■ • • )m  has 
one  term  for  each  partition  of  n;  for  example,  the  coefficient  of  zm+6  is 

(?)  am~6b6  + 5(™)  am~5b4c  + 4(7)  am~4b3d  + 6(7)  am~4b2c 2 

+ 3(?)  am~3b2e  + 6 (™)am~3bcd  + 2 {™)am~2bf  + (7)am“3c3 

+ 2(7)a™-W  (™)a™~2d2  + (7)a"-V  (29) 

If  we  set  a = 1,  the  term  with  exponents  bic:>dkel  . . . corresponds  to  the  partition 
with  i Is,  j 2s,  k 3s,  l 4s,  etc.  Thus,  for  example,  when  n = 6 he  essentially 
presented  the  partitions  in  the  order 

111111,  11112,  1113,  1122,  114,  123,  15,  222,  24,  33,  6.  (30) 

He  explained  how  to  list  the  partitions  recursively,  as  follows  (but  in  different 
language  related  to  his  own  notation):  For  k = 1,  2,  . . . , n,  start  with  k and 
append  the  (previously  listed)  partitions  of  n-k  whose  smallest  part  is  > k. 

[My  solution]  was  ordered  to  be  published  in  the  Transactions, 
not  so  much  as  a matter  relating  to  Play, 
but  as  containing  some  general  Speculations 
not  unworthy  to  be  considered  by  the  Lovers  of  Truth. 

— ABRAHAM  DE  MOIVRE  (1717) 
P.  R.  de  Montmort  tabulated  all  partitions  of  numbers  < 9 into  < 6 parts 
in  his  Essay  d Analyse  sur  les  Jeux  de  Hazard  (1708),  in  connection  with  dice 
problems.  His  partitions  were  listed  in  a different  order  from  (30);  for  example. 

111111,  21111,  2211,  222,  3111,  321,  33,  411,  42,  51,  6.  (31) 

He  probably  was  unaware  of  de  Moivre’s  prior  work. 

So  far  almost  none  of  the  authors  we’ve  been  discussing  actually  bothered 
to  describe  the  procedures  by  which  they  generated  combinatorial  patterns.  We 
can  only  infer  their  methods,  or  lack  thereof,  by  studying  the  lists  that  they 
actually  published.  Furthermore,  in  rare  cases  such  as  de  Moivre’s  paper  where  a 
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tabulation  method  was  explicitly  described,  the  author  assumed  that  all  patterns 
for  the  first  cases  1,  2,  . . . , n — 1 had  been  listed  before  it  was  time  to  tackle 
the  case  of  order  n.  No  method  for  generating  patterns  “on  the  fly,”  moving 
directly  from  one  pattern  to  its  successor  without  looking  at  auxiliary  tables, 
was  actually  explained  by  any  of  the  authors  we  have  encountered,  except  for 
Kedara  and  Narayana.  Today’s  computer  programmers  naturally  prefer  methods 
that  are  more  direct  and  need  little  memory. 

Roger  Joseph  Boscovich  published  the  first  direct  algorithm  for  partition 
generation  in  Giornale  de’  Letterati  (Rome,  1747),  on  pages  393-404  together 
with  two  foldout  tables  facing  page  404.  His  method,  which  produces  for  n — 6 
the  respective  outputs 

111111,  11112,  1122,  222,  1113,  123,  33,  114,  24,  15,  6,  (32) 

generates  partitions  in  precisely  the  reverse  order  from  which  they  are  visited  by 
Algorithm  7.2.1.4P;  and  his  method  would  indeed  have  been  featured  in  Section 
7.2. 1.4,  except  for  the  fact  that  the  reverse  order  turns  out  to  be  slightly  easier 
and  faster  than  the  order  that  he  had  chosen. 

Boscovich  published  sequels  in  Giornale  de’  Letterati  (Rome,  1748),  12-27 
and  84-99,  extending  his  algorithm  in  two  ways.  First,  he  considered  generating 
only  partitions  whose  parts  belong  to  a given  set  S , so  that  symbolic  multinomials 
with  sparse  coefficients  could  be  raised  to  the  mth  power.  (He  said  that  the  gcd 
of  all  elements  of  5 should  be  1;  in  fact,  however,  his  method  could  fail  if  1 ^ S.) 
Second,  he  introduced  an  algorithm  for  generating  partitions  of  n into  to  parts, 
given  to  and  n.  Again  he  was  unlucky:  A slightly  better  way  to  do  that  task, 
Algorithm  7.2.1.4H,  was  found  subsequently,  diminishing  his  chances  for  fame. 

Hindenburg’s  hype.  The  inventor  of  Algorithm  7.2.1.4H  was  Carl  Friedrich 
Hindenburg,  who  also  rediscovered  Narayana’s  Algorithm  7.2.1.2L,  a winning 
technique  for  generating  multiset  permutations.  Unfortunately,  these  small  suc- 
cesses led  him  to  believe  that  he  had  made  revolutionary  advances  in  mathemat- 
ics although  he  did  condescend  to  remark  that  other  people  such  as  de  Moivre, 
Euler,  and  Lambert  had  come  close  to  making  similar  discoveries. 

Hindenburg  was  a prototypical  overachiever,  extremely  energetic  if  not  in- 
spired. He  founded  or  cofounded  Germany’s  first  professional  journals  of  math- 
ematics (published  1786-1789  and  1794-1800),  and  contributed  long  articles  to 
each.  He  served  several  times  as  academic  dean  at  the  University  of  Leipzig, 
where  he  was  also  the  Rector  in  1792.  If  he  had  been  a better  mathematician, 
German  mathematics  might  well  have  flourished  more  in  Leipzig  than  in  Berlin 
or  Gottingen. 

But  his  first  mathematical  work,  Beschreibung  einer  ganz  neuen  Art,  nach 
einem  bekannten  Gesetze  fortgehende  Zahlen  durch  Abzahlen  oder  Abmessen 
bequem  und  sicher  zu  finden  (Leipzig:  1776),  amply  foreshadowed  what  was  to 
come:  His  “ganz  neue”  (completely  new)  idea  in  that  booklet  was  simply  to  give 
combinatorial  significance  to  the  digits  of  numbers  written  in  decimal  notation. 
Incredibly,  he  concluded  his  monograph  with  large  foldout  sheets  that  contained 
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a table  of  the  numbers  0000  through  9999  — followed  by  two  other  tables  that 
listed  the  even  numbers  and  odd  numbers  separately(l). 

Hindenburg  published  letters  from  people  who  praised  his  work,  and  invited 
them  to  contribute  to  his  journals.  In  1796  he  edited  Sammlung  combinatorisch- 
analytischer  Abhandlungen , whose  subtitle  stated  (in  German)  that  de  Moivre’s 
multinomial  theorem  was  “the  most  important  proposition  in  all  of  mathematical 
analysis.  About  a dozen  people  joined  forces  to  form  what  became  known  as 
Hindenburg  s Combinatorial  School,  and  they  published  thousands  of  pages  filled 
with  esoteric  symbolism  that  must  have  impressed  many  nonmathematicians. 

The  work  of  this  School  was  not  completely  trivial  from  the  standpoint 
of  computer  science.  For  example,  H.  A.  Rothe,  who  was  Hindenburg’s  best 
student,  noticed  that  there  is  a simple  way  to  go  from  a Morse  code  sequence  to 
its  lexicographic  successor  or  predecessor.  Another  student,  J.  K.  Burckhardt, 
observed  that  Morse  code  sequences  of  length  n could  also  be  generated  easily 
by  first  considering  those  with  no  dashes,  then  one  dash,  then  two,  etc.  Their 
motivation  was  not  to  tabulate  poetic  meters  of  n beats,  as  it  had  been  in  India, 
but  rather  to  list  the  terms  of  the  continuant  polynomials  K(x i,  X2,  ■ ■ ■ , xn), 
Eq.  4.5.3— (4).  [See  Archiv  der  reinen  und  angewandten  Mathematik  1 (1794), 
154-195.]  Furthermore,  on  page  53  of  Hindenburg’s  1796  Sammlung  cited  above, 
G.  S.  Kliigel  introduced  a way  to  list  all  permutations  that  has  subsequently 
become  known  as  Ord-Smit.h’s  algorithm;  see  Eqs.  (23)-(26)  in  Section  7.2. 1.2. 

Hindenburg  believed  that  his  methods  deserved  equal  time  with  algebra, 
geometry,  and  calculus  in  the  standard  curriculum.  But  he  and  his  disciples 
were  combinatorialists  who  only  made  combinatorial  lists.  Burying  themselves 
in  formulas  and  formalisms,  they  rarely  discovered  any  new  mathematics  of  real 
interest.  Eugen  Netto  has  admirably  summarized  their  work  in  M.  Cantor’s 
Geschichte  der  Mathematik  4 (1908),  201-219:  “For  a while  they  controlled 
the  German  market;  however,  most  of  what  they  dug  up  soon  sank  into  a not- 
entirely-deserved  oblivion.” 

The  sad  outcome  was  that  combinatorial  studies  in  general  got  a bad  name. 
Gosta  Mittag-Leffler,  who  assembled  a magnificent  library  of  mathematical  lit- 
erature about  100  years  after  Hindenburg’s  death,  decided  to  place  all  such 
work  on  a special  shelf  marked  “Dekadenter.”  And  this  category  still  persists 
in  the  library  of  Sweden’s  Institut  Mittag-Leffler  today,  even  as  that  institute 
attracts  world-class  combinatorial  mathematicians  whose  research  is  anything 
but  decadent. 

Looking  on  the  bright  side,  we  may  note  that  at  least  one  good  book  did 
emerge  from  all  of  this  activity.  Andreas  von  Ettingshausen’s  Die  combina- 
torische  Analysis  (Vienna:  1826)  is  noteworthy  as  the  first  text  to  discuss  com- 
binatorial generation  methods  in  a perspicuous  way.  He  discussed  the  general 
principles  of  lexicographic  generation  in  §8,  and  applied  them  to  construct  good 
ways  to  list  all  permutations  (§11),  combinations  (§30),  and  partitions  (§41— §44) . 

Where  were  the  trees?  We’ve  now  seen  that  lists  of  tuples,  permutations, 
combinations,  and  partitions  were  compiled  rather  early  in  human  history,  by 


7.2.1. 7 


HISTORY  AND  FURTHER  REFERENCES  509 


interested  and  interesting  researchers.  Thus  we’ve  accounted  for  the  evolution 
of  the  topics  studied  in  Sections  7.2. 1.1  through  7. 2. 1.5,  and  our  story  will  be 
complete  if  we  can  trace  the  origins  of  tree  generation,  Section  7.2. 1.6. 

But  the  historical  record  of  that  topic  before  the  advent  of  computers  is 
virtually  a blank  page,  with  the  exception  of  a few  19th-century  papers  by  Arthur 
Cayley.  Cayley’s  major  work  on  trees,  originally  published  in  1875  and  reprinted 
on  pages  427-460  of  his  Collected  Mathematical  Papers,  Volume  4,  was  climaxed 
by  a large  foldout  illustration  that  exhibited  all  the  free  trees  with  9 or  fewer 
unlabeled  vertices.  Earlier  in  that  paper  he  had  also  illustrated  the  nine  oriented 
trees  with  5 vertices.  The  methods  he  used  to  produce  those  lists  were  quite 
complicated,  completely  different  from  Algorithm  7.2.1.60  and  exercise  7.2. 1.6- 
90.  All  free  trees  with  up  to  10  vertices  were  listed  many  years  later  by  F.  Harary 
and  G.  Prins  [Acta  Math.  101  (1958),  158-162],  who  also  went  up  to  n — 12  in 
the  cases  of  free  trees  with  no  nodes  of  degree  2 or  with  no  symmetries. 

The  trees  most  dearly  beloved  by  computer  scientists  — binary  trees  or  the 
equivalent  ordered  forests  or  nested  parentheses  — are  however  strangely  absent 
from  the  literature.  We  saw  in  Section  2. 3. 4. 5 that  many  mathematicians  of  the 
1700s  and  1800s  had  learned  how  to  count  binary  trees,  and  we  also  know  that 
the  Catalan  numbers  Cn  enumerate  dozens  of  different  kinds  of  combinatorial 
objects.  Yet  nobody  seems  to  have  published  an  actual  list  of  the  C4  = 14 
objects  of  order  4 in  any  of  these  guises,  much  less  the  C5  = 42  objects  of 
order  5,  before  1950.  (Except  indirectly:  The  42  genji-ko  diagrams  in  (25)  that 
have  no  intersecting  lines  turn  out  to  be  equivalent  to  the  5-node  binary  trees 
and  forests.  But  this  fact  was  not  learned  until  the  20th  century.) 

There  are  a few  isolated  instances  where  authors  of  yore  did  prepare  lists  of 
C3  = 5 Catalan-related  objects.  Cayley,  again,  was  first;  he  illustrated  the  binary 
trees  with  3 internal  nodes  and  4 leaves  as  follows  in  Philosophical  Magazine  18 
(1859),  374-378: 

A A A A A <33) 

(That  same  paper  also  illustrated  another  species  of  tree,  equivalent  to  so-called 
weak  orderings.)  Then,  in  1901,  E.  Netto  listed  the  five  ways  to  insert  parentheses 
into  the  expression  ‘a  + b + c + d’: 

( a+b)  + (c+d ),  [(a+5)+c]+d,  [a+(6+c)]+d,  a+[(b+c)+d],  a+[6+(c+d)].  (34) 

[■ Lehrbuch  der  Combinatorik , §122.]  And  the  five  permutations  of  {+1, +1, +1, 
— — 1}  whose  partial  sums  are  nonnegative  were  listed  in  the  following  way 

by  Paul  Erdos  and  Irving  Kaplansky  [Script a Math.  12  (1946),  73-75]: 

1 + 1 + 1 — 1 — 1 — 1,  l + l-l  + l-l-l,  l + l-l-l  + l-l, 

1 — 1+1+1  — 1 — 1,  1 — 1 + 1 — 1 + 1-1.  (35) 

Even  though  only  five  objects  are  involved,  we  can  see  that  the  orderings  in  (33) 
;m<l  (34)  were  basically  catch-as-catch-can;  only  (35),  which  matches  Algorithm 
7.2.1.6P,  was  systematic  and  lexicographic. 
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We  should  also  note  briefly  the  work  of  Walther  von  Dyck,  since  many  recent 
papers  use  the  term  Dyck  words”  to  refer  to  strings  of  nested  parentheses.  Dyck 
was  an  educator  known  for  co-founding  the  Deutsches  Museum  in  Munich,  among 
other  things.  He  wrote  two  pioneering  papers  about  the  theory  of  free  groups 
[Math.  Annalen  20  (1882),  1-44;  22  (1883),  70-108],  Yet  the  so-called  Dyck 
words  have  at  best  a tenuous  connection  to  his  actual  research:  He  studied  the 
words  on  1 , . . . ,xk, x^.1}  that  reduce  to  the  empty  string  after  repeatedly 

erasing  adjacent  letter-pairs  of  the  forms  xixi  1 or  x~lxp,  the  connection  with 
parentheses  and  trees  arises  only  when  we  limit  erasures  to  the  first  case,  x^1 . 

Thus  we  may  conclude  that,  although  an  explosion  of  interest  in  binary  trees 
and  their  cousins  occurred  after  1950,  such  trees  represent  the  only  aspect  of  our 
story  whose  historical  roots  are  rather  shallow. 

After  1950.  Of  course  the  arrival  of  electronic  computers  changed  everything. 
The  first  computer-oriented  publication  about  combinatorial  generation  methods 
was  a note  by  C.  B.  Tompkins,  “Machine  attacks  on  problems  whose  variables 
are  permutations”  [Proc.  Symp.  Applied  Math.  6 (1956),  202-205].  Thousands 
more  were  destined  to  follow. 

Several  articles  by  D.  H.  Lehmer,  especially  his  “Teaching  combinatorial 
tricks  to  a computer”  in  Proc.  Symp.  Applied  Math.  10  (1960),  179-193,  proved 
to  be  extremely  influential  in  the  early  days.  [See  also  Proc.  1957  Canadian 
Math.  Congress  (1959),  160-  173;  Proc.  IBM  Scientific  Computing  Symposium 
on  Combinatorial  Problems  (1964),  23-30;  and  Chapter  1 of  Applied  Combina- 
torial Mathematics,  edited  by  E.  F.  Beckenbach  (Wiley,  1964),  5-31.]  Lehmer 
represented  an  important  link  to  previous  generations.  For  example,  Stanford’s 
library  records  show  that  he  had  checked  out  Netto’s  Lehrbuch  der  Combinatorik 
in  January  of  1932. 

The  main  publications  relevant  to  particular  algorithms  that  we’ve  studied 
have  already  been  cited  in  previous  sections,  so  there  is  no  need  to  repeat  them 
here.  But  textbooks  and  monographs  that  first  put  pieces  of  the  subject  together 
in  a coherent  framework  were  also  of  great  importance.  Three  books,  in  partic- 
ular, were  especially  noteworthy  with  respect  to  establishing  general  principles: 

• Elements  of  Combinatorial  Computing  by  Mark  B.  Wells  (Pergamon  Press, 
1971),  especially  Chapter  5. 

• Combinatorial  Algorithms  by  Albert  Nijenhuis  and  Herbert  S.  Wilf  (Aca- 
demic Press,  1975).  A second  edition  was  published  in  1978,  containing 
additional  material,  and  Wilf  subsequently  wrote  Combinatorial  Algorithms: 
An  Update  (Philadelphia:  SIAM,  1989). 

• Combinatorial  Algorithms:  Theory  and  Practice  by  Edward  M.  Reingold, 
Jurg  Nievergelt,  and  Narsingh  Deo  (Prentice-Hall,  1977),  especially  the 
material  in  Chapter  5. 

Robert  Sedgewick  compiled  the  first  extensive  survey  of  permutation  generation 
methods  in  Computing  Surveys  9 (1977),  137  164,  314.  Carla  Savage’s  survey 
article  about  Gray  codes  in  SIAM  Review  39  (1997),  605-629,  was  another 
milestone. 
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We  noted  above  that  algorithms  to  generate  Catalan-counted  objects  were 
not  invented  until  computer  programmers  developed  an  appetite  for  them.  The 
first  such  algorithms  to  be  published  were  not  cited  in  Section  7.2. 1.6  because 
they  have  been  superseded  by  better  techniques;  but  it  is  appropriate  to  list 
them  here.  First,  H.  I.  Scoins  gave  two  recursive  algorithms  for  ordered  tree 
generation,  in  the  same  paper  we  have  cited  with  respect  to  the  generation  of 
oriented  trees  [Machine  Intelligence  3 (1968),  43-60].  His  algorithms  dealt  with 
binary  trees  represented  as  bit  strings  that  were  essentially  equivalent  to  Polish 
prefix  notation  or  to  nested  parentheses.  Then  Mark  Wells,  in  Section  5.5.4  of  his 
book  cited  above,  generated  binary  trees  by  representing  them  as  noncrossing 
set  partitions.  And  Gary  Knott  [CACM  20  (1977),  113-115]  gave  recursive 
ranking  and  unranking  algorithms  for  binary  trees,  representing  them  via  the 
inorder-to-preorder  permutations  qi  . . . qn  of  Table  7.2. 1.6-3. 

Algorithms  to  generate  all  spanning  trees  of  a given  graph  have  been  pub- 
lished by  numerous  authors  ever  since  the  1950s,  motivated  originally  by  the 
study  of  electrical  networks.  Among  the  earliest  such  papers  were  works  of 
N.  Nakagawa,  IRE  Trans.  CT-5  (1958),  122-127;  W.  Mayeda,  IRE  Trans. 
CT-6  (1959),  136-137,  394;  H.  Watanabe,  IRE  Trans.  CT-7  (1960),  296-302; 
S.  Hakimi,  J.  Franklin  Institute  272  (1961),  347-359. 

A recent  introduction  to  the  entire  subject  can  be  found  in  Chapters  2 
and  3 of  Combinatorial  Algorithms:  Generation,  Enumeration,  and  Search  by 
Donald  L.  Kreher  and  Douglas  R.  Stinson  (CRC  Press,  1999). 

Frank  Ruskey  is  preparing  a book  entitled  Combinatorial  Generation  that 
will  contain  a thorough  treatment  and  a comprehensive  bibliography.  He  has 
made  working  drafts  of  several  chapters  available  on  the  Internet. 


EXERCISES 


Many  of  the  exercises  below  ask  a modern  reader  to  find  and/or  to  correct  errors  in 
the  literature  of  bygone  days.  The  point  is  not  to  gloat  over  how  smart  we  are  in  the 
21st  century;  the  point  is  rather  to  understand  that  even  the  pioneers  of  a subject  can 
stumble.  One  good  way  to  learn  that  a set  of  ideas  is  not  really  as  simple  as  it  might 
seem  to  today’s  computer  scientists  and  mathematicians  is  to  observe  that  some  of  the 
world’s  leading  thinkers  had  to  struggle  with  the  concepts  when  they  were  new. 

1.  [15]  Does  the  notion  of  “computing”  arise  in  the  I Chingl 

► 2.  [M30]  ( The  genetic  code.)  DNA  molecules  are  strings  of  “nucleotides”  on  the 

4-letter  alphabet  {T,  C,  A,  G},  and  most  protein  molecules  are  strings  of  “amino  acids”  on 
the  20-letter  alphabet  (A,  C,  D,  E,  F,  G,  H,  I,  K,  L,  M,  N,  P,  Q,  R,  S,  T,  V,  W,  Y}.  Three 
consecutive  nucleotides  xyz  form  a “codon,”  and  a strand  xiy1zix2y2Z2  ...  of  DNA 
specifies  the  protein  f(x\,y\,  zi)f(x2,y2,  z2) ... , where  f(x,y,  z)  is  the  element  in  row  z 
and  column  y of  matrix  x in  the  array 
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L S 
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/L  P H R\ 
L P H R 
L P Q R 
\L  P Q R / 
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(Here  (T,  C,  A,  G)  — (1,2, 3, 4);  for  example,  /(CAT)  is  the  element  in  row  1 and  column  3 
of  matrix  2,  namely  H.)  Encoding  proceeds  until  a codon  leads  to  the  stopper 
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a)  Show  that  there  is  a simple  way  to  map  each  codon  into  a hexagram  of  the  I Ching, 
with  the  property  that  the  21  possible  outcomes  {A,  C,  D, . . . , W,  Y,  -}  correspond 
to  21  consecutive  hexagrams  of  the  King  Wen  ordering  (i). 

b)  Is  that  a sensational  discovery? 

3.  [20]  What  is  the  millionth  meter  that  has  30  beats,  in  colex  ordering  analogous 

to  (2)?  What  is  the  rank  of  ^ w ? 

4.  [19]  Analyze  the  imperfections  of  Donnolo’s  list  of  permutations  in  Table  1. 

5.  [16]  What’s  wrong  with  Kircher’s  list  of  five-note  permutations  in  (7)? 

6.  [25]  Mersenne  published  a table  of  the  first  64  factorials  on  pages  108-110  of  his 
Traitez  de  la  Voix  et  des  Chants  (1636).  His  value  for  64!  was  « 2.2  x 10s9;  but  it  should 
have  been  « 1.3  x 1089.  Find  a copy  of  his  book  and  try  to  figure  out  where  he  erred. 

7.  [20]  What  permutations  of  {1,  2, 3, 4,  5}  are  “alive”  and  “dead”  according  to  Seki’s 
rules  (8)  and  (9)? 

► 8.  [M27]  Make  a patch  to  (9)  so  that  Seki’s  procedure  will  be  correct. 

9.  [15]  From  (11),  deduce  the  Arabic  way  to  write  the  Arabic  numerals  (0, 1, ... , 9). 

► 10.  [HM27]  In  Ludus  Clericalis,  what  is  the  expected  number  of  times  the  three  dice 
are  rolled  before  all  possible  virtues  are  acquired? 

11.  [21]  Decipher  LI  nil  s vertical  table  at  the  right  of  Fig.  65.  What  20  combinatorial 
objects  does  it  represent?  Hint:  Don’t  be  misled  by  typographic  errors. 

12.  [M20]  Relate  Schillinger’s  universal  cycle  (13)  to  the  universal  cycle  of  Poinsot  in 
exercise  7.2.1.3-106. 

13.  [21]  What  should  van  Schooten  have  written,  instead  of  (17)?  Give  also  the 
corresponding  tableau  for  combinations  of  the  multiset  (a,  a,  a,  b,  b,  c}. 

► 14.  [20]  Complete  the  following  sequence,  from  §95  of  Izquierdo’s  De  Combinations 

ABC  ABD  ABE  ACD  ACE  ACB  ADE  ADB  ADC  AEB 

15.  [15]  If  all  n-combinations  xi...xn  of  {l,...,m}  with  repetition  are  listed  in 
lexicographic  order,  with  x\  < ■ ■ • < xn,  how  many  of  them  begin  with  the  number  j? 

16.  [20]  (Narayana  Pandita,  1356.)  Design  an  algorithm  to  generate  all  compositions 
of  n into  parts  < q,  namely  all  ordered  partitions  n = ai  + • • ■ + at,  where  1 < a3  < q 
for  1 <j<t  and  t is  arbitrary.  Illustrate  your  method  when  n = 7 and  q = 3. 

17.  [HM27]  Analyze  the  algorithm  of  exercise  16. 

18.  [10]  Trick  question:  Leibniz  published  his  Dissertatio  de  Arte  Combinatoria  in 
1666.  Why  was  that  a particularly  auspicious  year,  permutat.ionwise? 

19.  [17]  In  which  of  Puteanus’s  verses  (20)  is  ‘tibi’  treated  as  instead  of 

20.  [M25]  To  commemorate  the  visit  of  three  illustrious  noblemen  to  Dresden  in  1617, 
a poet  published  1617  permutations  of  the  hexameter  verse 

Dant  tria  jam  Dresdae,  ceu  sol  dat,  lumina  lucem. 

“Three  give  now  to  Dresden,  as  the  sun  gives,  lights  to  light.”  [Gregor  Kleppis,  Proteus 
Poeticus  (Leipzig:  1617).]  How  many  permutations  of  those  words  would  actually  scan 
properly?  Hint.  The  verse  has  dactyls  in  the  first  and  fifth  feet,  spondees  elsewhere. 
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21.  [HM30]  Let  f (p,  q,r;  s,t)  be  the  number  of  ways  to  make  ( op,oq,or ) by  concate- 
nating the  strings  {s  ■ o,  t • oo},  when  p+q  + r = s + 2t.  For  example,  /( 2, 3, 2;  3,  2)  = 5 
because  the  five  ways  are 

(ojo,  o|oo,  oo),  (ojo,  oo|o,  oo),  (oo,  o]o|o,  oo),  (oo,  o|oo,  o|o),  (oo,  oo|o,  ojo). 

a)  Show  that  f(p,q,r\ s,  t)  = [upvqwrzs]  l/((  1 - zu  - u2)(l  - zv  - v2)(l  — zw  - w2)). 

b)  Use  the  function  / to  enumerate  the  scannable  permutations  of  (19),  subject  to 
the  additional  condition  that  the  fifth  foot  doesn’t  begin  in  the  middle  of  a word. 

c)  Now  enumerate  the  remaining  cases. 

► 22.  [ M40 ] Look  up  the  original  discussions  of  the  tot-tibi  problem  that  were  published 
by  Prestet,  Wallis,  Whitworth,  and  Hartley.  What  errors  did  they  make? 

23.  [20]  What  order  of  the  52  genji-ko  diagrams  corresponds  to  Algorithm  7.2.1.5H? 

► 24.  [23]  Early  in  the  1800s,  Toshiaki  Honda  gave  a recursive  rule  for  generating  all  par- 
titions of  {1, . . . ,n}.  His  algorithm  produced  them  in  the  following  order  when  n = 4: 

1111  nil  mi  mi  mi  nn  nn  im  mi  im  im  im  i im  im 

Can  you  guess  the  corresponding  order  for  n = 5?  Hint:  See  (26). 

25.  [15]  The  16th-century  author  of  The  Arte  of  English  Poesie  was  interested  only  in 
rhyme  schemes  that  are  “complete”  in  the  sense  of  exercise  7.2.1.5-35;  in  other  words, 
every  line  should  rhyme  with  at  least  one  other.  Furthermore,  the  scheme  should 
be  “indecomposable”  in  the  sense  of  exercise  7.2.1.2-100:  A partition  like  1 2 1 345 
decomposes  into  a 2-line  poem  followed  by  a 3-line  poem.  And  the  scheme  shouldn’t 
consist  trivially  of  lines  that  all  rhyme  with  each  other.  Under  these  conditions,  is  (28) 
a complete  list  of  5-line  rhyme  schemes? 

► 26.  [HM25]  How  many  n-line  rhyme  schemes  satisfy  the  constraints  of  exercise  25? 

► 27 . [HM31]  The  set  partition  14 1 25  1 36  can  be  represented  by  a genji-ko  diagram  such 
as  M but  every  such  diagram  for  this  partition  must  have  at  least  three  places  where 
lines  cross,  and  crossings  are  sometimes  considered  undesirable.  How  many  partitions 
of  {1,...  ,n}  have  a genji-ko  diagram  in  which  the  lines  cross  at  most  once? 

► 28.  [25]  Let  a,  b,  and  c be  prime  numbers.  John  Wallis  listed  all  possible  factorizations 
of  a3b2c  as  follows:  cbbaaa,  cbbaa  ■ a,  cbaaa  ■ b,  bbaaa  ■ c,  cbba  ■ aa,  cbba  ■ a ■ a,  cbaa  ■ ba , 
cbaa  ■ b ■ a,  bbaa  ■ ca,  bbaa  ■ c ■ a,  caaa  ■ bb,  caaa  ■ b ■ b,  baaa  ■ cb,  baaa  ■ c ■ b,  ebb  ■ aaa, 
ebb  ■ aa  ■ a,  ebb  ■ a ■ a ■ a,  eba  ■ baa,  eba  ■ ba  ■ a,  eba  ■ aa  ■ b,  eba  ■ b ■ a ■ a,  bba  ■ caa,  bba  ■ ca  ■ a, 
bba  -aa-c,  bba  ■ c-a-a , caa  -bb-  a,  caa  -ba-b,  caa  ■ b-b-a , baa  -cb-a,  baa  ■ ca-b,  baa  ■ ba-c , 
baa  ■ c ■ b ■ a,  aaa  ■ cb  ■ b,  aaa  ■ bb  ■ c,  aaa  ■ c • b ■ b,  cb  ■ ba  ■ aa,  cb  • ba  ■ a ■ a,  cb  ■ aa  ■ b ■ a, 
cb  ■ b ■ a ■ a ■ a,  bb  ■ ca  ■ aa,  bb  ■ ca  ■ a ■ a,  bb  ■ aa  ■ c ■ a,  bb  ■ c ■ a ■ a ■ a,  ca  ■ ba  ■ ba,  ca  ■ ha  ■ b ■ a, 
ca  ■ aa  ■ b ■ b,  ca  ■ b ■ b ■ a ■ a,  ba  ■ ba  ■ c ■ a,  ba  ■ aa  ■ c ■ b,  ba  ■ c ■ b ■ a ■ a,  aa  ■ c • b ■ b ■ a, 
c ■ b ■ b ■ a ■ a ■ a.  What  algorithm  did  he  use  to  generate  them  in  this  order? 

► 29.  [24]  In  what  order  would  Wallis  have  generated  all  factorizations  of  the  number 
abode  = 5 • 7 • 11  • 13  • 17?  Give  your  answer  as  a sequence  of  genji-ko  diagrams. 

30.  [M20]  What  is  the  coefficient  of  a'^a^  . . . zm+n  in  (ao-z  + a\z2  + 02 z3  + • • • )m? 
(See  (29).) 

31.  [20]  Compare  de  Moivre’s  and  de  Montmort’s  orders  for  partitions,  (30)  and  (31), 
with  Algorithm  7.2.1.4P. 

32.  [21]  (R.  J.  Boscovich,  1748.)  List  all  partitions  of  20  for  which  all  parts  are  1,  7, 
or  10.  Also  design  an  algorithm  that  lists  all  such  partitions  of  any  given  integer  n > 0. 
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Answer  not  a fool  according  to  his  folly, 
lest  thou  also  be  like  unto  him. 

— Proverbs  26:4 


NOTES  ON  THE  EXERCISES 

1.  A moderately  easy  problem  for  a mathematically  inclined  reader. 

2.  The  author  will  reward  you  if  you  are  first  to  report  an  error  in  the  statement  of 
an  exercise  or  in  its  answer. 

3.  See  H.  Poincare,  Rendiconti  Circ.  Mat.  Palermo  18  (1904),  45-110;  R.  H.  Bing, 
Annals  of  Math.  (2)  68  (1958),  17-37;  G.  Perelman,  arXiv:math.DG/0211159  0303109 
0307245. 


SECTION  7 


1.  Following  the  hint,  we’ll  want  the  second  ‘4m-4’  to  be  immediately  followed  by  the 
first  ‘2m- 1’.  The  desired  arrangements  can  be  deduced  from  the  first  four  examples, 
given  in  hexadecimal  notation:  231213,  46171435623725,  86a31bl368597a425b2479. 
ca8e531f  1358ac7db9e6427f 2469bd.  [R.  O.  Davies,  Math.  Gazette  43  (1959),  253-255.] 

2.  Such  arrangements  exist  if  and  only  if  n mod  4 = 0 or  1 . This  condition  is  necessary 
because  there  must  be  an  even  number  of  odd  items.  And  it  is  sufficient  because  we 
can  place  00  in  front  of  the  solutions  in  the  previous  exercise. 

Notes:  This  question  was  first  raised  by  Marshall  Hall  in  1951,  and  solved  the 
following  year  by  F.  T.  Leahy,  Jr.,  in  unpublished  work  [Armed  Forces  Security  Agency 
report  343  (28  January  1952)].  It  was  independently  posed  and  resolved  by  T.  Skolem 
and  T.  Bang,  Math.  Scandinavica  5 (1957),  57-58.  For  other  intervals  of  numbers,  see 
the  complete  solution  by  J.  E.  Simpson,  Discrete  Math.  44  (1983),  97  104. 

3.  Yes.  For  example,  the  cycle  (0072362435714165)  can’t  be  broken  up. 

4.  The  kth  occurrence  of  b is  in  position  J from  the  left,  and  the  Jfcth  occurrence 
of  a is  in  position  [k<f>2 J.  Clearly  [kc/>2 J - [k<t>\  = k,  because  02  = 0 + 1.  (The  integers 
[k<j> J form  the  “spectrum”  of  </>-,  see  exercise  3.13  of  CMath.) 

5.  2n  — k — 1 of  the  (2”)  equally  likely  pairs  of  positions  satisfy  the  stated  condition. 
If  these  probabilities  were  independent  (but  they  aren’t),  the  value  of  2 Ln  would  be 


n((2n-l-*)/(22")) 


k=  1 


(2  n)!2n(n  — 1) 
n!(2n)'1+1(2n  — l)n+i 


exp(nln~+\n^^+0(n  J)). 
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6.  (a)  When  the  products  are  expanded,  we  obtain  a polynomial  of  (2 n — 2)!/(n  — 2)! 
terms,  each  of  degree  4n.  There’s  a term  xf...  x2n  for  each  Langford  pairing;  every 
other  term  has  at  least  one  variable  of  degree  1.  Summing  over  xi, . . . , X2„  € { — 1,  +1} 
therefore  cancels  out  all  the  bad  terms,  but  gives  22"  for  the  good  terms.  An  extra  factor 
of  2 arises  because  there  are  2 L„  Langford  pairings  (including  left-right  reversals). 

(b)  Let  fk  = Yl2jLik  'xjXj+k+i  be  the  main  part  of  the  fcth  factor.  We  can  run 
through  all  4n  cases  x i . . . . , x2„  G { — 1,  +1}  in  Gray-code  order  (Algorithm  7.2.1.1L), 
negating  only  one  of  the  Xj  each  time.  A change  in  xt  causes  at  most  two  adjustments 
to  each  fk]  so  each  Gray-code  step  costs  0{n). 

We  needn’t  compute  the  sum  exactly;  it  suffices  to  work  mod  2N , where  2N  com- 
fortably exceeds  22n  + 1Ln.  Even  better,  when  n = 24,  would  be  to  do  the  computations 
mod  260  — 1,  or  mod  both  230  — 1 and  230  + 1.  One  can  also  save  [n/2]  bits  of  precision 
by  exploiting  the  fact  that  /*,  = k + 1 (modulo  2). 

(c)  The  third  equality  is  actually  valid  only  when  n mod  4 = 0 or  3;  but  those  are 

the  interesting  n’s.  The  sum  can  be  carried  out  in  n phases,  where  phase  p for  p < n 
involves  the  cases  where  x„_i  = xn+2,  x„-2  = x„+3,  xn-P+i  = xn+p,  xn-p  = xn  = 
Xn+i  = +1,  and  xn+p+1  = —1;  it  has  an  outer  loop  that  chooses  (xn_p+i, . . . ,xn-i)  in 
all  2P_1  ways,  and  an  inner  loop  that  chooses  (xi, . . . , xn_p_i,  xn+p+2, . . .,x2n)  in  all 
22”  2p  - ways.  (The  inner  loop  uses  Gray  binary  code,  preferably  with  “organ-pipe 
order”  to  prioritize  the  subscripts  so  that  x\  and  x2n  vary  most  rapidly.  The  outer 
loop  need  not  be  especially  efficient.)  Phase  n covers  the  2n_1  palindromic  cases  with 
Xj  = x2n+i-j  for  1 < j < n and  xn  = xn+i  = +1.  If  sp  denotes  the  sum  in  phase  p, 
then  si  -t h s„_i  + |s„  = 2 2n~2Ln. 

A substantial  fraction  of  the  terms  turn  out  be  zero.  For  example,  when  n = 16, 
zeros  appear  about  76%  of  the  time  (in  408,838,754  cases  out  of  229  + 214).  This  fact  can 
be  used  to  avoid  many  multiplications  in  the  inner  loop.  (Only  fi,  f3,  ...  can  be  zero.) 

7 . Let  dk  be  the  number  of  incomplete  pairs  after  k characters  have  been  read;  thus 
do  = d2n  = 0,  and  dk  = djt_i  ± 1 for  1 < k < 2 n.  The  largest  such  sequence  in  which  dk 
never  exceeds  6 is  (d0,  di, . . . , d2n)  = (0, 1, 2, 3, 4,  5, 6,  5, 6, . . . , 5, 6,  5, 4,  3, 2, 1, 0),  which 
has  YlkL i dk  = lln  — 30.  But  i dk  = I2k=i(k  + 1)  = ("J1)  + n in  any  Langford 
pairing.  Hence  (’“t1)  + n < 11  n — 30,  and  n < 15.  (In  fact,  width  6 is  also  impossible 
when  n = 15.  The  largest  and  smallest  possible  width  are  unknown  in  general.) 

8.  There  are  no  solutions  when  n = 4 or  n = 7.  When  n = 8 there  are  four: 


r 


n 


I 


1317538642572468;  1418634753268257;  4275248635713168;  5286235743681417. 


LL 


j 


L 


J I I L 


=u 


(This  problem  makes  a pleasant  mechanical  puzzle,  using  gadgets  of  width  k + 1 and 
height  \k/2]  for  piece  k.  In  his  original  note  [Math.  Gazette  42  (1958),  228],  C.  Dudley 
Langford  illustrated  similar  pieces,  and  exhibited  a planar  solution  for  n = 12.  The 
question  can  be  cast  as  an  exact  cover  problem,  with  nonprimary  columns  representing 
places  where  two  gadgets  are  not  allowed  to  intersect;  see  exercise  7.2.2.1-00.  Jean 
Brette  has  devised  a somewhat  similar  puzzle,  based  on  Skolem’s  variant  of  the  problem 
and  using  width  instead  of  planarity;  he  gave  a copy  to  David  Singmaster  in  1992.) 

9.  Just  three  ways:  181915267285296475384639743,  191218246279458634753968357, 
191618257269258476354938743  (and  their  reversals).  [First  found  in  1969  by  G.  Baron; 
see  Combinatorial  Theory  and  Its  Applications  (Budapest:  1970),  81-92.  The  “dancing 
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links  method  of  Section  7.2.2. 1 resolves  this  question  by  traversing  a search  tree  that 
has  only  360  nodes,  given  an  exact  cover  problem  with  132  rows.] 

10.  For  example,  let  A = 12,  K = 8,  Q = 4,  J = 0,  * = 4,  <?  = 3,  0 = 2,  * = 1;  add. 

[In  this  connection,  orthogonal  latin  squares  equivalent  to  Fig.  1 were  implicitly 
present  already  in  medieval  Islamic  talismans  illustrated  by  Ibn  al-Hajj  in  his  Kitab 
Shumus  al- Anwar  (Cairo:  1322);  he  also  gave  a 5x5  example.  See  E.  Doutte,  Magie 
et  Religion  dans  l’Afrique  du  Nord  (Algiers:  1909),  193-194,  214,  247;  W.  Ahrens, 
Der  Islam  7 (1917),  228-238.  See  also  an  article  on  the  history  of  latin  squares  being 
prepared  by  Lars  D.  Andersen.] 


/ dyH  aSl  b/3>  cal  \ 
c/3l  baN  ayl  dSi 
aai  dpi  cSH  byl 
\ bSl  cyi  dal  aPH ) 


[Joseph  Sauveur  presented  the  earliest  known  ex- 
ample of  such  squares  in  Memoires  de  l’Academie 
Royale  des  Sciences  (Paris,  1710),  92-138,  §83.] 


12.  If  n is  odd,  we  can  let  Mtj  = (i  - j)  mod  n.  But  if  n is  even,  there  are  no 

transversals:  For  if  {(f0  + 0)  mod  n, ... , (tB_1+„-l)  mod  n}  is  a transversal,  we  have 
Lfc=o  **  = £fc=o(*fc  + k ) (modulo  n),  hence  ^Lo  k = - 1)  is  a multiple  of  n. 

13.  Replace  each  element  l by  [Z/5J  to  get  a matrix  of  0s  and  Is.  Let  the  four  quarters 
be  named  (c  D );  then  A and  1)  each  contain  exactly  k Is,  while  B and  C each  contain 
exactly  k 0s.  Suppose  the  original  matrix  has  ten  disjoint  transversals.  If  k < 2,  at  most 
four  of  them  go  through  a 1 in  A or  D,  and  at  most  four  go  through  a 0 in  B or  C.  Thus 
at  least  two  of  them  hit  only  0s  in  A and  D,  only  Is  in  B and  C.  But  such  a transversal 
has  an  even  number  of  0s  (not  five),  because  it  intersects  A and  D equally  often. 

Similarly , a latin  square  of  order  4m  + 2 with  an  orthogonal  mate  must  have  more 
than  m intruders  in  each  of  its  (2m  + 1)  x (2m  + 1)  submatrices,  under  all  renamings 
of  the  elements.  [H.  B.  Mann,  Bull.  Amer.  Math.  Soc.  (2)  50  (1944),  249-257.] 

14.  Cases  (b)  and  (d)  have  no  mates.  Cases  (a),  (c),  and  (e)  have  respectively  2,  6, 
and  12265168(1),  of  which  the  lexicographically  first  and  last  are 


(a)  (a)  (c) 

0456987213  0691534782  0362498571 

1305629847  1308257964  1408327695 

2043798165  2169340578  2673519408 

3289176504  3250879416  3521970846 

4518263790  4587902631.  4890253167 
5167432089’  5412763890’  5736841920’ 
6894015372  6945081327  6259784013 

7920341658  7836425109  7915602384 

8731504926  8723196045  8147036259 

9672850431  9074618253  9084165732 


(c)  (e)  (e) 

0986271435  0214365897  0987645321 

1354068792  1025973468  1795402638 

2741853960  2690587143  2506913874 

3572690814  3857694201  3154067289 

4630789251 . 4168730925  4231850967 
5218947306’  5473829016’  5348276190' 
6095324178  6942158730  6820394715 

7869512043  7309216584  7069128543 

8407136529  8531402679  8412739056 

9123405687  9786041352  9673581402 


Notes:  Squares  (a),  (b),  (c),  and  (d)  were  obtained  from  the  decimal  digits  of  n,  e, 
7,  and  <j>,  by  discarding  each  digit  that  is  inconsistent  with  a completed  latin  square. 
Although  they  aren’t  truly  random,  they’re  probably  typical  of  10  x 10  latin  squares 
in  general,  roughly  half  of  which  appear  to  have  orthogonal  mates.  Parker  constructed 
square  (e)  in  order  to  obtain  an  unusually  large  number  of  transversals;  it  has  5504  of 
them.  (Euler  had  studied  a similar  example  of  order  6,  therefore  “just  missing”  the 
discovery  of  a 10  x 10  pair.) 

15.  Parker  was  dismayed  to  discover  that  none  of  the  mates  of  square  14(e)  are 
orthogonal  to  each  other.  With  J.  W.  Brown  and  A.  S.  Hedayat  [J.  Combinatorics  Inf. 
and  System  Sci.  18  (1993),  113-115],  he  later  found  two  10  x 10s  that  have  four  disjoint 
common  transversals  (but  not  ten).  [See  also  B.  Ganter,  R.  Mathon,  and  A.  Rosa, 
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Congressus  Numerantium  20  (1978),  383-398;  22  (1979),  181-204.]  While  pursuing 
an  idea  of  L.  Weisner  [Canadian  Math.  Bull.  6 (1963),  61—63],  the  author  accidentally 
noticed  some  squares  that  come  even  closer  to  a mutually  orthogonal  trio:  The  square 
below  is  orthogonal  to  its  transpose;  and  it  has  five  diagonally  symmetric  transver- 
sals, in  cells  (0,po),  ...,  (9,p9)  for  po...pg  = 0132674598,  2301457689,  3210896745, 
4897065312,  and  6528410937,  which  are  almost  disjoint:  They  cover  49  cells. 


L 


/ 0234567891 \ 
3192708546 
6528139407 
8753241960 
1689473025 
4970852613 
5047986132 
9416320758 
7361095284 
V 2805614379/ 


/ 0368145972 \ 
2157690438 
3925874160 
4283907615 
5712489306 
6034758291 
7891326054 
8549061723 
9406213587 
V 1670532849/ 


Extensive  computations  by  B.  D.  McKay,  A.  Meynert,  and  W.  Myrvold  [J.  Comb. 
Designs  15  (2007),  98-119]  prove  that  no  10  x 10  latin  square  with  nontrivial  symmetry 
has  two  mates  orthogonal  to  each  other.  Three  mutually  orthogonal  latin  squares  are 
known  to  exist  for  all  orders  n > 10  [see  S.  M.  P.  Wang  and  R.  M.  Wilson,  Congressus 
Numerantium  21  (1978),  688;  D.  T.  Todorov,  Ars  Combinatoria  20  (1985),  45-47]. 

16.  See  R.  A.  Brualdi  and  H.  J.  Ryser,  Combinatorial  Matrix  Theory  (Cambridge 
University  Press,  1991),  §8.2. 

17.  (a)  Let  there  be  3n  columns  r3-,  c,  , Vj  for  0 < j < n,  and  n2  rows;  row  (i,j)  has 
1 in  columns  r,,  cj,  and  vi,  where  l = Ltj,  for  0 < i,j  < n. 

(b)  Let  there  be  4n2  columns  nj,  ctj,  xtj,  y,,}  for  0 < i,  j < n,  and  n3-n2  + n rows; 
row  (i,j,  k)  has  1 in  columns  rik,  cjk,  x^,  and  ylk,  where  l = , for  0 < i,j,  k < n 

and  (i  = k or  j > 0). 

18.  Given  an  orthogonal  array  A with  rows  Ai  for  1 < i < to,  define  latin  square 
Li  = (Ljjfc)  for  1 < i < to  - 2 by  setting  Lijk  = Aiq  when  A(m_1)?  = j and  Amq  = k, 
for  0 < j,  k < n.  (The  value  of  q is  uniquely  determined  by  the  values  of  j and  k.) 
Permuting  the  columns  of  the  array  does  not  change  the  corresponding  latin  squares. 

This  construction  can  also  be  reversed,  to  produce  orthogonal  arrays  of  order  n 
from  mutually  orthogonal  latin  squares  of  order  n.  In  exercise  11,  for  example,  we  can 
let  a = a = N = 0,  6 = = 3 = 1,  c = 7 = 5 = 2,  and  d = 5 = 1 = 3,  obtaining 


A = 


/ 3012210303211 230 \ 
2310102301323201 
0123103223013210 
0000111122223333 
\0123012301230123/ 


(The  concept  of  an  orthogonal  array  is  mathematically  “cleaner”  than  the  concept 
of  orthogonal  latin  squares,  because  it  accounts  better  for  the  underlying  symmetries. 
Notice,  for  example,  that  annxn  matrix  L is  a latin  square  if  and  only  if  it  is  orthogonal 
to  two  particular  non-latin  squares,  namely 


Z1 

1 . . 

• M 

Z1 

2 . 

..  n \ 

2 

2 . . 

. . 2 

and  L _L 

1 

2 . 

. . n 

\n 

n 

. . nj 

ll 

2 .. 

. . n) 

L _L 
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Therefore  Latin  squares,  Graeco-Latin  squares,  Hebraic-Graeco-Latin  squares,  etc.,  are 

equivalent  to  orthogonal  arrays  of  depth  3,  4,  5, Moreover,  the  orthogonal  arrays 

considered  here  are  merely  the  special  case  t = 2 and  A = 1 of  a more  general  concept 
of  n-ary  m x An*  arrays  having  “strength  t"  and  “index  A,”  introduced  by  C.  R.  Rao 
in  Proc.  Edinburgh  Math.  Soc.  8 (1949),  119-125;  see  the  book  Orthogonal  Arrays  by 
A.  S.  Hedayat,  N.  J.  A.  Sloane,  and  J.  Stufken  (Springer,  1999).) 

19.  We  can  rearrange  the  columns  so  that  the  first  row  is  0nln  . . . (n— l)*1.  Then  we 
can  renumber  the  elements  of  the  other  rows  so  that  they  begin  with  01 . . . (n— 1).  The 
elements  in  each  remaining  column  must  then  be  distinct,  in  all  rows  but  the  first. 

To  achieve  the  upper  bound  when  n = p,  let  each  column  be  indexed  by  two 
numbers  x and  y,  where  0 < x,  y < p,  and  put  the  numbers  y,  x,  ( x + y)  mod  p, 
(x  + 2 y)  mod p,  . . . , ( x + (p-l)y)  mod  p into  that  column.  For  example,  when  p = 5 we 
get  the  following  orthogonal  array,  equivalent  to  four  mutually  orthogonal  latin  squares: 

/ 0 0 0 0 0 1 1111222223333344444\ 
0123401234012340123401234 
0123412340234013401240123 
0123423401401231234034012  ' 

0123434012123404012323401 
\01234401233401 223401 12340/ 

[Essentially  the  same  idea  works  when  n is  a prime  power,  using  the  finite  field  GF(pe); 
see  E.  H.  Moore,  American  Journal  of  Mathematics  18  (1896),  264-303,  §15(1).  These 
arrays  are  equivalent  to  finite  projective  planes-,  see  Marshall  Hall,  Jr.,  Combinatorial 
Theory  (Blaisdell,  1967),  Chapters  12  and  13.] 

20.  Let  u = e27rl^n , and  suppose  ai  . . . an 2 and  b\  . . . bn 2 are  the  vectors  in  different 
rows.  Then  ai&i  + • • • + an2&n2  = E0<j,fc<n  uj+k  = 0 because  EEo  = 0. 

21.  (a)  To  show  that  equality-or-parallelism  is  an  equivalence  relation,  we  need  to 
verify  the  transitive  law:  If  L ||  M and  M ||  N and  L / IV,  then  we  must  have  L ||  N. 
Otherwise  there  would  be  a point  p with  L C\  N = {p},  by  (ii);  and  p would  lie  on  two 
different  lines  parallel  to  M,  contradicting  (iii). 

(b)  Let  {L 1, . . . , Ln}  be  a class  of  parallel  lines,  and  assume  that  M is  a line  of 
another  class.  Then  each  Lj  intersects  M in  a unique  point  pj ; and  every  point  of  M 
is  encountered  in  this  way,  because  every  point  of  the  geometry  lies  on  exactly  one  line 
of  each  class,  by  (iii).  Thus  M contains  exactly  n points. 

(c)  We’ve  already  observed  that  every  point  belongs  to  m lines  when  there  are  m 
classes.  If  lines  L,  M,  and  N belong  to  three  different  classes,  then  M and  N have  the 
same  number  of  points  as  the  number  of  lines  in  L’s  class.  So  there’s  a common  line 
size  n,  and  in  fact  the  total  number  of  points  is  n “ . (Of  course  n might  be  infinite.) 

22.  Given  an  orthogonal  array  A of  order  n and  depth  to,  define  a geometric  net  with 
n points  and  to  classes  of  parallel  lines  by  regarding  the  columns  of  A as  points;  line  j 
of  class  k is  the  set  of  columns  where  symbol  j appears  in  row  k of  A. 

All  finite  geometric  nets  with  m > 3 classes  arise  in  this  way.  But  a geometric  net 
with  only  one  class  is  trivially  a partition  of  the  points  into  disjoint  subsets.  A geometric 
net  with  m = 2 classes  has  nn'  points  ( x,x *),  where  there  are  n lines  = constant’  in 
one  class  and  n!  lines  'x  = constant’  in  the  other.  [For  further  information,  see  R.  H. 
Bruck,  Canadian  J.  Math.  3 (1951),  94-107;  Pacific  J.  Math.  13  (1963),  421-457.] 

23.  (a)  If  d(x,y ) < t and  d(x',y)  < t and  x ^ x , then  d(x,  x)  < 2 1.  Thus  a code  with 
distance  > 2 1 between  codewords  allows  the  correction  of  up  to  t errors  — at  least  in 
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principle,  although  the  computations  might  be  complex.  Conversely,  if  d(x,x')  < 2 1 
and  x / x1,  there’s  an  element  y with  d(x,y)  < t and  d(x',y)  < t;  hence  we  can’t 
reconstruct  x uniquely  when  y is  received. 

(b,c)  Let  m = r + 2,  and  observe  that  a set  of  b 2 b-ary  m-tuples  has  Hamming 
distance  > m — 1 between  all  pairs  of  elements  if  and  only  if  it  forms  the  columns  of  a 
5-ary  orthogonal  array  of  depth  m.  [See  S.  W.  Golomb  and  E.  C.  Posner,  IEEE  Trans. 
IT-10  (1964),  196-208.  The  literature  of  coding  theory  often  denotes  a code  C(b,n,r) 
of  distance  d by  the  symbol  ( n + r,bn,d)b . Thus,  a b-ary  orthogonal  array  of  depth  m 
is  essentially  an  (m,  b2,m  — 1)&  code.] 

24.  (a)  Suppose  Xj  / x'  for  1 < j < l and  Xj  = x'  for  l < j < N.  We  have  x = x 
if  l = 0.  Otherwise  consider  the  parity  bits  that  correspond  to  the  m lines  through 
point  1.  At  most  l — 1 of  those  bits  correspond  to  lines  that  touch  the  points  {2, . . . , l}. 
Hence  x'  has  at  least  m — (l—  1)  parity  changes,  and  d(x,  x')  > l + (m  — (/  — 1))  = m + 1. 

(b)  Let  lpi , . . . , lprn  be  the  index  numbers  of  the  lines  through  point  p.  After 
receiving  a message  y\  . . ■ Vn+r,  compute  xp  for  1 < p < N by  taking  the  majority 
value  of  the  m + 1 “witness  bits”  {ypo,  ■ • • , yPm},  where  yp0  = yp  and 

UPk  = (VN+Ipk  + J2{V]  I j ^ P and  point  j lies  on  line  lpk})  mod  2,  for  1 < k < m. 

This  method  works  because  each  received  bit  j/j  affects  at  most  one  of  the  witness  bits. 

For  example,  in  the  25-point  geometry  of  exercise  19,  suppose  the  parity  bit 
X26+5 i+j  of  each  codeword  corresponds  to  line  j of  row  i,  for  0 < i < 5 and  0 < 
j < 5;  thus  x26  = Xi  ® x2  © x3  © x4  ffi  x5,  x2t  = x6  ® x7  © x8  © x9  ® xio,  . . . , 
X55  = X5  ® xe  ® X12  © xis  © X24 • Given  message  y\  . . . j/55,  we  decode  bit  xi  (say)  by 
computing  the  majority  of  the  seven  bits  1/1 , y26  ©2/2  ©J/3  ©2/4  ©2/5, 2/31  ©J/e©yi  1 ©2/ie  ©2/21 , 
2/36  ffi  2/10  © 2/14  ffi  2/18  © 2/22,  2/41  © 2/9  © 2/12  © 2/20  ffi  2/23,  2/46  ffi  2/8  © 2/15  ffi  2/17  © 2/24, 
2/5i  © 2/7  © 2/13  © 2/19  © 2/25-  [Section  7.1.2  explains  how  to  calculate  majority  functions 
efficiently.  Notice  that  we  can  eliminate  the  last  10  bits  if  we  only  wish  to  correct  up  to 
two  errors,  and  the  last  20  if  single-error  correction  is  sufficient.  See  M.  Y.  Hsiao,  D.  C. 
Bossen,  and  R.  T.  Chien,  IBM  J.  Research  and  Development  14  (1970),  390-394.] 

25.  By  considering  anagrams  of  {1,  e,  a,  s,  t}  (see  exercise  5-21),  we’re  led  to  the  square 

stela 
telas 
elast  , 
laste 
astel 

and  the  cyclic  rotations  of  its  rows.  Here  telas  are  Spanish  fabrics;  elast  is  a prefix 
meaning  flexible;  and  laste  is  an  imperative  Chaucerian  verb.  (Of  course  just  about 
every  pronounceable  combination  of  five  letters  has  been  used  to  spell  or  misspell 
something  somewhere,  at  some  point  in  history.) 

26.  “every  night,  young  video  buffs  catch  rerun  fever  forty  years  after  those 
great  shows  first  aired.”  [Robert  Leighton,  GAMES  16,6  (December  1992),  34,  47.] 

27.  (0, 4, 163, 1756, 3834)  for  k = (1,  2, 3, 4,  5);  mamma  and  esses  give  a “full  house.” 

28.  Yes,  38  pairs  altogether.  The  “most  common”  solution  is  needs  (rank  180)  and 
offer  (rank  384).  Only  three  cases  differ  consistently  by  +1  (adder  beefs,  sheer 
tiffs,  sneer  toffs).  Other  memorable  examples  are  ghost  hints  and  strut  rusts. 
One  word  of  the  pair  ends  with  the  letter  s except  in  four  cases,  such  as  robed  spade . 
[See  Leonard  J.  Gordon,  Word  Ways  23  (1990),  59-61.] 
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29.  There  are  18  palindromes,  from  level  (rank  184)  to  dewed  (rank  5688).  Some  of 
the  34  mirror  pairs  are  ‘devil  lived’,  ‘knits  stink’,  ‘smart  trams’,  ‘faced  decaf’. 

30.  Among  105  such  words  in  the  SGB,  first,  below,  floor,  begin,  cells,  empty, 
and  hills  are  the  most  common;  abbey  and  pssst  are  lexicographically  first  and  last. 
(If  you  don’t  like  pssst,  the  next-to-last  is  mossy.)  Only  37  words,  from  mecca  to 
zoned,  have  their  letters  in  reverse  order;  but  they  are,  of  course,  wrong  answers. 

31.  The  middle  word  is  the  average  of  the  other  two,  so  the  extreme  words  must  be 
congruent  mod  2;  this  observation  reduces  the  number  of  dictionary  lookups  by  a factor 
of  about  32.  There  are  119  such  triples  in  W0RDS(5757),  but  only  two  in  W0RDS(2000): 
marry,  photo,  solve;  risky,  tempo,  vague.  [Word  Ways  25  (1992),  13-15.] 

32.  The  only  reasonably  common  example  seems  to  be  peopleless. 

33.  chief,  fight,  right,  which,  ouija,  jokes,  ankle,  films,  hymns,  known,  crops, 
pique,  quart,  first,  first,  study,  mauve,  vowel,  waxes,  proxy,  crazy,  pizza.  (The 
idea  is  to  find  the  most  common  word  in  which  x is  followed  by  ( x + 1)  mod  26,  for 
x = a.  (0),  x = b (1),  . . . , x = z (25).  We  also  minimize  the  intervening  distance,  thus 
preferring  bacon  to  the  more  common  word  black.  In  the  one  case  where  no  such  word 
exists,  crazy  seems  most  rational.  See  OMNI  16,8  (May  1994),  94.) 

34.  The  top  two  (and  total  number)  in  each  category  are:  pssst  and  pf fft  (2),  schwa 
and  schmo  (2),  threw  and  throw  (36),  three  and  spree  (5),  which  and  think  (709), 
there  and  these  (234),  their  and  great  (291),  whooo  and  wheee  (3),  words  and  first 
(628),  large  and  since  (376),  water  and  never  (1313),  value  and  radio  (84),  would 
and  could  (460),  house  and  voice  (101),  quiet  and  queen  (25),  queue  only  (1),  abbhh 
and  ankhs  (4),  angle  and  extra  (20),  other  and  after  (227),  agree  and  issue  (20), 
along  and  using  (124),  above  and  alone  (92),  about  and  again  (58),  adieu  and  aquae 
(2),  earth  and  eight  (16),  eagle  and  ounce  (8),  outer  and  eaten  (42),  eerie  and 
audio  (4),  (0),  ouija  and  aioli  (2),  (0),  (0);  years  and  every  are  the  most  common  of 
the  868  omitted  words.  [To  fill  the  three  holes,  Internet  usage  suggests  ooops,  ooooh, 
and  ooooo . See  P.  M.  Cohen,  Word  Ways  10  (1977),  221-223.] 

35.  Consider  the  collection  WORDS (n)  for  n = 1,2,...,  5757.  The  illustrated  trie,  rooted 
at  s,  first  becomes  possible  when  n reaches  978  (the  rank  of  stalk).  The  next  root 
letter  to  support  such  a trie  is  c,  which  acquires  enough  branching  in  its  descendants 
when  n = 2503  (the  rank  of  craze).  Subsequent  breakthroughs  occur  when  n = 2730 
(bulks),  3999  (ducky),  4230  (panty),  4459  (minis),  4709  (whooo),  4782  (lardy),  4824 
(herem),  4840  (firma),  4924  (ridgy),  5343  (taxol). 

(A  breakthrough  occurs  when  a top-level  trie  acquires  Ilorton-Strahler  number  4; 
see  exercise  7.2.1.6-124.  Amusing  sets  of  words,  suggestive  of  a new  kind  of  poetry,  arise 
also  when  the  branching  is  right-to-left  instead  of  left-to-right:  black,  slack,  crack, 
track,  click,  slick,  brick,  trick,  blank,  plank,  crank,  drank,  blink,  clink,  brink, 
drink.  In  fact,  right-to-left  branching  yields  a complete  ternary  trie  with  81  leaves: 
males,  sales,  tales,  files,  miles,  piles,  holes,  . . . , tests,  costs,  hosts,  posts.) 

36.  Denoting  the  elements  of  the  cube  by  a k for  1 < i,j,k  < 5,  the  symmetry 
condition  is  = atkj  = ajik  = a,jki  = ajbij  = cikji-  In  genera!  an  nxnxn  cube  has 
3 n2  words,  obtained  by  fixing  two  coordinates  and  letting  the  third  range  from  1 to  n; 
but  the  symmetry  condition  means  that  we  need  only  ("+1)  words.  Hence  when  n = 5 
the  number  of  necessary  words  is  reduced  from  75  to  15.  [Jeff  Grant  was  able  to  find  75 
suitable  words  in  the  Oxford  English  Dictionary;  see  Word  Ways  11  (1978),  156-157.] 

Changing  (stove,  event)  to  (store,  erect)  or  (stole,  elect)  gives  two  more. 
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37.  The  densest  part  of  the  graph,  which  we  might  call  its  “bare  core,”  contains  the 
vertices  named  bares  and  cores,  which  each  have  degree  25. 

38.  tears  — ► raise  — > aisle  — ► smile;  the  second  word  might  also  be  reals.  [Going 
from  tears  to  smile  as  in  (n)  was  one  of  Lewis  Carroll’s  first  five-letter  examples.  He 
would  have  been  delighted  to  learn  that  the  directed  rule  makes  it  more  difficult  to  go 
from  smile  to  tears,  because  four  steps  are  needed  in  that  direction.) 

39.  Always  spanning,  never  induced. 

40.  (a)  2e , (b)  2n,  one  for  each  subset  of  E or  V . 

41.  (a)  n = 1 and  n = 2;  Po  is  undefined,  (b)  n = 0 and  n = 3. 

42.  G has  65/2  edges  (hence  it  doesn’t  exist). 

43.  Yes:  The  first  three  are  isomorphic  to  Fig.  2(e).  [The  left-hand  diagram  is,  in  fact, 
identical  to  the  earliest  known  appearance  of  the  Petersen  graph  in  print:  See  A.  B. 
Kempe,  Philosophical  Transactions  177  (1886),  1 70,  especially  Fig.  13  in  §59.)  But 
the  right-hand  graph  is  definitely  different;  it  is  planar,  Hamiltonian,  and  has  girth  3. 

44.  Any  automorphism  must  take  a corner  point  into  a corner  point,  because  three 
distinct  paths  of  length  2 can  be  found  only  between  certain  pairs  of  non-corner  points. 
Therefore  the  graph  has  only  the  eight  symmetries  of  C4. 

45.  All  edges  of  this  graph  connect  vertices  of  the  same  row  or  adjacent  rows.  Therefore 
we  can  use  the  colors  0 and  2 alternately  in  even-numbered  rows,  1 and  3 alternately  in 
odd-numbered  rows.  The  neighbors  of  NV  form  a 5-cycle,  hence  four  colors  are  necessary. 

46.  (a)  Every  vertex  has  degree  > 2,  and  its  neighbors  have  a well-defined  cyclic  order 
corresponding  to  the  incoming  lines.  If  u — v and  u — w,  where  v and  w are  cyclically 
consecutive  neighbors  of  u,  we  must  have  v — w.  Thus  all  points  in  the  vicinity  of  any 
vertex  u belong  to  a unique  triangular  region. 

(b)  The  formula  holds  when  n = 3.  If  n > 3,  shrink  any  edge  to  a point;  this 
transformation  removes  one  vertex  and  three  edges.  (If  u — v shrinks,  suppose  it  was 
part  of  the  triangles  x — u — v — x and  y — u — v — y.  We  lose  vertex  v and 
edges  {x  — v,u  — v,y  — u};  all  other  edges  of  the  form  w — v become  w — u.) 

47.  A planar  diagram  would  divide  the  plane  into  regions,  with  either  4 or  6 vertices  in 
the  boundary  of  each  region  (because  AY.3  has  no  odd  cycles).  If  there  are  /4  and  /6  of 
each  kind,  we  must  have  4/4  + 6 /6  = 18,  since  there  are  9 edges;  hence  (/4,  /6)  = (3, 1) 
or  (0, 3).  We  could  also  triangulate  the  graph  by  adding  /4  + 3/6  more  edges;  but  then 
it  would  have  at  least  15  edges,  contradicting  exercise  46. 

[The  fact  that  AY, 3 is  nonplanar  goes  back  to  a puzzle  about  connecting  three 
houses  to  three  utilities  (water,  gas,  and  electricity),  without  crossing  pipes.  Its  origin 
is  unknown;  H.  E.  Dudeney  called  it  “ancient”  in  Strand  46  (1913),  110.) 

48.  If  u,  v,  w are  vertices  and  u — v,  we  must  have  d(w,u ) ^ d(w,v)  (modulo  2); 
otherwise  shortest  paths  from  w to  u and  from  w to  v would  yield  an  odd  cycle.  After 
w is  colored  0,  the  procedure  therefore  assigns  the  color  d(w,  v ) mod  2 to  each  new 
uncolored  vertex  v that  is  adjacent  to  a colored  vertex  u\  and  every  vertex  v with 
d(w,v)  < 00  is  colored  before  a new  w is  chosen. 

49.  There  are  only  three:  A4,  A3>3,  and  fT/T  (which  is  C^). 

50.  The  graph  must  be  connected,  because  the  number  of  3-colorings  is  divisible  by 
3r  when  there  are  r components.  It  must  also  be  contained  in  a complete  bipartite 
graph  A'm,n,  which  can  be  3-colored  in  3(2m  + 2"  — 2)  ways.  Deleting  edges  from 
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Km,n  does  not  decrease  the  number  of  colorings;  hence  2m  + 2n  - 2 < 8,  and  we  have 
{m,n}  = {1, 1},  {1,2},  {1,3},  or  {2,2}.  So  the  only  possibilities  are  the  claw  A"13  and 
the  path  P4. 

51.  A 4-cycle  pi  — L\  — p2  — L2  — pi  would  correspond  to  two  distinct  lines 
{Li,L2}  with  two  common  points  {pi,p2},  contradicting  (ii).  So  the  girth  is  at  least  6. 

If  there  s only  one  class  of  parallel  lines,  the  girth  is  oo;  if  there  are  two  classes, 
with  n<n  members,  it  is  8,  or  oo  if  n = 1.  (See  answer  22.)  Otherwise  we  can  find  a 
6-cycle  by  making  a triangle  from  three  lines  that  are  chosen  from  different  classes. 

52.  If  the  diameter  is  d and  the  girth  is  g,  then  d > [g/ 2J,  unless  g = oo. 

53.  happy  (which  is  connected  to  tears  and  sweat,  but  not  to  world). 

54.  (a)  It’s  a single,  highly  connected  component.  (Incidentally,  this  graph  is  the 
line  graph  of  the  bipartite  graph  in  which  one  part  corresponds  to  the  initial  letters 
{A,  C,  D,  F,  G, . . . , W}  and  the  other  to  the  final  letters  {A,  C,D,E,H, . . . , Z}.) 

(b)  Vertex  WY  is  isolated.  The  other  vertices  with  in-degree  zero,  namely  FL,  GA, 
PA,  UT,  WA.  WI,  and  WV,  form  strong  components  by  themselves;  they  all  precede  a giant 
strong  component,  which  is  followed  by  each  of  the  remaining  single- vertex  strong 
components  with  out-degree  zero:  AZ,  DE,  KY,  ME,  NE,  NH,  NJ,  NY,  OH,  TX. 

(c)  Now  the  strong  component  {GU}  precedes  {UT};  NH,  OH,  PA,  WA,  WI,  and  WV  join 
the  giant  strong  component;  {FM}  precedes  it;  {AE}  and  {WY}  follow  it. 

55‘  (^)  “ C21) C?),  where  N = m + • ■ • + nk. 

56.  True.  Note  that  Jn  is  simple,  but  it  doesn’t  correspond  to  any  multigraph. 

57.  False,  in  the  connected  digraph  u — > w < — v.  (But  u and  v are  in  the  same  strongly 
connected  component  if  and  only  if  d(u,v)  < 00  and  d(v,u)  < 00;  see  Section  2.3. 4.2.) 

58.  Each  component  is  a cycle  whose  order  is  at  least  (a)  3 (b)  1. 

59.  (a)  By  induction  on  n,  we  can  use  straight  insertion  sorting:  Suppose  v\  — > ■ • > 

Vn-i-  Then  either  vn  tru  or  vn-\  — >vn  or  Vk-i  — >v„  — tujt,  where  k is  minimum 
such  that  vn  — >Vk ■ [L.  Redei,  Acta  litterarum  a c scientiarum  7 (Szeged,  1934),  39-43.] 

(b)  15:  01234,  02341,  02413,  and  their  cyclic  shifts.  [The  number  of  such  oriented 
paths  is  always  odd;  see  T.  Szele,  Matematikai  es  Fizikai  Lapok  50  (1943),  223-256.] 

(c)  Yes.  (By  induction:  If  there’s  only  one  place  to  insert  vn  as  in  part  (a),  the 
tournament  is  transitive.) 

60.  Let  A = {x  | u—>x),  B = {x  \ x—+v},  C = {x  | »-+x}.  If  v <£  A and  4nB  = 0 
we  have  |A|  + |£?|  = |AU  J3|  < n — 2,  because  u £ AU B and  v £ AUB.  But  |J3|  + |C|  = 
n - 1;  hence  |A|  < |Cj.  [H.  G.  Landau,  Bull.  Math.  Biophysics  15  (1953),  148.] 

1 1 2,  2 ^2;  then  A = (g[)  and  Ak  = (J^)  for  all  integers  k. 

62.  (a)  Suppose  the  vertices  are  {1, ... , n }.  Each  of  the  n!  terms  aipi  . . . anPn  in  the 
expansion  of  the  permanent  is  the  number  of  spanning  permutation  digraphs  that  have 
arcs  j > pj . (b)  A similar  argument  shows  that  det  A is  the  number  of  even  spanning 
permutation  digraphs  minus  the  number  of  odd  ones.  [See  F.  Harary,  SIAM  Review  4 
(1962),  202—210,  where  permutation  digraphs  are  called  “linear  subgraphs.”] 

63.  Let  v be  any  vertex.  If  g = 2t+l,  at  least  d{d- l)fc~1  vertices  x satisfy  d(v,x)  = k, 
for  1 < k < t.  If  g = 2t  + 2 and  v'  is  any  neighbor  of  v,  there  also  are  at  least  (d  - 1)* 
vertices  x for  which  d(v,  x)  = t + 1 and  d(v' , x)  = t. 

64.  To  achieve  the  lower  bound  in  answer  63,  every  vertex  v must  have  degree  d.  and 
the  d neighbors  of  v must  all  be  adjacent  to  the  remaining  d-  1 vertices.  This  graph 
is,  in  fact,  Kdtd. 
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65.  (a)  By  answer  63,  G must  be  regular  of  degree  d,  and  there  must  be  exactly  one 
path  of  length  < 2 between  any  two  distinct  vertices. 

(b)  We  may  take  Ai  = d , with  x\  = (1 . . . I)7  . All  other  eigenvectors  satisfy 


Jxj  = (0  . . . 0)J 


hence  A2 


-f-  A j — d - 


(c)  If  A2  = • • • = A m = (— l + -\/4d- 
nust  have  m — 1 

(d)  If  4d  - 3 = s- 


1 for  1 < j < N. 


-3)/2  and  Am+i  — • • • = Ajv  = ( — 1 — \/4d— 3)/2, 

we  must  have  m - 1 = N - m.  With  this  value  we  find  Ai  H 1-  A n = d - d2/ 2. 

s2  and  m is  as  in  (c),  the  eigenvalues  sum  to 

"2  1 * " 1 ' (s2  + 3)2 


+ 3 , , s - 1 

— + (m- 1)  — 


16 


+ 1 


— m ^ 


s+1 


which  is  15/32  plus  a multiple  of  s.  Hence  s must  be  a divisor  of  15. 

[These  results  are  due  to  A.  J.  Hoffman  and  R.  R.  Singleton,  IBM  J.  Research  and 
Development  4 (1960),  497  504,  who  also  proved  that  the  graph  for  d = 7 is  unique.] 

66.  Denote  the  50  vertices  by  [a,  6]  and  (a,  6)  for  0 < a,  6 < 5,  and  define  three  kinds 
of  edges,  using  arithmetic  mod  5: 


[a,  6]  — [a + 1,6];  (a,  6)  — (a + 2, 6);  (a,  6) — [a  + 6c,  c]  for  0 < a,  6,  c < 5. 

[See  W.  G.  Brown,  Canadian  J.  Math.  19  (1967),  644-648;  J.  London  Math.  Soc.  42 
( 1967),  514-520.  Without  the  edges  of  the  first  two  kinds,  the  graph  has  girth  6 and  cor- 
responds to  a geometric  net  as  in  exercise  51,  using  the  orthogonal  array  in  answer  19.] 

67.  Certain  possibilities  have  been  ruled  out  by  Michael  Aschbacher  in  Journal  of 
Algebra  19  (1971),  538-540. 

68.  If  G has  s automorphisms,  it  has  n\/s  adjacency  matrices,  because  there  are  s 
permutation  matrices  P such  that  P~AP  = A. 

69.  First  set  IDEG(u)  «-  0 for  all  vertices  v.  Then  perform  (31)  for  all  v,  also  setting 
u t—  TIP  (a)  and  IDEG(u)  IDEG(u)  + 1 in  the  second  line  of  that  mini-algorithm. 

To  do  something  “for  all  v"  using  the  SGB  format,  first  set  v <-  VERTICES (<7) ; 
then  while  v < VERTICES  (3)  + N (<7) , do  the  operation  and  set  v <—  v + 1. 

70.  Step  B1  is  performed  once  (but  it  takes  O(n)  units  of  time).  Steps  (B2,  B3,  . . . , 
B8)  are  performed  respectively  (n  + 1,  n,  n,m  + n,  m,  m,  n)  times,  each  with  0(1)  cost. 

71.  Many  choices  are  possible.  Here  we  use  32-bit  pointers,  all  relative  to  a symbolic 
address  Pool,  which  lies  in  the  Data_Segment.  The  following  declarations  provide  one 
way  to  establish  conventions  for  dealing  with  basic  SGB  data  structures. 

VSIZE  IS  32  ; ASIZE  IS  24  Node  sizes 

ARCS  IS  0 ; COLOR  IS  8 ;LINK  IS  12  Offsets  of  vertex  fields 

TIP  IS  0 ;NEXT  IS  4 Offsets  of  arc  fields 


arcs  GREG  Pool+ARCS  ; color  GREG  Pool+COLOR  ;link  GREG  Pool+LINK 
tip  GREG  Pool+TIP  :next  GREG  Pool+NEXT 


u GREG 

; v GREG 

; w GREG  ; s G. 

AlgB 

BZ 

n, Success 

MUL 

$0,n, VSIZE 

ADDU 

v,v0,$0 

SET 

w,v0 

1H 

STT 

mone , color  ,w 

ADDU 

w.w, VSIZE 

CMP 

> 

t* 

O 

PBNZ 

$0,  IB 

;a  GREG  ;mone  GREG  -1 

Exit  if  the  graph  is  null. 
Bl.  Initialize. 
v <—  v0  + n. 
w <—  Vo. 

COLOR(ui)  «■ 1. 

w •<—  w + 1. 

Repeat  until  w = v. 


2 .mv 
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OH 

SUBU 

w,w, VSIZE 

3H 

LDT 

$0, color, w 

PBNN 

$0,2F 

STC0 

0,link,w 

SET 

s,w 

4H 

SET 

u,  s 

LDTU 

s , link, s 

LDT 

$1, color, u 

NEG 

$1,1, $1 

LDTU 

a, arcs ,u 

5H 

BZ 

a,8F 

5H 

LDTU 

v,tip,a 

6H 

LDT 

$0, color, v 

CMP 

$2,$0,$1 

PBZ 

$2,7F 

BNN 

$0 .Failure 

STT 

$1 , color , v 

STTU 

s , link, v 

SET 

s,v 

7H 

LDTU 

a, next , a 

PBNZ 

a,5B 

8H 

PBNZ 

s ,4B 

2H 

CMP 

$0,w, vO 

PBNZ 

$0,0B 

Success 

L0C 

a 

w <—  w — 1. 

B3.  Color  w if  necessary. 

To  B2  if  COLOR(iu)  > 0. 

COLOR(ui)  <-  0,  LINK ('io ) <-  A. 

S i — W . 

B4.  Stack  =>  u.  Set  u <—  s. 
s <-  LINK(s). 

$1 <-  1 - COLOR(u). 
d i — ARCS  ( u ) . 

B5.  Done  with  u?  To  B8  if  a = A. 
v «-  TIP(a). 

B6.  Process  v. 

(Here  the  program  is  slightly  clever) 
To  B7  if  COLORS)  = 1 - COLOR(u) . 
Fail  if  COLOR(u)  = CDLOR(u). 

COLOR(w)  <-  1 - COLOR (u) . 

LINK(r)  «-  s. 

S <r-  V. 

B7.  Loop  on  a.  Set  a <—  NEXT  (a) . 

To  B5  if  a / A. 

B8.  Stack  nonempty?  To  B4  if  s / A. 
B2.  Done? 

If  w / v0,  decrease  w and  go  to  B3. 
(Successful  termination)  | 


72.  (a)  This  condition  clearly  remains  invariant  as  vertices  enter  or  leave  the  stack. 

(b)  Vertex  v has  been  colored  but  not  yet  explored,  because  the  neighbors  of  every 
explored  vertex  have  the  proper  color. 

(c)  Just  before  setting  s <-  « in  step  B6,  set  PARENT (u)  <-  u,  where  PARENT  is 
a new  utility  field.  And  just  before  terminating  unsuccessfully  in  that  step,  do  the 
following:  “Repeatedly  output  NAME(u)  and  set  u <-  PARENT («) , until  u = PARENT! y) ; 
then  output  NAME(u)  and  NAME(u) .” 


73.  Kw.  (And  random.graph(10, 100, 0, 1, 1, 0, 0, 0, 0, 0)  is  J10.) 

74.  badness  has  out-degree  22;  no  other  vertices  have  out-degree  > 20. 


75.  Let  the  parameters  (ni , ri2,  ri3,  ri4,p,  w,  o)  be  respectively  (a)  (n,  0, 0, 0,  — 1,  0, 0); 
(b)  (n,  0,0, 0,1, 0,0);  (c)  (n,  0, 0, 0, 1, 1,  0);  (d)  (n,  0, 0, 0, -1, 0, 1);  (e)  (n,  0,0, 0, 1, 0, 1); 
(f)  (n,  0,0,0, 1,1,1);  (g)  (m,n,  0,0, 1,0,0);  (h)  (m,  n,  0,  0, 1,  2, 0);  (i)  (m,  n,  0, 0, 1, 3, 0); 
(j)  (m,n,  0, 0,-1, 0, 0);  (k)  (■ m,n , 0, 0, 1, 3, 1);  (1)  (n,  0,  0, 0,  2,  0, 0);  (m)  (2 ,-n,  0, 0, 1, 0. 0). 


76.  Yes,  for  example  from  Ci  and  C2  in  answer  75(c).  (But  no  self-loops  can  occur 
when  p < 0,  because  arcs  x — >y  = x + kS  are  generated  for  k = 1,  2,  . . . until  y is  out 
of  range  or  y = x.) 


77.  Suppose  x and  y are  vertices  with  d(x,y)  > 2.  Thus  x-/-y;  and  if  v is  any  other 
vertex  we  jnust  have  either  v -j—  x or  v -f-  y.  These  facts  yield  a path  of  length  at 
most  3 in  G between  any  two  vertices  u and  v. 

78.  (a)  The  number  of  edges,  Q)/2,  must  be  an  integer.  The  smallest  examples  are 
K0,  Ki,  Pi,  Cg,  and  W. 
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(b)  If  q is  any  odd  number,  we  have  u — v if  and  only  if  pq  (u)  -j-  pq  (v) . Therefore 
ipq  cannot  have  two  fixed  points,  nor  can  it  contain  a 2-cycle. 

(c)  Such  a permutation  of  V also  defines  a permutation  p of  the  edges  of  Kn, 

taking  {u,u}  i->  <^({u,u})  = {( p(u),p(v)},  and  it’s  easy  to  see  that  the  cycle  lengths 

of  p are  all  even.  If  p has  t cycles,  we  obtain  2*  self-complementary  graphs  by  painting 
the  edges  of  each  cycle  with  alternating  colors. 

(d)  In  this  case  p has  a unique  fixed  point  v,  and  G'  = G\v  is  self-complementary. 
Suppose  p has  r cycles  in  addition  to  (u);  then  p has  r cycles  involving  the  edges  that 
touch  vertex  v,  and  there  are  2r  ways  to  extend  G'  to  a graph  G. 

[- References : H.  Sachs,  Publicationes  Mathematical  9 (Debrecen,  1962),  270-288; 
G.  Ringel,  Archiv  der  Mathematik  14  (1963),  354-358.] 

79.  Solution  1,  by  H.  Sachs,  with  <^>=(12...  4k):  Let  u — v when  u,  > v > 0 and 
u + v mod  4 < 1;  also  0 — v when  v mod  2 = 0. 

Solution  2,  with  p = (ai  b\  ci  d\) . . . ( ak  bk  Ck  dk),  where  aj  = 4 j — 3,  bj  = 4 j — 2, 

Cj  = 4 j - 1,  and  d3  = 4 j:  Let  0 — b3  — aj  — Cj  — dj  — 0 for  1 < j < k,  and 

ai  °j  — h — dj  — a — Cj  — di  — bj  — at , for  1 < i < j < k. 

80.  (Solution  by  G.  Ringel.)  Let  p be  as  in  answer  79,  solution  2.  Let  E0  be  the  3 k 
edges  bj  aj  Cj  dj  for  1 < j < k;  let  E\  be  the  8 (£)  edges  between  {a, , bi , c, , di } 
and  {bj,dj}  for  1 < i < j < k\  let  E2  be  the  8(j)  edges  between  {a,,  bi,  Ci,  di}  and 
{dj,Cj}  for  1 < i < j < k.  In  case  (a),  E0  U E\  gives  diameter  2;  E0  U E2  gives  diam- 
eter 3.  Case  (b)  is  similar,  but  we  add  2k  edges  bj  — 0 — dj  to  Ex,  a j — 0 — Cj  to  E2 . 

81.  C3,  K3,  D = o — and  DT  = <x — c^zio.  (The  converse  DT  of  a digraph  D 
is  obtained  by  reversing  the  direction  of  its  arcs.  There  are  16  nonisomorphic  simple 
digraphs  of  order  3 without  loops,  10  of  which  are  self-converse,  including  C3  and  K3.) 

82.  (a)  True,  by  definition,  (b)  True:  If  every  vertex  has  d neighbors,  every  edge 
u v has  d—  1 neighbors  u — w and  d—  1 neighbors  w — v.  (c)  True:  {ai,bj}  has 
rn  + n — 2 neighbors,  for  0 < i < m and  0 < j < n.  (d)  False:  L ( R 1 . 1 , 2 ) has  5 vertices 
and  8 edges,  (e)  True,  (f)  True:  The  only  nonadjacent  edges  are  {0,1}  -j-  {2,3}, 
{0,2}  —j—  {1,3},  {0,3}  -f-  {1,2}.  (g)  True,  for  all  n > 0.  (h)  False,  unless  G has  no 
isolated  vertices. 

83.  It  is  the  Petersen  graph.  [A.  Kowalewski,  Si tz u ngs beri elite  der  Akademie  der 
Wissenschaften  in  Wien,  Mathematisch-Nat.  Klasse,  Abteilung  Ha,  126  (1917),  67-90.] 

84.  Yes:  Let  p({au,bv})  = {a(u+u)  mod  3 ,b(u-v)  mod  3}  for  0 < u,  v < 3. 

85.  Let  the  vertex  degrees  be  {dx, . . .,dn}.  Then  G has  i(di  + •••  + <£„)  edges,  and 
L(G)  has  2 (di(dx  — 1)  + • • • + dn(dn  — 1)).  Thus  G and  L(G)  both  have  exactly  n edges 
if  and  only  if  ( dx  — 2)2  + • • • + ( dn  — 2) 2 = 0.  Consequently  exercise  58  gives  the  answer. 
[See  V.  V.  Menon,  Canadian  Math.  Bull.  8 (1965),  7-15.] 

86.  If  G = then  G = = L(G). 

87.  (a)  Yes,  easily.  [In  fact,  R.  L.  Brooks  has  proved  that  every  connected  graph  with 
maximum  vertex  degree  d > 2 is  d-colorable,  except  for  the  complete  graph  Kd+ 1;  see 
Proc.  Cambridge  Phil.  Soc.  37  (1941),  194-197.] 

(b)  No.  There’s  essentially  only  one  way  to  3-color  the  edges  of  the  outer  5-cycle 
in  Fig.  2(e);  this  forces  a conflict  on  the  inner  5-cycle.  [Petersen  proved  this  in  1898.] 

88.  One  cycle  doesn’t  use  the  center  vertex,  and  there  are  (n  - l)(n  - 2)  cycles  that  do 
(namely,  one  for  every  ordered  pair  of  distinct  vertices  on  the  rim).  We  don’t  count  C0. 
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(A  O ON  (AJ  J\  (A  J J\  (AO  0\ 

89.  Both  sides  equal  , , , , respectively. 

\OOCj  \J  J Cj  \OOCj  \JJCJ 

90.  K4  and  K4\  ATi,i,2  and  ^1,1,2;  A'2j2  = C4  and  772,2;  Ah, 3 and  Ah, 3;  Ah  © Ah, 2 
and  its  complement;  all  graphs  Ka  are  cographs  by  (47).  Missing  is  P4  = T\.  (All 
connected  subgraphs  of  a cograph  have  diameter  < 2;  Ws  is  a cograph,  but  not  W6.) 

91-  (a)  m ; (b)  X;  (c)  X;  (dj_D;  (e)  X;  (f)  I I;  (g)  X-  (In  general  we 
have  K2  a H = (Ah  □ H)  U (Ah  <S>  H ) , and  K2  o H = H — H . Thus  the  coincidences 
A'2  a H = Ah  □ H and  Ah  o H = K2mH  occur  if  and  only  if  H is  a complete  graph.) 

Mnemonics:  Our  notations  GaH  and  G 0 H nicely  match  diagrams  (a)  and  (c), 
as  suggested  by  J.  Nesetril,  Lecture  Notes  in  Comp.  Sci.  118  (1981),  94-102.  His 
analogous  recommendation  to  write  G x H for  (b)  is  also  tempting;  but  it  wasn’t 
adopted  here,  because  hundreds  of  authors  have  used  G x H to  denote  GaH. 

92-  (»)□;  (b)X;  (d)gj;  (e)M- 

93.  Km  El  Kn  = Km  O Kn  =*  Kmn  ■ 

94.  No;  they’re  induced  subgraphs  of  K26  □ Ah 6 □ K26  □ K2&  □ A'26  • 

95.  (a)  du  + dv.  (b)  dudv.  (c)  dudv  © du  © dv . (d)  du(n  dv)®(m — du)dv.  (e)  dun®dv. 

96.  (a)  AuB  = A&I+I&B.  (b)  AmB  = AaB+A®B.  (c)  AaB  = A®J+J®B-2A®B. 
(d)  AoB  = A®J  + I®B.  (Formulas  (a),  (b),  and  (d)  define  graph  products  of  arbitrary 
digraphs  and  multigraphs.  Formula  (c)  is  valid  in  general  for  simple  digraphs;  but 
negative  entries  can  occur  when  A and  B contain  values  >1.) 

Historical  notes:  The  direct  product  of  matrices  is  often  called  the  Kronecker 
product,  because  K.  Hensel  [Crelle  105  (1889),  329-344]  said  he  had  heard  it  in  Kro- 
necker’s  lectures;  however,  Kronecker  never  actually  published  anything  about  it.  Its 
first  known  appearance  was  in  a paper  by  J.  G.  Zehfuss  [Zeitschrift  fiir  Math,  und  Physik 
3 (1858),  298-301],  who  proved  that  det(A®B)  = (det  A)n(det  B)m  when  m = m'  and 
n = n . The  basic  formulas  (A®B)T  = AT  ®BT,  (A  ® B)(A'  0 B')  = AA'  ®BB',  and 
(A  ® B)_1  = A-1  ® B_1  are  due  to  A.  Hurwitz  [Math.  Annalen  45  (1894),  381-404]. 

97.  Operations  on  adjacency  matrices  prove  that  (G  © G')  □ H = (GaH)®  (G'  a H)\ 
(G®G’)®H  = (GmH)®  (G’  ei  H)\  (G  ® G')  o H = (G  o H)  ® (G'  o H).  Since 
GaH  = HaG,G®H  = H®G,  and  G El  H = H El  G,  we  also  have  right-distributive 
laws  G □ (H  © H')  =*  (G  □ H)  © (G  □ H ');  G ® (H  © H')  “ (G  ® H)  © (G  ® H')- 
Gm(H  © H')  = (GmH)  © (GeiH').  The  lexicographic  product  satisfies  G o H = Go  H; 

also  Km°H  = H II.  hence  K,„  o K „ = Kn n.  Furthermore  G o K„  = Gm  A'„ : 

Km  ® Kn  = Km  D A n — L(Km,n)  • 

98.  There  are  kl  components  (because  of  the  distributive  laws  in  the  previous  exercise, 
and  the  facts  that  GaH  and  G^H  are  connected  when  G and  H are  connected). 

99.  Every  path  from  (u,v)  to  (u',v')  in  GaH  must  use  at  least  dG(u,u')  “G- steps” 
and  at  least  dH(v,v')  UH- steps”;  and  that  minimum  is  achievable.  Similar  reasoning 
shows  that  doHif((w,  v),  (u1  ,v'))  = ma x(dG(u,  u'),dH(v,  v')). 

100.  If  G and  H are  connected,  and  if  each  of  them  has  at  least  two  vertices,  G 0 H 
is  disconnected  if  and  only  if  G and  H are  bipartite.  The  “if”  part  is  easy;  conversely, 
if  there’s  an  odd  cycle  in  G,  we  can  get  from  (it,  v)  to  (u',i/)  as  follows:  First  go  to 
(it",  n'),  where  u"  is  any  vertex  of  G that  happens  to  be  expedient.  Then  walk  an  even 
number  of  steps  in  G from  it"  to  u . while  alternating  in  H between  v'  and  one  of  its 
neighbors.  [P.  M.  Weichsel,  Proc.  Amer.  Math.  Soc.  13  (1962),  47-52.] 
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101.  Choose  vertices  u and  v with  maximum  degree.  Then  du  + dv  = dudv  by 
exercise  95;  so  either  G = H = Ku  or  du  = dv  = 2.  In  the  latter  case,  G = Pm 
or  Cm,  and  H = Pn  or  Cn.  But  Ga  H is  connected,  so  G or  H must  be  nonbipartite, 
say  G.  Then  G □ H is  nonbipartite,  so  H must  also  be  nonbipartite;  thus  G = Cm 
and  H = Cn,  with  m and  n both  odd.  The  shortest  cycle  in  Cm  □ Cn  has  length 
min (m,  n);  in  Cm  <8>  Cn  it  has  length  ma x(m,n);  hence  m = n.  Conversely,  if  n > 3 
is  odd,  we  have  Cn  □ Cn  = Cn  ® Cn,  under  the  isomorphism  that  takes  (u,v)  n- 

((“  + v)  mod  n>  (u  - v)  mod  n)  for  0 < u,  v < n.  [D.  J.  Miller,  Canadian  J.  Math  20 
(1968),  1511-1521.] 

102.  Pm  R Pn.  (It  is  planar  only  when  min(m,  n)  < 2 or  m = n = 3.) 
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104.  Edges  must  be  created  in  a somewhat  circuitous  order,  to  maintain  the  tableau 
shape.  Variables  i and  r delimit  the  available  rows  in  column  t.  For  example,  the 

second  part  of  exercise  103  begins  with  i <—  1,  t 4-  8,  r «—  1;  then  9 1,  i <—  2 t <—  6 

r*~  3;  then  9 — 3,  9 — 2,  i <-  4,  t <-  4,  r <-  8;  then  9 — 8. 

105.  Notice  that  dk  C k if  and  only  if  Ck  A k.  When  <7;.  j>  k we  have 

Cl  H + ck  = fc2+min(Mfc+1)  + min(Mk+2)  + hmin(fc,dn); 

therefore  the  condition  di  + • • • + dk  < C\  + • • • + ck  — k is  equivalent  to 

di  + • ■ ■ + dk  < f(k),  where  f(k)  = k(k-l)  + min(fc, dk+1)  + ■ • • + min(fc, dn).  (*) 

K k>s  we  have  f(k  + 1)  - f(k)  = 2 k-  dk+1  > dk+1-  hence  (*)  holds  for  1 < k < n 
if  and  only  if  it  holds  for  1 < k < s.  Condition  (*)  was  discovered  by  P.  Erdos  and 
T.  Gallai  [Matematikai  Lapok  11  (1960),  264-274],  It  is  obviously  necessary,  if  we 
consider  the  edges  between  {1, . . . , k}  and  {fc+1, . . . , n}. 

Let  ak  = di  H f dk  - c\  - ■ • • — ck  + k,  and  suppose  that  we  reach  ak  > 0 in 

step  H2  for  some  k < s.  Let  Aj,  Cj,  Dj,  N,  and  S be  the  numbers  that  correspond  to 

ai , cl>  dj,  n,  and  s before  steps  H3  and  H4;  thus  N = n + 1,  Dj  = dj  + (0  or  1),  etc. 
We  want  to  prove  that  AK  > 0 for  some  K < S. 

Steps  H3  and  H4  have  removed  row  N and  the  bottommost  remaining  q cells 
in  column  t,  for  some  t > S and  q > 0,  together  with  the  rightmost  cells  in  rows  1 

through  p.  If  p > 0 we  have  Ct+1  = p.  Let  r = DN  = p + q,  u = Ct,  and  v = ct.  Notice 

that  Dj  = t for  p < j < u.  and  Cj  = N for  1 < j < r;  also  Aj  = aj  for  1 < j < p. 

If  k is  minimal  we  have  1 < ak  < dk  - ck  + 1,  hence  ck  < dk.  If  Dk  > t then 

k < p and  Ak  = ak.  If  Dk  < t it  follows  that  Ak  = ak  + r - min(fc,r)  > ak,  because 
k < Dk.  Thus  we  may  assume  that  Dk  = t. 

Suppose  t > 5;  hence  u < S.  For  k < j < u we  have  dj  > Dj-l=t-l> 
dk  — 1 > ck  — 1 > Cj  — 1.  Thus  au  > ak  > 0.  But  Au  = au,  because  r < u < S < t. 
We  may  therefore  assume  that  t = S.  Suppose  k < t;  then  ck  = dk  = t,  because 
S — ck  < dk  < t.  But  r = t leads  to  ck  = N — 1 and  a contradiction;  and  r < t leads 
to  u = t,  from  which  it  follows  that  At  > At = at- 1 - 1 > 0. 
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(Deep  breath.)  OK;  we’ve  reduced  the  problem  to  cases  with  k — t = S.  Hence 
t = s < ct  < dt  < Dt  = t,  and  we  have  at  = at_ 1 + 1.  Consequently  at-i  = 0. 

In  fact  we  can  show  by  induction  on  t — j that  aj  = 0 for  p < j < t:  If  a^+i  = 0 
then  0 < aj  = Cj+i  — t — 1 > g — 1 > 0,  because  Cj+i  > t + q when  p < j < t — 1. 

If  p < t — 1 , this  argument  proves  that  q = 1 and  cr  = N — 1 = f + 1 . We  conclude 
that,  regardless  of  p,  we  must  have  q = l,  N = t + 2,  Dj  = t + 1 for  1 <3  <P,Dj  = t 
for  p < j < t + 1,  and  Dn  = p + 1.  Algorithm  H does  actually  change  this  “good” 
sequence  into  a “bad”  one;  but  Di  + ■ ■ ■ + Dn  = 2p  + t(t  + 1)  + 1 is  odd. 

106.  False  in  the  trivial  cases  when  d < 1 and  n > d + 2.  Otherwise  true:  In  fact,  the 
first  n — 1 edges  generated  in  step  H4  contain  no  cycles,  so  they  form  a spanning  tree. 

107.  The  permutation  p of  exercise  78  takes  a vertex  of  degree  d into  a vertex  of  degree 
n—\  — d.  And  ip2  is  an  automorphism  that  pairs  up  two  vertices  of  equal  degree,  except 
for  a possible  fixed  point  of  degree  (n  — l)/2. 

(Conversely,  a somewhat  intricate  extension  of  Algorithm  H will  construct  a self- 
complementary graph  from  every  graphical  sequence  that  satisfies  these  conditions, 
provided  that  d(n-i)/2  = (n  — l)/2  when  n is  odd.  See  C.  R.  J.  Clapham  and  D.  J. 
Kleitman,  J.  Combinatorial  Theory  B20  (1976),  67-74.) 

108.  We  may  assume  that  d±  > •••  > the  in-degrees  d need  not  be  in  any 

particular  order.  Apply  Algorithm  H to  the  sequence  dx  . . ,dn  = . . . d„ , but  with 

the  following  changes:  Step  H2  becomes  “[Done?]  Terminate  successfully  if  di  = n = 0: 
terminate  unsuccessfully  if  d\  > n.”  In  step  H3,  change  “j  <—  dn”  to  “j  <—  d~ ,”  and 
terminate  unsuccessfully  if  j > ci . In  step  H4,  omit  “cj  <—  Cj  — 1” ; change  “n  — m” 
to  “m — yn and  set  n t—  n — 1 just  before  returning  to  H2.  An  argument  like  Lemma 
M and  Corollary  H justifies  this  approach. 

(Exercise  7.2.1.4-57  proves  that  such  digraphs  exist  if  and  only  if  d + • • • + d„  = 
+ ■ ■ ■ + d„  and  . . .d~  = {d[, . . . , d'n},  where  d\  > • • • > d'n  and  d[  . . . d'n  is 
majorized  by  the  conjugate  partition  cx  . . ,cn  = ( d + . . . d„)T . The  variant  where  loops 
v — >v  are  forbidden  is  harder;  see  D.  R.  Fulkerson,  Pacific  J.  Math.  10  (1960),  831-836.) 

109.  It’s  the  same  as  exercise  108,  if  we  put  d£  = <4  [k  < m]  and  d(T  = <4  [k  > m]. 

110.  There  are  p vertices  of  degree  d — di  and  q vertices  of  degree  d—  1,  where  p+q  = n. 
Case  1,  d = 2k  + 1.  Make  u — v whenever  ( u — v ) mod n € (2, 3 ,k  + 1, 

n — k — l,...,n  — 3,n  — 2};  also  add  the  p/2  edges  1 — 2,  3 — 4,  . . . , (p — 1 ) — p. 

Case  2,  d = 2k.  Make  u — v whenever  ( u — v)  mod  n € {2, 3, . . . , k,  n — k, . . . , 
n — 3,  n — 2);  also  add  the  edges  1 — 2,  . . . , (q—  1)  — q,  as  well  as  the  path  or  cycle 
(q  = 0?  n:  q ) — (?+l)  — • • ■ — (n— 1)  — n.  [D.  L.  Wang  and  D.  J.  Kleitman,  in 
Networks  3 (1973),  225-239,  have  proved  that  such  graphs  are  highly  connected.] 

111.  Suppose  N = n + n'  and  V'  = {n  + 1, . . . , N}.  We  want  to  construct  e*,  = d — dk 
edges  between  k and  V',  and  additional  edges  within  V',  so  that  each  vertex  of  V‘  has 
degree  d.  Let  s = e i + ■ • ■ + en.  This  task  is  possible  only  if  (i)  n'  > max(ei, . . . , e„); 

(ii)  n'd  > s;  (iii)  n'd  < s + n'(n'  — 1);  and  (iv)  (n  + n')d  is  even. 

Such  edges  do  exist  whenever  n satisfies  (i)-(iv):  First,  s suitable  edges  be- 
tween V and  V'  can  be  created  by  cyclically  choosing  endpoints  (n+1,  n+2, . . . , n+n' , 
n+1,...),  because  of  (i).  This  process  assigns  either  \_s/n'\  or  [s/n']  edges  to  each 
vertex  of  V';  we  have  [s/n']  < d by  (ii),  and  d — [s/n'J  < n'  — 1 by  (iii).  Therefore 
the  additional  edges  needed  inside  V'  are  constructible  by  exercise  110  and  (iv). 

The  choice  n'  = n always  works.  Conversely,  if  G = Kn{V)  \ {1  — 2},  condition 

(iii)  requires  n > n when  n > 4.  [P.  Erdos  and  P.  Kelly,  AMM  70  (1963),  1074-1075.] 


7 


ANSWERS  TO  EXERCISES  529 


112.  The  uniquely  best  triangle  in  the  miles  data  is 

748  746  748 

Saint  Louis,  MO  Toronto,  ON  Winston-Salem,  NC  Saint  Louis,  MO. 

113.  By  Murphy’s  Law,  it  has  n rows  and  m columns;  so  it’s  n x m,  not  m x n. 

114.  A loop  in  a multigraph  is  an  edge  {a,  a}  with  repeated  vertices,  and  a multigraph 
is  a 2-uniform  hypergraph.  Thus  we  should  allow  the  incidence  matrix  of  a general  hy- 
pergraph to  have  entries  greater  than  1 when  an  edge  contains  a vertex  more  than  once. 
(A  pedant  would  probably  call  this  a “multihypergraph.”)  With  these  considerations 
in  mind,  the  incidence  matrix  and  bipartite  graph  corresponding  to  (26)  are 


115.  The  element  in  row  e and  column  / of  BTB  is  )Tu  bvebvf\  so  BTB  is  21  plus  the 
adjacency  matrix  of  L(G).  Similarly,  BBT  is  D plus  the  adjacency  matrix  of  G,  where 
D is  the  diagonal  matrix  with  dvv  = degree  of  v.  (See  exercises  2.3.4.2-18,  19,  and  20.) 

116.  KinU  = Km'1  © K„  \ generalizing  (38),  for  all  r > 1. 

117.  The  nonisomorphic  multisets  of  singleton  edges  for  m = 4 and  V = {0, 1,2}  are 
{{0},{0},{0},{0»,  {{0},{0},{0},{1}},  {{0},{0},{1},{1}},  and  {{0},  {0},  {1},  {2}}. 
The  answer  in  general  is  the  number  of  partitions  of  m into  at  most  n parts,  namely 

n 1 1 using  the  notation  explained  in  Section  7. 2. 1.4.  (Of  course,  there’s  little  reason  to 
think  of  partitions  as  1-uniform  hypergraphs,  except  when  answering  strange  exercises.) 

118.  Let  d be  the  sum  of  the  vertex  degrees.  The  corresponding  bipartite  graph  is 
a forest  with  m + n vertices,  d edges,  and  p components.  Hence  d = m + n — p,  by 
Theorem  2.3.4. 1A. 

119.  Then  there’s  an  additional  edge,  containing  all  seven  vertices. 

120.  We  could  say  that  (hyper)arcs  are  arbitrary  sequences  of  vertices,  or  sequences 
of  distinct  vertices.  But  most  authors  seem  to  define  hyperarcs  to  be  A — tv,  where  A 
is  an  unordered  set  of  vertices.  When  the  best  definition  is  found,  it  will  probably  be 
the  one  that  has  the  most  important  practical  applications. 

121.  x(H)  = |F|  — a(I(H)T)  is  the  size  of  a minimum  cover  of  V by  sets  of  F. 

122.  (a)  One  can  verify  that  there  are  just  seven  3-element  covers,  namely  the  vertices 
of  an  edge;  so  there  are  seven  4-element  independent  sets,  namely  the  complements  of 
an  edge.  We  can’t  two-color  the  hypergraph,  because  one  color  would  need  to  be  used 
4 times  and  the  other  three  colors  would  be  an  edge.  (Hypergraph  (56)  is  essentially 
the  projective  plane  with  seven  points  and  seven  lines.) 

(b)  Since  we’re  dualizing,  let’s  call  the  vertices  and  edges  of  the  Petersen  graph 
“points”  and  “lines”;  then  the  vertices  and  edges  of  the  dual  are  lines  and  points, 
respectively.  Color  red  the  five  lines  that  join  an  outer  point  to  an  inner  point.  The 
other  ten  lines  are  independent  (they  don’t  contain  all  three  of  the  lines  touching  any 
point);  so  they  can  be  colored  green.  No  set  of  eleven  lines  can  be  independent,  because 
no  four  lines  can  touch  all  ten  points.  (Thus  the  Petersen  dual  is  a bipartite  hypergraph, 
in  spite  of  the  fact  that  it  contains  cycles  of  length  5.) 

123.  They  correspond  to  n x n latin  squares,  whose  entries  are  the  vertex  colors. 

124.  Four  colors  easily  suffice.  If  it  were  3-colorable,  there  must  be  four  vertices  of 
each  color,  since  no  five  vertices  are  independent.  Then  two  opposite  corners  must  have 
the  same  color,  and  a contradiction  arises  quickly. 
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125.  The  Chvatal  graph  is  the  smallest  such  graph  with  g = 4.  G.  Brinkmann  found 
the  smallest  with  g = 5:  It  has  21  vertices  dj,  bj,  Cj  for  0 < j < 7,  with  edges 
aj  — dj+2,  dj  — bj,  dj  — bj+ 1,  bj  — Cj,  bj  — Cj+ 2,  Cj  — Cj+3  and  subscripts  mod  7. 
M.  Meringer  showed  that  there  must  be  at  least  35  vertices  if  g > 5.  B.  Griinbaum 
conjectured  that  g can  be  arbitrarily  large;  but  no  further  constructions  are  known. 
[See  AMM  77  (1970),  1088-1092;  Graph  Theory  Notes  of  New  York  32  (1997),  40-41.] 

126.  When  m and  n are  even,  both  Cm  and  Cn  are  bipartite,  and  4-coloring  is  easy. 
Otherwise  a 4-coloring  is  impossible.  When  m = n = 3,  a 9-coloring  is  optimum  by 
exercise  93.  When  m = 3 and  n = 4 or  5,  at  most  two  vertices  are  independent;  it’s 
easy  to  find  an  optimum  6-  or  8-coloring.  Otherwise  we  obtain  a 5-coloring  by  painting 
vertex  (j,  k)  with  (dj  + 2 bk)  mod  5,  where  periodic  sequences  {dj)  and  ( bk ) exist  with 
period  lengths  m and  n,  respectively,  such  that  dj  — dj+i  = ±1  and  bk  — bk+i  = ±1 
for  all  j and  k.  [K.  Vesztergombi,  Acta  Cybernetica  4 (1978),  207-212.] 

127.  (a)  The  result  is  true  when  n = 1.  Otherwise  let  H = G\v,  where  v is  any  vertex. 
Then  H = G\v,  and  we  have  x{H)  + x(H)  < n by  induction.  Clearly  x(G)  < x{H)  + 1 
and  x{G)  < x(H)  + 1;  so  there’s  no  problem  unless  equality  holds  in  all  three  cases. 
But  that  can’t  happen;  it  implies  that  x(H)  fs  d and  x(H)  < n — 1 — d,  where  d is  the 
degree  of  v in  G.  [E.  A.  Nordhaus  and  J.  W.  Gaddum,  AMM  63  (1956),  175-177.] 

To  get  equality,  let  G = Ka®  Kb,  where  ab  > 0 and  d + b = n.  Then  we  have  G = 
Ka  — Kb,  x(G)  = d,  and  x(G)  = 6+1.  [All  graphs  for  which  equality  holds  have  been 
found  by  H.-J.  Finck,  Wiss.  Zeit.  der  Tech.  Hochschule  Ilmenau  12  (1966),  243-246.] 
(b)  A fc-coloring  of  G has  at  least  [n/fc]  vertices  of  some  color;  those  vertices  form 
a clique  in  G.  Hence  x(G)x(G)  > x(G)[n/x(G)]  > n.  Equality  holds  when  G = Kn. 
(From  (a)  and  (b)  we  deduce  that  x(G)+x(G)  > 2y/n  and  x(G)x(G)  < |(n+l)2.) 

128.  x(Gd  H)  = max(x(G),  x(H)).  This  many  colors  is  clearly  necessary.  And  if  the 
functions  a(u)  and  b(v)  color  G and  H with  the  colors  (0, 1 ,k  — 1},  we  can  color 
GnH  with  c{u,  v ) = (a(u)  + b(v))  mod  k. 

129.  A complete  row  or  column  (16  cases);  a complete  diagonal  of  length  4 or  more 
(18  cases);  a 5-cell  pattern  {(x,  y),  (x—d,  y—a),  (x—a,  y+a),  (x+a,  y—d),  (x+a,  y+a)} 
for  a € {1,  2, 3}  (36  + 16  + 4 cases);  a 5-cell  pattern  {(x,y),  (x—a,  y),  (x+a,  y),  (x,  y—a), 
(x,y+a)}  for  a G {1,2,3}  (36  +16  + 4 cases);  a pattern  containing  four  of  those 
five  cells,  when  the  fifth  lies  off  the  board  (24  + 32  + 24  cases);  or  a 4-cell  pattern 
{(x,y),(x+a,y),(x,y+a),(x+a,y+a)}  for  a G {1,3, 5, 7}  (49  + 25  + 9 + 1 cases). 
Altogether  310  maximal  cliques,  with  respectively  (168, 116, 4, 4, 18)  of  size  (4, 5, 6,  7,  8). 

130.  If  graph  G has  p maximal  cliques  and  graph  H has  q,  then  the  join  G — H has 
pq,  because  the  cliques  of  G — H are  simply  the  unions  of  cliques  from  G and  H. 
Furthermore,  the  empty  graph  Kn  has  n maximal  cliques  (namely  its  singleton  sets). 

Thus  the  complete  fc-partite  graph  with  part  sizes  {ni,  ...,«*},  being  the  join  of 
empty  graphs  of  those  sizes,  has  m ...  rife  maximal  cliques. 

131.  Assume  that  n > 1.  In  a complete  fc-partite  graph,  the  number  n i . . . rik  is  maxi- 
mized when  each  part  has  size  3,  except  perhaps  for  one  or  two  parts  of  size  2.  (See  exer- 
cise 7.2.1.4-68(a).)  So  we  must  prove  that  N(n)  cannot  be  larger  than  this  in  any  graph. 

Let  m(v ) be  the  number  of  maximal  cliques  that  contain  vertex  v.  If  u—j—v  and 
m(u)  < m(v),  construct  the  graph  G'  that  is  like  G except  that  u is  now  adjacent  to 
all  the  neighbors  of  v instead  of  to  its  former  neighbors.  Every  maximal  clique  U in 
either  graph  belongs  to  one  of  three  classes: 

i)  u G U‘,  there  are  m(u ) of  these  in  G and  m(v)  of  them  in  G' . 
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ii)  v € U;  there  are  m(y)  of  these  in  G and  also  in  G' . 

iii)  u £ U and  v U;  such  maximal  cliques  in  G are  also  maximal  in  G' . 

Therefore  G'  has  at  least  as  many  maximal  cliques  as  G.  And  we  can  obtain  a complete 
/■.'-partite  graph  by  appropriately  repeating  the  process. 

[This  argument,  due  to  Paul  Erdos,  was  presented  by  J.  W.  Moon  and  L.  Moser 
in  Israel  J.  Math.  3 (1965),  23-25.] 

132.  The  strong  product  of  cliques  in  G and  //  is  a clique  in  Ox// . by  exercise  93;  hence 
uj(GsiH)  > u}(G)tv(H)  = x(G)x(H).  On  the  other  hand,  colorings  a(u ) and  b(v)  of  G 
and  H lead  to  the  coloring  c(u,v)  = ( a(u),b(v ))  of  G&H-,  hence  y (GbH)  < x(G)y(H) 
And  uo(G^H)  < X(G®H). 

133-  (a)  24;  (b)  60;  (c)  3;  (d)  6;  (e)  6;  (f)  4;  (g)  5;  (h)  4;  (i)  K2®C12;  (j)  18;  (k)  12. 
(1)  \es,  of  degree  5.  (m)  No.  [In  fact,  Markus  Chimani  used  branch-and-cut  methods 
in  2009  to  prove  that  it  cannot  be  drawn  with  fewer  than  12  crossings.]  (n)  Yes;  in  fact, 
it  is  4-connected  (see  Section  7.4.1).  (o)  Yes;  we  consider  every  graph  to  be  directed, 
with  two  arcs  for  each  edge,  (p)  Of  course  not.  (q)  Yes,  easily. 

[The  musical  graph  represents  simple  modulations  between  key  signatures.  It 
appears  on  page  73  of  Graphs  by  R.  J.  Wilson  and  J.  J.  Watkins  (1990).] 

134.  By  rotating  and/or  swapping  the  inner  and  outer  vertices,  we  can  find  an  auto- 
morphism that  takes  any  vertex  into  C.  If  C is  fixed,  we  can  interchange  the  inner  and 
outer  vertices  of  any  subset  of  the  remaining  11  pairs,  and/or  do  a left-right  reflection. 
Therefore  there  are  24  x 211  x 2 = 98,304  automorphisms  altogether. 

135.  Let  w = e2nl/6,  and  define  the  matrices  Q = (qij),  S = (s0-),  where  q%]  = 

[/  = (*  + 1)  mod  12]  and  Sij  = a/J,  for  0 < i,j  < 12.  By  exercise  96(b),  the  adjacency 

matrix  of  the  musical  graph  K2mC12  is  A = (J })  <g,(/+Q  + Q- ) _/.  Let  T be  the  matrix 
(i  -i)  ® then  T~AT  is  a diagonal  matrix  D whose  first  12  entries  are  1 + 4 cos  if 
for  0 < j < 12,  and  whose  other  12  entries  are  -1.  Therefore  A2,n  = TDmT~ , and  it 
follows  that  the  number  of  2m-step  walks  from  C to  (C,  G,  D,  A,  E,  B,  F8)  respectively  is 

Cm  = ^j(25m  + 2(13  + 4^3)m  + 32m+1  + 2(13  - 4^)"*  + 16); 

Cm  = ^(25m  + Vs(13  + 4y/3)m  - ^(13  - 4^/3)™  - 1); 

Dm  = (25m  + (13  + 4'/3)m  + (13  - 4^3)™  - 3); 

Am  = ^(25m  -32m+1  +2); 

Em  = Tj(25m  - (13  + 4v/3)m  - (13  - 4V3)m  + 1); 

Bm  = ^(25m  - \/3(13  + 4\/3)m  + y/3(13  - 4s/3)m  - 1); 

= ^(25m  - 2(13  + 4v/3)m +32m+1  - 2(13  - 4\/3)m); 

also  am  = Cm— 1,  dm=Fm=em  = Gm,  etc.  In  particular,  (C6,  G6,  D6,  A6,  E6,  B6,  Fg)  = 
(15462617,  14689116,  12784356,  10106096,  7560696,  5655936,  5015296),  so  the  desired 
probability  is  15462617/512  ~ 6.33%.  Asm  oo,  the  probabilities  are  all  ^h+O(0.8m). 

136.  No.  Only  two  Cayley  graphs  of  order  10  are  cubic,  namely  K2nC$  (whose  vertices 
can  be  written  {e,a,  a2 ,q3 ,a4 , /3,  /3a,  /3a2 , /3a3 , /3a4}  where  a5  = /32  = (a/3)2  = e)  and 
the  graph  with  vertices  {0, 1, ...  ,9}  and  arcs  v -y  (v±l)  mod  10,  v ->  (u  + 5)  mod  10. 
[See  D.  A.  Holton  and  J.  Sheehan,  The  Petersen  Graph  (1993),  exercise  9.10.  Inciden- 
tally, the  SGB  graphs  raman(p,  q,t,0)  are  Cayley  graphs.] 
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137.  Let  [x,  y]  denote  the  label  of  (x,  y);  we  want  [x,  y\  = [x  + a,y  + b\  = [x  + c,y  + d] 
for  all  x and  y.  If  A is  the  matrix  (“  *),  the  operation  of  adding  t times  the  bottom 
row  of  A to  the  top  row  changes  A to  the  matrix  A'  = ( J ‘ ) A = ( ^ ),  where 
a1  = a + tc,  b1  = b + td , c'  = c,  d'  = d.  The  new  condition  [x,  y]  = [x  + a',  y + b'\  = 
[x  + c',y  + d’}  is  equivalent  to  the  old;  and  gcd(a',  6',  c',  d')  = gcd(a,  b,  c,  d).  Similarly 
we  can  premultiply  A by  ( \ ° ) without  really  changing  the  problem. 

We  can  also  operate  on  columns,  replacing  A by  A"  = Ail  \ ) = h"  ),  where 
a = a,  b =ta  + b,c  = c,  d = tc+d.  This  operation  does  alter  the  problem,  but  only 
slightly:  If  we  find  a labeling  that  satisfies  [x,  r/J  = [x  + a" , y + b"\  = [x  + c",  y + d "]] 
for  all  x and  y,  then  we’ll  have  [x,y]  = [x  + a,  y + b\  = [x  + c,  x + d]  if  [x,y\  = [x,r/  + tx]]. 
Similarly  we  can  postmultiply  A by  (J  °);  the  problem  remains  almost  the  same. 

A series  of  such  row  and  column  operations  will  reduce  A to  the  simple  form 
UAV  = (J  °),  where  U and  V are  integer  matrices  with  det  U = det  V = 1.  Further- 
more, if  we  have  V = ( “ ^ ),  a labeling  for  the  reduced  problem  that  satisfies  the  simple 
conditions  [x,  i/J]  = [x  + 1,  y]  = [x,  y + nj  will  provide  a solution  to  the  original  labeling 
problem  if  we  define  [x,  y]  = [ax  + 7 y,  /3x  + dj/J. 

Finally,  the  reduced  labeling  problem  is  easy:  We  let  [x,  2/]  = y mod  n.  Thus  the 
desired  answer  is  to  set  p = /3,  q = 6. 

138.  Proceeding  as  before,  but  with  a k x k matrix  A,  row  and  column  operations 
will  reduce  the  problem  to  a diagonal  matrix  UAV.  The  diagonal  entries  (di,. . . ,dk) 
are  characterized  by  the  condition  that  dx  . . . dj  is  the  greatest  common  divisor  of 
the  determinants  of  all  j x j submatrices  of  A.  [This  is  “Smith  normal  form”;  see 
H.  J.  S.  Smith,  Philosophical  Transactions  151  (1861),  293-326,  §14.]  If  the  labeling 
[x{|  satisfies  the  reduced  problem,  the  original  problem  is  satisfied  by  [x]  = [xV].  The 
number  of  elements  in  the  generalized  torus  is  n = det  A = d\  . . . dk- 

The  reduced  problem  has  a simple  solution  as  before  if  d\  = • • ■ = dk  _ x = 1.  But 
in  general  the  reduced  labeling  will  be  an  r-dimensional  ordinary  torus  of  dimensions 
(dk-r+i,  ■ ■ ■ , dk),  where  dk-r+i  > dk-r  = 1.  (Here  do  = 1;  we  might  have  r = k.) 

In  the  requested  example,  we  find  d\  = 1,  d2  = 2,  d3  = 10,  n = 20;  indeed, 

/ 1 -2  0 \ / 3 1 l\/l  5 6\  /I  0 0 \ 

UAV  = 0 1 -1  1 3 10  1 1 = 0 2 0. 

\-l  -1  4/  \l  1 3/  \0  0 1/  \0  0 10/ 

Each  point  ( x,y,z ) now  receives  a two-dimensional  label  («,  v)  = ((5x  + y)  mod  2, 
(6 x + y + z)  mod  10).  The  six  neighbors  of  (u,  v)  are  ((u±  1)  mod  2,v),  ((u±  1)  mod  2, 
(v  ± 1)  mod  10),  (u,  (v  ± 1)  mod  10).  It’s  a multigraph,  since  the  first  two  neighbors  are 
identical;  but  it’s  not  the  same  as  the  multigraph  C2  KlCio,  which  has  degree  8. 

[Generalized  toruses  are  essentially  the  Cayley  graphs  of  Abelian  groups;  see 
exercise  136.  They  have  been  proposed  as  convenient  interconnection  networks,  in 
which  case  it  is  desirable  to  minimize  the  diameter  when  k and  n are  given.  See  C.  K. 
Wong  and  D.  Coppersmith,  JACM  21  (1974),  392-402;  C.  M.  Fiduccia,  R.  W.  Forcade, 
and  J.  S.  Zito,  SIAM  J.  Discrete  Math.  11  (1998),  157-167.] 

139.  (This  exercise  helps  clarify  the  distinction  between  labeled  graphs  G,  in  which  the 
vertices  have  definite  names,  and  unlabeled  graphs  H such  as  those  in  Fig.  2.)  If  NH  is 
the  number  of  labeled  graphs  on  (1, 2, . . . , h}  that  are  isomorphic  to  H,  and  if  U is  any 
h-element  subset  of  V,  the  probability  that  G \ U is  isomorphic  to  H is  NH /2h(h~1)/2 . 
Therefore  the  answer  is  (") NH /2h(h~l)/2 . We  need  only  figure  out  the  value  of  NH, 
which  is:  (a)  1;  (b)  h\/ 2;  (c)  ( h — 1)  !/2;  (d)  h\/a , where  H has  a automorphisms. 
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14°.  (a)  #(K3:Wn)  = n—1  and  #(P3:Wn)  = Q" J)  for  n > 5;  also  #(K^:W8)  = 7. 

_ (b)  G Js  Proportional  if  and  only  if  #(K3:G)  = #(K^:G)  = §Q)  and  #(Py.G)  = 
#{Py.G)  = |Q).  If  G has  e edges,  we  have  (ra  — 2)e  = 3#(K3:G)+2#(P3:G)+#(P3:G), 
because  every  pair  of  vertices  appears  in  n- 2 induced  subgraphs.  If  G has  degree 
sequence  dx  . . . dn,  we  have  di  +•  ■•  + <*„  = 2e,  Q1)  +•  • •+  Q")  = 3 #(P3:G)  + #(P3:G), 
anddi(ro  1— di)  + - • --\-dn(n—  1— dn)  = 2ff[P3:G)+2ff(P3:G).  Therefore  a proportional 
graph  satisfies  (*)  unless  n = 2.  (The  exercise  should  have  excluded  that  case.) 

Conversely,  if  G satisfies  (*)  and  has  the  correct  #(K3:G),  it  also  has  the  correct 
#(P3:G),  #(P3:G),  and  #(K3:G). 

[References:  S.  Janson  and  J.  Kratochvil,  Random  Structures  & Algorithms  2 
(1991),  209-224.  In  J.  Combinatorial  Theory  B47  (1989),  125-145,  A.  D.  Barbour, 
M.  Karonski,  and  A.  Rucinski  had  shown  that  the  variance  of  #{H:G)  is  proportional 
j0  ^H  n 2’  n2h  3’  or  n2h  4>  where  the  first  case  occurs  when  H does  not  have 
2 (2)  edges,  and  the  third  case  occurs  when  H is  a proportional  graph.] 

141.  Only  8 degree  sequences  d1...d8  satisfy  (*):  73333333(1/2),  65433322(26/64), 
64444222  (2/10),  64443331  (8/22),  55543222  (8/20),  55533331  (2/10),  55444321  (26/64),' 
and  44444440  (1/2).  Each  degree  sequence  is  shown  here  with  statistics  (A h/N),  where 
N nonisomorphic  graphs  have  that  sequence  and  iVi  of  them  are  proportional.  The  last 
three  cases  are  complements  of  the  first  three.  No  graph  of  order  8 is  both  proportional 
and  self-complementary.  Maximally  symmetric  examples  of  the  first  five  cases  are  W8, 


142.  The  hint  follows  as  in  the  previous  answer;  (n  - 3 )#(A3:G)  and  (n  - 3)#(P3:G) 
can  also  be  expressed  in  terms  of  four-vertex  counts.  Furthermore,  a graph  with  e 
edges  has  (2)  = #(P3  C G)  + #(K2  © K2  C G),  because  any  two  edges  form  either  P3 
or  K2  © K2;  in  this  formula,  jf(P3  C G)  counts  not-necessarily-induced  subgraphs. 

^ We  have  #(P3  C G)  = ^(P3:G)  + 3 #(K3:G),  and  a similar  formula  expresses 
#(^2  © K 2 C G)  in  terms  of  induced  counts.  Thus  an  extraproportional  graph  must 
be  proportional  and  satisfy  e = f Q),  #(P3  C G)  = f Q),  #(K2  © K2  C G)  = §("). 
But  these  values  contradict  the  formula  for  Q). 

143.  Consider  the  graph  whose  vertices  are  the  rows  of  A,  and  whose  edges  u v 

signify  that  rows  u and  v agree  except  in  one  column,  j.  Label  such  an  edge  j. 

If  the  graph  contains  a cycle,  delete  any  edge  of  the  cycle,  and  repeat  the  process 
until  no  cycles  remain.  Notice  that  the  label  on  every  deleted  edge  appears  elsewhere  in 
its  cycle;  hence  the  deletions  don’t  affect  the  set  of  edge  labels.  But  we’re  left  with  fewer 
than  m < n edges,  by  Theorem  2.3.4. 1 A;  so  there  are  fewer  than  n different  labels. 
[See  J.  A.  Bondy,  J.  Combinatorial  Theory  B12  (1972),  201-202.] 

144.  Let  G be  the  graph  on  vertices  {l,...,m},  with  edges  i — j if  and  only  if 
* / xu  / x.ji  * for  some  l.  This  graph  is  fc-colorable  if  and  only  if  there  is  a completion 
with  at  most  k distinct  rows.  Conversely,  if  G is  a graph  on  vertices  {1, . . . ,n},  with 
adjacency  matrix  A,  the  n x n matrix  X = A + *(J  - I — A)  has  the  property  that 
i j if  and  only  if  * 7^  xu  ^ Xji  ^ * for  some  l.  [See  M.  Sauerhoff  and  I.  Wegener, 
IEEE  Trans.  CAD-15  (1996),  1435-1437.] 
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145.  Set  c 4 — 0 and  repeat  the  following  operations  for  1 < j < n:  If  c = 0,  set  x a} 
and  c <—  1;  otherwise  if  x = a,j,  set  c <—  c + 1;  otherwise  set  c <—  c — 1.  Then  x is 
the  answer.  The  idea  is  to  keep  track  of  a possible  majority  element  x,  which  occurs 
c times  in  nondiscarded  elements;  we  discard  a3  and  one  x whenever  finding  x / o.j . 
[See  Automated  Reasoning  (Kluwer,  1991),  105-117.  Extensions  to  find  all  elements 
that  occur  more  than  n/k  times,  in  0(n  log  k)  steps,  have  been  discussed  by  J.  Misra 
and  D.  Gries,  Science  of  Computer  Programming  2 (1982),  143-152.] 

SECTION  7.1.1 

1.  (Solution  by  C.  Sartena.)  He  was  describing  the  implication  x =>  y,  with  “it” 
standing  respectively  for  y,  x,  x,  y,  y,  x.  (Other  solutions  are  possible.) 

2.  The  Earth  operation  corresponding  to  the  Pincusian  x o y is  x o y-,  its  truth  table 
is  therefore  the  reverse  of  the  complement  of  the  truth  table  for  o.  Hence  the  respective 
answers  are  T,  V,  C,  L,  D,  R,  =,  A,  A,  ®,  R,  D,  L,  C,  V,  J_.  (Any  identity  involving  the 
16  operations  of  Table  1 implies  a corresponding  dual  identity  obtained  by  substituting 
the  Pincusian  equivalents.  For  example,  each  of  De  Morgan’s  laws  (n)  and  (12)  is  the 
dual  of  the  other,  as  are  the  identities  (3),  (4)  relating  = and  ®.  In  this  sense  = can 
be  considered  to  be  just  as  useful  as  its  dual,  ©.) 

3.  (a)  V;  (b)  A;  (c)  L;  (d)  =.  [Many  formulas  actually  work  out  better  if  we  use  — 1 
for  truth  and  +1  for  falsehood,  even  though  this  convention  seems  a bit  immoral;  then 
x ■ y corresponds  to  ©.  Notice  that  (xyz)  = sign(x  + y + z),  with  either  convention.] 

4.  [TVans.  Arner.  Math.  Soc.  14  (1913),  481-488.]  (a)  Start  with  the  truth  tables  for 
L and  R;  then  compute  truth  table  a A (3  bitwise  from  each  known  pair  of  truth  tables 
a and  (3,  generating  the  results  in  order  of  the  length  of  each  formula  and  writing  down 
a shortest  formula  that  leads  to  each  new  4-bit  table: 


_L : (x  A (xA  x ))  A (x  A (x  A x)) 
A : (x  A y)  A (x  A y) 

D : (x  A (x  A y))  A (x  A (x  A y)) 
L : x 

C : (y  A (x  A x))  A (y  A (x  A a;)) 
R:  y 

©:  (y  A (x  A x))  A (x  A (x  A y)) 
V : (y  A y)  A {x  A x) 


V : (x  A (x  A a:))  A ((y  Ay)  A (x  A x)) 
= : (x  A y)  A ((y  A y)  A (a:  A a:)) 

R:  yAy 
C : y A (x  A x) 

[:  1A1 
D : xA(xA  y) 

A : x Ay 


T : x A(x  Ax) 

(b)  In  this  case  we  start  with  four  tables  _L,  T,  L,  R,  and  we  prefer  formulas  with  fewer 
occurrences  of  variables  whenever  there’s  a choice  between  formulas  of  a given  length: 


_L : 0 

A : {x  A y)  A 1 
D:  ((j/A1)Ai)A1 
L : x 

C:  (y  A (x  A 1))  A 1 
R:  y 

©:  (y  A (x  A 1))  A {(y  A 1)  A x) 
(y  A1)A(iA  1) 


V:  1 A ({y  A 1)  A (x  A 1)) 

= : {x  A y)  A ((y  A 1)  A (x  A 1)) 
R:  jAl 
C : y A (1  A 1) 

[:  lAl 
D : (jAl)Ai 
A:  x Ay 
T:  1 


V 
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5.  (“)  _L:  iCi;  A:  (x  C y)  C y\  D:  y Ci;  L:  x;  C:xCy;  R:  y;  the  other  10 
cannot  be  expressed,  (b)  With  constants,  however,  all  16  are  possible: 

V:  y C(xCl) 

A:  (yC  1)  Ci  = : (ycx)c((xcy)d) 

D:  «Ca!  R : y Cl 

X_  C:  (xCy)Cl 

C:  xCy  [:  iCl 

R ' y _ _ _ _ D : (y  C x)  C 1 

©:  ((j/C  a:)^C  ((xC  y)  C 1))  C 1 A : ((j/CllCxlCl 

V:  (y  C (x  C 1))  C 1 T:  x 

[B.  A.  Bernstein,  University  of  California  Publications  in  Mathematics  1 (1914),  87-96.] 

6.  (a)  _L,  A,  L,  R,  ©,  V,  =,  T.  (b)  _L,  L,  R,  ©,  =,  T.  [Notice  that  all  of  these  operators 
are  associative.  In  fact,  the  stated  identity  implies  the  associative  law  in  general:  First 
we  have  (i).  {x  o y)  o ((z  o y)  o w)  = ((x  o z)  o (z  o y))  o ((z  o y)  o w)  = (x  o z)  o w,  and 
similarly  (ii)  (x  o [y  o z))  o (y  o w)  = x o (z  o w).  Furthermore  (iii)  (x  o y)  o (z  o w)  = 
(IO!/)o((zoy)o(j/oio))  = (xoz)o(yow)  by  (i).  Thus  (xoz)ow  = (xoz)  o ((zoz)  o w)  = 
(x  o (z  o z))  o (z  o w)  = x o (z  o w)  by  (i),  (iii),  (ii).  The  free  system  generated  by 
{a?i, . . . , xn}  has  exactly  n + 2 nn2  distinct  elements,  namely  {xj  | 1 < j < n}  and 
{xi  o Xjj  o • • • o Xjr  o Xk  \ r > 0 and  1 < i,  k < n and  1 < ji  < ■ ■ ■ < jT  < n}.] 

7.  Equivalently,  we  want  the  identity  y o (x  o y)  = x,  which  holds  only  for  © and  =. 
[Jevons  noticed  this  property  of  © in  Pure  Logic  §151,  but  he  did  not  pursue  the  matter. 
We  will  investigate  general  systems  of  this  nature,  called  “gropes,”  in  Section  7.2.3.] 

8.  (U,A,C},S0),  ({T,  V,  d},  Si),  ({l,  L},  S0 nft),  ({©,=, R>,  S2),  ({5,  v),  So n52), 
({C,A},5i  ns2),  and  (R.any^,  where  S0  = {□  | 0 □ 0 = 0),  Sj  = {□  | 1 □ 1 = 1},  and 
•S2  — {a  | xoy  = xuy}  = {l,R,  L,R).  Thus  92  of  the  256  pairs  are  left-distributive.  [This 
problem  and  those  of  exercise  6 were  first  treated  by  E.  Schroder  in  §55  of  his  posthu- 
mously published  Vorlesungen  iiber  die  Algebra  der  Logik  2,  2 (1905).  He  expressed  the 
answer  by  saying  in  essence  that  the  respective  truth  tables  ( pqrs , wxyz ) of  (o,a)  must 
satisfy  the  relation  ( (pq  V r s)  A z)  V ( {pq  V fs)  A w)  V ( (pq  V rs)  A ( (w  = z)  V (x  = y)) ) = 0.] 

9.  (a)  False;  (xffij/JVz  = (xVz)®(|/Vz)ffiz.  (b)  True,  because  the  identity  obviously 
holds  when  z = 0 and  when  z = 1.  (c)  True;  it’s  also  (x  © y)  V (x  © z)  = 1 - [x  = y = z], 

10.  The  first  stage  of  decomposition  (16)  yields  the  functions  with  truth  tables  g = 
10100011  and  h = 10100011  © 10010011  = 00110000;  and  the  process  continues  in  a 
similar  way,  yielding  l + j/  + xz  + u)  + wy  + wx  + wxz  (modulo  2). 

11.  The  stated  term  is  present  if  and  only  if  f(xx , . . . ,x„)  is  true  an  odd  number  of 
times  when  x\  — x\  — x5  = X7  = xg  = xi0  = • • • = 0.  (There  are  2k  such  cases  when 
we  set  all  but  k variables  to  zero.)  In  other  words  the  multilinear  representation  can 
be  expressed  in  a suggestive  notation  like 

f(x,y,z)  = (/ooo  +/00.Z  + f0,0y  + fo„yz  + f*oox  + f,0txz  + f,t0xy  + f„*xyz)  mod  2 
illustrated  here  for  n = 3,  where  f.,0  = /( 1, 1,0)© /( 1,0, 0)  © /( 0, 1,0)© /( 0,  0, 0),  etc. 

12.  (a)  Substitute  1 - w for  w,  etc.,  in  (23),  getting  1 - y - xz  + 2 xyz  - w + wy  + 
wx  © wxz  — 2 wxyz.  [Some  authors  have  called  this  the  “Zhegalkin  polynomial”;  but 
I.  I.  Zhegalkin  himself  always  worked  modulo  2.  Other  names  in  the  literature  are 
“availability  polynomial,”  “reliability  polynomial,”  “characteristic  polynomial.”] 
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(b)  The  corresponding  coefficients  for  an  arbitrary  n-ary  function  can  be  as  large 
as  2n  1 in  absolute  value  (and  this,  by  induction,  is  the  maximum).  For  example,  the 
integer  multilinear  representation  of  xi  © • • • © xn  over  the  integers  turns  out  to  be 
ei  — 2e2  + 4e3  — • • • + (— 2)n  1en,  where  e*,  is  the  fcth  elementary  symmetric  function 
of  {zi, . . . , xn}.  The  formula  in  the  previous  answer  becomes 

fixiy>z)  = fooo  + foo*z  + fo*oy  + fo**yz  + f*ooX  + f*o*xz  + f,*0xy  + f*,,xyz 

over  the  integers,  where  we  now  have  /,»0  = /( 1,1,0)-  /( 1, 0, 0)  - /( 0, 1,0)  + /( 0, 0, 0), 
etc.  This  expansion  is  a disguised  form  of  the  Hadamard  transform,  Eq.  4. 6. 4^(38). 

(c,d)  The  polynomial  is  the  sum  of  its  minterms  like  xi(l  - :r2)(l  - x3)x4.  Each 
minterm  is  nonnegative  for  0 < xlt . . . , xn  < 1,  and  the  sum  of  all  minterms  is  1. 

(e)  df/dxj  = h(x)  - g(x),  where  h(x)  > g(x)  by  (d).  (See  exercise  21.) 

13.  In  fact,  F is  precisely  the  integer  multilinear  representation  (see  exercise  12). 

14.  Let  rj  = pj/(  1 -pj).  We  want  /(0,0,0)  = 0 and  /(l,  1, 1)  = 1 nr2r3  > 1, 
/( 0,0,1)  = 0 and  /( 1, 1,0)  = 1 +>  r,r2  > r3,  /(0, 1,0)  = 0 and  /(1,0, 1)  = 1 <+>  nr3  > 
r2,  /( 0, 1, 1)  = 0 and  /(1,0,0)  = l»n>  r2r3.  So  we  get  (a)  ( x\x2x3)\  (b)  x4\  (c)  x3. 

15.  Exercise  1.2.6-10  tells  us  that  (x)  mod  2 = [x  & k = k].  Hence,  for  example,  (£)  = 
x4  A x2  A xi  (modulo  2)  when  x = ( xn  . . .a;i)2;  and  we  can  obtain  every  term  in  a 
multilinear  representation  like  (19)  in  this  way.  Moreover,  we  needn’t  work  mod  2, 
because  the  interpolating  polynomial  (£)  (15“x)  represents  i4Ai3Ai2A  x\  exactly. 

16.  Yes,  or  even  by  +,  because  different  minterms  can’t  be  simultaneously  true.  (But 
we  can’t  do  that  in  ordinary  disjunctive  normal  forms  like  (25).  See  exercise  35.) 

17.  The  binary  operation  A is  not  associative,  so  an  expression  like  x A y A 2 must  be 
interpreted  as  a ternary  operation.  Quick’s  notation  is  fine  if  one  understands  NAND  to 
be  an  n-ary  operation,  being  careful  to  note  that  the  NAND  of  a single  variable  x is  x. 

18.  If  not,  we  could  set  u\  i • • • i — us  i — 1 and  Vi  i — • • • i — vt  4 — 0,  making  f both 
true  and  false.  (And  if  we  consider  applying  the  distributive  law  (2)  repeatedly  to  a 
DNF  until  it  becomes  a CNF,  we  find  that  the  converse  is  also  true:  The  disjunction 
Vi  V • • • V vt  is  implied  by  / if  and  only  if  it  has  a literal  in  common  with  every  implicant 
of  /,  if  and  only  if  it  has  a literal  in  common  with  every  prime  implicant  of  /,  if  and 
only  if  it  has  a literal  in  common  with  every  implicant  of  some  DNF  for  /.) 

19.  The  maximal  subcubes  contained  in  0010,  0011,  0101,  0110,  1000,  1001,  1010,  and 
1011  are  0*10,  0101,  *01*,  and  10**;  so  the  answer  is  (tuVyVz)  A(mViVj/Vz)A(iVj)  A 
(wVx).  (This  CNF  is  also  shortest.) 

20.  True.  The  corresponding  maximal  subcube  is  contained  in  some  maximal  subcubes 
/ and  g , and  their  intersection  can’t  be  larger.  (This  observation  is  due  to  Samson 
and  Mills,  whose  paper  is  cited  in  answer  31  below.) 

21.  By  Boole’s  law  (20),  we  see  that  an  n-ary  function  / is  monotone  if  and  only  if  its 
(n  ~ l)-ary  projections  g and  h are  monotone  and  satisfy  g < h.  Therefore 

f = (g  A xn)  V {h /\  xn)  = (g  Axn)  V (<?  Ai„)  V (h  Ai„)  = gV(hAxn), 

so  we  can  do  without  complementation.  The  constants  0 and  1 disappear  unless  the 
function  is  identically  constant.  Conversely,  any  expression  built  up  from  A and  V is 
obviously  monotone. 

Note  on  terminology:  Strictly  speaking,  we  should  say  “monotone  nondecreasing” 
instead  of  simply  “monotone,”  if  we  want  to  preserve  the  language  of  classical  math- 
ematics, because  a decreasing  function  of  a real  variable  is  also  said  to  be  monotonic. 
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(See,  for  example,  the  “run  test”  in  Section  3.3.2G.)  But  “nondecreasing”  is  quite 
a mouthful;  so  researchers  who  work  extensively  on  Boolean  functions  have  almost 
unanimously  opted  to  assume  that  “monotone”  automatically  implies  nondecreasing, 
in  a Boolean  context.  Similarly,  the  mathematical  term  “positive  function”  normally 
refers  to  a function  whose  value  exceeds  zero;  but  authors  who  write  about  “positive 
Boolean  functions”  are  referring  to  the  functions  that  we  are  calling  monotone.  Since 
a monotone  function  is  order-preserving,  some  authors  have  adopted  the  term  isotone ; 
but  that  word  has  already  been  coopted  by  physicists,  chemists,  and  musicologists. 

A Boolean  function  like  x V y,  which  becomes  monotone  if  some  subset  of  its  vari- 
ables is  complemented,  is  called  unate.  Theorem  Q obviously  applies  to  unate  functions. 

22.  Both  g and  g ® h must  be  monotone,  and  g(x)  A h(x)  = 0. 

23.  x A (nVy)  A (vVz)  A ( wVz ).  (Corollary  Q applies  also  to  conjunctive  prime  forms 
of  monotone  functions.  Therefore,  to  solve  any  problem  of  this  kind,  we  need  only 
apply  the  distributive  law  (2)  until  no  A occurs  within  a V,  then  remove  any  clause 
that  contains  all  the  variables  of  another.) 

24.  By  induction  on  k,  the  similar  tree  with  V at  the  root  gives  a function  with  22r't/2l  -1 
prime  implicants  of  length  21A/2J,  whHe  the  tree  with  A gives  42l'l/2J-i  Qf  length  2 T*/2l . 
When  k = 6,  for  example,  the  4'  = 214  prime  implicants  in  the  A case  have  the  form 

a'(0to0*0o0to0o)2  A X(OtoOtooltoOl)2  A ^(Oto ltoiOtoio)2  A X (ot0 lt0l  1*01 1 )2 

A a:(it1o«10otioo)2  A a;(i«1Oii0itioi)2  A a:(iiiif11Oii1o)2  A a;(it1it11it111)2, 

with  the  t’s  either  0 or  1.  [For  further  information  about  such  Boolean  functions,  see 
D.  E.  Knuth  and  R.  W.  Moore,  Artificial  Intelligence  6 (1975),  293-326;  V.  Gurvich 
and  L.  Khachiyan,  Discrete  Mathematics  169  (1997),  245-248.] 

25.  Let  an  be  the  answer.  Then  a2  = a3  = 2,  a4  = 3,  and  a„  = an_2  +an_3  for  n > 4, 
because  the  prime  implicants  when  n > 4 are  either  pn_2  Arn_r  or  pn_3  Aa:„_2  Ai„  for 
some  prime  implicant  pk  in  the  fc-variable  case.  (These  prime  implicants  correspond  to 
minimal  vertex  covers  of  the  path  graph  Pn.  They  are  shellable,  in  the  sense  of  exercise 
35,  when  listed  in  lexicographic  order.  We  have  an  = (7 Pn  + 10P„+i  + Pn+2)/23  when 
Pn  is  the  Perrin  number  of  exercise  7.1.4-15.) 

26.  (a)  Let  Xj  = [j  £ J],  Then  f{x)  = 0 and  g{x)  = 1.  (This  fact  was  exercise  18.) 

(b)  Suppose,  for  example,  that  k £ J £ G and  k £ and  assume  that  test 

(a)  has  been  passed.  Let  x,  = [j  £ J and  j / k].  Then  f(x)  = 1;  and  g(x)  = 0,  because 
every  J'  £ Q with  J'  ^ J contains  an  element  ^ J . 

(c)  Again  assume  that  condition  (a)  has  been  ruled  out.  If,  say,  |J|  > |.F|,  let 
Xj  = [j  is  the  smallest  element  of  I n J,  for  some  I £ JF].  Then  f (x)  = 1,  g(x)  = 0. 

(d)  Now  we  assume  that  \JI€:FI  - Uj€g  J-  Each  1 e ? stands  for  2n-l7l  vectors 
where  f(x)  = 0;  similarly,  each  J £ § stands  for  2n~\J\  vectors  where  g(x)  = 1.  If  the 
sum  s is  less  than  2n,  we  can  compute  s = so  + si,  where  so  counts  the  contributions 
to  s when  xn  = 0.  If  so  < 2n_1,  set  xn  <—  0;  otherwise  sj  < 2n_1,  so  we  set  xn  4—  1. 
Then  we  set  n 4—  n — 1;  eventually  all  Xj  are  known,  and  f(x)  = 1,  g(x)  = 0. 

27.  Let  m = min({|/|  | I £ T}  U {|J|  | J £ Q})  be  the  length  of  the  shortest  prime 
clause  or  implicant.  Then  N ■ 2n_m  > X^/e^2n-m  + zLjgs  2nHJl  > 2n;  so  we  have 
m <\gN.  If,  say,  |/|  = m,  some  index  k must  appear  in  at  least  1/m  of  the  members 
J £ G,  because  each  J intersects  I.  This  observation  proves  the  hint. 

Now  let  A(0)  = A(l)  = 1 and  A(v)  = 1 + A(v  — 1)  + A([pwJ)  for  v > 1.  Then 
A(|^|  |<5|)  is  an  upper  bound  on  the  number  of  recursive  calls  (the  number  of  times  XI 
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is  performed).  Letting  B( v)  = A(v)  + 1,  we  have  B(v)  = B(v  - 1)  ~\~  B([pv\)  for  v > 1, 
hence  B(v)  < B(v  — k)  + kB([pv J)  for  v > k.  Taking  k = v — [pw J shows  that  B(v)  < 
((1  — p)v  + 2)B(\_pv\)\  hence  B(v)  = 0(((  1 — p)v  + 2)f)  when  plv  < 1,  namely  when  t > 
lnu/ln(l/p)  = 0((log u)(log N)).  Consequently  A(|Jr||tl|)  < A(N2/ 4)  = Aro(losiv)2. 

In  practice  the  algorithm  will  run  much  faster  than  the  pessimistic  bounds  just 
derived.  Since  the  prime  clauses  of  a function  are  the  prime  implicants  of  its  dual, 
this  problem  is  essentially  the  same  as  verifying  that  one  given  DNF  is  the  dual  of 
another.  Moreover,  if  we  start  with  f(x)  = 0 and  repeatedly  find  minimal  x's  where 
f(x)  = g(x)  = 0,  we  can  “grow”  / until  we’ve  obtained  the  dual  of  g. 

The  ideas  presented  here  are  due  to  M.  L.  Fredman  and  L.  Khachiyan,  J.  Algo- 
rithms 21  (1996),  618-628,  who  also  presented  refinements  that  reduce  the  running 
time  to  N°(1osn/1o&1o&n').  No  polynomial-time  algorithm  is  known;  yet  the  problem  is 
unlikely  to  be  NP-complete,  because  we  can  solve  it  in  less-than-exponential  time. 

28.  This  result  is  obvious  once  understood,  but  the  notations  and  terminology  can 
make  it  confusing;  so  let’s  consider  a concrete  example:  If,  say,  y1  = y4  = y6  = 1 and 
the  other  yk  are  zero,  the  function  g is  true  if  and  only  if  the  prime  implicants  p4,  p4, 
and  p6  cover  all  the  places  where  / is  true.  Thus  we  see  that  there  is  a one-to-one 
correspondence  between  every  implicant  of  g and  every  DNF  for  / that  contains  only 
prime  implicants  pj.  In  this  correspondence,  the  prime  implicants  of  g correspond  to 
the  “irredundant”  DNFs  in  which  no  pj  can  be  left  out. 

Numerous  refinements  of  this  principle  have  been  discussed  by  R.  B.  Cutler  and 
S.  Muroga,  IEEE  Transactions  C-36  (1987),  277-292. 

29.  Bl.  [Initialize.]  Set  k <—  k1  0.  (Similar  methods  are  discussed  in  exercise  5-19.) 

B2.  [Find  a zero.]  Increase  k zero  or  more  times,  until  either  k = m (terminate) 
or  Vk  & 2J  = 0. 

B3.  [Make  k1  > k.]  If  k’  < k,  set  k!  «—  k + 1. 

B4.  [Advance  k1 .}  Increase  k1  zero  or  more  times,  until  either  k'  = m (terminate) 
or  vy  >vk  + 2j. 

B5.  [Skip  past  a big  mismatch.]  If  vk  © vk’  > 2J+1,  set  k •<-  k'  and  return  to  B2. 

B6.  [Record  a match.]  If  vy  = vk  + 2\  output  ( k , k'). 

B7.  [Advance  k.\  Set  k <—  k + 1 and  return  to  B2.  | 

(Steps  B3  and  B5  are  optional,  but  recommended.) 

30.  The  following  algorithm  keeps  variable-length,  sorted  lists  in  a stack  5 whose  size 
will  never  exceed  2m  + n.  When  the  topmost  entry  of  the  stack  is  St  = s,  the  topmost 
list  is  the  ordered  set  Ss  < Ss+i  < • ■ ■ < St- 1-  Tag  bits  are  maintained  in  another 
stack  T,  having  the  same  size  as  S (after  the  initialization  step). 

PI.  [Initialize.]  Set  Tk  <—  0 for  0 < k < m.  Then  for  0 < j < n,  apply  the  /-buddy 
scan  algorithm  of  exercise  29,  and  set  Tk  «—  Tk  + 2^,  Ty  •<—  Ty  + 2j  for  all 
pairs  ( k,k ')  found.  Then  set  s <—  t 0 and  repeat  the  following  operations 
until  s = m:  If  Ts  = 0,  output  the  subcube  (0,vs)  and  set  s t—  s + 1;  otherwise 
set  St  <—  va,  Tt  <—  Ts,  t t—  t + 1,  s 4-  s + 1.  Finally  set  A <—  0 and  St  <—  0. 

P2.  [Advance  A.]  (At  this  point  stack  S contains  u(A)  + 1 lists  of  subcubes. 
Namely,  if  A = 2ei  + • • • + 2er  with  e\  > • ■ • > er  > 0,  the  stack  contains 
the  6-values  of  all  subcubes  (a,  6)  C V whose  a-values  are  respectively  0,  2ei , 
2ei  +262 , . . . , A,  except  that  subcubes  whose  tags  are  zero  do  not  appear.  All 
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of  these  lists  are  nonempty,  except  possibly  the  last.  We  will  now  increase  A 
to  the  next  relevant  value.)  Set  j <—  0.  If  St  — t (that  is,  if  the  topmost  list  is 
empty),  increase  j zero  or  more  times  until  j > n or  .4  & 2J  / 0.  Then  while 
j < n and  A & 23  4 0,  set  t 4 St  — 1,  A 4 — A — 23 , and  j 4 — j — 1.  Terminate 
the  algorithm  if  j > n ; otherwise  set  A <-  A + 2j . 

P3.  [Generate  list  A.]  Set  r 4—  t,  s 4—  St,  and  apply  the  j-buddy  scan  algorithm  of 
exercise  29  to  the  r - s numbers  Ss  < ■ ■■  < Sr-i.  For  all  pairs  (k,  k')  found, 
set  x <-  (Tfc  & Tfc/)  - 23;  and  if  x = 0,  output  the  subcube  (A,  Sk),  otherwise 

set  t <—  t + 1,  St  4-  Sk,  Tt  4-  x.  Finally  set  t 4—  t + 1,  St  4-  r + 1,  and  go 

back  to  step  P2.  | 

This  algorithm  is  based  in  part  on  ideas  of  Eugenio  Morreale  [IEEE  Trans.  EC-16 
(1967),  611—620;  Proc.  ACM  Nat.  Conf.  23  (1968),  355—365].  The  running  time  is 
essentially  proportional  to  mn  (for  step  PI)  plus  the  total  number  of  subcubes  contained 
in  V . If  m < 2n(l  — e),  and  if  V is  chosen  at  random  with  size  m,  exercise  34  shows 

that  the  average  total  number  of  subcubes  is  at  most  O (log  log  n/  log  log  log  n)  times 

the  average  number  of  maximal  subcubes;  hence  the  average  running  time  in  most  cases 
will  be  nearly  proportional  to  the  average  amount  of  output  produced.  On  the  other 
hand,  exercises  32  and  116  show  that  the  amount  of  output  might  be  huge. 

31.  (a)  Let  c = cn_1  . . ,c0,  c = cn_x  . . . c'0,  c"  = 4[-i  ■ ■ . Cg.  There  must  be  some  j 
with  Cj  / * and  Cj  ^ c";  otherwise  c"  C c.  Similarly  there  must  be  some  k with  c'k  ^ * 
and  4 / 4'-  If  J 4 k,  there  would  be  a point  xn-t  ...x0  <E  c"  that  is  in  neither  c 
nor  c , because  we  could  let  Xj  = Cj  and  xk  = c'k.  Hence  j = k.  and  the  value  of  j is 
uniquely  determined.  Furthermore  it’s  easy  to  see  that  c'  = cr  And  if  i / j,  we  have 
either  Cj  = * or  c{  = c" , and  either  c[  = * or  c[  = c" . 

(b)  This  statement  is  an  obvious  consequence  of  (a). 

(c)  First  we  prove  that  the  parenthesized  remark  in  step  E2  is  true  whenever  that 
step  is  encountered.  It’s  clearly  true  when  j = 0.  Otherwise,  let  c C V be  a j-eube, 
and  suppose  c — eg  \J  c%  where  eg  and  ci  are  ( j — l)-cubes.  On  the  preceding  execution 
of  step  E2  we  had  cQ  C c'0  G C and  cx  C c[  6 C for  some  Cg  and  c', ; hence  either 
c C Co  U Ci  or  c C Co  or  c C cx . In  each  case,  c is  now  contained  in  some  element  of  C . 

Secondly,  we  prove  that  the  outputs  in  step  E3  are  precisely  the  maximal  j-cubes 
contained  in  V:  Let  c C F be  any  fc-cube.  If  c is  maximal,  then  c will  be  in  C when 
we  reach  step  E3  with  j = k,  and  it  will  be  output.  If  c isn’t  maximal,  it  has  a buddy 
c C F,  which  is  a fc-cube  contained  in  some  subcube  c"  € C when  we  reach  E3.  Since 
c 2 c , the  consensus  c U c will  be  a ( j + l)-cube  of  C' , and  c will  not  be  output. 

References:  The  notion  of  consensus  was  first  defined  by  Archie  Blake  in  his  Ph.D. 
dissertation  at  the  University  of  Chicago  (1937);  see  J.  Symbolic  Logic  3 (1938),  93, 
112-113.  It  was  independently  rediscovered  by  Edward  W.  Samson  and  Burton  E.  Mills 
[Air  Force  Cambridge  Research  Center  Tech.  Report  54-21  (Cambridge,  Mass.:  April 
1954),  54  pp.[  and  by  W.  V.  Quine  [AMM  62  (1955),  627-631].  The  operation  is  also 
sometimes  called  the  resolvent,  since  J.  A.  Robinson  used  it  in  a more  general  form  (but 
with  respect  to  clauses  rather  than  implicants)  as  the  basis  of  his  “resolution  principle” 
for  theorem  proving  [JACM  12  (1965),  23-41],  Algorithm  E is  due  to  Ann  C.  Ewing, 
J.  Paul  Roth,  and  Eric  G.  Wagner,  AIEE  Transactions,  Part  1,  80  (1961),  450-458. 

32.  (a)  Change  the  definition  of  U in  exercise  31  to  the  following  associative  and 
commutative  operation  on  the  four  symbols  A = {0, 1,  *,  •},  for  all  a € A and  x € {0, 1}: 

*Ua  = aU*  = a,  •Ua  = aU»  = iUi  = *,  and  iUi  = i. 


540  ANSWERS  TO  EXERCISES 


7.1.1 


Also  let  h(0)  — 0,  /i(l)  — 1,  h(*)  = *,  and  h(»)  = *.  Then  c = h(c i LI  • • • U cm), 
computed  componentwise,  is  the  generalized  consensus  if  and  only  if  this  subcube  is 
contained  in  ci  U • ■ ■ U cm.  [See  P.  Tison,  IEEE  Transactions  EC-16  (1967),  446-456.] 

(b)  For  example,  let  Cj  = [The  final  component  is  superflu- 

ous. All  solutions  have  been  characterized  by  R.  H.  Sloan,  B.  Szorenyi,  and  G.  Turan, 
in  SIAM  J.  Discrete  Math.  21  (2008),  987-998.] 

(c)  By  (a),  every  prime  implicant  corresponds  uniquely  to  the  subset  of  implicants 
that  it  “meets.”  [A.  K.  Chandra  and  G.  Markowsky,  Discrete  Math.  24  (1978),  7-11.] 

(d)  For  example,  (y lAah)  V (y2Ax1Ax2)  V • • • V (ym AxxA  • • • Axm-iAxm)  as’ in  (b). 
[.T.-M.  Laborde,  Discrete  Math.  32  (1980),  209-212.] 

33.  (cl)  (m_2n  — fc  )/ Cm)-  (b)  We  must  exclude  the  cases  when  xi  A • • • A xJ-1  A x3  A 
Xj+\  A • • • A Xk  is  also  an  implicant.  By  the  inclusion-exclusion  principle,  the  answer  is 


?(?)<-*>■( 


2n  - (1+  l)2n~k 
m-  (l  + l)2n~k 


it  simplifies  to  (2”m"1  1)/(2r^)  when  k = n. 


34.  (a)  We  have  c(m,n)  = X]  c(m,n),  where  c,  (m,n)  = 2n~j  (n)  I2"-23)  / 12")  is  the 
average  number  of  implicants  with  n — j literals  (the  average  number  of  subcubes  of 
dimension  j in  the  terminology  of  exercise  30).  Clearly  c0(m,n)  = m,  and 


c1(m,n) 


nm(m  — 1)  mn  ( m\  1 
2(2"  - 1)  - ~2~  \2"/  - 2m’ 


similarly  Cj(m,n)  < m/(22j\n2i  1 ').  Also  p(m,n)  = £jPj(m,n),  where  we  have 


(b)  Notice  that  t = [lg  lgn  - lg  lg(2n/m)  + lg(4/3)J  < lglgn  + 0(l)  is  quite  small. 
We  will  repeatedly  use  the  fact  that  (£:/£)/  Q < aL„,  and  indeed  that 


I2n-j-2t 
V m — j ■ 2* 


Qmn(l  + 0(j222t/m)) 


is  an  extremely  good  approximation  when  j isn’t  too  large.  To  establish  the  hint, 
note  that  £\<t  Cj(m,  n)/ct(m,  n)  = O^tct-rim^n) / ct(m,n))  = 0(t2/{n^Tn))  = 

O ( (log log n) 2/n1/3) ; and  ct+j(m,n)/ct(m,n)  = 0{{n/{2t)Y  a2^1).  Consequently  we 
have  c(m,n)/ct(m,n)  ~ 1 + §(f +j)amn,  where  the  second  term  dominates  when  amn 
is  in  the  upper  part  of  its  range.  Furthermore 


z(T)<-»-“™(‘+°(^)) 


(1  ctmn)  + 0(n2amn(l  + amn)n22t/m) 


has  an  exponentially  small  error  term,  because  (1  + Qmn)n  = 0(e"1/3)  < m.  Therefore 
P(m,n)/ct(m,n)  is  asymptotically  e_nQ,m"  + ± 
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(c)  Here  amn  — 2 « n Mnft/lnt);  so  c(m,n)/ct(m,n)  = 1 + 0(t  Mogt), 

p(m,n)/ct(m,n)  = t 1 In t+^t  1 lnt  + Oft-1  loglogt).  We  conclude  that,  in  this  case, 

c(mi  n ) _ 2 lglgn  / ^ / log  log  log  log  n \ N 

p(m,n)  31glglgn\  V log  log  log  n ))' 

(d)  If  na.mn  < In  t — hi  In  t,  we  have  p(m,  n)/c(m,  n ) > pt(m,  n)/c(m,  n)  > t-1  In  t+ 
0(t  logt)  . On  the  other  hand  if  namn  > lnt  — lnlnt,  we  have  p(m,  n) / c(m,  n)  > 
pt+i(m,n)/c(m,n)  > ~t~l  In  t + 0{t~l  log  log  t). 

[The  means  c(m,n)  and  p(rri,  n),  and  the  variance  of  c(m,  n),  were  first  studied 
by  F.  Mileto  and  G.  Putzolu,  IEEE  Trans.  EC-13  (1964),  87-92;  JACM  12  (1965), 
364-375.  Detailed  asymptotic  information  about  implicants,  prime  implicants,  and 
irredundant  DNFs  of  random  Boolean  functions,  when  each  value  f(x i, . . . , xn)  is  inde- 
pendently equal  to  1 with  probability  p(n),  has  been  derived  by  Karl  Weber,  Elekt.ro- 
nische  Informationsverarbeitung  und  Kybernetik  19  (1983),  365-374,  449-458,  529-534.] 

35.  (a)  By  rearranging  coordinates  we  can  assume  that  the  pth  subcube  is  0'cl“*';,  so 
that  Bp  = 0*1“0”  and  Sp  = lfc0“+u.  Then  all  points  of  *klu*v  are  still  covered,  by 
induction  on  p,  because  all  points  of  *J  1 1 J 1 u *v  have  been  covered  for  1 < j < k. 

(b)  The  jth  and  fcth  subcubes  differ  in  every  coordinate  position  where  Bj  & Sk 
is  nonzero.  On  the  other  hand  if  Bj  & Sk  is  zero,  the  point  Sk  of  subcube  k lies  in  a 
previous  subcube,  by  (a),  because  we  have  Sk  D Bj. 

(c)  From  the  list  1100,  1011,  0011  (with  the  bits  of  each  Sk  underlined)  we  obtain 
the  orthogonal  DNF  (xiAx2)  V {Xl Ax2 Ax3 Ax4)  V (ij Ax3Ax4). 

(d)  There  are  eight  solutions;  for  example,  (01100,00110,00011, 11010, 11000). 

(e)  (001100, 011000, 0001 10, 110010, 110000,  010011, 000011)  is  a symmetrical  solu- 
tion. And  there  are  many  more  possibilities;  for  example,  42  permutations  of  the  bit 
codes  {110000,011000,001100,000110,000011,110010,011010}  are  shellings. 

[The  concept  of  a shelling  for  monotone  Boolean  functions  was  introduced  by 
Michael  O.  Ball  and  J.  Scott  Provan,  Operations  Research  36  (1988),  703-715,  who 
discussed  many  significant  applications.] 

36.  If  j < k we  have  Bj  = al f3  and  Bk  = 067  for  some  strings  a,  /3,  7.  Form  the 
sequence  x0  = a I7,  Xl  = x'0,  . . . , x,  = x',^,  where  x,  = a00l71.  We  have  f(x0)  = 1 
since  xo  D Bk,  but  f(xi)  = 0 since  xi  C Bj.  So  the  string  Xi,  where  f(Xi)  = 1 and 
f(xi+ 1)  = •••  = }{xi)  = 0,  is  in  B.  It  precedes  Bk  and  proves  that  Bj&iSk  D 0^10^. 

[This  construction  and  parts  of  exercise  35  are  due  to  E.  Boros,  Y.  Crama,  O.  Ekin, 
P.  L.  Hammer,  T.  Ibaraki,  and  A.  Kogan,  SIAM  J.  Discrete  Math.  13  (2000),  212-226.] 

37.  The  shelling  order  (000011,  001101,  001100,  110101,  110100,  110001,  110000) 
generalizes  to  all  n.  There  also  are  interesting  solutions  not  based  on  shelling,  like 
the  cyclically  symmetrical  (110***,  1110**,  **110*,  **1110,  0***11,  10**11,  111111). 

For  the  lower  bound,  assign  the  weight  wx  = - Uj=i(x2j-i  + x2j  - 3x2j-ix2j ) to 
each  point  x,  and  notice  that  the  sum  of  wx  over  all  x in  any  subcube  is  0 or  ±1.  (It 
suffices  to  verify  this  curious  fact  for  each  of  the  nine  possible  subcubes  when  n = 1.) 
Now  choose  a set  of  disjoint  subcubes  that  partition  the  set  F = {x  \ f(x)  = 1};  we 
have 

1 ^ H = I] w*  53  [xec]  = 

C chosen  C chosen  z€C  x£F  C chosen  x£F 


542  ANSWERS  TO  EXERCISES 


7.1.1 


There  are  (£)2n  k vectors  x with  exactly  k pairs  x2j-ix2j  = 1 and  nonzero  weight. 
Their  weight  is  (-l)*-1,  and  they  lie  in  F except  when  k = 0.  Hence  eFwx  = 

J2k>0  — l)fc_1  = 2n  — (2  — l)n. 

[See  M.  O.  Ball  and  G.  L.  Nemhauser,  Mathematics  of  Operations  Research  4 
(1979),  132-143.] 

38.  Certainly  not;  a DNF  is  satisfiable  if  and  only  if  it  has  at  least  one  implicant.  The 
hard  problem  for  a DNF  is  to  decide  whether  or  not  it  is  a tautology  (always  true). 

39.  Associate  variables  y%,  . . . , with  each  internal  node  in  preorder,  so  that  every 
tree  node  corresponds  to  exactly  one  variable  of  F.  For  each  internal  node  y.  with 
children  (l,r)  and  labeled  with  the  binary  operator  o,  construct  four  3CNF  clauses 
coo  A coi  A cio  A cix,  where 

Cpq  = (y™N  V lpN  V rqN) 

and  N denotes  complementation  (so  that  x0N  = x and  x1N  = x).  These  clauses  state  in 
effect  that  y = lor;  for  example,  if  o is  A,  the  four  clauses  are  (y  V l V r)  A (y  V / V f)  A 
(yVlVr)A(j/VlVf).  Finally,  add  one  more  clause,  (j/i  V yx  V j/i),  to  force  F = 1. 


Every  higher  number  can  be  formed  by  mere  complications  of  threes. 
. . . Take  the  quadruple  fact  that  A sells  B to  C for  the  price  D. 

This  is  a compound  of  two  facts: 
first,  that  A makes  with  C a certain  transaction,  which  we  may  name  E; 
and  second,  that  this  transaction  E is  a sale  of  B for  the  price  D. 

— CHARLES  S.  PEIRCE,  A Guess  at  the  Riddle  (1887) 

40.  Following  the  hint,  A says  ‘u  < v ® v < it’  and  B says  ‘u<vAv<w=>u<w\ 
So  4AB  says  that  there’s  a linear  ordering  of  the  vertices,  u\  < u2  < • • • < un.  (There 
are  n\  ways  to  satisfy  A A B.)  Now  C says  that  quvw  is  equivalent  to  u < v < w;  so  D 
says  that  u and  w are  not  consecutive  in  the  ordering,  when  u-j-w.  Thus  4ABACAD 
is  satisfiable  if  and  only  if  there  is  a linear  ordering  in  which  all  nonadjacent  vertices 
are  nonconsecutive  (that  is,  in  which  all  consecutive  vertices  are  adjacent). 

41.  Solution  0:  ‘[m<n]’  is  such  a formula,  but  it  is  not  in  the  spirit  of  this  exercise. 
Solution  1:  Let  xik  mean  that  pigeon  j occupies  hole  k.  Then  the  clauses  are 

{xji  V ■ ■ • V Xjn)  for  1 < j < m and  (xik  V xjk)  for  1 < i < j < m and  1 < k < n. 
[See  S.  A.  Cook  and  R.  A.  Reckhow,  J.  Symbolic  Logic  44  (1979),  36-50;  A.  Haken, 
Theoretical  Comp.  Sci.  39  (1985),  297-308.] 

Solution  2:  Assume  that  n = 2*  and  let  pigeon  j occupy  hole  (xji  . . . Xjt)2.  The 
clauses  (( xn  ®Xji)  V • • • V (xu  ©x^t))  for  1 < i < j < m can  be  put  into  the  CNF  form 
(Vij i V • • • V ytjt ) as  in  exercise  39,  by  introducing  auxiliary  clauses  (ikjk  V xik  V Xjk)  A 
(yijk  V xik  V Xjk)  A (yijk  V xik  V xjk)  A (yijk  V xik  V xjk).  The  total  size  of  this  CNF  is 
©(m2  logn),  compared  to  0(m2n)  in  Solution  1.  If  n is  not  a power  of  2,  O(mlogn) 
additional  clauses  of  size  O(logn)  will  rule  out  inappropriate  values. 

42.  (iVt/)A(zVi)A(i/V2)A(zV  z). 

43.  Probably  not,  because  every  3SAT  problem  can  be  converted  to  this  form.  For 
example,  the  clause  (ari  V x2  V x2)  can  be  replaced  by  (x\  V y V x3)  A (y  V x2)  A (y  V x2 ), 
where  y is  a new  variable  (essentially  equivalent  to  x2). 
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44.  Suppose  f(x)  = f(y)  = 1 implies  f(x&ry)  = 1 and  also  that,  say,  c = x\ Va^Va^VaU 
is  a prime  clause  of  /.  Then  c'  — xi  V X2  V X3  V X4  is  not  a clause;  otherwise  c A c'  — 
X2  V x3  V 0:4  would  also  be  a clause,  contradicting  primality.  So  there’s  a vector  y with 
f(y)  = 1 and  y\  = 1,  J/2  = 0,  j/3  = y4  = 1.  Similarly,  there’s  a 2 with  f(z)  = 1 and 
21  = 0,  22  = 1,  23  = 24  = 1.  But  then  /(y  & 2)  = 1,  and  c isn’t  a clause.  The  same 
argument  works  for  a clause  c that  has  a different  number  of  literals,  as  long  as  at  least 
two  of  the  literals  aren’t  complemented. 

45.  (a)  A Horn  function  f(x  1, . . . ,xn)  is  indefinite  if  and  only  if  it  is  unequal  to  the 
definite  Horn  function  g(x i,...,x„)  = f(x i,...,xn)  V (aq  A •••  Ai„).  So  / g is  a 
one-to-one  correspondence  between  indefinite  and  definite  Horn  functions,  (b)  If  / is 
monotone,  its  complement  / is  either  identically  1 or  an  indefinite  Horn  function. 

46.  Algorithm  C puts  88  pairs  xy  in  the  core:  When  x = a,  b,  c,  0,  or  1,  the  following 
character  y can  be  anything  but  (.  When  x = (,  *,  /,  +,  -,  we  can  have  y = (,  a,  b,  c, 
0,  1;  also  y = - when  x = (,  +,  or  -.  Finally,  the  legitimate  pairs  beginning  with  x = ) 
are  )+,  )-,  )*,  )/,  )). 

47.  The  order  in  which  Algorithm  C brings  vertices  into  the  core  is  a topological  sort, 
since  all  predecessors  of  k are  asserted  before  the  algorithm  sets  TRUTH (xk)  <-  1.  But 
Algorithm  2.2.3T  uses  a queue  instead  of  a stack,  so  the  ordering  it  actually  produces 
is  usually  different  from  that  of  Algorithm  C. 

48.  Let  1 be  a new  variable,  and  change  every  indefinite  Horn  clause  to  a definite 
one  by  ORing  in  this  new  variable.  (For  example,  lw  V y’  becomes  ‘ui  V y V _L’,  namely 
‘w  A y =>■  _L’;  definite  Horn  clauses  stay  unchanged.)  Then  apply  Algorithm  C.  The 
original  clauses  are  unsatisfiable  if  and  only  if  _L  is  in  the  core  of  the  new  clauses.  The 
algorithm  can  therefore  be  terminated  as  soon  as  it  is  about  to  set  TRUTH  (_!_)  4-  1. 

(J.  If.  Quick  thought  of  another  solution:  We  could  apply  Algorithm  C to  the 
function  g constructed  in  the  answer  to  exercise  45(a),  because  / is  unsatisfiable  if  and 
only  if  every  variable  Xj  is  in  the  core  of  g.  However,  indefinite  clauses  of  / such  as  w\/y 
become  many  different  clauses  (wVyVz)  A(wVy  Vr)  A(wVy  V«)  A(wVy  Vu)  A • • • of  g, 
one  for  each  variable  not  in  the  original  clause.  So  Quick’s  suggestion,  which  might 
sound  elegant  at  first  blush,  could  increase  the  number  of  clauses  by  a factor  of  Q(n).) 

49.  We  have  / < g if  and  only  if  / A g is  unsatisfiable,  if  and  only  if  /Ac  is  unsatisfiable 

for  every  clause  c of  g.  But  c is  an  AND  of  literals,  so  we  can  apply  exercise  48. 
[See  H.  Kleine  Biining  and  T.  Lettmann,  Aussagenlogik:  Deduktion  und  Algorithmen 
(1994),  §5.6,  for  further  results  including  an  efficient  way  to  test  if  g is  a “renaming” 
of  /,  namely  to  determine  whether  or  not  there  exist  constants  (yi, . . . ,yn)  such  that 
f(x  1,  • ■■,xn)=  g(  xi  yn).] 

50.  See  Gabriel  Istrate,  Random  Structures  & Algorithms  20  (2002),  483-506. 

51.  If  vertex  v is  marked  A,  introduce  the  clauses  =>  A+{v)  and  B~(v);  if  it  is 
marked  B,  introduce  =>  A ( v ) and  =>  B+(v).  Otherwise  let  v have  k outgoing  arcs 
v -»  «i,  . . . , v -»  uk.  Introduce  the  clauses  A~(uj)  =>  B+(v)  and  B~{uj)  =>  A+(v)  for 
l < j < k.  Also,  if  v is  not  marked  C,  introduce  the  clauses  A+(ui)  A • • • A A+(uk)  => 
B ( v ) and  B+(u  1)  A • • • A B+(uk)  =>  A~ (v).  All  forcing  strategies  are  consequences  of 
these  clauses.  Exercise  2.2.3-28  and  its  answer  provide  further  information. 

Notice  that,  in  principle,  Algorithm  C can  therefore  be  used  to  decide  whether  or 
not  the  game  of  chess  is  a forced  victory  for  the  white  pieces  — except  for  the  annoying 
detail  that  the  corresponding  digraph  is  larger  than  the  physical  universe. 
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52.  With  best  play,  the  results  (see  exercise  51)  are: 


n (a) 

2 0 wins 

3 0 wins 

4 first  player  wins 

5 second  player  wins 

6 second  player  wins 

7 1 loses  if  first 

8 draw 

9 draw 


(b) 

second  player  wins 
first  player  wins 
first  player  wins 
draw 

second  player  wins 
second  player  wins 
draw 
draw 


(c) 

1 wins 

first  player  wins 
first  player  wins 
draw 

1 loses  if  first 
1 loses  if  first 
draw 
draw 


(d) 

second  player  wins 
first  player  wins 
first  player  wins 
1 loses  if  first 
1 loses  if  first 
1 loses  if  first 
1 loses  if  first 
1 loses  if  first 


(Here  “1  loses  if  first”  means  that  the  game  is  a draw  if  player  0 plays  first,  otherwise 
0 can  win.)  Comments:  In  (a),  player  1 has  a slight  disadvantage,  because  f(x)  = 0 
when  xi  . . . xn  is  a palindrome.  This  small  difference  affects  the  result  even  when  n = 7. 
Although  player  1 would  seem  to  be  better  off  playing  Os  in  the  left  half  of  the  board, 
it  turns  out  that  his/her  first  move  when  n = 4 must  be  to  *1**;  the  alternative,  *0**, 
draws.  Game  (b)  is  essentially  a race  to  see  who  can  eliminate  the  last  *.  In  game  (c), 
a random  choice  of  x\  . . . xn  makes  f(x)  = 1 with  probability  Fn+2/ 2n  = ©((</»/ 2)");  in 
game  (d),  this  probability  approaches  zero  more  slowly,  as  0(1/ log  n).  Still,  player  1 
does  better  in  (c)  than  in  (d)  when  n = 2,  5,  8,  and  9;  no  worse  in  the  other  cases. 

53.  (a)  She  should  switch  either  day  1 or  day  2 to  day  3. 

(b,f)  Several  possibilities;  for  example,  change  day  2 to  day  3. 

(c)  This  case  is  illustrated  in  Fig.  6;  change  either  Desert  or  Excalibur  to  Aladdin. 

(d)  Change  either  Caesars  or  Excalibur  to  Aladdin. 

(e)  Change  either  Bellagio  or  Desert  to  Aladdin. 

Of  course  Williams,  who  doesn’t  appear  in  the  cycle  (42),  bears  no  responsibility 
whatever  for  the  conflicts. 


54.  If  x and  x are  both  in  S,  then  u G S <*==>  u G 5,  because  the  existence  of  paths 
from  x to  x and  x to  x and  1 to  a and  u to  x implies  the  existence  of  paths  from  u 
to  x and  x to  u,  hence  from  u to  u and  u to  u. 

55.  (a)  Necessary  and  sufficient  conditions  for  successfully  renaming  a clause  such  as 
X!  V x2  V x3  V x4  are  ( yi  V y2)  A (yi  V y3)  A (yi  V y4 ) A (y2  V y3)  A (y2  V y4)  A ( y3  V y4). 
A similar  set  of  (!j)  clauses  of  length  2 in  the  variables  {yi, . . . , yn  } corresponds  to  any 
clause  of  length  k in  {*1, . . . , x„}.  [H.  R.  Lewis,  JACM  25  (1978),  134-135.] 

(b)  A given  clause  of  length  k > 3 in  {x4, . . . ,xn}  can  be  converted  into  3 (k  - 2) 
clauses  of  length  2,  instead  of  the  (^)  clauses  above,  by  introducing  k — 3 new  variables 
{t2, . . . , tk- 2},  illustrated  here  for  the  clause  x\  V x2  V x3  V x4  V x$: 


(2/1V2/2)  A (yiVt2)  A ( y2\lt2 ) A (t2\/y3)  A (t2Vt3)  A (j/3Vt3)  A ( t3\/y4 ) A ( t3Vy5 ) A (j/4Vys). 

In  general,  the  clauses  from  xi  V • ■ • V xk  are  (tj-i  V yj)  A (tj-i  V tj)  A (yj  V tj)  for 
1 < j < k,  but  with  ti  replaced  by  y3  and  tk- 1 replaced  by  yk\  change  yj  to  yj  if  Xj 
appears  instead  of  Xj.  Do  this  for  each  given  clause,  using  different  auxiliary  variables 
tj  for  different  clauses;  the  result  is  a formula  in  2CNF  that  has  length  < 3 m and  is 
satisfiable  if  and  only  if  Horn  renaming  is  possible.  Now  apply  Theorem  K. 
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[See  B.  Aspvall,  J.  Algorithms  1 (1980),  97-103.  One  consequence,  noted  by  H. 
Kleine  Biining  and  T.  Lettmann  in  Aussagenlogik:  Deduktion  und  Algorithmen  (1994), 
Theorem  5.24,  is  that  any  satisfiable  formula  in  2CNF  can  be  renamed  to  Horn  clauses. 
Notice  that  two  CNFs  for  the  same  function  may  give  different  outcomes;  for  example, 
(x  V y V z)  A (x  V y V z)  A (x  V z)  A (y  V z)  is  actually  a Horn  function,  but  the  clauses 
in  this  representation  cannot  be  converted  to  Horn  form  by  complementation.] 

56.  Here  f(x,  y,  z)  corresponds  to  the  digraph  shown  below  (analogous  to  Fig.  6),  and 
it  can  also  be  simplified  to  y A (x  V z).  Each  vertex  is  a strong  component.  So  the 
formula  is  true  with  respect  to  the  quantifiers  333,  33V,  V33; 
false  in  the  other  cases  V3V,  (any)V(any).  In  general  the  eight 
possibilities  can  be  arranged  at  the  corners  of  a cube,  with  each 
change  from  3 to  V making  the  formula  more  likely  to  be  false. 

57.  Forming  the  digraph  as  in  Theorem  K,  we  can  prove  that  the  quantified  formula 
holds  if  and  only  if  (i)  no  strong  component  contains  both  x and  x;  (ii)  there  is  no  path 
from  one  universal  variable  x to  another  universal  variable  y or  to  its  complement  y; 
(iii)  no  strong  component  containing  a universal  variable  x also  contains  an  existential 
variable  v or  its  complement  v,  when  ‘3d’  appears  to  the  left  of  ‘V.t’.  These  three 
conditions  are  clearly  necessary,  and  they  are  readily  tested  as  the  strong  components 
are  being  found. 

To  show  that  they  are  sufficient,  notice  first  that  if  S is  a strong  component  with 
only  existential  literals,  condition  (i)  allows  us  to  set  them  all  equal  as  in  Theorem  K. 
Otherwise  S has  exactly  one  universal  literal,  Uj  = Xj  or  Uj  = x3\  all  other  literals 
in  S are  existential  and  declared  to  the  right  of  Xj , so  we  can  equate  them  to  u3 . And 
all  paths  into  S in  such  a case  come  from  purely  existential  strong  components,  whose 
value  can  be  set  to  0 because  the  complements  of  such  strong  components  cannot  also 
lead  into  S;  for  if  v and  v imply  Uj,  then  Uj  implies  v and  v. 

[Information  Proc.  Letters  8 (1979),  121-123.  By  contrast,  M.  Krom  had  proved 
in  J.  Symbolic  Logic  35  (1970),  210-216,  that  an  analogous  problem  in  first-order 
predicate  calculus  (where  parameterized  predicates  take  the  place  of  simple  Boolean 
variables,  and  quantification  is  over  the  parameters)  is  actually  unsolvable  in  general.] 

58.  We  can  assume  that  each  clause  is  definite,  by  introducing  ‘_L’  as  in  exercise  48  and 
placing  ‘VJL’  at  the  left.  Call  the  universal  variables  xo,  Xi,  . . . , xm  (where  xo  is  _L)  and 
call  the  existential  variables  yi,  . . . , yn.  Let  lu  -<  v'  mean  that  variable  u appears  to  the 
left  of  variable  v in  the  list  of  quantifiers.  Remove  Xj  from  any  clause  whose  unbarred 
literal  is  yk  when  y k -<  xj.  Then,  for  0 < j < m,  let  Cj  be  the  core  of  the  Horn  clauses 
when  the  additional  clauses  (x0)  A • • ■ A (xj- 1)  A (xj+ 1)  A • • • A (xm)  A /\{( yk)  \ yk  -<  x3 
and  yk  £ Co}  are  appended.  (In  other  words,  Cj  tells  us  what  can  be  deduced  when 
all  the  x’s  except  x3  are  assumed  to  be  true.)  We  claim  that  the  given  formula  is  true 
if  and  only  if  Xj  £ Cj,  for  0 < j < m. 

To  prove  this  claim,  note  first  that  the  formula  is  certainly  false  if  Xj  € Cj  for 
some  j.  (When  yk  G Co  and  yk  -<  Xj  and  Xi  = 1 for  * / j we  must  set  yk  t—  1.) 
Otherwise  we  can  choose  each  yk  to  make  the  formula  true,  as  follows:  If  yk  Co,  set 
yk  t—  0;  otherwise  set  yk  «—  /\{xj  \ yk  £ Cj}.  Notice  that  yk  depends  on  Xj  only  when 
Xj  -<  yk.  Each  clause  c with  unbarred  literal  Xj  is  now  true:  For  if  Xj  = 0,  some  yk 
appears  in  c for  which  yk  £ Cj,  because  Xj  ^ C,;  hence  yk  = 0.  And  each  clause  c with 
unbarred  literal  yk  is  also  true:  If  yk  = 0,  we  either  have  yk  ^ Co,  in  which  case  some 
yi  in  c is  ^ Co,  hence  yi  = 0;  or  yk  £ Co  \ Cj  for  some  j,  in  which  case  some  Xj  = 0 
and  either  Xj  appears  in  c or  some  yi  appears  in  c where  yi  ^ Cj,  making  yi  = 0. 
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[This  solution  is  due  to  T.  Dahlheimer.  See  M.  Karpinski,  H.  Kleine  Biining,  and 
P.  H.  Schmitt,  Lecture  Notes  in  Comp.  Sci.  329  (1988),  129-137;  H.  Kleine  Biining, 
K.  Subramani,  and  X.  Zhao,  Lecture  Notes  in  Comp.  Sci.  2919  (2004),  93-104.] 

59.  By  induction  on  n:  Suppose  /( 0,x2,  ...,*„)  leads  to  the  quantified  results  yi,  ..., 

2/2"-i  ! while  f{l,x2,  ...,xn)  leads  similarly  to  zlt  . . . , z2„_i.  Then  3x1f(x1,x2,  ...,xn) 
leads  to  yi  V zi,  ...,  y2„-i  V z2„-i,  and  \/x1f{x1,x2,...,xn)  leads  to  i/i  A z,(  ..., 
V-i "-i  ■ Now  use  the  fact  that  (yVz)  + (yAz)  = y + z.  [See  Proc.  Mini-Workshop 

on  Quantified  Boolean  Formulas  2 (QBF-02)  (Cincinnati:  May  2002),  1-16.] 

60.  Both  (a)  and  (b).  But  (c)  is  always  0;  (d)  is  always  1;  (e)  is  {xyz);  (f)  is  xVyV  z. 

61.  True  — indeed  obviously  so,  when  w — 0,  and  when  w = 1. 

62.  Since  {xltx2,x3}  C {0, 1},  we  can  assume  by  symmetry  that  xl  equals  x2.  Then 
either  f(xi,  xi,  x3,  x4, . . . , xn)  = f(xi,  xi,  xi,  x4, . . . , xn)  or  f(xi,  xi , x3,  x4, . . . , xn)  = 
f(x 3,xi,X3,x4, . . . ,xn),  assuming  only  that  / is  monotone  in  its  first  three  variables. 

63.  {xyz)  — {xxyyz).  Note:  Emil  Post  proved,  in  fact,  that  a single  subroutine  for  any 
nontrivial  monotone  self-dual  function  will  suffice  to  compute  them  all.  (By  induction 
on  n,  at  least  one  appropriate  way  to  call  such  an  n-ary  subroutine  will  yield  {xyz).) 

64.  [ FOCS  3 (1962),  149-157.]  (a)  If  / is  monotone  and  self-dual,  Theorem  P says 
that  f{x)  = Xk  or  f(x)  = (fi{x)f2{x)f3{x)).  The  condition  therefore  holds  either 
immediately  or  by  induction.  Conversely,  if  the  condition  holds  it  implies  that  / is 
monotone  (when  x and  y differ  in  just  one  bit)  and  self-dual  (when  they  differ  in  all  bits). 

(b)  We  merely  need  to  show  that  it  is  possible  to  define  / at  one  new  point 
without  introducing  a conflict.  Let  x be  the  lexicographically  smallest  point  where 
f(x)  is  undefined.  If  f(x)  is  defined,  set  f(x)  = f{x).  Otherwise  if  f(x')  = 1 for  some 
x'  C x,  set  f{x)  = 1;  otherwise  set  f(x)  = 0.  Then  the  condition  still  holds. 

65.  If  T is  maximal  intersecting,  we  have  (i)  Jf  6 J =>  X £ F,  where  X is  the 
complementary  set  {1,  2, . . . , n}  \ X;  (ii)  X eF  and  X C Y V £ F,  because 
FC{Y}  is  intersecting;  and  (hi)  X$F  =>  X € F,  because  F U {.Y}  must  contain 
an  element  Y C X.  Conversely,  one  can  prove  without  difficulty  that  any  family  F 
satisfying  (i)  and  (ii)  is  intersecting,  and  maximal  if  it  also  satisfies  (iii). 

Punch  line:  All  three  statements  are  simple,  in  the  language  of  Boolean  functions: 
(i)  f(x)  = 1 =►  fix)  = 0;  (ii)  * C y =>  f{x)  < f{y)-  (iii)  f{x)  = 0 =►  f{x)  = 1. 

66.  [T.  Ibaraki  and  T.  Kameda,  IEEE  Transactions  on  Parallel  and  Distributed  Sys- 
tems 4 (1993),  779-794.]  Every  family  with  the  property  that  Q C Q'  implies  Q = Q' 
clearly  corresponds  to  the  prime  implicants  of  a monotone  Boolean  function  /.  The 
further  condition  that  Q n Q'  / 0 corresponds  to  the  further  relation  f{x)  < fix), 
because  fix)  = f{x)  = 1 holds  if  and  only  if  x and  x both  make  prime  implicants  true. 

If  coteries  C and  C'  correspond  in  this  way  to  functions  / and  /',  then  C dominates 
C if  and  only  if  / / /'  and  fix)  < fix)  for  all  x.  Then  /'  is  not  self-dual,  because 
there  is  an  x with  fix)  = 0,  f{x)  = 1;  and  we  have  fix)  — 0,  hence  f{x)  = 0. 

Conversely,  if  f is  not  self-dual,  there’s  a y with  fiy)  = f'{y)  = 0.  If  y = 0 . 0, 

coterie  C'  is  empty,  and  dominated  by  every  other  coterie.  Otherwise  define  fix)  = 
f ix)  V[iD  y].  Then  / is  monotone,  and  f)x)  < f)x)  for  all  x;  so  it  corresponds  to  a 
coterie  that  dominates  C' . 

67.  (a)  By  induction,  if  T(tioo)  = K(toio)  = 1 we  have  a Y of  Is,  because  the  Ys  in 
tioo  and  tow  either  intersect  or  are  adjacent.  Similarly,  if  Y(t10o)  = Y(foio)  = 0 we  do 
not  have  a Y of  Is,  because  there  is  a Y of  0s. 


7.1.1 


ANSWERS  TO  EXERCISES  547 


(b)  This  formula  follows  from  (a)  and  the  fact  that  ( tabc)def  = f(0+d)(6+e)(c+/)  = 
( tdefjabc ■ [Schensted  stated  the  results  of  this  exercise,  and  those  of  exercises  62  and  69, 
in  a 28-page  letter  sent  to  Martin  Gardner  on  21  January  1979.  Milnor  had  written  to 
Gardner  on  26  March  1957  about  a corresponding  game  called  “Triangle.”] 


68. 


o 

oo 

Here  is  one  of  the  258,594  solutions  for  n = 15  that  has  59  black  stones:  0°.*0\°0  • 

(The  answers  for  1 < n < 15  are  respectively  2,  3,  4,  6,  8,  11,  14,  18,  23,  27,  00.*o0.0«0oVo 

33,  39,  45,  52,  59.  The  prime  implicants  for  these  functions  can  be  o*o*o°/o0o*o*o°»*o 

represented  by  fairly  small  ZDDs;  see  Section  7.1.4.)  o°o%0oV»0o0»0»*o0»*o°o 

o**ooo»#ooo*»o 

0t00*#000«i00#0 

oo*#*o****o#*«oo 


69.  The  proof  of  Theorem  P shows  that  we  need  only  prove  Y(T)  < f(x).  A Y in  T 
means  that  we’ve  got  at  least  one  variable  in  each  pj.  Therefore  f(x i, . . . , xn)  = 0,  and 
/(x i, . . . ,x„)  = 1. 

70.  Self-duality  of  g is  obvious  for  arbitrary  t when  / is  self-dual:  g(x)  = (/(x)  V 
[x  = t])A[x^t]  = (f(x)\/[x  = i])A[x^t]  = (/(x)A[x/t])v([x  = t]A[x^i])  = g(x). 

Let  x — xi  ...  Xj-iOxj+i  . . . x„  and  y = x i . . . Xj-ilXj+i  . . . x„;  for  monotonicity 
we  must  prove  that  g(x)  < g(y).  If  x — t or  y = t,  we  have  g(x)  = 0;  if  x = t or  y = t, 
we  have  g(y)  = 1;  otherwise  g(x)  = f(x ) < f(y)  = g(y).  [European  J.  Combinatorics 
16  (1995),  491-501;  discovered  independently  by  J.  C.  Bioch  and  T.  Ibaraki,  IEEE 
Transactions  on  Parallel  and  Distributed  Systems  6 (1995),  905-914.] 

71.  (( xyz)uv ) = ((( xyz)uv)uv ) = ((( yuv)x(zuv))uv ) — ((yuv)(xuv)((zuv)uv))  = 
( (xuv)  ( yuv ) (zuv)) . 

72.  For  (58),  v = (uvu)  = u.  For  (59),  (uyv)  = ( vu(xuy ))  = (( vux)uy ) = (xuy)  = y. 
And  for  (60),  ( xyz ) = (( xuv)yz ) = (x{uyz)(vyz))  = (xyy)  = y. 

73.  (a)  If  d(u,v ) = d{u,x)  + d(x,v),  we  obviously  obtain  a shortest  path  of  the  form 
u — • • • — x — ■ • • — v.  Conversely,  if  [uxv],  let  u — • • • — x — • • • — v be  a 
shortest  path,  with  l steps  to  x followed  by  m steps  to  v.  Then  d(u,  v)  = l + m > 
d(u,  x)  + d(x,  v)  > d(u,  v). 

(b)  For  all  z,  ( zxu ) = (z(vux) (yux))  = (( zvy)ux ) 6 {(yux) , (vux)}  = {u,x}. 

(c)  We  can  assume  that  d(x,u)  > d(x,v)  > 0.  Let  u y — v be  a shortest 

path,  and  let  w = (xuy).  Then  (raw)  = (v (vux) (wux))  = ((vvw)ux)  = (vux)  — x, 
soi  £ [«>..«].  We  have  [uwy\,  because  d(u,y)  < d(u,v)  and  w 6 [u..y\.  If  w / u 
we  have  d(w,v)  < d(u,v)\  hence  [wxv\,  hence  [uxv].  If  ui  = u we  have  x — u by  (b). 
But  d(x,u)  > d(x,v)\  therefore  x — v,  and  [uxv]. 

(d)  Let  y = (uxv).  Since  y € [u..x],  we  have  d(u,x)  = d(u,y)  + d(y,x)  by  (a) 
and  (c).  Similarly,  d(u,v)  = d(u,y)  +d(y,v)  and  d(x,v)  = d(x,y)  + d(y,v).  But  these 
three  equations,  together  with  d(u,v)  = d(u,x)  + d(x,v),  yield  d(x,y)  = 0.  [Proc. 
Amer.  Math.  Soc.  12  (1961),  407-414.] 

74.  w = (yxw)  = (yx(zxw))  = (yx(zx(yzw)))  = (( yxz)x(yzw ))  = ( x(xyz)(wyz ))  = 
((xxw)yz)  = (xyz)  by  (55),  (55),  (55),  (52),  (51),  (53),  and  (50). 

75.  (a)  If  ui  = (xxy)  we  have  [xwx]  by  (iii),  hence  w = x by  (i). 

(b)  Axiom  (iii)  and  part  (a)  tell  us  that  [xxy]  is  always  true.  So  we  can  set  x = y 
in  (ii)  to  conclude  that  [uxv]  4=>  [ vxu ].  The  definition  of  (xyz)  in  (iii)  is  therefore 
perfectly  symmetrical  between  x,  y,  and  z. 

(c)  By  the  definition  of  (uxv)  in  (iii),  we  have  x = (uxv)  if  and  only  if  [uxx],  [uxv], 
and  [xxv] . But  we  know  that  [uxx]  and  [xxv]  are  always  true. 
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• hlS,St6p  and  subsequent  steps,  we  will  construct  one  or  more  auxiliary 

points  of  M and  then  use  Algorithm  C to  derive  every  consequence  of  the  betweenness 
relations  that  are  known.  (The  axioms  have  the  convenient  form  of  Horn  clauses  ) For 
example,  here  we  define  * = ( xyv >,  so  that  we  know  [uxy],  [uyv],  [**„],  [**,],  and 
[yzv\.  From  these  hypotheses  we  deduce  [uzy]  and  [uzv\.  So  z = (uyv)  = y. 

(e)  The  hinted  construction  implies,  among  other  things,  [utv],  [utz],  [vtz\,  [uwv  1, 
[muz],  H;  hence  t _ w.  (A  computer  program  is  helpful  here.)  Adding  the  hypothe- 
ses [rre],  [rwz],  [swz]  now  yields  [xyz]  as  desired;  it  also  turns  out  that  r=pands  = 9. 

(f)  Let  r = (ynv),  s = (; zuv ),  t = {xyz),  p = ( xrs ),  q = (tuv);  then  \pqp]  flows  out. 
[Proc.  Amer.  Math.  Soc.  5 (1954),  801-807.  For  early  studies  of  betweenness  axioms, 
see  E.  V.  Huntington  and  J.  R.  Kline,  Trans.  Amer.  Math.  Soc.  18  (1917),  301-325.] 

76.  Axiom  (i)  obviously  holds,  and  axiom  (ii)  follows  from  commutativity  and  („). 

e answer  to  exercise  74  derives  (iii)  from  the  identity  (xyz)  = (x(xyz)(wyz))-  so  we 

need  only  verify  thaUormula:  (x(xyz)(Wyz))  = ((yxz)x(wyz))  = (((yxz)xz)x(lyz))  = 
(( yxz)x(zx(wyz )))  - ( x(xyz)(z(xyz)w » = (( x(xyz)z)(xyz)w ) = {(xyz)(xyz)w) . 

Notes:  The  original  treatment  of  median  algebra  by  Birkhoff  and  Kiss  in  Bull, 
mer.  Math.  Soc.  53  (1947),  749-752,  assumed  (50),  (51),  and  the  short  distributive 
aw  (53)-  he  fact  that  associativity  (52)  actually  implies  distributivity  was  not  realized 
untfl  many  years  later;  M.  Kolibiar  and  T.  Marcisova,  Matematicky  Casopis  24  (1974) 
179  185  proved  it  via  Sholander’s  axioms  as  in  this  exercise.  A mechanical  derivation 

of  (53)  from  (50)— (52)  was  found  in  2005  by  R.  Veroff  and  W.  McCune  using  an 
extension  of  the  Otter  theorem  prover. 

77.  (a)  In  coordinate  r~s  of  the  labels,  suppose  l(r)  has  a 0 and  l(s)  has  a 1;  then 
the  left  vertices  have  0 in  that  coordinate.  If  u — v — where  u and  v!  are  on  the  left 
but  u is  on  the  right,  (uu  v)  lies  on  the  left.  But  [u  . . v]  n [u' . . v]  = {«},  unless  u = u' . 

(b)  This  statement  is  obvious,  by  Corollary  C. 

(c)  Suppose  u v and  u'  — v' , where  u and  u'  are  on  the  left,  v and  v'  are 

°n  _ 6 ,rlght'  Let  u - no  • • • — vk  = v'  be  a shortest  path,  and  let  u0  = u, 
uk-u.  All  vertices  Vj  he  on  the  right,  by  (b).  The  left  vertex  Ul  = (u0vlUk)  must 
be  a common  neighbor  of  u0  and  vu  since  the  distance  d(u0,  m)  = 2.  (We  cannot  have 
ui  - uo,  because  that  would  imply  the  existence  of  a shortest  path  from  v to  v'  going 
through  the  left  vertex  u.)  Therefore  m has  rank  1;  and  so  do  v2,  ...,  by  the 

Commentationes  Mathematical  Universitatis  Carolina: 
iz  (tatij,  317  325;  M.  Mulder,  Discrete  Math.  24  (1978),  197-204.1 

(d)  These  steps  visit  all  vertices  v of  rank  1 in  order  of  their  distance  d(v,  s)  from  s. 
it  such  a v has  a late  neighbor  u not  yet  seen,  the  rank  of  u must  be  1 or  2.  If  the 
rank  is  1,  u will  have  at  least  two  early  neighbors,  namely  v and  the  future  MATE  Cm). 
btep  18  bases  its  decision  on  an  arbitrary  early  neighbor  w of  u such  that  w / v.  Since 

1’  1 Tv2:  the  Vert!X  * = {SVW)  haS  rank  1 by  If  w has  rank  0,  then  x — v\  so  u has 
rank  1.  Otherwise  d(x,  s)  < d(v,  a),  and  the  rank  of  w was  correctly  determined  when 

x was  visited.  If  w has  rank  1,  u lies  on  a shortest  path  from  v to  w;  if  w has  rank  2 w 
lies  on  a shortest  path  from  u to  s.  In  both  cases  u and  w have  the  same  rank,  by  (c). 

(e)  The  algorithm  removes  all  edges  equivalent  tor  — s,  by  (a)  and  (d).  Their 
removal  clearly  disconnects  the  graph;  the  two  pieces  that  remain  are  convex  by  (b) 
so  they  are  connected  and  in  fact  they  are  median  graphs.  Step  17  records  all  of 
the  relevant  relations  between  the  two  pieces,  because  all  4-cycles  that  disappear  are 
examined  there.  By  induction  on  the  number  of  vertices,  each  piece  is  properly  labeled. 
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78.  Every  time  v appears  in  step  14,  it  loses  one  of  its  neighbors  Uj.  Each  of  these 
edges  v — Uj  corresponds  to  a different  coordinate  of  the  labels,  so  we  can  assume 
that  l(v)  has  the  form  alk  for  some  binary  string  a.  The  labels  for  ui,  U2,  ...,  Uk 
are  then  aOl"-1,  al01fc~2,  ...,  alfc-10.  By  taking  componentwise  medians,  we  can 
now  prove  that  all  2k  labels  of  the  form  aft  occur  for  vertices  in  the  graph,  since 
(( a/3)(a(3')(0  ...  0))  is  the  bit  string  a(/3  & /3'). 

79.  (a)  If  l(v)  — k,  exactly  v{k)  smaller  vertices  are  neighbors  of  v. 

(b)  At  most  [n/2j  Is  appear  in  bit  position  j,  for  0 < j < [lgn]. 

(c)  Suppose  exactly  k vertices  have  labels  beginning  with  0.  At  most  min(fc,  n — k) 
edges  correspond  to  that  bit  position,  and  at  most  f(k)  + f(n  — k)  other  edges  are 
present.  But 

f(n)  = max  (min(fc, n — k)  + f(k ) + f(n  — fc)), 

because  the  function  g(m,  n)  = f(m  + n)  — m — f(m)  — f(n)  satisfies  the  recurrence 

g(2m  + a,2 n + b)  = ab  + g(m  + a,  n)  + g(m,  n + b)  for  0 < a,  b < 1. 

It  follows  by  induction  that  g(m,m)  = g(m,m  + 1)  = 0,  and  that  g(m,n)  > 0 when 
m < n.  [Annals  of  the  New  York  Academy  of  Sciences  175  (1970),  170-186;  D.  E. 
Knuth,  Proc.  IFIP  Congress  1971  (1972),  24.] 

80.  (a)  (Solution  by  W.  Imrich.)  The  graph  with  vertex  labels  0000,  0001,  0010,  0011, 
0100,  0110,  0111,  1100,  1101,  1110,  1111  cannot  be  labeled  in  any  essentially  different 
way;  but  the  distance  from  0001  to  1101  is  4,  not  2. 

(b)  The  cycle  C2m  is  a partial  cube,  because  its  vertices  can  be  labeled  l(k)  = 
lfc0m_fc,  l(m  + k)  = oiTm-A:  for  0 < k < m.  But  the  bitwise  median  of  1(0),  l(m  — 1), 
and  l(m  + 1)  is  01m_20;  and  indeed  those  vertices  don’t  have  a median,  when  m > 2. 

81.  Yes.  A median  graph  is  an  induced  subgraph  of  a hypercube,  which  is  bipartite. 

82.  The  general  case  reduces  to  the  simple  case  where  G has  only  two  vertices  {0, 1}, 
because  we  can  operate  componentwise  on  the  median  labels,  and  because  d(u,  v)  is 
the  Hamming  distance  between  l(u)  and  l{v). 

In  the  simple  case,  the  stated  rule  sets  Uk  «—  Vk  except  when  Uk-i  = Vk-i  = 
Vk+ i ^ Vk,  and  it  is  readily  proved  optimum.  (Other  optimum  possibilities  do  exist, 
however;  for  example,  if  V0V1U2U3  = 0110,  we  could  set  U0U1U2U3  = 0000.) 

[This  problem  was  motivated  by  the  study  of  self-organizing  data  structures.  F.  R. 
K.  Chung,  R.  L.  Graham,  and  M.  E.  Saks,  in  Discrete  Algorithms  and  Complexity  (Aca- 
demic Press,  1987),  351-387,  have  proved  that  median  graphs  are  the  only  graphs  for 
which  Uk  can  always  be  chosen  optimally  as  a function  of  (no,  vi, . . . , Vfc+i),  regardless 
of  the  subsequent  values  (vk+ 2,  ■ ■ ■ , t>t).  They  have  also  characterized  all  cases  for  which 
a given  finite  amount  of  lookahead  will  suffice,  in  Combinatorica  9 (1989),  111-131.] 

83.  Consider  first  the  Boolean  (two- vertex)  case,  and  let  an  optimum  solution  be 
obtainable  by  the  recursive  rules  uo  t—  i’o  and  Uj  t—  ft+2-j(u,j-x,  Vj, . . . , vt)  for  1 < 
j < t,  where  each  /*,  is  a suitable  Boolean  function  of  k variables.  The  first  function 
ft+i(vo,  vi, . . . , vt)  actually  depends  on  its  “most  remote”  variable  vt,  because  we  must 
have  /2fc+i(0, 1, 1, 0, 1, 0, 1, ... , 0, 1,  Q,x)  = x when  p = 1 — e and  k > 2. 

One  suitable  function  ft+ 1 can  be  obtained  as  follows:  Let  /t+i(0,ui, . . . ,Vt ) = 0 
if  v\  = 0.  Otherwise  let  the  “runs”  of  the  input  sequence  be 

V0V1  . . .Vt  = 01afe0a'I_1  . . . la20ai  or  01afc0Ofc-1  . . . la30°2lai, 
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where  ak, . . . , ai  > 1,  and  let  a,  = 2 - djp  = max(0,  2 - ajp)  for  1 < j < k.  Then 

/*+i(0,  Vi, . ,.,vt)  = [ak  - (ak- 1 -(•••-  (a2  - (1  - axp))  •••))  = 0]. 

Also  let  ft+i  (1,  Vi, . ..,vt)  = /t+i(0,  Vi, . . .,vt),  so  that  ft+1  is  self-dual. 

With  a somewhat  delicate  proof  one  can  show  that  ft+i  is  also  monotone. 
Therefore,  by  Theorem  P,  we  can  apply  ft+1  componentwise  to  the  labels  of  an 
arbitrary  median  graph,  always  staying  within  the  graph. 

84.  There  are  81  such  functions,  each  of  which  can  be  represented  as  the  median  of  an 
odd  number  of  elements.  Seven  types  of  vertices  occur: 

Cases  Adjacent  to  Degree 

5 ( vwxyzzz } 1 

5 (z),  ( wxyzz ) 5 

20  (vwxyzzz),  ( vwxxyyzzz ) 4 

30  (xyz),  (wxyzz),  (vwxyyzz)  5 
10  (vwxxyyzzz) , (vwxyz)  7 

1 (vwxyyzz)  10 

10  (vwxxyyzzz)  3 

[Von  Neumann  and  Morgenstern  enumerated  these  seven  types  in  their  book  Theory 
of  Games  and  Economic  Behavior  (1944),  §52.5,  in  connection  with  the  study  of  an 
equivalent  problem  about  systems  of  winning  coalitions  that  they  called  simple  games. 
The  graph  for  six-variable  functions,  which  has  2646  vertices  of  30  types,  appears  in  the 
paper  by  Meyerowitz  cited  in  exercise  70.  Only  21  of  those  types  can  be  represented  as 
a simple  median-of-odd;  a vertex  like  ((abd) (ace) (bcf)) , for  example,  has  no  such  repre- 
sentation. Let  the  corresponding  graph  for  n variables  have  Mn  vertices;  P.  Erdos  and 
N.  Hindman,  in  Discrete  Math.  48  (1984),  61-65,  showed  that  lg Mn  is  asymptotic  to 
(ln/2j)-  D-  Kleitman,  in  J.  Combin.  Theory  1 (1966),  153-155,  showed  that  the  vertices 
for  distinct  projection  functions  like  x and  y are  always  furthest  apart  in  this  graph.] 

85.  Every  strong  component  must  consist  of  a single  vertex;  otherwise  two  coordinates 
would  always  be  equal,  or  always  complementary.  Thus  the  digraph  must  be  acyclic. 

Furthermore,  there  must  be  no  path  from  a vertex  to  its 
complement;  otherwise  a coordinate  would  be  constant. 

When  these  two  conditions  are  satisfied,  we  can  prove  that 
no  vertex  x is  redundant,  by  assigning  the  value  0 to  all  vertices 
that  precede  x or  x,  assigning  1 to  all  vertices  that  follow,  and 
giving  appropriate  values  to  all  other  vertices. 

(Consequently  we  obtain  a completely  different  way  to  rep- 
resent a median  graph.  For  example,  the  digraph  shown  corresponds  to  the  median 
graph  whose  labels  are  {0000,0001,0010,0011,0111,1010}.) 

86.  Yes.  By  Theorem  P,  any  monotone  self-dual  function  maps  elements  of  X into  X. 

87.  Here  the  topological  ordering  7654321  1234567  can  replace  (72);  we  get 


f= 

— j — 

' 1 

’ 

1 

da 

- 

da 

5 

1 

[3 

1 

— X 

(Consecutive  inverters  on  the  same  line  can,  of  course,  be  canceled  out.) 


Type  Typical  vertex 

1 (z) 

2 (vwxyzzz) 

3 (wxyzz) 

4 ( vwxxyyzzz ) 

5 (vwxyyzz) 

6 (vwxyz) 

7 (xyz) 
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88.  A given  value  of  d contributes  at  most  6 \t/d\  units  of  delay  (for  2\t/d~\  clusters). 

89.  Suppose  first  that  the  new  condition  is  * -»  j while  the  old  was  i!  — > j',  where 

i < j and  i < j'  and  there  are  no  complemented  literals.  The  new  module  changes 
xi  ■ ■ ■ xt  to  2/i  ...  yt,  where  yt  = Xi  A Xj,  yj  = Xi  V Xj,  and  yk  --  Xk  otherwise.  We 
certainly  have  < yy  when  IT  {i,j}  = 0.  And  there  is  no  problem  if  i = i1 , 

since  y */  = y*  < Xi  = xp  < xy  = yji . But  the  case  i = j ’ is  trickier:  Here  the  relations 
i —>  i and  i — > j imply  also  i'  — > j]  and  this  relation  has  been  enforced  by  previous 
modules,  because  modules  have  been  appended  in  order  of  decreasing  distance  d in  the 
topological  ordering  u\...u2t ■ Therefore  yp  = xp  < Xj  and  iji>  < Xjt  = x,,  hence 
y,/  < Xi  A Xj  = yi  = yy.  A similar  proof  works  when  j = i'  or  j = j1 . 

Finally,  with  complemented  literals,  the  construction  cleverly  reduces  the  general 
case  to  the  uncomplemented  case  by  inverting  and  un-inverting  the  bits. 

90.  When  t = 2,  1,1.  does  the  job.  The  general  case  follows  recursively  from  this 
building  block  by  reducing  t to  \t/ 2"j. 

[The  study  of  Cl-nets,  and  other  networks  of  greater  generality,  was  initiated  by 
E.  W.  Mayr  and  A.  Subramanian,  J.  Computer  and  System  Sci.  44  (1992),  302-323.] 

91.  The  answer  does  not  yet  seem  to  be  known  even  in  the  special  case  when  the 
median  graph  is  a free  tree  (with  t + 1 vertices),  or  in  the  monotone  case  when  it  is  a 
distributive  lattice  as  in  Corollary  F.  In  the  latter  case,  inverters  may  be  unnecessary. 

93.  Let  dx(u,v)  be  the  number  of  edges  on  a shortest  path  between  u and  v,  when 
the  path  lies  entirely  within  X.  Clearly  dx(u,v ) > da{u,v).  And  if  u = uo  — ui  — 

Uk  = v is  a shortest  path  in  G,  the  path  u = f(u o)  — f(u i) f(uk)  = v 

lies  in  X when  / is  a retraction  from  G to  A;  hence  dx(u,v)  < da(u,v). 

94.  If  / is  a retraction  of  the  f-cube  onto  X,  two  different  coordinate  positions  cannot 
always  be  equal  or  always  complementary  for  all  x £ X,  unless  they  are  constant. 
For  if,  say,  all  elements  of  X have  the  forms  00*  ...  * or  11*  . . . *,  there  would  be  no 
path  between  vertices  of  those  two  types,  contradicting  the  fact  that  X is  an  isometric 
subgraph  (hence  connected). 

Given  x,y,z  6 X , let  w = ( xyz ) be  their  median  in  the  t-cube.  Then  f(w)  £ 
[x  ■ . y]  n [x  . . z\  IT  [y  . . z\,  because  (for  example)  f(w)  lies  on  a shortest  path  from  x to  y 
in  X.  So  f(w)  = w,  and  we  have  proved  that  w £ X.  [This  result  and  its  considerably 
more  subtle  converse  are  due  to  H.  J.  Bandelt,  J.  Graph  Theory  8 (1984),  501-510.] 

95.  False  (although  the  author  was  hoping  otherwise);  the  network  at  the  ; 

right  takes  0001  i->  0000,  0010  i->  0011,  1101  i-»  0110,  but  nothing  i— ► 0010.  — » — 

t (The  set  of  all  possible  outputs  appears  to  have  no  easy  char-  , I 
— — ■ I acterization,  even  when  no  inverters  are  used.  For  example, 

j ' .j  ~ the  pure-comparator  network  at  the  left,  constructed  by  Tomas  Feder, 
takes  000000  i->  000000,  010101  ^ 010101,  and  101010  ^ 011001,  but 
nothing  010001.  See  also  exercises  5.3.4-50,  5.3.4-52.) 

96.  No.  If  / is  a threshold  function  based  on  real  parameters  w = (w\ , . . . , wn)  and  t, 
let  max{w  • x | f(x)  = 0}  = t — e.  Then  e > 0,  and  / is  defined  by  the  2n  inequalities 
w-x-t>  0 when  f(x)  = l,t-w-x-e>0  when  f(x)  = 0.  If  A is  any  M x N matrix 
of  integers  for  which  the  system  of  linear  inequalities  Av  > (0, . . . , 0)T  has  a real- valued 
solution  v = (vi, . . . , vn)t  with  vn  > 0,  there  also  is  such  a solution  in  integers.  (Proof 
by  induction  on  N .)  So  we  can  assume  that  w\,  . . . , wn,  t,  and  e are  integers. 

[A  closer  analysis  using  Hadamard’s  inequality  (see  Eq.  4.6.1  (25))  proves  in 
fact  that  integer  weights  of  magnitude  at  most  (n  + l)("+1)/2/2n  will  suffice;  see 
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S.  Muroga,  I.  Toda,  and  S.  Takasu,  J.  Franklin  Inst.  271  (1961),  376-418,  Theorem  16. 
Furthermore,  exercise  112  shows  that  weights  nearly  that  large  are  sometimes  needed.] 

97.  (lllllxix2),  (lllxix2),  (laqx2),  (0xix2),  (000xix2),  (00000x1x2). 

98.  We  may  assume  that  f(x i, . . . , xn)  = (y™1  . . . y“"),  with  positive  integer  weights  Wj 
and  with  w\  + ■ ■ ■ + wn  odd.  Let  <5  be  the  minimum  positive  value  of  the  2n  sums 
±u>i  ± • • • ± wn,  with  n independently  varying  signs.  Renumber  all  subscripts  so  that 

wi  H V wk  - wk+i wn  - S.  Then  wxyi  -\ b wnyn  > i(Wi  4 b w„)  <=> 

wi{yi  - |)  4 b wn(yn  - |)  > 0 4=>  w1(y1  - §)  4 b wn(yn  - f ) > - S/2  -t=> 

W\y\  4 b wnyn  > |( Wi  + ■ ■ ■ + wn  — (wi  + b wk  — Wk+i  — • ■ ■ — wn))  = wk+1  + 

b wn  4=b  wiyi  + b wkyk  - Wk+iijk+i  — • • • — w„yn  > 0. 

99.  We  have  [xi  + ■ ■ ■ + x2s_!  + s(Vl  + ■ ■ ■ + y2t_2)  > sf]  = H 1-  rua.-ij/sj  + 

yi-i bj/2t-2>t];  and  [(xi  4 bx2s_x)/sj  = [xi  4 bx2s_x  >s], 

(For  example,  ({ xyz)uv ) = (xyzu2 v2) , a quantity  that  we  also  know  is  equal  to 
(x{yuv)(zuv))  and  {{xuv) (yuv) (zuv))  by  Eqs.  (53)  and  (54).  Reference:  C.  C.  Elgot, 
FOCS  2 (1961),  238.) 

100.  True,  because  of  the  preceding  exercise  and  (45). 

101.  (a)  When  n = 7 they  are  x7Ax6,  x6Ax5,  x7AxsAx4,  x6Ax4Ax3,  x7Ax5  Ax3Ax2, 
xq  A x4  A x2  A Xi  , x7  A X5  A x3  A xi ; and  in  general  there  are  n prime  implicants,  forming 
a similar  pattern.  (We  have  either  xn  = xn_i  or  xn  = x„_i.  In  the  first  case, 
xn  AXn-i  is  obviously  a prime  implicant.  In  the  second  case,  Fn(x  1, . . . ,xn-i,xn-i)  = 
Fn- i(xi, . . . ,xn_i);  so  we  use  the  prime  implicants  of  the  latter,  and  insert.  Xn  when 
xn_i  does  not  appear.) 

(b)  The  shelling  pattern  (0000011,  0000110,  0001101,  0011010,  0110101,  1101010, 
1010101)  for  n = 7 works  for  all  n. 

(c)  Two  of  several  possibilities  for  n = 7 illustrate  the  general  case: 


( X6  ^ 

f xe  \ 

X7  X5 

x7  x5 

X6  xe  x4 

= Y 

X6  Xq  X4 

x7  x5  x7  x3 

X7  X5  X5  X3 

Xq  Xq  X4  Xq  X“2 

X6  Xq  X4  X4  X2 

\ Xj  Xq  X7  X3  X7  Xi  J 

\X7  X5  X5  X3  X3  Xi  / 

[The  Fibonacci  threshold  functions  were  introduced  by  S.  Muroga,  who  also  discovered 
the  optimality  result  in  exercise  105;  see  IEEE  Transactions  EC-14  (1965),  136-148.] 

102.  (a)  By  (1)  and  (2),  /(x o,xj, . . . ,xn)  is  the  complement  of  /(x0,x  1, . . . ,xn). 

(b)  If  / is  given  by  (75),  / is  [(w  + 1 - 2 t)x0  + wlXl  + ■ ■ ■ + wnxn  > w -b  1 - t], 

where  w = wi  4 b wn.  Conversely,  if  / is  a threshold  function,  so  is  f(x  1, . . . ,xn)  = 

/(l,xi,  • ■ • ,x„).  [E.  Goto  and  H.  Takahasi,  Proc.  IFIP  Congress  (1962),  747-752.] 

103.  [See  R.  C.  Minnick,  IRE  Transactions  EC-10  (1961),  6-16.]  We  want  to  minimize 

®i  4 bt«n  subject  to  the  constraints  Wj  > 0 for  1 < j < n and  (2ei  - l)wi  4 b 

{2en  — l)wn  > 1 for  each  prime  implicant  x'1  A-  • -Ax'".  For  example,  if  n = 6,  the  prime 
implicant  x2  A x5  A x6  would  lead  to  the  constraint  —wi  4-  w2  — w3  — u>4  4-  w5  + we  > 1. 
If  the  minimum  is  -boo,  the  given  function  is  not  a threshold  function.  (The  answer  to 
exercise  84  gives  one  of  the  simplest  examples  of  such  a case.)  Otherwise,  if  the  solution 
(w wn)  involves  only  integers,  it  minimizes  the  desired  size.  When  noninteger 
solutions  arise,  additional  constraints  must  be  added  until  the  best  solution  is  found, 
as  in  part  (c)  of  the  following  exercise. 
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104.  First  we  need  an  algorithm  to  generate  the  prime  implicants  x\'  A • • • A x®"  of  a 
given  majority  function  (x”1  . . . x™n),  when  w\  > • • • > wn  and  wi  A b wn  is  odd: 

Kl.  [Initialize.]  Set  t <—  0.  Then  for  j = n,  n — 1,  . . . , 1 (in  this  order),  set 
aj  «-  t,  t <-  t + Wj,  ej  <—  0.  Finally  set  t <—  (t  + l)/2,  si  «—  0,  and  l <—  0. 

K2.  [Enter  level  l.\  Set  l «—  l + 1,  ej  «—  1,  s/+i  <—  s;  + u>(. 

K3.  [Below  threshold?]  If  s/+i  < t,  return  to  K2. 


K4.  [Visit  a prime  implicant.]  Visit  the  exponents  (ei, . . . , e„). 

K5.  [Downsize.]  Set  e;  t—  0.  Then  if  + a;  > t,  set  si+i  e-  s/  and  go  to  K2. 

K6.  [Backtrack.]  Set  l t—  l — 1.  Terminate  if  l = 0;  otherwise  go  to  K5  if  e;  = 1; 
otherwise  repeat  this  step.  | 

(a)  {xix\x\x\x%x%x\°x}?)  (21  prime  implicants). 

(b)  The  optimum  weights  for  (xj6_2txf  x\x\x^)  are  W0W1W2W3W4  = 10000,  31111, 
21110,  32211,  11100,  23211,  12110,  13111,  01000,  for  0 < t < 8;  the  other  cases  are  dual. 

(c)  Here  the  optimum  weights  (rai,...,wi0)  are  (29,25, 19, 15, 12, 8, 8, 3,3, 0)/2; 
so  we  learn  that  xio  is  irrelevant,  and  we  must  deal  with  fractional  weights.  Constrain- 
ing ws  > 2 gives  integer  weights  (15,13,10,8,6,4,4,2,1,0),  which  must  be  optimum 
because  their  sum  exceeds  the  previous  sum  by  2.  (Only  two  of  the  175,428  self-dual 
threshold  functions  on  nine  variables  have  nonintegral  weights  minimizing  wi  + - ■ ■+wn-1 
the  other  one  is  (x^x^x^x^xsxlxyxlxg}.  The  largest  w\  in  a minimum  representa- 
tion occurs  in  (x^x^x^x^xpx^xfxlxg);  the  largest  w\  + ■ ■ ■ + wg  occurs  uniquely 
in  (x?4x22x38x47x§4x|°x78x|5x91),  which  is  also  an  example  of  the  largest  u>g.  See 
S.  Muroga,  T.  Tsuboi,  and  C.  R.  Baugh,  IEEE  Transactions  C-19  (1970),  818-825.) 


105.  When  n = 7,  the  inequalities  generated  in  exercise  103  are  W7+We  — W3  — W4—W3  — 

W2—W1  > 1,  —W7+We+W5—W4—W3—W2—Wl  > 1,  W7 —W6 +Ws +W4 — W3  ~W2  ~Wl  > 1, 
—W7  + we  — W$  + W4  + U>3  — W2  — Wi  > l,  u)7  — Ws  + W5  — W4  + u>3  + w2  — U>1  > 1, 
— W7  + u)6  — W5  + W4  — W3+W2  + W1  > l,  W7  —wq+ws  —W4+W3  — w2+wi  > 1.  Multiply 

them  respectively  by  1,  1,  2,  3,  5,  8,  5 to  get  w\  -I b w7  > 1 + 1 + 2 + 3 + 5 + 8 + 5. 

The  same  idea  works  for  all  n > 3. 


106.  (a)  (x2  x!" 


-2" 

■Xn  Vl 


1_2" 

2/2 


. . . ynz).  (By  exercise  99,  we  could  also  perform 


n medians-of-three:  ({ . . . ( xnynz ) . . . x2y2)x1yi).) 

(b)  If  (xj^xl]2  . . . x^yi'y”2  . . . ijnnzw ) solves  the  problem,  2n+1  - 1 basic  inequal- 
ities need  to  hold;  for  example,  when  n = 2 they  are  ui  + u2  — fi  + v2  — w > 1, 

Ui  + U2  — Vl  — V2  + W > 1,  U\  — U2  + Vl  — V2  — W > 1,  Kl  — U2  — 1>1  + V2  + W > 1, 
—Ui  + U2  + Vl  + V2  — W > 1,  —Ul  + U2  + Vl  — V2  + W > 1,  —Ui  — U2  + Vl  + V2  + W > 1. 

Add  them  all  up  to  get  ui  + u2  + b un  + Vi  + v2  + ■ ■ ■ + vn  + w > 2n+1  — 1. 


107.  / N(f)  E (/) 

/ N(f)  E (/) 

/ N(f)  E (/) 

/ N{f)  E (/) 

_L  0 (0,0) 

C 1 (0,1) 

V 1 (0,0) 

L 2 

(0,1) 

A 1 (1,1) 

R 2 (1,2) 

= 2 (1,1) 

D 3 

(1,2) 

3 1 (1,0) 

© 2 (1,1) 

R 2 (1,0) 

A 3 

(1,1) 

L 2 (2,1) 

V 3 (2,2) 

C 3 (2,1) 

T 4 

(2,2) 

Notice  that  ® and  = have  the  same  parameters  N(f)  and  E (/);  they  are 
Boolean  binary  operations  that  aren’t  threshold  functions. 

the  only 
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108.  If  £ (g)  — (so,  Si, . . . , sn),  the  value  of  g is  1 in  s0  cases  when  a;0  = 1 and  in  2"  —so 
cases  when  x0  = 0.  We  also  have  £(/0)  + £(/i)  = (sx, . . . , sn),  and 

i i 

S(/o)  = (xi,. . .,xn)g(0,x1,.  ..,xn) 

xi=0  xn=0 
1 1 

= £•••£  ((1,  • • • , 1)  — (®1>  ■ ■ ■ ,Xn))(l  — g(l,  Xl,  . . . ,Xn)) 

xi=0  xn=0 

= (2n_1  — s0, ... , 2n_1  — s0)  + £(/i). 

So  the  answers,  for  n > 0,  are  (a)  N(f0)  = 2n  - s0,  £(/o)  = ±(si  - s0  + 2n~\. . . , 
s„  - so  + 2n~1);  (b)  JV(/x)  = so,  E(/i)  = §(Sl  + s0  - 2™“*, . . . , sn  + s0  - 2"-1). 
[Equivalent  results  were  presented  by  E.  Goto  in  lectures  at  MIT  in  1963.] 

109.  (a)  ai  + • • • + a,k  > b\  + • • • + bk  if  and  only  if  k — ai  — — a*,  < k — bi  — ■ ■ ■ — bk. 

(b)  Let  a+  = (ai,ai+a2, . . . ,aH h an).  Then  the  vector  (ci, . . . ,c„)  obtained 

by  componentwise  minimization  of  a+and  /3+  is  (aA/3)+.  (Clearly  cj  = cj-1  + aj  or  bj.) 

(c)  Proceed  as  in  (b)  but  with  componentwise  maximization ; or  take  a A /3. 

(d)  True,  because  max  and  min  satisfy  these  distributive  laws.  (In  fact,  we  ob- 
tain a distributive  mixed-radix  majorization  lattice  in  a similar  way  from  the  set  of 
all  n-tuples  ffli  . . . an  with  0 < aj  < rrij  for  1 < j < n.  R.  P.  Stanley  has  observed 
that  Fig.  8 is  also  the  lattice  of  order  ideals  of  the  triangular  grid  shown  here.) 

(e)  a 1 covers  aO  and  a 1 0/3  covers  a01/3.  [This  characterization  is  due  to  R.  O. 
Winder,  IEEE  Trans.  EC-14  (1965),  315-325,  but  he  didn’t  prove  the  lattice  property. 
The  lattice  is  often  called  A/(n);  see  B.  Lindstrom,  Nordisk  Mat.  Tidskrift,  17  (1969), 
61-70;  R.  P.  Stanley,  SIAM  J.  Algebraic  and  Discrete  Methods  1 (1980),  177-  179.] 

(f)  Because  of  (e)  we  have  r(a)  = nai  + (n  - l)a2  H h an. 

(g)  The  point  is  that  0 (3  X 0a  if  and  only  if  f)  X a and  that  1/3  t 0 a if  and  only 
if  1/3  y 10...0V0a  = la'. 

(h)  That  is,  how  many  a\  . . . an  have  the  property  that  a\  . . . a*,  contains  no  more 
Is  than  0s?  The  answer  is  (Ln/2J);  see,  for  example,  exercise  2.2. 1-4  or  7.2.1.6-42(a). 

110.  (a)  If  x C y then  x -<  y,  hence  f(x)  < f(y);  QED. 

(b)  No;  a threshold  function  need  not  be  monotone  (see  (79)).  But  we  can  show 
that  / is  regular  if  we  also  require  wn  > 0:  For  if  f(x)  = 1 and  y covers  x we  then  have 
w ■ y > w ■ x. 

(c)  Whenever  f(x)  = 1 and  Xj  < Xj+ 1,  we  have  f(y)  = 1 when  y covers  x with 
xj  <4  Xj+i]  hence  Sj  > Sj+i.  (This  argument  holds  even  when  wn  < 0.) 

(d)  No;  consider,  for  example,  {xxx\x l),  which  equals  (xix2x3).  Counterexamples 
can  arise  even  when  the  weights  minimize  W\  + ■ ■ ■ + wn,  because  the  solution  to  the 
linear  program  in  exercise  103  is  not  always  unique.  One  such  case,  found  by  Muroga, 
Tsuboi,  and  Baugh,  is  (xlTx%xlxlxlx6Xyxjxg),  a function  that  is  actually  symmetric 
in  X4  and  x5.  But  if  Sj  > Sj+i  we  must  have  Wj  > Wj+ 1,  because  of  (c). 

111.  (a)  Find  an  optimum  self-dual  function  / pointwise  as  in  exercise  14;  in  case  of 
ties,  set  f(x  1, ...  ,xn)  = xi . Thus  f(x  1, . . . ,x„)  = [r* 1 . . . r^n  > y/r\  . . . rn  ],  except 
that  “>’  becomes  ‘>’  when  x\  = 0.  This  function  is  regular  when  tt  > • • • > r„  > 1. 

(b)  Let  g be  the  regular,  self-dual  function  constructed  in  (a).  If  / is  a given 
regular,  self-dual  function,  we  want  to  verify  that  f(x)  < g(x)  for  all  vectors  x:  this 
will  imply  that  f = g,  because  both  functions  are  self-dual. 
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Suppose  f(x)  = 1,  and  let  y -<  x be  minimal  such  that  f(y)  = 1.  If  we  have 
verified  that  g(y)  = 1,  then  indeed  g(x)  — 1,  as  desired.  [See  K.  Makino  and  T.  Kameda, 
SIAM  Journal  on  Discrete  Mathematics  14  (2001),  381-407.] 

For  example,  there  are  only  seven  self-dual  regular  Boolean  functions  when  n = 5, 
generated  by  the  following  minimal  elements  in  Fig.  8:  10000;  01111,  10001;  OHIO, 
10010;  01101,  10011,  10100;  01100;  01011,  11000;  00111.  So  an  optimum  coterie  can 
be  found  by  examining  only  a few  function  values. 

(c)  Suppose  l>Pl>--->Pr>§>  Pr+1  > • • • > pn  > 0.  Let  fk(x  1, . . . , xn)  be 
the  fcth  monotone,  self-dual  function  and  Fk (x'i . . . . , xn)  its  integer  multilinear  represen- 
tation. We  want  to  find  the  optimum  availability  G(pi, . . . ,p„)  = max*,  Fk(pi, . . ■ ,pn )■ 
If  Pi  < Pi,  •••,  Pn  < Pn,  we  have  Fk(p1, . . . ,pn)  < Fk(p'i, . . . ,p'n)  by  exercise  12(e); 
hence  G(pi,...,p„)  < G(pi, . . . ,p'n). 

Therefore  if  0 < r < n we  have 

G(pi,...,p„)  < G(pi,...,pr,i,...,i). 

And  the  latter  is  F(pi , . . . . pr . | .....  | ) , derived  from  these  larger  probabilities  as  in 
part  (a).  This  function  does  not  depend  on  ( xr+i , . . . , x„),  so  it  gives  the  optimum. 

If  r = 0 the  problem  seems  to  be  deeper.  We  have  G(pi, . . . ,pn)  < G(pi, . . . ,pi); 
so  we  can  conclude  that  the  optimum  coterie  is  / ( x i , . . . , xn ) — X\  in  this  case  if  we 
can  show  that  Fk(p,  ■ ■ . , p)  < p for  all  k whenever  p < I . In  general  Fk(p, . . . , p)  = 
YlmcmPm { 1 — p)"_m,  where  cm  is  the  number  of  vectors  x such  that  fk(x)  = 1 and 
ux  — m.  Exercise  70  tells  us  that  cm  +c„_m  = (^),  for  all  k.  And  the  Erdos-Ko-Rado 
theorem  (exercise  7.2.1.3-111)  tells  us  that  we  have  cm  < (^~i)  f°r  anY  intersecting 
family  of  m-sets  when  m < n/2.  The  result  follows. 

[See  Y.  Amir  and  A.  Wool,  Information  Processing  Letters  65  (1998),  223-228.] 

112.  (a)  The  leading  terms  are  respectively  0,  +xy,  —xy,  -He,  — xy,  +y,  —2 xy,  —xy, 
+xy,  +2 xy,  -y,  +xy,  -x,  +xy,  -xy,  1;  so  F(f)  = 1 when  / is  A,  L,  R,  V,  =,  C,  D,  T. 

(b)  The  coefficient  corresponding  to  exponents  01101,  say,  is  /o»*o*  in  the  notation 
of  answer  12;  it  is  a linear  combination  of  truth  table  entries,  always  lying  in  the  range 
f — 2fc — 1 1 < /o„o.  < [2fc_1]  when  there  are  k asterisks.  Thus  the  leading  coefficient  is 
positive  if  and  only  if  the  mixed-radix  number 

/*»...*,  ■■■,  ft 0...0,  /00...0 

. 2m+l,  2m-!-t-l,  ...,  21  -(- 1 , 2°  + l. 

is  positive,  where  the  /’ s are  arranged  in  reverse  order  of  Chase’s  sequence  and  the 
radix  2k  + \ corresponds  to  an  / with  k asterisks.  For  example,  when  m — 2 we  have 
F(f)  = 1 if  and  only  if  the  sum  18/..  + 6/0.  + 2/»0  + /00  = 18(/n  - fo\  ~ fio  + foo)  + 
6(/oi  - foo)  + 2(/io  - foo)  + foo  = 18/n  - 12/oi  - I6/10  + ll/oo  is  positive;  so  the 
threshold  function  can  be  written  (/n  foiflo  /oo  )• 

(In  this  particular  case  the  much  simpler  expression  (/n/11/01/10/00)  is  actually 
valid.  But  part  (c)  will  show  that  when  m is  large  we  can’t  do  a great  deal  better.) 

(c)  Suppose  F(f)  — {^2ava(fa  — L)  > 0],  where  the  sum  is  over  all  2m  binary 
strings  a of  length  m and  where  each  va  is  an  integer  weight.  Define 

wa  = ^2(-ir^v0  and  Ea  = ^(-l)"(Q^)//3-2m-1[a  = 00...0]; 

0 0 

thus,  for  example,  ui0i  = -v00  + v0i  - v10  + »n  and  Fn  = foo  - foi  - /10  + /n-  One 
can  show  that  FlkQi  = 2lf„k0i,  if  Fa  = 0 whenever  v{a)  > k > 0;  therefore  the  signs 
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of  the  transformed  truth  coefficients  Fa  determine  the  sign  of  the  leading  coefficient  in 
the  multilinear  representation.  Furthermore,  we  now  have  F(f)  = waFa  > 0], 

The  general  idea  of  the  proof  is  to  choose  test  functions  f from  which  we  can 
derive  properties  of  the  transformed  weights  wa.  For  example,  if  f(x = 
xi  © • • • © Xk,  we  find  Fa  = 0 for  all  a except  that  F^om-k  = ( — l)'£_12m_1 . The 
multilinear  representation  of  xi  ® • ■ ■ ® xk  has  leading  term  [(-2)'c~1]a;1  . . . xk\  hence 
we  can  conclude  that  wlkQm-k  > 0,  and  in  a similar  way  that  wa  > 0 for  all  a.  In 
general  if  m changes  tom+1  but  / does  not  depend  on  xm+1,  we  have  Fa0  = 2 Fa 
and  Fai  = 0. 

The  test  function  X2  © • • • ® xm  ® X1X2  • • . xm  proves  that 

m — 1 

w\m  > (2  — l)w0im-i  + w1k01m-i-k  + smaller  terms, 

k= 1 

where  the  smaller  terms  involve  only  wa  with  v(a)  < m - 2.  In  particular,  wn  > 
woi  + ww  + woo-  The  test  function  Xl  ® • • • ® im_,  ® xx  . . . xm-2  (xm-i  ® xm)  proves 

m — 3 

wim~2oi  > (2  — l)ra1m-210  + (wi|roi’n-3-fcin  + mifeni’o-3-tni ) + smaller  terms. 

k= o 

where  the  smaller  terms  this  time  have  v(a)  < m — 3.  In  particular,  iwioi  > who  + 
woio  + wool-  By  permuting  subscripts,  we  obtain  similar  inequalities  leading  to 

wa.  > (2l'(Q^)-1  - l)wa._1  for  0 < j < 2m, 

because  the  w’s  begin  to  grow  rapidly.  But  we  have  va  = S/3(-l)I/(/3“Q)W/3/n; 
hence  |nQ|  = wn...i/n  + 0(wn,,,x/n2).  [SIAM  J.  Discrete  Math.  7 (1994),  484-492. 
Important  generalizations  of  this  result  have  been  obtained  by  N.  Alon  and  V.  II.  Vu, 
J.  Combinatorial  Theory  A79  (1997),  133-160.] 

113.  The  stated  g3  is  52,3,6,8,9  because  the  stated  g2  is  52,3,4,5,8,9,10,11,12. 

For  the  more  difficult  function  5i,3,5,8,  let  gi  = [ux>6];  g2  = [122:  >3];  g3  = 
[vx  — 5gi  — 2g2  > 2]  = 52, 4, 5, 9, 10, 11, 12;  P4  = [2vx  - lbgx  - 9g3  > 1]  = 5i,3,s,8.  [See 
M.  A.  Fischler  and  M.  Tannenbaum,  IEEE  Transactions  C-17  (1968),  273-279.] 

114.  [4a-  + 2y  + 2 G {3, 6}]  = ( x/\y/\z ) V (xAyAz).  In  the  same  way,  any  Boolean 
function  of  n variables  is  a special  case  of  a symmetric  function  of  2n  — 1 variables. 
[See  W.  H.  Kautz,  IRE  Transactions  EC-10  (1961),  378.] 

115.  Both  sides  are  self-dual,  so  we  may  assume  that  ,r0  = 0.  Then 

Sj  [Xj  + • • • + Xj+m-i  > Xj+m  + • • • + Xj+2m-l]- 

If  Ki  + - • ■ + x2m  is  odd,  we  have  Sj  = Sj+m-  hence  si®'  • - + 82^1  = m and  the  result  is  1. 

But  if  xi  + h X2m  is  even,  the  difference  x3  -f p xj+m-i  - xj+m xJ+2m-i 

will  be  zero  for  at  least  one  j < m;  that  makes  s3  = sj+m  = 0,  so  we  will  have 

Si  + • ' ' + $2  m < m. 

116.  (a)  It’s  an  implicant  if  and  only  if  f(x)  = 1 whenever  j < vx  < n — k + j . It’s  a 
prime  implicant  if  and  only  if  we  also  have  f(x)  = 0 when  vx  = j- 1 or  vx  = n-k+j+1. 

^(b)  Consider  the  string  v = u0ui  . . . vn  such  that  f{x)  = vvx.  By  part  (a),  there 
are  (alhtC)  Prime  impiicants  when  v = 0al6+10c.  In  the  stated  case,  a = b = c = 3,  so 
there  are  1680  prime  implicants. 
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(c)  For  a general  symmetric  function,  we  add  together  the  prime  implicants  for 
each  run  of  Is  in  v.  Clearly  there  are  more  for  v = oa+1l6+1Oc_1  than  for  v = 0“l6+10c 
when  a < c - 1;  so  v contains  no  two  consecutive  Os  when  the  maximum  is  reached. 

Let  6(m,  n)  be  the  maximum  number  of  prime  implicants  possible  when  vm  = 1 
and  Vj  = 0 for  m < j < n.  Then  when  m < |n  we  have 

= (fm/21,  [m/2J,n-ro)  +i<r”>/21 
with  6(  — 2,n)  = 6(  — l,n)  = 0.  And  the  overall  maximum  is 

6(n)  = ( n ) + b{m  - 2,n)  + b(n2  - 2,n),  nj  = 

\n0,n1,n2J  ’ 1 

In  particular  we  have  6(9)  = 1698,  with  the  maximum  occurring  for  v = 1101111011. 

(d)  By  Stirling’s  approximation,  b(n)  = 3n+3/2/(2 7m)  + 0(3n/n2). 

(e)  In  this  case  the  appropriate  recurrence  for  m < \n/ 2]  is 

S(m.n)  = J + (t_i>0”_t  + 1)+^-2,»)) 

( [m/21,  [m/2J,n  - m)  + ( [m/2]  - 1 ) + 2,ii) 

and  b(n)  = 6([u/2]  - l,n)  maximizes  min(prime implicants(/), prime implicants(/)). 
We  have  (6(1),  6(2), . . . ) = (1,1,4,5,21,31,113,177,766,1271,4687,7999,34412,...); 
for  example,  6(9)  = 766  corresponds  to  So, 2,3,4,8(zi, . . . ,x9).  Asymptotically,  6(n)  = 

2(3n+3+(n  mod  2))/2^2?rnj  + 0(23n/2/ U2) . 

References:  Summaries,  Summer  Inst,  for  Symbolic  Logic  (Dept,  of  Math.,  Cor- 
nell Univ.,  1957),  211-212;  B.  Dunham  and  R.  Fridshal,  J.  Symbolic  Logic  24  (1959), 
17-19;  A.  P.  Vikulin,  Problemy  Kibernetiki  29  (1974),  151-166,  which  reports  on  work 
done  in  1960;  Y.  Igarashi,  Transactions  of  the  IEICE  of  Japan  E62  (1979),  389-394. 

117.  The  maximum  number  of  subcubes  of  the  n-cube,  with  none  contained  in  another, 
is  obtained  when  we  choose  all  subcubes  of  dimension  \n/3\ . (It  is  also  obtained  by 
choosing  all  subcubes  of  dimension  [(«  + 1)/3J;  for  example,  when  n = 2 we  can 
choose  either  {0*,  1*,  *0,  *1}  or  {00,01,10,11}.)  Hence  6*(n)  = (Ln73j)2n_Ln/3J  = 
3n+1/ y/ + 0(3n/n 3^2).  [See  the  paper  of  Vikulin  in  the  previous  answer,  pages  164- 
166;  A.  K.  Chandra  and  G.  Markowsky,  Discrete  Math.  24  (1978),  7-11;  N.  Metropolis 
and  G.  C.  Rota,  SIAM  J.  Applied  Math.  35  (1978),  689-694.] 

118.  Consider  two  functions  equivalent  if  we  can  obtain  one  from  the  other  by  comple- 
menting and/ or  permuting  variables,  but  not  complementing  the  function  value  itself. 
Such  functions  clearly  have  the  same  number  of  prime  implicants;  this  equivalence  rela- 
tion is  studied  further  in  answer  125  below.  A computer  program  based  on  exercise  30 
produces  the  following  results: 


TO 

Classes  Functions 

TO 

Classes  Functions 

TO 

Classes  Functions 

0 

1 

1 

5 

87 

17472 

10 

7 

632 

1 

5 

81 

6 

70 

12696 

11 

1 

96 

2 

18 

1324 

7 

43 

7408 

12 

2 

24 

3 

46 

6608 

8 

24 

3346 

13 

1 

16 

4 

87 

14536 

9 

10 

1296 

14 

0 

0 

n + j 
. 3 . ' 
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And  here  are  the  corresponding  statistics  for  functions  of  five  variables: 


m 

Classes  Functions 

m 

Classes 

Functions 

m 

Classes  Functions 

0 

1 

1 

11 

186447 

666555696 

22 

338 

608240 

1 

6 

243 

12 

165460 

590192224 

23 

130 

197440 

2 

37 

14516 

13 

129381 

459299440 

24 

71 

75720 

3 

244 

318520 

14 

91026 

319496560 

25 

37 

28800 

4 

1527 

3319580 

15 

57612 

199792832 

26 

15 

10560 

5 

6997 

19627904 

16 

33590 

113183894 

27 

6 

2880 

6 

23434 

73795768 

17 

17948 

58653984 

28 

4 

1040 

7 

57048 

190814016 

18 

8880 

27429320 

29 

2 

640 

8 

105207 

362973410 

19 

3986 

11597760 

30 

2 

48 

9 

152763 

538238660 

20 

1795 

4548568 

31 

2 

64 

10 

183441 

652555480 

21 

720 

1633472 

32 

1 

16 

119.  Several  authors  have  conjectured  that  b(n ) = 8(n);  M.  M.  Gadzhiev  has  proved 
that  equality  holds  for  n < 6 [Diskretnyi  Analiz  18  (1971),  3-24]. 

120.  (a)  Every  prime  implicant  is  a minterm,  since  no  adjacent  points  of  the  n-cube 
have  the  same  parity.  So  the  full  disjunctive  form  is  the  only  decent  DNF  in  this  case. 

(b)  Now  all  prime  implicants  consist  of  two  adjacent  points.  We  must  include  the 
14  subcubes  0J*06  J and  1J*16  J for  0 < j < 6,  in  order  to  cover  the  points  with  vx  = I 
and  vx  = 6.  The  other  (g)  + (^)  = 70  points  can  be  covered  by  35  well-chosen  prime 
implicants  (see,  for  example,  exercise  6.5-1,  or  the  “Christmas  tree  pattern”  in  Section 
7. 2. 1.6).  I hus  the  shortest  DNF  has  length  49.  [An  ingeniously  plausible  but  fallacious 
argument  that  70  prime  implicants  are  necessary  was  presented  by  S.  B.  Yablonsky  in 
Problemy  Kibernetiki  7 (1962),  229-230.] 

(c)  For  each  of  2n  1 choices  of  (®i, . . . , xn-i)  we  need  at  most  one  implicant  to 
account  for  the  behavior  of  the  function  with  respect  to  xn. 

[Asymptotically,  almost  all  Boolean  functions  of  n variables  have  a shortest  DNF 
with  0(2n/ (log  n log  log  n))  prime  implicants.  See  R.  G.  Nigmatullin,  Diskretnyi  Analiz 
10  (1967),  69-89;  V.  V.  Glagolev,  Problemy  Kibernetiki  19  (1967),  75-94;  A.  D. 
Korshunov,  Metody  Diskretnogo  Analiza  37  (1981),  9-41;  N.  Pippenger,  Random 
Structures  & Algorithms  22  (2003),  161-186.] 

121.  (a)  Let  x = xi  ...  xm  and  y — yi . . .yn.  Since  / is  a function  of  ( vx , vy),  there 
are  altogether  2(m+1)("+1)  possibilities. 

(b)  In  this  case  vx  < vx'  and  vy  < vy'  implies  f(x,y)  < f(x',y').  Every  such 

function  corresponds  to  a zigzag  path  from  a0  = (-  \ , n + \ ) to  am+n+2  = (m  + I , - \ ) , 
with  a,j  = Oj  — i + (1, 0)  or  a,j  = a,j-i  — (0, 1)  for  1 < j < m + n + 2;  we  have  f(x,y)  = 1 
if  and  only  if  the  point  ( vx,vy ) lies  above  the  path.  So  the  number  of  possibilities  is 
the  number  of  such  paths,  namely  . 

(c)  Complementing  x and  y changes  vx  to  m-  vx  and  vy  ton-  vy.  So  there  are 
no  such  functions  when  m and  n are  both  even;  otherwise  there  are  2(m+1h’1+1)/2. 

(d)  The  path  in  (b)  must  now  satisfy  aj+am+n+2-j  = (m,  n)  for  0 < j < m+n+2. 

Hence  there  are  ?/21)[m  odd  or  n odd]  such  functions.  For  example,  the 

following  ten  cases  arise  when  m = 3 and  n = 6: 
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122.  A function  of  this  kind  is  regular  with  the  x’s  to  the  left  of  the  y’s  if  and  only  if 
the  zigzag  path  does  not  contain  two  points  (x,  y)  and  ( x + 2 ,y)  with  0 < y < n;  it  is 
regular  with  the  y’s  left  of  the  x’s  if  and  only  if  the  zigzag  path  does  not  contain  both 
( x,y  + 2)  and  ( x,y ) with  0 < x < m.  It  is  a threshold  function  if  and  only  if  there  is 
a straight  line  through  the  point  (m/2,n/2)  with  the  property  that  ( s,t ) is  above  the 
line  if  and  only  if  ( s,t ) is  above  the  path,  for  0 < s < m and  0 < t < n.  So  cases  5 
and  8,  illustrated  in  the  previous  answer,  fail  to  be  regular;  cases  1,  2,  3,  7,  9,  and  10 
are  threshold  functions.  The  regular  non-threshold  functions  that  remain  can  also  be 
expressed  as  follows:  ((xi  V X2  V x3)  A (xiX2X3?/i?/22/32/42/52/6))  V (xj  A X2  A X3)  (case  4); 
(00xix2x3j/i2/2l/3l/4l/52/6}  v ((X1X2X3)  A (1  lx ix2x3 2/1  r/2 l/3l/4 2/51/6})  (case  6). 

123.  Self-dual  regular  functions  are  relatively  easy  to  list,  for  small  n,  but  the  numbers 
grow  rapidly:  When  n = 9 there  are  319,124  of  them,  found  by  Muroga,  Tsuboi,  and 
Baugh  in  1967,  and  when  n = 10  there  are  1,214,554,343  (see  exercise  7.1.4-75).  The 
corresponding  numbers  for  n < 6 appear  in  Table  5,  because  all  such  functions  are 
threshold  functions  when  n < 9;  there  are  135  when  n = 7,  and  2470  when  n = 8. 

The  threshold  condition  can  be  tested  quickly  for  any  such  function  by  improving 
on  the  method  of  exercise  103,  because  constraints  are  needed  only  for  the  minimal 
vectors  x (with  respect  to  majorization)  such  that  /(x)  = 1. 

The  number  0n  of  n- variable  threshold  functions  is  known  to  satisfy  lg  9n  = 
n~  — 0(n2/logn);  see  Yu.  A.  Zuev,  Matematicheskie  Voprosy  Kibernetiki  5 (1994),  5-61. 

124.  The  222  equivalence  classes  listed  in  Table  5 include  24  classes  of  size  2n+1n!  = 
768;  so  there  are  24  x 768  = 18432  answers  to  this  problem.  One  of  them  is  the  function 
{w  A (x  V (y  A z)))  © 2. 

125.  0;  x;  xAy;  xAyAz ; xA(yVz);  xA(y®z).  (These  functions  are  xA  f(y,z),  where 
/ runs  through  the  equivalence  classes  of  two-variable  functions  under  permutation 
and/or  complementation  of  variables  but  not  of  the  function  values.  In  general,  let 
f — g mean  that  / is  equivalent  to  g in  that  weaker  sense,  but  write  f = g if  they  are 
equivalent  in  the  sense  of  Table  5.  Then  xAf  = xAg  if  and  only  if  f — g,  assuming  that 
/ and  g are  independent  of  the  variable  x.  For  it’s  easy  to  see  that  (x  A /)  ~ (x  V g ) 
is  impossible.  And  if  (x  A /)  ~ (x  A g),  we  can  prove  that  f — g by  showing  that, 
if  cr  is  a signed  permutation  of  {xo,...,x„}  and  if  x = xi  ...xn,  then  the  identity 
x0  A /(x)  = (x0cr)  A g(xa)  implies  /(x)  = g(xar),  where  r interchanges  xo  -f-t  x0a. 
Consequently  the  bottom  line  of  Table  5 enumerates  equivalence  classes  under  ~,  but 
with  n increased  by  1;  there  are,  for  example,  402  such  classes  of  4- variable  functions.) 

126.  (a)  The  function  is  canalizing  if  and  only  if  it  has  a prime  implicant  with  at  most 
one  literal,  or  a prime  clause  with  at  most  one  literal. 

(b)  The  function  is  canalizing  if  and  only  if  at  least  one  of  the  components  of 
E(/)  is  equal  to  0,  2n_1,  N(f),  or  N(f)  — 2n_1.  [See  I.  Shmulevich,  H.  Lahdesmaki, 
and  K.  Egiazarian,  IEEE  Signal  Processing  Letters  11  (2004),  289-292,  Proposition  6.] 

(c)  If,  say,  V(/)  = 2/1...  yn  with  yj  = 0,  then  /(x)  = 0 whenever  Xj  = 1. 
Therefore  / is  canalizing  if  and  only  if  we  don’t  have  V(/)  = V(/)  = 1 ...  1 and  A (/)  = 
A (/)  = 0 . . . 0.  With  this  test  one  can  prove  that  many  functions  are  noncanalizing 
when  their  value  is  known  at  only  a few  points. 

127.  (a)  Since  a self-dual  function  f(x i,...,x„)  is  true  at  exactly  2n~1  points,  it  is 
canalizing  with  respect  to  the  variable  Xj  if  and  only  if  /(x  1, . . . ,xn)  = Xj. 

(b)  A definite  Horn  function  is  clearly  canalizing  if  (i)  it  contains  any  clause  with 
a single  literal,  or  (ii)  some  literal  occurs  in  every  clause.  Otherwise  it  is  not  canalizing. 
For  we  have  /(0, . . . , 0)  = /( 1, . . . , 1)  = 1,  because  (i)  is  false;  and  if  Xj  is  any  variable, 
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there  is  a clause  Co  not  containing  x j and  a clause  Ci  not  containing  Xj,  because  (ii)  is 
false.  By  choosing  appropriate  values  of  the  other  variables,  we  can  make  Co  ACi  false 
when  Xj  = 0 and  also  when  Xj  = 1. 

128.  For  example,  (aq  A • • • A x„)  V (x\  A • • • A xn). 

129.  T,k=i(-1)k+1(nk)22ri ~k+k+1  - 2 (n  - 1)  - 4(n  mod  2)  = n22”'1+2  + 0(n2 22"^). 
[See  W.  Just,  I.  Shmulevich,  and  J.  Konvalina,  Physica  D197  (2004),  211-221.] 

130.  (a)  If  there  are  <x„  functions  of  n or  fewer  variables,  but  bn  functions  of  exactly  n 
variables,  we  have  an  = J2k  {k)bk-  Therefore  bn  = J2k{~^)n  k (’k)ak-  (This  rule,  noted 
by  C.  E.  Shannon  in  Trans.  Amer.  Inst.  Electrical  Engineers  57  (1938),  713-723,  §4, 
applies  to  all  rows  of  Table  3,  except  for  the  case  of  symmetric  functions.)  In  particular, 
the  answer  sought  here  is  168  — 4-20  + 6-  6 — 4-3  + 2 = 114. 

(b)  If  there  are  a'n  essentially  distinct  functions  of  n or  fewer  variables,  and  b'n  of 
exactly  n variables,  we  have  a'n  = Yhk=o  ^4-  Hence  b'n  = a'n  — a'n_x , and  the  answer  in 
this  case  is  30  — 10  = 20. 

131.  Let  there  be  h(n)  Horn  functions  and  k(n)  Krom  functions.  Clearly  lg  h{n)  > 
(|n/2j)  and  lg  k{n)  > (j)-  V.  B.  Alekseyev  [Diskretnaih  Matematika  1 (1989),  129-136] 
has  proved  that  lgh(n)  = (Ln"2J)(l  + 0(n-1/4  logn)).  B.  Bollobas,  G.  Brightwell,  and 
I.  Leader  [Israel  J.  Math.  133  (2003),  45-60]  have  proved  that  \gk(n)  ~ |n2. 

132.  (a)  The  hint  is  true  because  J2y  s{y)s(y®z)  = J2WtX)y{-l)f(w)+w'y+f(x)+x'(y+x)  = 
2"  Y2w,x(~ l)H“)+Hx)+3; 'z[x  = w}.  Now  suppose  that  f(x)  = g(x)  for  2n_1  + k values 
of  x;  then  f(x)  = g(x)  ® 1 for  2n~1  — k values  of  x.  But  if  |fc|  < 2n/2~1  for  all  affine  g, 
we  would  have  |s(y)|  < 2n/2  for  all  y,  contradicting  the  hint  when  z = 0. 

(b)  Given  y = yx...yn,  there  are  exactly  2n/2((yiy2  + 2/3 2Aa  + ■ ■ ■ + yn-\yn  + 
1 + h(yi,y3, . . . , j/n-i))  mod  2)  solutions  to  f(x)  = x ■ y mod  2 when  x2k  = t/2fc-i  for 
1 < k < n/2,  and  there  are  2n/,2_1  solutions  for  each  of  the  other  2n^2  — 1 values 
of  (x2,  X4, . . . , xn).  So  there  are  2n_1  ± 2”/2-1  solutions  altogether.  (This  argument 
proves,  in  fact,  that  (^g(xx , x3 1 . . . > X2n — 1)  - (-^-2,  X4 , . . . , x2n ) + h(^x 2 , r4 , . . . ,X2n))  mod  2 
is  bent  whenever  g(x  1,2:3,  • • • ,x2n-i)  is  a permutation  of  all  2n^2-h\t  vectors.) 

(c)  The  argument  in  part  (a)  proves  that  f(x)  is  bent  if  and  only  if  s(y)  = 
2Tl/2 ( — i)s(s/)  for  some  Boolean  function  g(y).  This  function  g,  the  Fourier/Hadamard 
transform  of  /,  is  also  bent,  because  Y/y(-l)9iv)+w  y = 2~n/2  Ex,!/(“1)/<x)+X'!/+TO'!/  = 
2+2  ^2X(— l)flxi [x  = w]  = 2n^2(  — l)!!™)  for  all  w.  The  hint  now  tells  us  that  we  have 

( — i)3<y)+9fj/©z)  = 0 for  a][  nonzero  z,  and  the  same  holds  for  /. 

Conversely,  assume  that  f(x)  satisfies  the  stated  condition.  Then  we  have 

x,t  t X 

for  all  y. 

(d)  By  exercise  11,  the  term  xx...xr  is  present  if  and  only  if  the  equation 

f(x  1, . . . ,av,  0, . . . ,0)  = 1 has  an  odd  number  of  solutions,  and  an  equivalent  condition 
is  l)!!11’---’*’-'0’ ■••’°))  mod  4 = 2.  We’ve  seen  in  part  (c)  that  this  sum  is 

2~n  £ s(y)(-ir^+  -+x^  = 2r-n  £ «(0j  • • • , 0,  yr+i,  - - • , Vn). 

x1,...,xr,y  2/r-fl  >•••)!/«. 

If  r = n,  the  latter  sum  is  ±2n^2;  otherwise  it  contains  an  even  number  of  summands, 
each  of  which  is  +2r-n/,2.  So  the  result  is  a multiple  of  4. 


£(-D 


/(x)  + /(x®i) 


2n 


s(y)2  = y^(_  i'jf(x)+x-y+f(*®t)+(x®t)-y 
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[Bent  functions  were  introduced  by  O.  S.  Rothaus  in  1966;  his  privately  circulated 
paper  was  eventually  published  in  J.  Combinatorial  Theory  A20  (1976),  300-305.  J.  F. 
Dillon,  Congressus  Numerantium  14  (1975),  237-249,  discovered  additional  families 
of  bent  functions,  and  many  other  examples  have  subsequently  been  found  when 
n > 8 and  n is  even.  Bent  functions  don’t  exist  when  n is  odd,  but  a function  like 
g(x i, . . . , £n-i)  © xnAh(xi, . . . ,xn- i)  has  distance  2n_1  - 2(n~1)/2  from  all  affine  func- 
tions when  g and  g © h are  bent.  A better  construction  for  the  case  n = 15  was  found  by 
N.  J.  Patterson  and  D.  H.  Wiedemann,  IEEE  Transactions  IT-29  (1983),  354-356,  IT- 
36  (1990),  443,  achieving  distance  214  — 108.  S.  Kavut  and  M.  Diker  Yiicel,  Information 
and  Computation  208  (2010),  341-350,  have  achieved  distance  28  — 14  when  n = 9.] 

133.  Let  pk  = l/(22"“'‘+  1),  so  that  pk  = 22"“7(22"“''+  1).  [Ph.D.  thesis  (MIT,  1994).] 

SECTION  7.1.2 

1.  ((liVl4)Al2)s(xiVl3). 

2.  (a)  (w  ffi  (x  A y ))  © (( x © y)  A z);  (b)  (w  A (x  V y))  A (( x Ai/)V  z). 

3.  [Doklady  Akademii  Nauk  SSSR  115  (1957),  247-248.]  Construct  a k x n matrix 
whose  rows  are  the  vectors  x where  f(x)  = 1.  By  permuting  and/or  complementing 
variables,  we  may  assume  that  the  top  row  is  1 ...  1 and  that  the  columns  are  sorted. 
Suppose  there  are  l distinct  columns.  Then  / = g A h,  where  g is  the  AND  of  the 
expressions  (xj- 1 = Xj)  over  all  1 < j < n such  that  column  j — 1 equals  column  j, 
and  h is  the  OR  of  k minterms  of  length  l,  using  one  variable  from  each  group  of  equal 
columns.  For  example,  if  n = 8 and  if  / is  1 at  the  k = 3 points  11111111,  00001111, 
00110111,  then  l = 4 and  f(x)  equals  (xi  = X2)  A ( X3  = £4)  A ( x§  = £7)  A (£7  = £S)  A 
((£1  A £3  A £5  A £6)  V (£1  A£3  A £5  A £6)  V (£1  A £3  A £5  A £6)).  The  length  of  this  formula 
in  general  is  2 n + (k  — 2)1  — 1,  and  we  have  l < 2fc_1. 

Notice  that,  if  k is  large,  we  get  shorter  formulas  by  writing  f(x)  as  a disjunction 
fi{x)  V • • • V }r(x),  where  each  fj  has  at  most  \k/r]  Is.  Thus 

L(f)  < min(r  — 1 + (2n  + \k/r  — 2]  2^/r-1l  )r). 

r>  1 

4.  The  first  inequality  is  obvious,  because  a binary  tree  of  depth  d has  at  most 
1 + 2 + • • • + 2d_1  = 2d  — 1 internal  nodes. 

The  hint  follows  because  we  can  find  a minimal  subtree  of  size  > [r/3j.  Its  size  s 
is  at  most  1 + 2([r/3j  — 1).  Therefore  we  can  write  / = ( g ? f\:  /o),  where  g is  a 
subformula  of  size  s;  /o  and  / 1 are  the  formulas  of  size  r — s — 1 obtained  when  that 
subformula  is  replaced  by  0 and  1,  respectively. 

Let  d(r)  = max{  D{f)  \ L(f)  = r }.  Since  the  mux  function  has  depth  2,  and  since 
max(s,r  — s — 1)  < [^],  we  have  d(r)  < 2 + d([^]  — 1)  for  r > 3,  and  the  result 
follows  by  induction  on  r.  [Hawaii  Int.  Conf.  System  Sci.  4 (1971),  525-527.  B.  Doerr 
improved  a to  — 1/ lg( x/3  — 1)  in  Math.  Sem.  Univ.  Kiel  Tech.  Report  99-28  (1999).] 

5.  Let  g0  — 0,  gi  = xi,  and  g3  = xj  A (£3-1  V g,- 2)  for  j > 2.  Then  Fn  = gnV  gn- 1, 
with  cost  2n  — 2 and  depth  n.  [These  functions  gj  also  play  a prominent  role  in  binary 
addition;  see  exercises  42  and  44  for  ways  to  compute  them  with  depth  O(logn).] 

6.  True:  Consider  the  cases  y = 0 and  y = 1. 

7.  £5  = £1  V£4,  £6  = £2  A£5 , £7  = £i  V£3,  £8  = £6 ©£7.  (The  original  chain  computes 
the  “random”  function  (6);  see  exercise  1.  The  new  chain  computes  the  normalization 
of  that  function,  namely  its  complement.) 
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8.  The  desired  truth  table  consists  of  blocks  of  2n  k Os  alternating  with  blocks  of 

2n~k  Is,  as  in  (7).  Therefore,  if  we  multiply  by  22"  + 1 we  get  Xk  + (xk  <S  2n~k), 

which  is  all  Is. 

9.  When  finding  L(f)  = 00  in  step  L6,  we  can  store  g and  h in  a record  associated 
with  /.  Then  a recursive  procedure  will  be  able  to  construct  a minimum-length  formula 
for  / from  the  respective  formulas  for  g and  h. 

10.  In  step  L3,  use  k — 1 1 instead  of  fc  = 1 — 1 — j.  Also  change  L to  D everywhere. 

11.  The  only  subtle  point  is  that  j should  decrease  in  step  U3;  then  we’ll  never  have 
4>(g)  & 4>(h)  / 0 when  j = 0,  so  all  cases  of  cost  r — 1 will  be  discovered  before  we  begin 
to  look  at  list  r — 1. 

Ul.  [Initialize.]  Set  U( 0)  4—  <^>(0)  4—  0 and  U(f)  4—  00  for  1 < / < 22"-1.  Then 
set  U(xk)  4—  4>{xk)  4—  0 and  put  Xk  into  list  0,  as  in  step  LI.  Also  set 
U(xj  o Xk)  4—  1,  set  4>(xj  o xk)  to  its  unique  footprint  vector  (which  contains 
exactly  one  1),  and  put  Xj  o Xk  into  list  1,  for  1 < j < k < n and  all  five 
normal  operators  o.  Finally  set  c 4—  22  _1  — 5(![)  — n — 1. 

U2.  [Loop  on  r.]  Do  step  U3  for  r — 2,  3,  . . . , while  c > 0. 

U3.  [Loop  on  j and  k.\  Do  step  U4  for  j = [(r  - 1)/2J,  [(r  - 1)/2J  - 1,  . . . , and 
k = r — 1 — j,  while  j > 0. 

U4.  [Loop  on  g and  h .]  Do  step  U5  for  all  g in  list  j and  all  h in  list  fc;  if  j = fc, 
restrict  h to  functions  that  follow  g in  list  fc. 

U5.  [Loop  on  /.]  If  4>(g)  k,  4>(h)  / 0,  set  u 4-  r-  1 and  v 4-  4>{g)  & cj>(h);  otherwise 
set  u «—  r and  v <—  4>(g)  \ 4>{h).  Then  do  step  U6  for  f = gkh,  f = gkh, 
f = gkh,  f = g \ h,  and  / = g ® h. 

U6.  [Update  U(f)  and  4>(f).\  If  U(f)  = 00,  set  c «-  c - 1,  </>(/)  t-  v,  U(f)  t-  u, 
and  put  / into  list  u.  Otherwise  if  U(f)  > u,  move  / from  list  U(f)  to  list  u 
and  set  rj>(f)  <—  v,  U(f)  i—  u.  Otherwise  if  U(f)  = u,  set  <f>(f)  «—  0(/)  | v.  | 

12.  X4  = X\  ® X2,  X5  = x3  A X4,  Xq  — X2  A *4,  17  = i5  V Xq . 

13.  /5  = 01010101  (x3);  f4  = 01110111  ( x2  V x3);  f3  = 01110101  ((xl  A x2)  Vi3); 
/2  = 00110101  (xx?  X3--  x2)',  fi  = 00010111  ((xjxaxs)). 

14.  For  1 < j < n,  flrst  compute  t t-  (3  ® (g  » 2n~j))  & Xj,  t •(-  t © (t  < 2n~j), 
where  Xj  is  the  truth  table  (11);  then  for  1 < fc  < n and  fc  / j , the  desired  truth  table 
corresponding  to  Xj  <—  Xj  oit  is  g ® (t  & ((xj  o Xk)  © Xj)). 

(The  5 n(n  — 1)  masks  (xj  o xk)  © Xj  are  independent  of  g and  can  be  computed 
in  advance.  The  same  idea  applies  if  we  allow  more  general  computations  of  the  form 
Xj(i)  4-  xk(i ) °i  x;(i),  with  5 n2(n  - 1)  masks  ( xk  o xi)  © Xj.) 

15.  Remarkably  asymmetrical  ways  to  compute  symmetrical  functions: 


(a)  xi  4—  Xi  © x2, 
x3  4-  x3  © X4, 
xi  4—  Xi  © x3 , 
x2  4—  x2  © x4, 
x3  4—  x3  V x2, 
x3  4-13  Aii. 


(b)  xi 

<r~ 

Xl 

© 

x2 

X2 

<- 

X2 

A 

Xi, 

X3 

<— 

x3 

© 

x4 

X4 

4— 

X4 

A 

Xl, 

X2 

4— 

X2 

A 

x3, 

X2 

4- 

X2 

© 

Xl 

x2 

4- 

x2 

A 

x4. 

(c)  Xi  4—  *1  © x2, 
X2  4-  X2  © X3, 
x2  4—  X2  V Xl, 
Xl  4—  Xl  © x4, 
Xl  4-liAl3, 

x2  4—  x2  A xi, 
x2  4-  X2  © X4. 


7.1.2 


ANSWERS  TO  EXERCISES  563 


16.  A computation  that  uses  only  ® and  complementation  produces  nothing  but 
affine  functions  (see  exercise  7.1.1-132).  Suppose  f(x)  = /( xi,...,xn)  is  a non-affine 
function  computable  in  minimum  memory.  Then  /(x)  has  the  form  g(Ax  + c)  where 
sKib,  2/2,  - • • , Vn)  = 9(2/1  A 2/2, 2/2, , • • ,2/n),  for  some  nonsingular  n x n matrix  A of  Os 
and  Is,  where  x and  c are  column  vectors  and  the  vector  operations  are  performed 
modulo  2;  in  this  formula  the  matrix  A and  vector  c account  for  all  operations  Xi  <— 
Xi  © Xj  and/or  permutations  and  complementations  of  coordinates  that  occur  after  the 
most  recent  non-affine  operation  that  was  performed.  We  will  exploit  the  fact  that 
9(0,  0,1/3,  • • ■ ,2/n)  = 9(1,  0,1/3,  • • . ,2/n). 

Let  a and  0 be  the  first  two  rows  of  A;  also  let  a and  6 be  the  first  two  elements 
of  c.  Then  if  Ax  + c = y (modulo  2)  we  have  2/1  = 2/2  = 0 if  and  only  if  a ■ x = a and 
0 ■ x = b.  Exactly  2n~2  vectors  x satisfy  this  condition,  and  for  all  such  vectors  we 
have  f(x)  = f(x  © w),  where  Aw  = (1,0,...,  0)T. 

Given  a,  0,  a,  b,  and  w,  with  a # (0, . . . , 0),  0 / (0, . . . , 0),  a # 0,  and  a ■ w = 1 
(modulo  2),  there  are  22"-2"  2 functions  f with  the  property  that  /(x)  = f(x  © w) 
whenever  a ■ x mod  2 = a and  0 ■ x mod  2 = 6.  Therefore  the  total  number  of  functions 
computable  in  minimum  memory  is  at  most  2"+1  (for  affine  functions)  plus 


(2"  - l)(2n  - 2)22(2n-1)(22" 


< 2 


•“+3,1+1 


17.  Let  f(xi,...,xn)  = g(xi, . . . , x„_i)  © (/i(xi, . . . , xn-i)  A xn)  as  in  7.1.1-(i6). 
Representing  h in  CNF,  form  the  clauses  one  by  one  in  xo  and  AND  them  into  xn, 
obtaining  hAxn.  Representing  g as  a sum  (mod  2)  of  conjunctions,  form  the  successive 
conjunctions  in  x0  and  XOR  them  into  x„  when  ready. 

(It  appears  to  be  impossible  to  evaluate  all  functions  inside  of  n + 1 registers  if  we 
disallow  the  non-canalizing  operators  © and  =.  But  n + 2 registers  clearly  do  suffice, 
even  if  we  restrict  ourselves  to  the  single  operator  A.) 

18.  As  mentioned  in  answer  14,  we  should  extend  the  text’s  definition  of  minimum- 
memory  computation  to  allow  also  steps  like  xj(i)  «-  o{  xI{i),  with  k(i)  / j(i)  and 
f(*)  / j(i),  because  that  will  give  better  results  for  certain  functions  that  depend  on 
only  four  of  the  five  variables.  Then  we  find  Cm(f)  = (0, 1, . . . , 13, 14)  for  respectively 
(2,  2,  5,  20,  93,  389,  1960,  10459,  47604,  135990,  198092,  123590,  21540,  472,  0)  classes 
of  functions  . . . leaving  75,908  classes  (and  575,963,136  functions)  for  which  Cm(f)  = 00 
because  they  cannot  be  evaluated  at  all  in  minimum  memory.  The  most  interesting 
function  of  that  kind  is  probably 


(xi  A x2)  V (X2  A x3)  V (x3  A x4)  V (x4  A x5)  V (x5  A xi), 

which  has  C(f)  = 7 but  Cm(f)  = 00.  Another  interesting  case  is  (((xi  V x2)  © x3)  V 
((x2  V x4)  Ax5))  A ((xi  =x2)  Vx3  V x4),  for  which  C(f)  = 8 and  Cm(f)  = 13.  One  way 
to  evaluate  that  function  in  eight  steps  is  x6  = xi  V x2,  x7  = Xi  V x4,  x8  = x2  © x7, 

Xg  = X3  © X6,  X10  = X4  © Xg,  XU  = X5  V X9,  X12  = X8  A X10,  Xl3  = Xu  A Xl2. 

19.  If  not,  the  left  and  right  subtrees  of  the  root  must  overlap,  since  case  (i)  fails. 
Each  variable  must  occur  at  least  once  as  a leaf,  by  hypothesis.  At  least  two  variables 
must  occur  at  least  twice  as  leaves,  since  case  (ii)  fails.  But  we  can’t  have  n + 2 leaves 
with  r < n + 1 internal  nodes,  unless  the  subtrees  fail  to  overlap. 

20.  Now  Algorithm  L (with  ‘/  = g©  /i’  omitted  in  step  L5)  shows  that  some  formulas 
must  have  length  15;  and  even  the  footprint  method  of  exercise  11  does  no  better 
than  14.  To  get  truly  minimum  chains,  the  25  special  chains  for  r = 6 in  the  text  must 
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be  supplemented  by  five  others  that  can  no  longer  be  ruled  out,  namely 


and  when  r = (7, 8, 9)  we  must  also  consider  respectively  (653, 12387, 225660)  additional 
potential  chains  that  are  not  special  cases  of  the  top-down  and  bottom-up  constructions. 
Here  are  the  resulting  statistics,  for  comparison  with  Table  1: 


CcU) 

Class 

es 

- Func- 
tions 

Uc(f) 

Class 

es 

- Func- 
tions 

Lc(f) 

Class- 

es 

- Func- 
tions 

Dc(f) 

Class- 

es 

Func- 

tions 

0 

2 

10 

0 

2 

10 

0 

2 

10 

0 

2 

10 

1 

1 

48 

1 

1 

48 

1 

1 

48 

1 

1 

48 

2 

2 

256 

2 

2 

256 

2 

2 

256 

2 

7 

684 

3 

7 

940 

3 

7 

940 

3 

7 

940 

3 

59 

17064 

4 

9 

2336 

4 

9 

2336 

4 

7 

2048 

4 

151 

47634 

5 

24 

6464 

5 

21 

6112 

5 

20 

5248 

5 

2 

96 

6 

30 

10616 

6 

28 

9664 

6 

23 

8672 

6 

0 

0 

7 

61 

18984 

7 

45 

15128 

7 

37 

11768 

7 

0 

0 

8 

45 

17680 

8 

40 

14296 

8 

27 

10592 

8 

0 

0 

9 

37 

7882 

9 

23 

8568 

9 

33 

11536 

9 

0 

0 

10 

4 

320 

10 

28 

5920 

10 

16 

5472 

10 

0 

0 

11 

0 

0 

11 

6 

1504 

11 

30 

6304 

11 

0 

0 

12 

0 

0 

12 

5 

576 

12 

3 

960 

12 

0 

0 

13 

0 

0 

13 

3 

144 

13 

8 

1472 

13 

0 

0 

14 

0 

0 

14 

2 

34 

14 

2 

96 

14 

0 

0 

15 

0 

0 

15 

0 

0 

15 

4 

114 

15 

0 

0 

The  two  function  classes  of  depth  5 are  represented  by  S2, 4(2:1, x2, £3, £4)  and  x\  ® 
S 2(2:2, 2:3, £4);  and  those  two  functions,  together  with  S2{x i,x2,x3,x4)  and  the  parity 
function  Si,3(xi,  x2,  x3,  x4)  = x\  © x2  © £3  © x4,  have  length  15.  Also  f7c(S2,4)  = 
Uc(Si.t)  = 14.  The  four  classes  of  cost  10  are  represented  by  Si,4(xi,  x2,  x3,  x4), 
S2,4(:ri,  *2, £3,2:4),  (2:4?  x4  © x2  © x3:  (xix2x3)),  and  [(2:12:2X30:4)2  € {0, 1, 4,  7, 10,13}]. 
(The  third  of  these,  incidentally,  is  equivalent  to  (20),  “Harvard’s  hardest  case.”) 

21.  (The  authors  stated  that  their  table  entries  “should  be  regarded  only  as  the  most 
economical  operators  known  to  the  present  writers.”)  The  minimum  cost  of  their 
hardest  function  (20)  is  still  unknown,  but  David  Stevenson  has  shown  that  V(f ) < 17: 

g = AND(NAND(to,  x),  NAND(u),  x)); 

/ = OR(AND(NOT(9),  NAND(w,  2),  NAND(j/,  z)), 

AND(NOT(NOT(g)),  NAND(y,  z),  NAND (y,  2))). 

Although  they  failed  to  find  this  particular  construction,  the  Harvard  researchers  did 
remarkably  well,  in  some  cases  beating  the  footprint  heuristic  by  as  many  as  6 grids! 

22.  i/(xix2X3X4X5)  = 3 if  and  only  if  i'(xix2X3X4)  6 {2,3}  and  u(x  1X2X3X4X5)  is  odd. 
Similarly,  S2(xi,  x2,  x3,  x4,  X5)  = S3(xi,x2,x3,x4,x5)  incorporates  Si,2(xi,  x2,  x3,  x4): 

S2  = 
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23.  We  need  only  consider  the  32  normal  cases,  as  in  Fig.  9,  since  the  complement  of 
a symmetric  function  is  symmetric.  Then  we  can  use  reflection,  like  Sll2(x)  = S3,4(x), 
possibly  together  with  complementation,  like  S2, 3A,s(x)  = So,i(x)  = S4, 5(s),  to  deduce 
most  of  the  remaining  cases.  Of  course  Si,  Si,3>5,  and  Si, 2, 3,4, 5 trivially  have  cost  4. 
That  leaves  only  Si, 2,3, 4(2:1,  x2,  x3,  x4,  x5)  = (aq  ©x2)  V (x2  ® *3)  V (x3  ® x4)  V (x4  ©*5), 
which  is  discussed  for  general  n in  exercise  79. 

24.  As  noted  in  the  text,  this  conjecture  holds  for  n < 5. 

25.  It  is  2 -n— 1,  the  number  of  nontrivial  normal  functions.  (In  any  normal  chain 

of  length  r that  doesn’t  include  all  of  these  functions,  Xj  o Xk  will  be  a new  function 
for  some  j and  k in  the  range  1 < j,  k < n + r and  some  normal  binary  operator  o;  so 
we  can  compute  a new  function  with  every  new  step,  until  we’ve  got  them  all.) 

26.  False.  For  example,  if  g = S1,3(x1,x2,x3)  and  h = S2,3(x i,x2,x3),  then  C(gh)  = 5 
is  the  cost  of  a full  adder;  but  / = S2t3(xo,xi,x2,x3)  has  cost  6 by  Fig.  9. 

27.  Yes:  The  operations  lx2  <r-  x2  © x4 , aq  4-  aq  ® x3,  11  <- n Ai2,  *1  Mi  ® x3, 
x2  «—  x2  ® x3  transform  (xi,x2,x3)  into  (21,20,2:3). 

28.  Let  v'  = v"  = v®(x  ®y);  v!  = ((v  ® y)  C (x  ® y))  ® u,  u"  = ((u  ® y)  V (a;  ® y))  ® u. 
Thus  we  can  set  uo  = 0,  vq  — aq,  Uj  = ((vj-i(&x2  j+i)V  (x2j  ©aqj+i))®  ua-i  if  j is  odd, 
ui  = (ivj- 1 ® 2:2j+i)C  (x2j  ®x2j+i))®Uj-i  if  j is  even,  and  Vj  = Vj-i  ® {x2j  ®x2j+1), 
obtaining  (ujVj)2  = (a;i  + - • -+2;2j+i)  mod  4 for  1 < j < |_rr/2j . Set  2:n+i  = 0 if  n is  even. 
Thus  [(xi  + • • • + xn ) mod  4 = 0]  = M[n/2j  A v^n/2^  is  computed  in  [5n/2j  — 2 steps. 

This  construction  is  due  to  L.  J.  Stockmeyer,  who  proved  that  it  is  nearly  optimal. 
In  fact,  the  result  of  exercise  80  together  with  Figs.  9 and  10  shows  that  it  is  at  most 
one  step  longer  than  a best  possible  chain,  for  all  n > 5. 

Incidentally,  the  analogous  formula  u"  = (( v ® y)  A (a;  ® y))  ® u yields  (u"'v')2  = 
((uv)2  + x — y)  mod 4.  The  simpler-looking  function  ((uv)2  +x  + y)  mod4  costs  6,  not  5. 

29.  To  get  an  upper  bound,  assume  that  each  full  adder  or  half  adder  increases  the 
depth  by  3.  If  there  are  aid  bits  of  weight  2j  and  depth  3d,  we  schedule  at  most  |ajd/ 3] 
subsequent  bits  of  weights  {23 ,23+1}  and  depth  3 (d  +1).  It  follows  by  induction  that 
ajd  < (^)3  dn  + 4.  Hence  ajd  < 5 when  d > log3/,2  n,  and  the  overall  depth  is  at  most 
3 log3/2  n + 3.  (Curiously,  the  actual  depth  turns  out  to  be  exactly  100  when  n = 107.) 

30.  As  usual,  let  un  denote  the  sideways  addition  of  the  bits  in  the  binary  represen- 
tation of  n itself.  Then  s(n)  = 5 n — 2 vn  — 3[lgnJ  — 3. 

31.  After  sideways  addition  in  s(n)  < 5 n steps,  an  arbitrary  function  of  (2^  nj , . . . , z0) 
can  be  evaluated  in  ~ 2n/\gn  steps  at  most,  by  Theorem  L.  [See  O.  B.  Lupanov, 
Doklady  Akademii  Nauk  SSSR  140  (1961),  322-325.] 

32.  Bootstrap:  First  prove  by  induction  on  n that  t(n)  < 2n+1 . 

33.  False,  on  a technicality:  If,  say,  N = yjn,  at  least  n steps  are  needed.  A correct 
asymptotic  formula  N + 0(\/N ) + 0(n)  can,  however,  be  proved  by  first  noting  that 
the  text’s  method  gives  N + 0(\/N  ) when  N > 2n_1;  otherwise,  if  [lg  A^J  =n  — k — l, 
we  can  use  0(n)  operations  to  AND  the  quantity  x4  A • • • A xk  to  the  other  variables 
Xk+i,  ■ ■ ■ , xn,  then  proceed  with  n reduced  by  k. 

(One  consequence  is  that  we  can  compute  the  symmetric  functions  {Si,  S2, , Sn } 
with  cost  s(n)  + n + 0(\/n  ) = 6n  + Ofi/n ) and  depth  O(logn).) 

34.  Say  that  an  extended  priority  encoder  has  n + 1 = 2m  inputs  0:02:1  . . . xn  and 
m + 1 outputs  yoyi  . . . ym,  where  yo  = 2:0  V oq  V • ■ ■ V xn.  If  Q'm  and  Q ]],  are  extended 
encoders  for  x'0  . . . x'n  and  x”,  . . . x'r[ , then  Qm+ 1 works  for  x'0  . . . x'nx'd  ...  2:"  if  we  define 


i<mi  r> 
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Vo  = y'o  V y'o,  Vi  = 3/0 , 3/2  = 3/i?  J/":  2/i,  • • • , Vm+ 1 = 2/i?  2/m:  y™-  If  Pm  is  an  ordinary 
priority  encoder  for  x\  . . . x'n,  we  get  Pm+ 1 for  x[  . . . x'nx'd  . . . x„  in  a similar  way. 

Starting  with  m = 2 and  t/2  = 13  V (aq  A £2),  1/1  = 12  V 2:3,  3/0  = io  V ii  V i/i, 
this  construction  yields  Pm  and  Qm  of  costs  pm  and  q„, , where  p2  = 3,  q2  = 5,  and 
Pm+i  = 3m  + pm  + qm,  qm+ 1 = 3m  + 1 + 2<jm  for  m > 2.  Consequently  pm  = qm  - m 
and  = 15 . 2m“2  - 3m  - 4 w 3.75n. 

35.  If  n = 2m,  compute  aq  Aaq,  . . . , x„-i  A*n,  then  recursively  form  x\  A - • •A1211-2A 
*2fc+i  A • • • A*n  for  1 < k < m,  and  finish  in  n more  steps.  If  n = 2m  — 1,  use  this  chain 
for  n + 1 elements;  three  steps  can  be  eliminated  by  setting  xn+x  «—  1.  [I.  Wegener, 
The  Complexity  of  Boolean  Functions  (1987),  exercise  3.25.  The  same  idea  can  be  used 
with  any  associative  and  commutative  operator  in  place  of  A.] 

36.  Recursively  construct  Pn(x  1, ...,£„)  and  Qn(xi, . . . , xn)  as  follows,  where  P„  has 
D(yj)  < [lgn]  for  1 < j < n and  Qn  has  D(yj)  < [lgn]  + [j  /n]:  The  case  n = 1 is 
trivial;  otherwise  Pn  is  obtained  from  Q'r  (aq , . . . , xr)  and  P'J  (xT+1 , . . . , xn ),  where  r = 
[n/2]  and  s = \n/ 2J , by  setting  yj  = p'  for  1 < j < r,  yj  = y'r/\y"-T  for  r < j < n.  And 
Qn  is  obtained  from  either  P'r{x  1 Ax2, . . . , xn-i  Ai„)  or  P'r{x\  A12, . . . , xn-2Axn-i,  xn) 
by  setting  yi  = Xl,  y2j  = 3/',  y2j+1  = y ' A x2j+1  for  1 < j < s,  and  y2s  = y's,  yn  = y'r. 

These  calculations  can  be  performed  in  minimum  memory , setting  x^^  <—  A 
a at  step  i for  some  indices  j(i)  < k(i).  Thus  we  can  illustrate  the  construction 
with  diagrams  analogous  to  the  diagrams  for  sorting  networks.  For  example, 


Ps 


(delay  0) 
(delay  1) 
(delay  2) 
(delay  2)  . 
(delay  3)  ’ 
(delay  3) 
(delay  3) 
(delay  3) 


Q 8 — 


(delay  0) 

-| — (delay  1) 
— (delay  2) 
-| — (delay  2) 
■4—  (delay  3) 
-j — (delay  3) 
■4—  (delay  4) 
• (delay  3) 


The  costs  pn  and  qn  satisfy  p„  = |n/2j  + 9|-n/2]  +P[„/2j , qn  = 2[n/2j  - 1 +Pfn/2] 
when  n > 1;  for  example,  (pi, . . . ,pj)  = (qi, . . . , q7)  = (0, 1, 2, 4,  5,  7, 9).  Setting  pn  = 
4 n — pn  and  qn  = 3n  — qn  leads  to  simpler  formulas,  which  prove  that  pn  < 4n  and 
qn  < 3 Tl.  qn  — P\n/ 2]  “h  even];  P4n  ~ P2n  4“  Pn  T I5  P4n+1  = p2n  4“  Pn+1  1,  P4n+2  = 
p2n+i  +Pn+ 1,  p4n+3  = P4n+2  + 2.  In  particular,  l+p2m  = 7^171+5  is  a Fibonacci  number. 

[See  JACM  27  (1980),  831-834.  Slightly  better  chains  are  obtained  if  we  replace 
<?2n+i  by  (Q2n  and  t/2n+i  = V2n  A X2n+\)  when  n is  a power  of  2,  if  we  replace  P5  and 
Pe  by  Qs  and  Q6,  and  if  we  then  replace  (Pg , Pio , Pn , Pn)  by  (Qg,  Qw,  Qn , Qn)] 
Notice  that  this  construction  works  in  general  if  we  replace  ‘A’  by  any  associative 
operator.  In  particular,  the  sequence  of  prefixes  x\  ® • • • ® Xk  for  1 < k < n defines  the 
conversion  from  Gray  binary  code  to  radix-2  integers,  Eq.  7.2.1.1  (io). 


37.  The  case  m = 15,  n = 16  is  illustrated  at  the  right. 

(a)  Let  Xi..j  denote  the  original  value  of  Xi  A ■ • ■ A Xj.  Whenever  the 
algorithm  sets  Xk  <—  Xj  A Xk,  one  can  show  that  the  previous  value  of  Xk 
was  Xj+i..k ■ After  step  SI,  Xk  is  Xf( k)+i..k  where  f(k)  = k & (k  — 1)  for 
1 < k < m and  /(m)  = 0.  After  step  S2,  Xk  is  X\..k  for  1 < k < m. 

(b)  The  cost  of  SI  is  m — 1,  the  cost  of  S2  is  m — 1 — [lgm],  and 
the  cost  of  S3  is  n — m.  The  final  delay  of  Xk  is  [lg  k\  + v k — 1 for 
1 < k < m,  and  it  is  [lgm]  + k — m for  m < k < n.  So  the  maximum 
delay  for  {a?i , . . . , Xrn-i}  turns  out  to  be  g(m)  = m - 1 for  m < 4, 
g(m)  = [lg  mj  + [lg  y J for  m > 4.  We  have  c(m,  n)  = m + n — 2 — [lg  m] , 
d(m,  n)  = ma x(g(m),  [lgm]  + n — m).  Hence  c(m,  n)  + d(m,  n)  = 2n  — 2 
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whenever  n > m + g(m)  — [lg  m] . 

(c)  A table  of  values  reveals  that  d(n)  = [lgn]  for  n < 8,  and  d(n)  = flg(n  — 
LlgnJ  + 3)J  + [lg  | (n  — flgnj  + 3)J  — 1 for  n > 8.  Stating  this  another  way,  we  have 
d(n)  > d(n  — 1)  and  n > 2 if  and  only  if  n = 2k  + k — 3 or  2k  + 2k~ 1 + k — 3 for  some 
k > 1.  The  minimum  with  minimal  cost  occurs  for  m — n when  n < 8;  otherwise  it 
occurs  for  m = n - [lg  § (n  - [lg  nj  + 3)J  + 2 - [n  = 2k  + k - 3 for  some  k], 

(d)  Set  m <r~  m(n,d),  where  m(n,d(n))  is  defined  in  the  previous  sentence  and 
m(n,d)  = m(n  — l,d  — 1)  when  d > d(n).  [See  J.  Algorithms  7 (1986),  185-201.] 

38.  (a)  From  top  to  bottom,  fk(xi, . . . , x„)  is  an  elementary  symmetric  function  also 
called  the  threshold  function  S>k(x i, . . . ,xn).  (See  exercise  5.3.4-28,  Eq.  7.1.1-(go).) 

(b)  After  calculating  {Si,  ...,S„}  in  w6n  steps  as  in  answer  33,  we  can  apply  the 
method  of  exercise  37  to  finish  in  2 n further  steps. 

But  it  is  more  interesting  to  design  a Boolean  chain  specifically  for  the  computation 
of  the  2m  + 1 threshold  functions  gk(xi, . . . ,xm ) = [(xi  . . .xm)2  > k]  for  0 < k < 2m . 

Since  [(x'x")2  > (y'y")2]  = [(x')a  > (w')2  + l]  V ([(x')2  > (y'h]  A [(*")2  > (»")a]),  a 

divide-and-conquer  construction  analogous  to  a binary  decoder  solves  this  problem 
with  a cost  at  most  2 t(m). 

Furthermore,  if  2m  1 < n < 2m,  the  cost  u(n)  of  computing  {(/i, . . . , gn}  by  this 
method  turns  out  to  be  2 n + 0(y/n),  and  it  is  quite  reasonable  when  n is  small: 

n = 1234567  8 9 10  11  12  13  14  15  16  17  18  19  20 

u(n)  = 0 1 2 4 7 7 8 12  15  17  19  19  20  21  22  27  32  34  36  36 

Starting  with  sideways  addition,  we  can  sort  n Boolean  values  in  s(n)  + u(n)  « 7 n 
steps.  A sorting  network,  which  costs  2S(n),  is  better  when  n = 4 but  loses  when 
n > 8.  [See  5.3.4-(n);  D.  E.  Muller  and  F.  P.  Preparata,  JACM  22  (1975),  195-201.] 

39.  [IEEE  Transactions  C-29  (1980),  737-738.]  The  identity 


Mr+S(xi,  • . . , xr,  av-i-i, . . . , :rr-|-s;  yo , . . . , ?/2r+s  — i)  — Mr{x i, . . . , xr\  r/Q, . . . , y2r  — i\ 

where  y'  = Vl~o 1 (dk  A y2“j+k)  and  dk  is  the  fcth  output  of  an  s-t.o-2s  decoder  applied 
to  (xr+i, . . . ,xr+s),  shows  that  C(Mr+s)  < C(Mr)  + 2r+s  +2r(2s  - 1)  + t(s),  where  t(s) 
is  the  cost  (30)  of  the  decoder.  The  depth  is  D(Mr+s)  = ma x(Dx(Mr+s),  Dy(Mr+s)), 
where  Dx  and  Dy  denote  the  maximum  depth  of  the  x and  y variables;  we  have 
Dx(Mr+s)  < max(Dx(Mr),  1 + s + figs]  + Dy(Mr))  and  Dy(Mr+s)  < 1 + s + Dy(Mr). 

Taking  r = [m/2]  and  s = [m/2]  yields  C(Mm ) < 2m+1  + 0(2m/2),  Dy{Mm ) < 
m + 1 + fig m] , and  Dx(Mm ) < Dy(Mm)  + flgm]. 

40.  We  can,  for  example,  let  fnk(x)  = V"=i  ~k(h(x)  A rj+k-i{x)),  where 


li  (*) 


Xj , if  j mod  k = 0, 

Xj/\lj+i(x),  if  j mod  k 7!  0, 


for  1 < j < n — (n  mod  fc); 


ri(x) 


1, 

Xj  A Tj-i(x), 


if  j mod  k = 0, 
if  j mod  k / 0, 


for  k < j < n. 


The  cost  is  4n  - 3/t  — 3[|J  - + 2 — (n  mod  k ). 

A recursive  solution  is  preferable  when  n is  small  or  k is  small:  Observe  that 
Xn-kj- 1 A • • • A Xk  A 

/(2n-2fe)(n  — fc)(^l)  ■ ■ • ) * * * , ^n)i  for  k Tl  < 2k 

fnk\X)  = . . 

J[(n+k)/2jk[Xl,  . . . ,X|_(n+fc)/ 2j)  V 

. f[(n+k—  l)/2jfe(^L(n  — 0/2J+l>  * * * 5 Xn), 


for  n > 2k. 
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The  cost  of  this  solution  can  be  shown  to  equal  n - 1 + UgjJ  when  k < n < 2k, 

and  it  lies  asymptotically  between  (m  + ak  - l)n  + 0(km)  and  (m  + 2-2/ak)n  + 0(km) 
as  n -A  oo,  where  m = [lg  k\  and  1 < ak  = (k  + 1) /2m  < 2. 

A marriage  of  these  methods  is  better  yet;  the  optimum  cost  is  unknown. 

41.  Let  c(m)  be  the  cost  of  computing  both  ( x)2  + (y)2  and  ( x)2  + (y)2  + 1 by  the 
conditional-sum  method  when  x and  y have  n = 2m  bits,  and  let  c'(m)  be  the  cost  of 
the  simpler  problem  of  computing  just  (01)2  + (y)2.  Then  c(m  + 1)  = 2c(m)  + 6 • 2m  + 2. 
c (m  + 1)  = c(m)  + c (ra)  + 3 • 2m  + 1.  (Bit  zn  of  the  sum  costs  1;  but  bits  zk  for 
n < k < 2n  + 1 cost  3,  because  they  have  the  form  c?  ak : bk  where  c is  a carry  bit.)  If 
we  start  with  n = 1 and  c(0)  = 3,  c'(0)  = 2,  the  solution  is  c(m)  = (3m  + 5)2m  - 2, 
c (m)  = (3m  + 2)2m  - m.  But  improved  constructions  for  the  case  n = 2 allow  us 
to  start  with  c(l)  = 11  and  c'(  1)  = 7;  then  the  solution  is  c(m)  = (3m  + \)2m  — 2. 
c'(m)  = (3m  + |)2m  - m + 1.  In  either  case  the  depth  is  2m  + 1.  [See  J.  Sklansky,  IRE 
Transactions  EC-9  (1960),  226-231.] 


42. 


(a)  Since  ( xkykck ) — uk  V ( vk  A ck),  we  can  use  (26)  and  induction. 

(b)  Notice  that  Uk+1  = uk  and  Vk+1  = vk;  use  induction  on  j — i.  [See  A.  Wein- 

7 T 1,  TD  T7l  rr 1 ,•  ^ ^ „ > _ . L 


berger  and  J.  L.  Smith,  IRE  Transactions  EC-5  (1956) 

Rung,  IEEE  Transactions  C-31  (1982),  260-264.] 

(c)  First,  for  l = 1,  2,  . . . , m — 1,  and  for  1 < k < n,  compute  Vk  for  all  multiples  i 
of  h(l)  in  the  range  k,  > i > kl+1,  where  = h(l)[(k  - l)/h(l)\  denotes  the  largest 
™Q<!tlplQQ0f  is  less  than  k-  For  example,  when  l = 2 and  k = 99,  we  compute 

Vse  , V8S  = Vg6  A Via6,  Vso9  = Vg989  A Vgo8 , ■ • • , VH  = V79i  A V^2;  this  is  a prefix 
computation  using  the  values  V9969,  V8986,  Va808,  Vj42  that  were  computed  when  l = 2. 
Using  the  method  of  exercise  36,  step  l adds  at  most  l levels  to  the  depth  and  it 
requires  a total  of  (Pl  + p2  + • . . +p2,)n/2'  = 0(2'n)  gates. 

Then,  again  for  l = 1,  2,  . . . , m - 1,  and  for  1 < k < n,  compute  Utk  for  i = kl+1 
using  the  “unrolled”  formula 


, 65-73;  R.  P.  Brent  and  H.  T. 


<1  =tf,V  \/  (Vf+wAUj+W). 

kl>i>kl  + 1 

For  example,  the  unrolled  formula  when  l = 3 and  k = 99  is 

Ull  = u!i  V (V999  A u!l)  V (U899  A do8)  V (Us9?  A U86)  V (U799  A UH). 

Every  such  U?  is  a union  of  at  most  2l  terms,  so  it  can  be  computed  with  depth  < l 
m addition  to  the  depth  of  each  term.  The  total  cost  of  this  phase  for  1 < k < n is 
(0  + 2 + 4 + • • • + (2l —2))n/2l  = 0(2ln). 

JThe  overall  cost  to  compute  all  necessary  U’s  and  U’s  is  therefore  0(2ln)  = 

0(2mn).  ^Furthermore  the  quantities  V0k  aren’t  actually  needed,  so  we  save  the 
cost  of  Er=r  h(l)P2‘  gates.)  For  example,  when  m = (2, 3, 4,  5)  we  obtain  Boolean 
chains  for  the  addition  of  (2, 8, 64, 1024)-bit  numbers,  respectively,  with  overall  depths 
(3,7,11,16)  and  costs  (7,64,1254,48470). 

[This  construction  is  due  to  V.  M.  Khrapchenko,  Problemy  Kibernetiki  19  (1967), 
107-122,  who  also  showed  how  to  combine  it  with  other  methods  so  that  the  overall 
cost  will  be  O(n)  while  still  achieving  depth  lg  n + 0(  Vlog  n ).  However,  his  combined 
method  is  purely  of  theoretical  interest,  because  it  requires  n > 264  before  the  depth 
becomes  less  than  21gn.  Another  way  to  achieve  small  depth  using  the  recurrences 
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in  (b)  can  be  based  on  the  Fibonacci  numbers:  The  Fibonacci  method  computes  the 
carries  with  depth  log^  n + 0(1)  « 1.44  lgn  and  cost  0(n  log  n).  For  example,  it  yields 
chains  for  binary  addition  with  the  following  characteristics: 


n = 

4 

8 

16 

32 

64 

128 

256 

512 

1024 

depth 

6 

7 

9 

10 

12 

13 

15 

16 

18 

cost 

24 

71 

186 

467 

1125 

2648 

6102 

13775 

30861 

See  D.  E.  Knuth,  The  Stanford  GraphBase  (1994),  276-279. 

Charles  Babbage  found  an  ingenious  mechanical  solution  to  the  analogous  problem 
for  addition  in  radix  10,  claiming  that  his  design  would  be  able  to  add  numbers  of 
arbitrary  precision  in  constant  time;  for  this  to  work  he  would  have  needed  idealized, 
rigid  components  with  vanishing  clearances.  See  H.  P.  Babbage,  Babbage’s  Calculating 
Engines  (1889),  334-335.  Curiously,  an  equivalent  idea  works  fine  with  physical  tran- 
sistors, although  it  cannot  be  expressed  in  terms  of  Boolean  chains;  see  P.  M.  Fenwick, 
Comp.  J.  30  (1987),  77-79.] 

43.  (a)  Let  A = B = Q = { 0,1}  and  q0  = 0.  Define  c(q,  a)  = d(q , a)  = q A a. 

(b)  The  key  idea  is  to  construct  the  functions  di(q) . . . dn-i(q),  where  d\(q)  = 
d(q,ai)  and  dj(q)  = d(dj-i(q),a,j).  In  other  words,  di  = d(a i)  and  dj  = dj- 1 o d(°i), 
where  is  the  function  that  takes  q h+  d(q,  a)  and  where  o denotes  composition  of 
functions.  Each  function  dj  can  be  encoded  in  binary  notation,  and  o is  an  associative 
operation  on  these  binary  representations.  Hence  the  functions  did2...dn-i  are  the 
prefixes  d(°i),  d(°i)od(°2),  . . . , d(a i)o-  • -od(a™-i);  and  q\q2  . . . qn  = 9odi(<?o)  • • • dn-i(g0). 

(c)  Represent  a function  f(q)  by  its  truth  table  /o/i-  Then  the  composition  fofi  o 
gogi  is  hohi,  where  the  functions  ho  = /o?  <?i:  go  and  h\  = /i?  gp.  go  are  muxes  that  can 
each  be  computed  with  cost  3 and  depth  2.  (The  combined  cost  C(hohi)  is  only  5,  but 
we  are  trying  to  keep  the  depth  small.)  The  truth  table  for  d(°)  is  aO.  Using  exercise  36, 
we  can  therefore  compute  the  truth  tables  diodnd2od2i  • • • d(n-i)od(n-i)i  with  cost 
< 6pn_i  < 24n  and  depth  < 2[lg(n  — 1)~|;  then  6i  = ai,  and  bj  = qj  Adj  = d(J_1)0  A a3 
for  j > 1.  (These  cost  estimates  are  quite  conservative;  substantial  simplifications 
arise  because  of  the  Os  in  the  initial  truth  tables  of  d(“j)  and  because  many  of  the 
intermediate  values  dji  are  never  used.  For  example,  when  n = 5 the  actual  cost  is 
only  10,  not  6p„_i  + (n—  1)  = 28;  the  actual  depth  is  4,  not  2[4g(n—  1)]  +1  = 5.  Notice 
that  the  straightforward  chain  bj  = aj  A bj- 1 for  1 < j < n also  solves  problem  (a); 
it  wins  on  cost,  but  has  depth  n — 1.) 

44.  The  inputs  may  be  regarded  as  the  string  xoyo  x\yi  . . . xn-\yn-\  whose  elements 
belong  to  the  four-letter  alphabet  A = {00,01, 10, 11};  there  are  two  states  Q = {0, 1}, 
representing  a possible  carry  bit,  with  q0  = 0;  the  output  alphabet  is  B = {0, 1};  and 
we  have  c(q,xy)  = q © x ® y,  d(q,xy)  = ( qxy ).  In  this  case,  therefore,  the  finite  state 
transducer  is  essentially  described  by  a full  adder. 

Only  three  of  the  four  possible  functions  of  q occur  when  we  compose  the  mappings 
d(xy).  We  can  encode  them  as  wV(gAv).  The  initial  functions  d(xvi  have  u — xAy,  v = 
x®y\  and  the  composition  (uv)o(u'v')  is  u"v" , where  u"  = u V(»'Ab)  and  v"  = vAv'. 

When  n = 4,  for  example,  the  chain  has  the  following  form,  using  the  notation  of 
exercise  42:  U£+1  = xk  A yk,  Vkk+1  = xk  ® yk,  for  0 < k < 4;  Ul  = Ul  V (V?  A Ul), 
Ul  = Ut  V (U34  A Ui),  V24  = V23  A V4;  Ul  = Ul  V (V23  A U02),  U%  = U$  V (U24  A I/02); 
20  = Uo1,  = Ul  © V]2,  z2  — Uq  0 U23,  23  = Ul  © U34,  24  = U$ . The  total  cost  is  20; 
the  maximum  depth,  6,  occurs  in  the  computation  of  23. 
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In  general  the  cost  will  be  2nT3pn  in  the  notation  of  exercise  36,  because  we  need 
2 n gates  for  the  initial  u’s  and  v’s,  then  3 pn  gates  for  the  prefix  computation;  the  n — 1 
additional  gates  needed  to  form  Zj  for  0 < j < n are  compensated  by  the  fact  that  we 
need  not  compute  VJ  for  1 < j < n.  Therefore  the  total  cost  is  14  • 2m  - 3 Fm+b  + 3, 
superior  to  the  conditional-sum  method  (which  however  has  depth  2m  + 1,  not  2m + 2): 

n = 2 4 8 16  32  64  128  256  512  1024 

cost  of  conditional-sum  chain  7 25  74  197  492  1179  2746  6265  14072  31223 

cost  of  Ladner-Fischer  chain  7 20  52  125  286  632  1363  2888  6040  12509 

[George  Boole  introduced  his  Algebra  in  order  to  show  that  logic  can  be  understood 
in  terms  of  arithmetic.  Eventually  logic  became  so  well  understood,  the  situation 
was  reversed.  People  like  Shannon  and  Zuse  began  in  the  1930s  to  design  circuits  for 
arithmetic  in  terms  of  logic,  and  since  then  many  approaches  to  the  problem  of  parallel 
addition  have  been  discovered.  The  first  Boolean  chains  of  cost  0(n ) and  depth  0(log  n) 
were  devised  by  Yu.  P.  Ofman,  Doklady  Akademii  Nauk  SSSR  145  (1962),  48-51.  His 
chains  were  similar  to  the  construction  above,  but  the  depth  was  approximately  4m.] 

45.  That  argument  would  indeed  be  simpler,  but  it  wouldn’t  be  strong  enough  to  prove 
the  desired  result.  (Many  chains  with  steps  of  fan-out  0 inflate  the  simpler  estimate.) 
The  text’s  permutation-enhanced  proof  technique  was  introduced  by  J.  E.  Savage  in 
his  book  The  Complexity  of  Computing  (New  York:  Wiley,  1976),  Theorem  3.4.1. 

46.  When  r = 2n/n  + C>(l)  we  have  ln(22r+1(n+r-l)2r/(r  - 1)!)  = r In  r + (1+  In  4)r  + 
O(n)  = (2  /n)(n  In  2 — In  n + 1 + In  4)  + O(n).  So  a (n)  < (n/(4e))-2n/n+°(n/logn), 
which  approaches  zero  quite  rapidly  indeed  when  n > 4e. 

(In  fact,  (32)  gives  a(ll)  < 7.6  x 107,  a(12)  < 4.2  x 10“6,  a(13)  < 1.2  x 1(T38.) 

47.  Restrict  permutations  to  the  (r  — m)\  cases  where  iir  = i for  1 < i < n and 
(n+r+l-k)n  is  the  kth  output.  Then  we  get  (r  - m)\c(m,n,r)  < 22r+1(u+r-l)2r  in 
place  of  (32).  Hence,  as  in  exercise  46,  almost  all  such  functions  have  cost  exceeding 
2 nm/(n  + lgm)  when  m = 0(2n/n2). 

48.  (a)  Not  surprisingly,  this  lower  bound  on  C(n)  is  rather  crude  when  n is  small: 

n = 1 2 3 4 5 6 7 8 9 10  11  12  13  14  15  16 

r(n)  = 1 1 2 3 5 9 16  29  54  99  184  343  639  1196  2246  4229 

(b)  The  bootstrap  method  (see  Concrete  Mathematics  §9.4)  yields 

r(n)  = ~ fl  + !g,w-2-Vln2  + 0/l£gn^ 
n \ n \ n2  J J 

49.  The  number  of  normal  Boolean  functions  that  can  be  represented  by  a formula  of 
length  < r is  at  most  5 n ’ gr.  where  gr  is  the  number  of  oriented  binary  trees  with 
r internal  nodes.  Set  r = 2"/lg n - 2'>+2/(lg n)2  in  this  formula  and  divide  by  22"-1 
to  get  an  upper  bound  on  the  fraction  of  functions  with  L(f)  < r.  The  result  rapidly 
approaches  zero,  by  exercise  2.3.4.4-7,  because  it  is  0((5q/16)2"/18")  where  a « 2.483. 

[J.  Riordan  and  C.  E.  Shannon  obtained  a similar  lower  bound  for  series-parallel 
switching  networks  in  J.  Math,  and  Physics  21  (1942),  83-93;  such  networks  are  equiva- 
lent to  formulas  in  which  only  canalizing  operators  are  used.  R.  E.  Krichevsky  obtained 
more  general  results  in  Problemy  Kibernetiki  2 (1959),  123-138,  and  O.  B.  Lupanov 
gave  an  asymptotically  matching  upper  bound  in  Prob.  Kibernetiki  3 (1960),  61-80.] 

50.  (a)  Using  subcube  notation  as  in  exercise  7.1.1-30,  the  prime  implicants  are 

00001*,  (0001*1),  0100*1,  0111*1,  1010*1,  101*11,  00*011,  00*101,  (01*111),  11*101, 
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(0*1101),  (1*0101),  1*1011,  0*0*11,  *00101,  (*01011),  (*11101),  where  the  parenthe- 
sized subcubes  are  omitted  in  a shortest  DNF.  (b)  Similarly,  the  prime  clauses  and  a 
shortest  CNF  are  given  by  00111*,  01010*,  10110*,  0110**,  00*00*,  11*00*,  11*11*, 
(0*100*),  (1*00**),  1*0*1*,  (1****0),  *0000*,  (*1100*),  *1***0,  **1**0,  ***1*0,  and 
(****00).  (Thus  the  CNF  is  (xi  VX2VX3VX4VX5)  A (xiV x2V X3V X4V X5)  A ■ • ■ A (x4Vxg).) 


51.  / = ([x5x6  € {01}]  A [(xiX2X3X4)2  G {1,3, 4, 7, 9, 10, 13, 15}])  V ([x5X6  € {10, 11}]  A 
[X1X2X3X4  = 0000])  V ([X5X6  G {11}]  A [(xix2X3X4)2  G {1, 2, 4,  5,  7, 10, 11, 14}]). 


52.  The  small-n  results  are  quite  different  from  those  that  work  asymptotically: 


n k l (38) 

5 2 2 39 

6 2 2 67 
721  109 


n k l (38) 

8 3 2 175 

9 3 2 279 

10  4 4 471 


n k l (38) 

11  4 4 803 

12  4 3 1329 

13  5 6 2355 


n k l (38) 

14  5 5 4045 

15  5 5 7141 

16  5 4 12431 


(These  upper  bounds  are  quite  weak  when  n is  small.  For  example,  we  know  that 
C(n)  = (0,1,4,7,12)  when  n = (1,2, 3, 4, 5);  and  Eq.  7.1.1  -(16)  gives  C(n+  1)  < 
2 C(n)  + 2,  so  that  C(6)  < 26,  C( 7)  < 54,  etc.) 

53.  First  note  that  2 k/l  < n — 31gn,  hence  m,  <n  — 31gn+l  and  2mi  = 0(2n/n3). 
Also  l = 0(n ) and  t(n  — k)  = 0{2n/n2).  So  (38)  reduces  to  l ■ 2n~k  + 0(2n/n 2)  — 
2"/(n  — 31gn)  + 0(2n/n2). 


54.  The  greedy-foot.print  heuristic  gives  a chain  of  length  14: 


x5  = Xl  © x3, 

X6  = X2  © X3, 
X7  = Xl  A X2, 
Xg  = Xl  A Xg, 
x9  — X4  A X5, 


X10  = x4  A xs, 

Xll  = X4  © X5, 
X12  = x6  A Xn, 
fl  = X13  = X?  A X12, 
h = *14  = *6  A X10, 


/3  = X15  = X8  A X9, 
/4  = Xl6  = X4  A X8, 
fb  = Xl7  = X7  A X9, 
fs  = *19  = x6  A X10 


The  minterm-first  method  corresponds  to  a chain  of  length  22,  after  we  remove  steps 
that  are  never  used: 


X5 

= Xl 

A 

x2, 

X13 

= *5 

A 

*10, 

*20 

= Xg  A Xu, 

Xg 

= Xl 

A 

x2, 

X14 

= x5 

A 

Xn, 

/«  = 

= X21 

= X15 

V 

*18 

X7 

= Xl 

A 

X2, 

*15 

= x6 

A 

*9, 

fl  = 

= *22 

= *13 

V 

*21 

x8 

= Xl 

A 

X2, 

*16 

= X6 

A 

*11, 

h - 

= X23 

= *12 

V 

*20 

Xg 

= X3 

A 

x4 , 

X17 

= *7 

A 

x9, 

*24 

= *14 

V 

*16 

X10 

= X3 

A 

X4, 

Xis 

= X7 

A 

Xu, 

h - 

= X25 

= *24 

V 

*19 

Xll 

= x3 

A 

X4, 

fb  = *19 

= *8 

A 

X9, 

U-- 

= x26 

= *17 

V 

*20 

X12 

= X5 

A 

x9, 

(The  distributive  law  could  replace  the  computation  of  X14,  Xi6,  and  X24  by  two  steps.) 

Incidentally,  the  three  functions  in  the  answer  to  exercise  51  can  be  computed  in 
only  ten  steps: 

X5  = X2  V X4,  /3  = x9  = X6  © Xg,  X12  = x2  © X3, 

X6  = Xl  A X5 , X10  = Xl  © Xg,  Xi3  = X10  A X12, 

*7  = x2  A X4,  /2  = Xll  = x9  V X10,  fl  = Xl4  = x4  © X13. 

Xg  = X3  A X7, 


55.  The  optimum  two-level  DNF  and  CNF  representations  in  answer  50  cost  53  and  43, 
respectively.  Formula  (37)  costs  29,  when  optimized  as  in  exercise  54.  The  alternative 
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m exercise  51  costs  only  17.  But  the  catalog  of  optimum  five- variable  chains  suggests 


*7  = *1  A X2, 
*8  = *3  ® XT, 
X9  = X2  A Xg, 
*10  = Xl  ® Xg, 


*11  = *5  A Xio, 
*12  = *5  V *10, 

*13  = *4  A *n, 
*14  = *8  A *12, 


*15  = *13  © *14, 

*16  = *5  A *10, 
*17  — *3  A *16, 


*18  = *4  A *17, 
*19  = *6  A *15, 
*20  = *18  V *19, 


for  this  six-variable  function.  Is  there  a better  way? 

56.  If  we  care  about  at  most  two  values,  the  function  can  be  either  constant  or  x3  or  Xj . 

57.  The  truth  tables  for  *5  through  *17,  in  hexadecimal  notation,  are  respectively  Of  f 0, 
2222,  33cc,  OdOd,  7777,  5d5d,  3ecl,  6b94,  4914,  4804,  060b,  2020,  7007.  So  we  get 

1010 n-f,  1011  ^n,  1100 1101  i-a  J,  1110 mie+j. 


[David  Stevenson,  believing  that  it  would  be  better  to  have  a solution  in  which  nondigits 
never  masquerade  as  digits,  has  come  up  with  another  chain  of  length  13,  namely 


*5  = *1  A *4, 
*6  — *3  A *5, 
*7  = *2  © *5, 
*8  = *3  © *7, 
*9  = *3  V *5, 


d = *10  = *6  A *g, 
a = *11  = *3  A *7, 
b = *12  = *2  A *8, 
/ = *13  = *2  A *9, 
e = *14  = *4  V *n, 


c = *15  = *2  A *6, 
*16  = *9  © *10, 
g = *17  = *1  © *16, 


for  which  a,  . . . , g have  the  truth  tables  b7f3,  f9fc,  dfcf , b6f3,  a2a2,  8fcf , 3ec0,  and 

1010 H+JI,  1011  lioo^'j,  1101 1 -a*|,  1110^  r,  miH^r 

Is  there  a 13-step  chain  for  (44)  that  produces  16  distinct  outputs?] 

58.  The  truth  tables  of  all  cost-7  functions  with  exactly  eight  Is  in  their  truth  tables 
are  equivalent  to  either  0779,  169b,  or  179a.  Combining  these  in  all  possible  ways 
yields  9656  solutions  that  are  distinct  under  permutation  and/or  complementation  of 
{*1,  *2,  *3,  *4}  as  well  as  under  permutation  and/or  complementation  of  {fi,  f2,  /3,  f^}. 

59.  The  greedy-footprint  heuristic  produces  the  following  17-step  chain: 


*5  = *2  © *3, 

*6  = *1  © *4, 
*7  = *1  © *3, 
*8  = *4  V *5, 

*9  = *6  A *8, 

*10  = *7  V *9, 


*11  = *2  V *7, 
*12  = *2  A *6, 

*13  = *3  A *4, 
*14  = *4  A *5, 
*15  = *5  A *10, 
*16  = *2  A *13, 


*17  = *6  A *8, 
fl  = *18  = *11  © *17, 

/2  = *19  = *10  A *14, 
/3  = *20  = *9  © *16, 
fi  = *21  = *12  © *15- 


The  initial  functions  all  have  large  footprints,  so  we  can’t  achieve  C(f i/2/3/4)  = 28; 
but  a slightly  more  difficult  S-box  probably  does  exist. 

60.  One  way  is  U\  = *1  © j/i,  m2  = *2  © 2/2,  v\  = y2  © Mi,  v2  = y\  © u2,  z\  = v\  A u2, 
z2  = v2  A tii  • 

61.  The  following  17-gate  solution  by  David  Stevenson  generalizes  to  8m  © 1 gates 
for  addition  mod  2m  +1:  uo  = *0  A j/o,  uo  = *0  © yo,  u 1 = *1  A 3/1 , tq  = x\  © y\ , 
fi  = «1  A Mo,  *2  = Vi  © Mo,  C2  = Ml  V ti;  M2  = *2  A y2,  t3  = *2  V y2,  f4  = t3  V c2; 
t5  = t2\/Vo,  te  = tsAt4,  tj  = t6\/u2-,  t8  = t7  Amo,  Zo  = f7©M0,  Zl  = t2©t8;  Z2  = t4©t7. 
(Notice  that  (*2*i*o)2  + (2/22/12/0)2  = ( u2t4t2vo)2  — 4[*  = y = 4].  Gilbert  Lee  has  found 
another  17-step  solution  if  the  inputs  are  represented  by  000,  001,  Oil,  101,  and  111.) 
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62.  There  are  ( 2„rf)  22  c such  functions,  at  most  (2r"d)  t,(n,  r)  of  which  have  cost  < r. 
So  we  can  argue  as  in  exercise  46  to  conclude  from  (32)  that  the  fraction  with  cost 
< r = [2"c/nJ  is  at  most  22r+1-2"c(n  + r — l)2r/(r  — 1)!  = 2-rlgn+°(r). 

63.  [Problemy  Kibernetiki  21  (1969),  215-226.]  Put  the  truth  table  in  a 2kx2n~k  array 
as  in  Lupanov’s  method,  and  suppose  there  are  Cj  cares  in  column  j,  for  0 < j < 2n~k . 
Break  that  column  into  \cj/m\  subcolumns  that  each  have  rri  cares,  plus  a possibly 
empty  subcolumn  at  the  bottom  that  contains  fewer  than  m of  them.  The  hint  tells  us 
that  at  most  2m+k  column  vectors  suffice  to  match  the  Os  and  Is  of  every  subcolumn 
that  has  a specified  top  row  i0  and  bottom  row  i\.  With  0(m2m+3k)  operations 
we  can  therefore  construct  0(2m+3k)  functions  gt(x i,...,xk)  from  the  minterms  of 
(ar , • • • , Xk },  so  that  every  subcolumn  matches  some  type  t.  And  for  every  type  t we  can 
construct  functions  ht(xk+i,  ■ ■ ■ ,xn)  from  the  minterms  of  {i/t+i, . . . specifying 
the  columns  that  match  f;  the  cost  is  at  most  ]Cj(Lcj/TOJ  + 1)  < 2 nc/m  + 2n~k . 
Finally,  / = \J t(gt  A ht ) requires  0(2m+3k)  additional  steps.  Choosing  k = [2  lg nj  and 
m = \n  — 91gn]  makes  the  total  cost  at  most  (2nc/n)(l  + 9n_1  lgn  + 0(n-1)). 

Of  course  we  need  to  prove  the  hint,  which  is  due  to  E.  I.  Nechiporuk  [Doklady 
Akad.  Nauk  SSSR  163  (1965),  40-42].  In  fact,  2m(l  + [A:  In 2])  vectors  suffice  (see  S.  K. 
Stein,  J.  Combinatorial  Theory  A16  (1974),  391-397):  If  we  choose  q = 2r"'\k  In  2] 
vectors  at  random,  not  necessarily  distinct,  the  expected  number  of  untouched  subcubes 
is  (m)2m(l  ~ 2 m)g  < (£t)2me~92  "*  < 2m.  (An  explicit  construction  would  be  nicer.) 

For  extensive  generalizations  — tolerating  a percentage  of  errors  and  specifying  the 
density  of  Is  — see  N.  Pippenger,  Mathematical  Systems  Theory  10  (1977),  129-167. 

64.  It’s  exactly  the  game  of  tic-tac-toe,  if  we  number  the  cells  |jjl  as  in  an  ancient  Chi- 
nese magic  square.  [Berlekamp,  Conway,  and  Guy  use  this  numbering  scheme  to  present 
a complete  analysis  of  tic-tac-toe  in  their  book  Winning  Ways  3 (2003),  732-736.] 

65.  One  solution  is  to  replace  the  “defending”  moves  dj  by  “attacking”  moves  aj  and 
“counterattacking”  moves  c3 , and  to  include  them  only  for  corner  cells  j £ {1,3, 9,  7}. 
Let  j ■ k = (jk)  mod  10;  then 

j ■ 1 j ' 2 j ■ 3 

j ■ 4 j ■ 5 j ■ 6 

j ■ 7 j ■ 8 j ■ 9 

gives  us  another  way  to  look  at  the  tic-tac-toe  diagram,  when  j is  a corner,  because 
j _L  10.  The  precise  definition  of  a}  and  c,  is  then 

aj  = mj  A (( Xj-3  A /3(j.8)(j.g)  A (Oj.4©Oj.6))  V ( Xj.7  A P(j-6)(j-9)  A {Oj-2®Oj.s)) 

V (rrij.g  A ((my8  A Xj.2  A (oy3©oy6))  V (mj.6  A x3.4  A (oy7®oy8))))); 

Cj  =Z  dj  A (Xj.6  A Oj.7)  A (Xj.8  A Oj.3)  A dj.9; 

here  dj  = rrij  A 0(j-2)(j-3)  A 0(j-4)(j-f)  lakes  the  place  of  (51).  We  also  define 

u = (aq  ® x3)  © (xr  © x9 ), 

V = (01  © 03)  © (07  © Og),  Zj  = 

t = ni2  A me  A m8  A m4  A (»Vti), 

in  order  to  cover  a few  more  exceptional  cases.  Finally  the  sequence  of  rank-ordered 
moves  d3did3dgd7d2dedsd4m3mim3mgm7m2memsm4  in  (53)  is  replaced  by  the  se- 
quence aia3aga7Cic3CgC7Z3ziz3zgZ7Z2zeZ8Z4;  and  we  replace  (d,AJ])V(m3-Aro])  in  (55) 
by  iaj  A a')  V (cjAc'j)  V ( ZjAz'j ) when  j is  a corner  cell,  otherwise  simply  by  (zj  Az'). 


{mj  A t , if  j = 5, 
mjAdj.g,  if  j € {1,3,9, 7}, 
mj,  if  j 6 {2, 6, 8, 4), 
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(Notice  that  this  machine  is  required  to  move  correctly  from  all  legal  positions, 
even  when  those  positions  couldn’t  arise  after  the  machine  had  made  X s earlier  moves. 
We  essentially  allow  humans  to  play  the  game  until  they  ask  the  machine  for  advice. 
Otherwise  great  simplifications  would  be  possible.  For  example,  if  X always  goes  first, 
it  could  grab  the  center  cell  and  eliminate  a huge  number  of  future  possibilities;  fewer 
than  8x6x4x2  = 384  games  could  arise.  Even  if  O goes  first,  there  are  fewer  than 
9x7x5x3  = 945  possible  scenarios  against  a fixed  strategy.  In  fact,  the  actual 
number  of  different  games  with  the  strategy  defined  here  turns  out  to  be  76  + 457,  of 
which  72  + 328  are  won  by  the  machine  and  the  rest  belong  to  the  cat.) 

66.  The  Boolean  chain  in  the  previous  answer  fulfills  its  mission  of  making  correct 
moves  from  all  4520  legal  positions,  where  correctness  was  essentially  defined  to  mean 
that  the  worst-case  final  outcome  is  maximized.  But  a truly  great  tic-tac-toe  player 
would  do  things  differently.  For  example,  from  position  the  machine  takes  the  center, 
z)||;,  and  O probably  draws  by  playing  in  a corner.  But  moving  to  zjj:  or  xjzjz  would 
give  O only  two  chances  to  avoid  defeat.  [See  Martin  Gardner,  HexaSexagons  and 
Other  Mathematical  Diversions,  Chapter  4.] 

Furthermore  the  best  move  from  a position  like  Sjx|z  is  to  |ji)=  instead  of  winning 
immediately;  then  if  the  reply  is  -gfx| - , move  to  y|*k.  That  way  you  still  win,  but  without 
humiliating  your  opponent  so  badly. 

Finally,  even  the  concept  of  a single  “best  move”  is  flawed,  because  a good  player 
will  choose  different  moves  in  different  games  (as  Babbage  observed). 

It  might  be  thought  that  programing  a digital  computer  to  play  ticktacktoe, 
or  designing  special  circuits  for  a ticktacktoe  machine, 
would  be  simple.  This  is  true  unless  your  aim  is  to  construct  a master  robot 
that  will  win  the  maximum  number  of  games  against  inexperienced  players. 

MARTIN  GARDNER,  The  Scientific  American  Book  of 
Mathematical  Puzzles  & Diversions  (1959) 

67.  The  best  solution  known  so  far,  due  to  David  Stevenson  in  2010,  uses  a total  of 
818  gates  (472  AND,  327  OR,  13  NOR,  6 ANDN);  see 

http : //www-cs-f acuity . Stanford . edu/~knuth/818-gate-solut ion 

for  the  details.  After  taking  care  of  moves  such  as  Wj  and  bj,  and  cleverly  optimizing 
don’t-cares,  Stevenson  essentially  ORs  together  about  200  special  positions  (such  as 
ikjt)  that^make  c = 1,  about  200  others  (such  as  Jfr)  that  make  s = 1,  and  about  50 
(such  as  ;j6k)  that  make  m = 1;  then  he  saves  gates  by  finding  common  subexpressions 
among  the  ANDs  that  define  special  positions,  and  by  using  the  distributive  law,  etc. 

[This  exercise  was  inspired  by  a discussion  in  John  Wakerly’s  book  Digital  Design 
(Prentice-Hall,  3rd  edition,  2000),  §6.2.7.  Incidentally,  Babbage  planned  to  choose 
among  k possible  moves  by  looking  at  N mod  k,  where  N was  the  number  of  games  won 
so  far;  he  didn’t  realize  that  successive  moves  would  tend  to  be  highly  correlated  until  N 
changed.  Much  better  would  have  been  to  let  N be  the  number  of  moves  made  so  far.] 

68.  No.  That  method  yields  a “uniform”  chain  with  a comprehensible  structure,  but 
its  cost  is  Q(n2n).  A circuit  with  approximately  2 n/n  gates,  constructed  by  Theorem  L, 
exists  but  is  more  difficult  to  fabricate.  (Incidentally,  C(7t5)  = 10.) 

69.  (a)  One  can,  for  example,  verify  this  result  by  trying  all  64  cases. 

(b)  If  x m lies  in  the  same  row  or  column  as  xt , and  also  in  the  same  row  or  column 
as  Xj,  we  have  am  = aioi  = ttno  = 0,  so  the  pairs  are  good.  Otherwise  there  are 
essentially  three  different  possibilities,  all  bad:  If  ( i,j,m ) = (1,2,4)  then  aioi  = 0, 
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®ioo  = £52:9  © x6xa,  aon  = x9 ; if  ( i,j,m ) = (1,2,6)  then  a0io  = X4X9,  a0n  = x7, 
Qioo  = xsxg,  Q101  = x8;  if  ( i,j,m ) = (1,5,9)  then  am  = 1,  (*110  = 0,  a0io  = 2:32:7. 

70.  (a)  xxA((a:5Aa;9)©(x6Aa:g))  ©*2A((a:6Ax7)®(2:4Aa;9))  © x3  A((x4Axs)®(x5Ax7)). 

(b)  xiA((i5Aa;9)V(a;6Ax8))  V x2  A((x6Ax7)V  (x4Axg))  V x3  A((x4Ax8)  V(x5Ax7)). 

(c)  Let  yi  = x\  Axs  Axg,  y2  = xiAxeAx8,  y3  = x2Ax&Ax7,  1/4  = x2AXiAxg,  y3  = 
x3Ax4Ax8,  ye  = x3Ax5Ax7.  The  function  f(yi,.. . ,y6)  = [2/1  + y2  + y3  > y4  + y3  + ye] 
can  be  evaluated  in  15  further  steps  with  two  full  adders  and  a comparator;  but  there  is 
a 14-step  solution:  Let  z\  = (2/1  ©2/2)  ©2/3,  z2  = (2/1  ©2/2)  V (2/1  ©2/3),  z3  = (2/4  ©2/5)  ©2/6, 
z*  = (l/4  © 3/5)  V (y4  © ye).  Then  / = ( z\  © ( z2  A(24©(ziVz3))))A  (z3Wz4).  Furthermore 
2A2/22/3  = 111  <=>  2/42/53/6  = 111;  so  there  are  don’t-cares,  leading  to  an  11-step  solution: 
/ = ((21AZ3)  V24)  A z2.  The  total  cost  is  12  + 11  = 23. 

(The  author  knows  of  no  way  by  which  a computer  could  discover  such  an  efficient 
chain  in  a reasonable  amount  of  time,  given  only  the  truth  table  of  /.  But  perhaps  an 
even  better  chain  exists.) 

71.  (a)  P(p)  = 1 - 12p2  + 24p3  + 12p4  - 96p5  + 144p6  - 96p7  + 24p®,  which  is  11  + 
§e2  — 3e4  — 24e6  + 24e8  when  p = | + e. 

(b)  There  are  N = 2'1  3 sets  of  eight  values  (fo,  ■ ■ ■ , f7),  each  of  which  yields  good 
pairs  with  probability  P(p).  So  the  answer  is  1 - P(p)N- 

(c)  The  probability  is  (^)P(p)r(l  — P(p))N~r  that  exactly  r sets  succeed;  and  in 
such  a case  t trials  will  find  good  pairs  with  probability  ( r/N )L  The  answer  is  therefore 
1 - £f=0  (?)  WU  - P(p))N~r(r/NY  = 1 - P(Py  + 0(t2/N). 

(d)  £"o  OPipVd-Pip^-E^/NY  = (l-P(py)/(l-P(p))+0(t3/N). 

72.  The  probability  in  exercise  71(a)  becomes  P(p)  + (72 p3  - 264p4  + 432p5  - 336p6  + 
96p7 ) r + (60 p2  - 240p3  + 456p4  - 432p5  + 144p6 ) r 2 + ( -48p2  + 144p3  - 2 16p4  + 96p5 ) r3  + 
(— 36p2  + 24p3  + 12p4)r4  + (48p2  - 24p3)r5  - 12pV.  If  p = q = (1  - r)/2,  this  is 
(11  + 48r  + 36r2  - 144r3  - 30r4  + 336r5  - 348r6  + 144r7  - 21r®)/32;  for  example,  it’s 
7739/8192  k,  0.94  when  r = 1/2. 

73.  Consider  the  Horn  clauses  1A2=>3,  lA3=i-4,  ...,  lA(n  - l)=>n,  lAn=>2,  and 
iAj=>  1 for  1 < i < j < n.  Suppose  \Z\  > 1 in  a decomposition,  and  let  1 be  minimum 
such  that  Xi  € Z.  Also  let  j be  minimum  such  that  j > i and  Xj  G Z.  We  cannot  have 
i > 1,  since  iAj=S>l  in  that  case.  Thus  i = 1,  and  Xj  € Z for  2 < j < n. 

74.  Suppose  we  know  that  no  nontrivial  decomposition  exists  with  x\  G Z or  • • • or 
xi- 1 € Z\  initially  i = 1.  We  hope  to  rule  out  Xi  G Z too,  by  choosing  j and  m 
cleverly.  The  Horn  clauses  iAj=>m  reduce  to  Krom  clauses  j=>m  when  i is  asserted. 
So  we  essentially  want  to  use  Tarjan’s  depth-first  search  for  strong  components,  in  a 
digraph  with  arcs  j=>m  that  may  or  may  not  exist. 

When  exploring  from  vertex  j,  first  try  m = 1,  . . . , m = * - 1;  if  any  such 
implication  iAj^rn  succeeds,  we  can  eliminate  j and  all  its  predecessors  from  the 
digraph  for  i.  Otherwise,  test  if  j=>m  for  any  such  eliminated  vertex  m.  Otherwise 
test  unexplored  vertices  m.  Otherwise  try  vertices  m that  have  already  been  seen, 
favoring  those  near  the  root  of  the  depth-first  tree. 

In  the  example  f(x)  = (det  X)  mod  2,  we  would  successively  find  lA2y^3,  1 A2=^4, 
1 A4=>3,  1 A3=>5,  1 A5=>6,  1A6=>7,  1A7=>8,  lA8=t>9,  1A9=>2  (now  i 2);  2A3y^l, 
2A3=>4,  2A4y^l,  2A4t^>5,  2A4=>6,  2A6=>1  (now  3,  4,  and  6 are  eliminated  from 
the  digraph  for  2),  2A5=>1  (and  5 is  eliminated),  2A7^1,  2A7=>3  (7  is  eliminated), 
2A8=>1,  2A9=i>l  (now  i «—  3);  3A4t^1,  3A4=^2,  3A5=>1,  etc. 
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75.  This  function  is  1 at  only  two  points,  which  are  complementary.  So  it  is  inde- 
composable; yet  the  pairs  (58)  are  never  bad  when  n > 3.  Every  partition  (y,  Z)  will 
therefore  be  a candidate  for  decomposition. 

Similarly,  if  / is  decomposable  with  respect  to  (Y,Z),  the  indecomposable  func- 
tion f(x)  ® So,n(x)  will  act  essentially  like  / in  the  tests.  (A  method  to  deal  with 
approximately  decomposable  functions  should  probably  be  provided  in  a general-purpose 
decomposability  tester.) 

76.  (a)  Let  a;  = [i  > l]  for  0 < l < 2m.  The  cost  is  < 2f(m),  as  observed  in  answer 
38(b);  and  in  fact,  the  cost  can  be  reduced  to  2m+1  — 2m  — 2 with  0(m)  depth. 
Furthermore  the  function  [i<j}  = (ii  A ji)  V ((ix  = ji)  A [i2  . . . im  < j2  . . . jm ])  can  be 
evaluated  with  4m— 3 gates.  After  computing  x(By,  each  z;  costs  2m+1  + l = 0(n/ log  n). 

(b)  Here  the  cost  is  at  most  C(g0)  + b C(g2m)  < (2m  + l)(22m/(2m  - O(m))) 

by  Theorem  L,  because  each  gi  is  a function  of  2m  inputs. 

(c)  If  i < j we  have  z;  = x for  l < i and  z;  = y for  l > i;  hence  fi(x ) = c0  ® • • • © a 
and  fj  (y)  = Cj+i  ® • • • ® c2m . If  i > j we  have  zi  = y for  l < i and  z;  = x for  l > i; 
hence  fj(y ) = Co  ® • • • ® e,  and  fi(x ) = ci+1  ® • • • ® c2m. 

(d)  The  functions  bi  = [j  < 1}  can  be  computed  for  0 < l < 2m  in  0( 2m)  steps,  as 
in  (a).  So  we  can  compute  F from  (co,...,c2m)  with  0{ 2m)  further  gates.  Step  (b) 
therefore  dominates  the  cost,  for  large  m. 

(e)  do  = 1,  ai  = i,  a2  = 0;  60  = 0,  bi  = j,  b2  = 1;  d = [i  < j]  = i V j\  mi  = ai  ® d, 
zio  = x0  ® (mi  A (*0  ® yo)),  zn  = xi  ® (mj  A (x\  ® y\)),  for  l = 0,  1,  2;  co  = zoi; 
Ci  = zio  A Zn;  c2  = z2o  V z2i;  c\  = ci  A (d  = ai),  c " — a A (d  = bi),  for  l = 0,  1,  2;  and 
finally  F = (c'0  ® ci  ® c'2)  V (c'o  ® c"  ® c2). 

The  net  cost  (29  after  obvious  simplifications)  is,  of  course,  outrageous  in  such 
a small  example.  But  one  wonders  if  a state-of-the-art  automatic  optimizer  would  be 
able  to  reduce  this  chain  to  just  5 gates. 

[This  result  is  a special  case  of  more  general  theorems  in  Matematicheskie  Zametki 
15  (1974),  937-944;  London  Math.  Soc.  Lecture  Note  Series  169  (1992),  165-173.] 

77.  Given  a shortest  such  chain  for  f„  or  fn,  let  Ut  = {i  \ l = j(i)  or  l = k(i)}  be  the 

“uses”  of  xi,  and  let  ui  = \Ui\.  Let  f,  = 1 if  Xi  = Xj^  otherwise  t.i  = 0.  We  will 

show  that  there’s  a chain  of  length  < r — 4 that  computes  either  /n_i  or  f„-i,  by  using 
the  following  idea:  If  variable  xm  is  set  to  0 or  1,  for  any  to,  we  can  obtain  a chain 
for  fn- 1 or  fn—i  by  deleting  all  steps  of  [7m  and  modifying  other  steps  appropriately. 
Furthermore,  if  Xi  — Xj(i)  0 2T(i)  and  if  either  Xj (j)  or  Xk(i)  is  known  to  equal  t,  when 
xm  has  been  set  to  0 or  1,  then  we  can  also  delete  the  steps  Ui.  (Throughout  this 
argument,  the  letter  to  will  stand  for  an  index  in  the  range  1 < m < n.) 

Case  1:  um  = 1 for  some  m.  This  case  cannot  occur  in  a shortest  chain.  For  if  the 
only  use  of  xm  is  Xi  = xm,  eliminating  this  step  would  change  /n  <->  /„;  and  otherwise 
we  could  set  the  values  of  xiL  . . . , xm-i , xm+i,  ■ ■ . , xn  to  make  Xi  independent  of  xm, 
contradicting  xn+r  = fn  or  fn . Thus  every  variable  must  be  used  at  least  twice. 

Case  2:  xi  = xm  for  some  l and  m,  where  um  > 1.  Then  x , = xi  oit  for  some  i 
and  k,  and  we  can  set  xm  «—  L to  make  Xi  independent  of  Xk-  Eliminating  steps  Um, 
Ui,  and  Ui  then  removes  at  least  4 steps,  except  when  ui  = u,  = 1 and  um  = 2 and 
Xj  = xm  o Xi ; but  in  that  case  we  can  also  eliminate  Uj. 

Case  3:  um  > 3 for  some  to,  and  not  Case  2.  If  i,j,k  e Um  and  i < j < k,  set 
xm  ■<—  tk  and  remove  steps  i,  j,  k,  Uk- 

Case  u\  = u2  = • • • = un  = 2,  and  not  Case  2.  We  may  assume  that  the  first 
step  is  X\  = xi  o x2,  and  that  xi  = x 1 o Xk  for  some  k < l. 
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Case  4-1:  k > 0.  Then  k > 1.  If  Uk  = 1,  set  x\  «—  t;  and  remove  steps  1,  k,  l,  Ui. 

Otherwise  set  x2  <—  ti;  this  forces  Xk  = it,  and  we  can  remove  steps  1,  k,  l,  Uk. 

Case  4-2:  xi  = x\  o xm-  Then  we  must  have  m = 2;  for  if  in  > 2 we  could 
set  X2  <—  ti,  xm  t—  ti,  and  make  xT  independent  of  x\.  Hence  we  may  assume  that 
x\  = ii  Ai2,  12  = xi  V X2-  Setting  x\  +-  0 allows  us  to  remove  Uo  and  U\\  setting 

x\  t—  1 allows  us  to  remove  Uo  and  U2-  Thus  we’re  done  unless  u\  = U2  — 1. 

If  xp  = xi,  set  x\  t—  0 and  remove  1,  2,  p,  t/p;  if  xq  = x2,  set  xi  <—  1 and  remove  1, 
2,  g,  Ug.  Otherwise  a;p  = x\  o a;u  and  xq  = i2oi»,  where  and  xv  do  not  depend  on 
xi  or  X2-  But  that’s  impossible;  it  would  allow  us  to  set  x3,  . . . , x„  to  make  xu  = tp, 
then  X2  i — 1 to  make  xr  independent  of  x\. 

[Problemy  Kibernetiki  23  (1970),  83-101;  28  (1974),  4.  With  similar  proofs, 
Red’kin  showed  that  the  shortest  AND-OR-NOT  chains  for  the  functions  ix1...x„  < 
Vi  ■ ■ ■ Vn  and  ‘xi  ...xn  = yi  ■ ■ ■ yp  have  lengths  5n  — 3 and  5 n — 1,  respectively.] 

78.  [SICOMP  6 (1977),  427-430.]  Say  that  yk  is  active  if  k E S.  We  may  assume  that 
the  chain  is  normal  and  that  ||S||  > 1;  the  proof  is  like  Red’kin’s  in  answer  77: 

Case  1:  Some  active  yk  is  used  more  than  once.  Setting  yk  0 saves  at  least  two 
steps  and  yields  a chain  for  a function  with  ||S||  — 1 active  values. 

Case  2:  Some  active  yk  appears  only  in  an  AND  gate.  Setting  yk  <—  0 eliminates 
at  least  two  steps,  unless  this  AND  is  the  final  step.  But  it  can’t  be  the  final  step, 
because  yk  = 0 makes  the  result  independent  of  every  other  active  yj . 

Case  3:  Like  Case  2 but  with  an  OR  or  NOTBUT  or  BUTNOT  gate.  Setting  yk  <—  c 
for  some  appropriate  constant  c has  the  desired  effect. 

Case  4-  Like  Case  2 but  with  XOR.  The  gate  can’t  be  final,  since  the  result  should 
be  independent  of  yk  when  (*1  . . . xm)2  addresses  a different  active  value  yj.  So  we  can 
eliminate  two  steps  by  setting  yk  to  the  function  defined  by  the  other  input  to  XOR. 

79.  (a)  Suppose  the  cost  is  r < 2n  - 2;  then  n > 1.  If  each  variable  is  used  exactly 
once,  two  leaves  must  be  mates.  Therefore  some  variable  is  used  at  least  twice.  Pruning 
it  away  produces  a chain  of  cost  < r - 2 on  n - 1 variables,  having  no  mates. 

(Incidentally,  the  cost  is  at  least  2n  — 1 if  every  variable  is  used  at  least  twice, 
because  at  least  2n  uses  of  variables  must  be  connected  together  in  the  chain.) 

(b)  Notice  that  So,n  = /\u_v(u  = v)  whenever  the  edges  u — v form  a free  tree 
on  {xi, . . . , xn}.  So  there  are  many  ways  to  achieve  cost  2 n — 3. 

Any  chain  of  cost  r < 2n  — 3 must  have  n > 2 and  must  contain  mates  u and  v.  By 
renaming  and  possibly  complementing  intermediate  results,  we  can  assume  that  u = 1, 
v — 2,  and  that  f(x  1, . . . ,x„)  = g(x  1 o h(x 3, . . . ,xn),X2, . . . ,xn),  where  o is  A or  ®. 

Case  1:  o is  AND.  We  must  have  h(0,  ...,0)  = h(  1, . . . , 1)  = 1,  for  otherwise 
f(x  i,x2,  y,...,y)  wouldn’t  depend  on  x\.  Therefore  f(x  1, . . . ,xn)  = h(x  3, . . . ,xn)  A 
g{x i,X2,  • ■ • , xn)  can  be  computed  by  a chain  of  the  same  cost  in  which  1 and  2 are 
mates  and  in  which  the  path  between  them  has  gotten  shorter. 

Case  2:  o is  XOR.  Then  / = fa  V/i,  where  f0(x  1,. . ,,xn)  = (xi=h(x3,-- . , xn))  A 
g(  0,x2,...,xn)  and  /i(xx,. . ,,xn)  = (Xl  ® h(x  3,...,xn))  A g(l,x2,. . -,xn).  But  / = 
So,n  has  only  two  prime  implicants;  so  there  are  only  four  possibilities: 

Case  2a:  f0  = f . Then  we  can  replace  xi  ® h by  0,  to  get  a chain  of  cost  < r — 2 
for  the  function  g(0,x2,  ...,xn)  = S0,n-i(x2,  ■ . -,xn). 

Case  2b:  fi  = /.  Similar  to  Case  2a. 

Case  2c:  fo(x)  = xi  A • ■ • A xn  and  fi(x)  = x\  A • • • A xn.  In  this  case  we  must 
have  g( 0,x2, . . . ,x„)  = x2  A • • • A xn  and  g(l,x2,  ■ . . ,xn)  — x2  A ■ • • A xn-  Replacing  h 
by  1 therefore  yields  a chain  that  computes  / in  < r steps. 
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Case  2d:  fo(x)  = X\  A • • • A xn  and  fi(x)  = xi  A • • • A xn.  Similar  to  Case  2c. 

Applying  these  reductions  repeatedly  will  lead  to  a contradiction.  Similarly,  one 
can  show  that  C(SoSn)  = 2n  — 2.  [Theoretical  Computer  Science  1 (1976),  289-295.] 

80.  (a)  Without  loss  of  generality,  a0  = 0 and  the  chain  is  normal.  Define  Ui  and  ui 
as  in  answer  77.  We  may  assume  by  symmetry  that  u\  = max(ui, . . . , un). 

We  must  have  rti  > 2.  For  if  ui  = 1,  we  could  assume  further  that  x„+i  = xi  0x2; 
hence  two  of  the  three  functions  SQ(0, 0,  x3, . . . , xn)  = Sa»,  Sa(0, 1,  x3, . . . , xn)  = Sta ,, 
(1, 1 , X3 , . . . , xn ) = S"a  would  be  equal.  But  then  Sa  would  be  a parity  function,  or 
Siai  would  be  constant. 

Therefore  setting  x\  = 0 allows  us  to  eliminate  the  gates  of  Ui,  giving  a chain  for 
SQ/  with  at  least  2 fewer  gates.  It  follows  that  C(Sa)  > C(Sa>)  + 2.  Similarly,  setting 
X\  — 1 proves  that  C(Sa)  > C(S>a)  + 2. 

Three  cases  arise  when  we  explore  the  situation  further: 

Case  1:  u\  > 3.  Setting  x\  = 0 proves  that  C(Sa)  > C(Sai)  + 3. 

Case  2:  U\  = {i,j}  and  operator  Oj  is  canalizing  (namely,  AND,  BUTNOT,  NOT- 
BUT,  or  OR).  Setting  x\  to  an  appropriate  constant  forces  the  value  of  Xj  and  allows 
us  to  eliminate  U\  U Uj\  notice  that  i Uj  in  an  optimum  chain.  So  either  C(Sa)  > 
C(Sa.)  + 3 or  C(Sa)  > C{S,a)  + 3. 

Case  3:  U\  = {?’,/}  and  ot  = Oj  = 0.  We  may  assume  that  Xi  = x\  © X2  and 
Xj  = xi  0 xk.  If  Uj  = 1 and  xi  = Xj  0 xp,  we  can  restructure  the  chain  by  letting 
Xj  = Xk®xp,  xi  = xi($)Xj ; therefore  we  can  assume  that  either  Uj  ^ lorii  = XjOxp  for 
some  canalizing  operator  o.  If  U2  = { i,j '},  we  can  assume  similarly  that  Xji  = x2®xy 
and  that  either  Uji  = 1 or  Xf  = Xji  o'  xpt  for  some  canalizing  operator  o'.  Furthermore 
we  can  assume  by  symmetry  that  Xj  does  not  depend  on  Xji. 

If  Xk  does  not  depend  on  Xi,  let  /( 2:3, . . . , xn ) = Xk\  otherwise  let  /( X3, . . . , x„ ) be 
the  value  of  xk  when  xt  = 1.  By  setting  x:  = f(x3,  ■ ■ ■ ,xn)  and  x2  = f(x3,  ■ ■ - ,xn), 
or  vice  versa,  we  make  xt  and  Xj  constant,  and  we  obtain  a chain  for  the  nonconstant 
function  Siai.  We  can,  in  fact,  ensure  that  xi  is  constant  in  the  case  Uj  = 1.  We  claim 
that  at  least  five  gates  of  this  chain  (including  xt  and  Xj)  can  be  eliminated;  hence 
C(Sa)  > C(Siai)  + 5.  The  claim  is  clearly  true  if  | Ui  U C/y  | > 3. 

We  must  have  \UiUUj  \ > 1.  Otherwise  we’d  have  p = i,  and  xk  would  not  depend 
on  Xi,  so  Sa  would  be  independent  of  x\  with  our  choice  of  x2.  Therefore  | Ui  IJ  U3  \ = 2. 

Case  Sa:  Uj  = {/}.  Then  xt  is  constant;  we  can  eliminate  Xi,  Xj,  and  UiUUj  U Ui. 
If  the  latter  set  contains  only  two  elements,  then  xq  = Xi  o xi  is  also  constant  and  we 
eliminate  Uq.  Since  Sia'  isn’t  constant,  we  won’t  eliminate  the  output  gate. 

Case  3b:  Ui  C Uj,  \Uj\  = 2.  Then  xq  = Xi  o Xj  for  some  q\  we  can  eliminate  Xi, 
Xj,  and  Uj  U Uq.  The  claim  has  been  proved. 

(b)  By  induction,  C(Sk)  >2 n + min(fc,n  - k)  - 3 - [re  = 2k],  for  0 < k < re; 
C(S>k)  > 2n  + min(fe,re  + 1 — k)  — 4,  for  1 < k < n.  The  easy  cases  are  C(So)  = 
C(Sn)  = C(S>i)  = C(S>n)  = n — 1;  C(S>o)  = 0.  (According  to  Figs.  9 and  10,  these 
bounds  are  optimum  for  k = [re/2]  when  re  < 5.  All  known  results  are  consistent  with 
the  conjecture  that  C(Sk)  = C(S>k)  for  k > re/2.) 

Reference:  Mathematical  Systems  Theory  10  (1977),  323-336. 

81.  If  some  variable  is  used  more  than  once,  we  can  set  it  to  a constant,  decreasing  re 
by  1 and  decreasing  c by  > 2.  Otherwise  the  first  operation  must  involve  x\,  because 
2/i  = x\  is  the  only  output  that  doesn’t  need  computation;  making  Xi  constant  decreases 
re  by  1,  c by  > 1,  and  d by  > 1.  [J.  Algorithms  7 (1986),  185-201.] 

82.  (62)  is  false. 
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(63)  reads,  “For  all  numbers  m there’s  a number  n such  that  m < n + 1”;  it  is 
true  because  we  can  take  m = n. 

(64)  fails  when  n = 0 or  n = 1,  because  the  numbers  in  these  formulas  are  required 
to  be  nonnegative  integers. 

(65)  says  that,  if  b exceeds  a by  2 or  more,  there’s  a number  ab  between  them.  Of 
course  it’s  true,  because  we  can  let  ab  = a + 1. 

(66)  was  explained  in  the  text,  and  it  too  is  true.  Notice  that  ‘A’  takes  precedence 
over  ‘V’  and  ‘=’  takes  precedence  over  '<=>’,  just  as  * + ’ takes  precedence  over  “>’  and 
“<’  over  ‘A’  in  (65);  these  conventions  reduce  the  need  for  parentheses  in  sentences  of  L. 

(67)  says  that,  if  A contains  at  least  one  element  n,  it  must  contain  a minimum 
element  m (an  element  that’s  less  than  or  equal  to  all  of  its  elements).  True. 

(68)  is  similar,  but  m is  now  a maximum  element.  Again  true,  because  all  sets  are 
assumed  to  be  finite. 

(6g)  asks  for  a set  P with  the  property  that  [OeP]  = [3  ^ P],  [1  G P]  = [4  £ P], 
...,  [999  eP]  = [1002  gP],  [1000  6 P]  # [1003  £ P],  [1001  6 P]  / [1004  gP],  etc.  It’s 
true  if  (and  only  if)  P = {x  \ a:  mod  6 E {1,2,3}  and  0 < x < 1000}. 

Finally,  the  subformula  Vro  (n  E C 77  n + 1 6 C)  in  (70)  is  another  way  of  saying 
that  C = 0,  because  C is  finite.  Hence  the  parenthesized  formula  after  V.4  VI?  is  a 
tricky  way  to  say  that  A = 0 and  5^0.  (Stockmeyer  and  Meyer  used  this  trick  to 
abbreviate  statements  in  L that  involve  long  subformulas  more  than  once.)  Statement 
(70)  is  true  because  an  empty  set  doesn’t  equal  a nonempty  set. 

83.  We  can  assume  that  the  chain  is  normal.  Let  the  canalizing  steps  be  yi,  . . . , yp. 
Then  yk  = ak  o /?&  and  / = ap+i,  where  ak  and  /?*,  are  ®’s  of  some  subsets  of 
{aq, . . . , xn,y  1, . . . , yk- 1};  at  most  n+k  — 2 ®’s  are  needed  to  compute  them,  combining 
common  terms  first.  Hence  C(f)  < p + J2kii(n  + k ~ 2)  = (P+  l)(n  + p/2)  - 1. 

84.  Argue  as  in  the  previous  answer,  with  V or  A in  place  of  ©.  [N.  Alon  and  R.  B. 
Boppana,  Combinatorica  7 (1987),  15  -16.] 

85.  (a)  A simple  computer  program  shows  that  13744  are  legitimate  and  19024  aren’t. 
(An  illegitimate  family  of  this  kind  has  at  least  8 members;  one  such  is  {00,  Of , 33,  55, 
ff,  15,3f,77}.  Indeed,  if  the  functions  aq  V x2  (3f),x2\/x3  (77),  and  (xiVi2)Ax3  (15) 
are  present  in  a legitimate  family  L,  then  x2  U 15  = 33  | 15  = 37  must  also  be  in  L .) 

(b)  The  projection  and  constant  functions  are  obviously  present.  Define  A*  = 
f]{B  \ B D A and  B 6 -4},  or  A*  = 00  if  no  such  set  B exists.  Then  we  have 
\a)  n \B]  = \AnB]  and  \A]  U [B]  = [(A  U £)*]. 

(c)  Abbreviate  the  formulas  as  i(Ci,V  Vi=„+i  <5«,  xt  C £,  V V,!=n+i  and  argue 
by  induction:  If  step  l is  an  AND  step,  xi  = Xj  nit  C xj  AitC  (xj  V V/=n+i  Si)  A 
(xk  VV.U+1  Si)  =I|V  V i=n+i  Si-,  X,  = XjAxk  c (xj  V Vte^+1  e»)  A ( xk  V Vi=^+i  e.)  = 
(xj  A xk)  V V/=n+ 1 e*i  and  A xk  = xi\/  ei . Argue  similarly  if  step  l is  an  OR  step. 

86.  (a)  If  5 is  an  r-family  contained  in  the  (r  + l)-family  S',  clearly  A (S)  C A (S'). 

(b)  By  the  pigeonhole  principle,  A (S)  contains  elements  u and  v of  each  part, 
whenever  S is  an  r-family.  And  if  A(S)  = {«,«},  we  certainly  have  u — v. 

(c)  The  result  is  obvious  when  r = 1.  There  are  at  most  r — 1 edges  containing 
any  given  vertex  u,  by  the  “strong”  property.  And  if  u — v,  the  edges  disjoint  from 
{w,  u}  are  strongly  (r  — l)-closed;  so  there  are  at  most  (r  — 2)2  of  them,  by  induction. 
Thus  there  are  at  most  1 + 2(r  - 2)  + (r  - 2)2  edges  altogether. 
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(d)  Yes,  by  exercise  85(b),  if  r > 1,  because  strongly  r-closed  graphs  are  closed 
under  intersection.  All  graphs  with  < 1 edges  are  strongly  r-closed  when  r > 1,  because 
they  have  no  r-families  containing  distinct  edges. 

(e)  There  are  Q)  triangles  Xij  A Xik  A Xjk,  only  n — 2 of  which  are  contained  in  any 
term  xuv  of  /.  Hence  the  minterms  for  at  most  (r  — l)2(n  — 2)  triangles  are  contained 
in  /,  and  the  others  must  be  contained  in  one  of  the  functions  a = Xi  ® (arq*)  A xkn\). 
Such  a term  has  the  form 

T = (\G]  n [HI)  © ([Gl  A\H])  = (|"G]  A r#l)  ATGTTFT, 

where  G and  H are  strongly  r-closed;  we  will  prove  that  T contains  at  most  2(r  — l)3 
triangles. 

A triangle  Xij  A x^  A Xjk  in  T must  involve  some  variable  (say  xtJ)  of  |"G]  and 
some  variable  (say  Xik)  of  \H] , but  no  variable  of  \G  n H] . There  are  at  most  (r  - l)2 
choices  for  ij : and  then  there  are  at  most  2(r  — 1)  choices  for  k,  since  H has  at  most 
r — 1 edges  touching  i and  at  most  r — 1 edges  touching  j. 

(f)  There  are  2n_1  complete  bigraphs  obtained  by  coloring  1 red,  coloring  other 
vertices  either  red  or  blue,  and  letting  u — v if  and  only  if  u and  v have  opposite  colors. 
By  the  first  formula  in  exercise  85(c),  the  minterms  B for  every  such  graph  must  be 
contained  in  one  of  the  terms  T = Si  — Xi  © (xj(i}  V xk(i))  = f(GU  H)*]  A [GU  H]. 
(For  example,  if  n = 4 and  vertices  (2,3,4)  are  (red,  blue,  blue),  then  B = xi2  A x\3  A 
X14  A X23  A x24  A X34.)  A minterm  B is  contained  in  T if  and  only  if,  in  the  coloring 
for  B,  some  edge  of  (GUH)*  has  vertices  of  opposite  colors,  but  all  edges  of  G U II  are 
monochromatic.  We  will  prove  that  T includes  at  most  2 n~rr2  such  B. 

Let  G be  any  graph,  and  T = [G*]  A [~G"|.  The  following  (inefficient)  algorithm 
can  be  used  to  find  G*:  If  there’s  an  r-family  S with  |A(S)|  < 2,  stop  with  G*  = 00. 
Otherwise,  if  A(S)  = {u,«}  and  u—/—v,  add  the  edge  u — v to  G and  repeat. 

At  most  2n~r  bipartite  minterms  B have  monochromatic  { Uj , Vj } for  1 < j < r 
when  |A(S)|  < 2.  And  when  A (S)  = {w,  ti}  there  are  2n_r_1  with  monochromatic 
{uj,Vj}  and  bichromatic  {«, u}.  So  we  want  to  show  that  the  algorithm  for  G takes 
fewer  than  2r2  iterations  when  G is  strongly  r-closed. 

For  k > 1,  let  Uk  — 'i’k  be  the  first  new  edge  added  to  G that  is  disjoint  from 
{ Uj,Vj } for  1 < j < k.  At  most  r such  edges  exist,  by  “strongness”;  and  each  of  them 
is  followed  by  at  most  2r  — 3 new  edges  that  touch  Uj  or  Vj . So  the  total  number  of 
steps  to  find  G*  is  at  most  r(2r  — 2)  + 1 < 2r2. 

(g)  Exercise  84  tells  us  that  q < (£)  + (p+  1)  (£)  • Thus  we  have  either  2(r  - l)3p  > 
(3)  — (r  — l)2(n  — 2)  or  (J)  + (p  + l)^)  > 2 r~1/r2 . Both  lower  bounds  for  p are 

^(6fe)3(1  + °(1°S?))  Wh6n  r=  i '^  ( 186624(lg  n)4  ) 1 

[Noga  Alon  and  Ravi  B.  Boppana,  Combinatorica  7 (1987),  1-22,  proceeded  in  this 
way  to  prove,  among  other  things,  the  lower  bound  f2(n/logn)s  for  the  number  of  A’s 
in  any  monotone  chain  that  decides  whether  or  not  G has  a clique  of  fixed  size  s > 3.] 

87.  The  entries  of  X3  are  at  most  n2  when  X is  a 0-1  matrix.  A Boolean  chain 
with  0(n’s7(logn)2)  gates  can  implement  Strassen’s  matrix  multiplication  algorithm 
4.6.4-(36),  on  integers  modulo  2L1sn2J+1. 

88.  There  are  1,422,564  such  functions,  in  716  classes  with  respect  to  permutation  of 
variables.  Algorithm  L and  the  other  methods  of  this  section  extend  readily  to  ternary 
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operations,  and  we  obtain  the  following  results  for  optimum  median-only  computation: 
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S.  Amarel,  G.  E.  Cooke,  and  R.  O.  Winder  [IEEE  Trans.  EC-13  (1964),  4-13,  Fig.  5b] 
conjectured  that  the  9-operation  formula 

(x\X2X3X4X5XeXj)  = (xi  {(X2X3X5)  (x2X4Xe)  (X3X4X7))  ((X2X5X6)  (X3X5X7)  {X4X6X7))) 

is  the  best  way  to  compute  medians-of-7  via  medians-of-3.  But  the  “magic”  formula 
(*1  (x2  (x3X4X5)  (x3X6X7))  (x4  (x2X6X7}  ( X3X5  (X5X6X7)))) 
needs  only  8 operations;  and  in  fact  the  shortest  chain  needs  just  seven  steps: 

(xiX2X3X4X5X6X7)  = (X1(X2(X5X6X7}(X3(X5X6X7)X4))(X5(X2X3X4)(X6(X2X3X4)X7))). 

The  interesting  function  f(x i,...,x7)  = (xiAx2Ax4)  V (x2Ax3Ax5)  V (x3Ax4Ax6)  V 
(x4/\x$/\X7)  V (xsAxgAxi)  V (x6Ax7Ax2)  V (X7AX1AX3),  whose  prime  implicants  corre- 
spond to  the  projective  plane  with  7 points,  is  the  toughest  of  all:  Its  minimum  length 
L(f)  = 11  and  minimum  depth  D(f)  = 4 are  achieved  by  the  remarkable  formula 

((xiX4(x4*5X6))(x3X6(xi  (X2X3X7)  (x2XsXe)))  (X2X7  (Xl  {X3X2X4)  {X3X3X7)))) . 

And  the  following  even  more  astonishing  chain  computes  it  optimally: 

Xg  = (X1X2X3),  Xg  = (xix4x6),  X10  = (xiX5X8),  Xu  = (x2X7Xg), 

X12  = (X3X9X10),  Xl3  = (X4X5X12),  Xi4  = (X6X11X12),  X15  = (x7Xl3Xl4). 


SECTION  7.1.3 

1.  These  operations  interchange  the  bits  of  x and  y in  positions  where  m is  1.  (In 
particular,  if  m = —1,  the  step  ly  <—  y 0 (x  & m)’  becomes  just  ‘y  <—  y ® x’,  and  the 
three  assignments  will  swap  x y without  needing  an  auxiliary  register.  H.  S.  Warren, 
Jr.,  has  located  this  trick  in  vintage-1961  IBM  programming  course  notes.) 

2.  All  three  hold  when  x and  y are  nonnegative,  or  if  we  regard  x and  y as  “unsigned 

2-adic  integers”  in  which  0<l<2<  -<—  3<-2<  -1.  But  if  negative  integers 

are  less  than  nonnegative  integers,  (i)  fails  if  and  only  if  x < 0 and  y < 0;  (ii)  and  (iii) 
fail  if  and  only  if  x ® y < 0,  namely,  if  and  only  if  x < 0 and  y > 0 or  x > 0 and  y < 0. 

3.  Note  that  x — y = (x  (By)  — 2(x  & y)  (see  exercise  93).  By  removing  bits  common 
to  x and  y at  the  left,  we  may  assume  that  xn_i  = 1 and  yn-\  = 0.  Then  2(x  & y)  < 
2((x  © y)  - 2n~1)  = (x  © y)  - (x  © y)M  - 1. 

4.  xCN  = x + 1 = xs,  by  (16).  Hence  xNC  = xNCSP  = x™cnp  = xnnp  = xp 
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5.  (a)  Disproof:  Let  x = ( . . .x2XiXo)2-  Then  bit  l of  x <g  k is  xi-k[l  > k j.  So  bit 
l of  the  left-hand  side  is  xi-k-j[l>  k][l  — k>j ],  while  bit  l of  the  right-hand  side  is 
xi-j-k[l>j  + k].  These  expressions  agree  if  j > 0 or  k < 0.  But  if  j < 0 < k,  they 
differ  when  l = max(0,  j + k)  and  xi-j-k  = 1. 

(We  do,  however,  have  (x  j)  <S  k C x <g  (j  + k)  in  all  cases.) 

(b)  Proof:  Bit  l in  all  three  formulas  is  xi+j[l  > —j]  A yi-k[l  > k]. 

6.  Since  x <C  y > 0 if  and  only  if  x > 0,  we  must  have  x > 0 if  and  only  if  y > 0. 
Obviously  x — y is  always  a solution.  The  solutions  with  x > y are  (a)  x — — 1 and 
y — —2,  or  2V  > x > y > 0;  (b)  x = 2 and  y = 1,  or  2~x  > —y  > —x  > 0. 

7.  Set  x' <—  (x  + p>o)  ® R o,  where  is  the  constant  in  (47).  Then  x'  = ( . . . x'2x'1x'0)2, 
since  (x'(Bp.o)  — Ro  = ( • • • x'3X2x'1x'0)2  — ( ■ ■ ■ 1010)2  = ( • • • 0x'20x'o)2  — ( ■ ■ ■ a^Oz'iO^  = x. 

[This  is  Hack  128  in  HAKMEM;  see  answer  20  below.  An  alternative  formula, 
x «—  (po  — x)®fio,  has  also  been  suggested  by  D.  P.  Agrawal,  IEEE  Trans.  C-29  (1980), 
1032-1035.  The  results  are  correct  modulo  2n  for  all  n,  but  overflow  or  underflow  can 
occur.  For  example,  two’s  complement  binary  numbers  in  an  n-bit  register  range  from 
— 2n_1  to  2n_1  — 1,  inclusive,  but  negabinary  numbers  range  from  — |(2n  — 1)  to 
i(2n  — 1)  when  n is  even.  In  general  the  formula  x <—  (x  + fi)  © //.  converts  from 
binary  notation  to  the  general  number  system  with  binary  basis  (2n(— l)m")  discussed 
in  exercise  4.1  30(c),  when  y = ( . . . ra2mirao)2.] 

8.  First,  x®y  £ (S® y) U (x®T).  Second,  suppose  that  0 < k < x®?/,  and  let  x®y  = 
(ala')2,  k = (a0aw)2,  where  a,  a',  and  a"  are  strings  of  0s  and  Is  with  |a'|  = |a"|. 
Assume  by  symmetry  that  x — (/31/3')2  and  y = (707^2,  where  |a'|  = \fi'\  — I7' | . Then 
k®y  = (/307")2  is  less  than  x.  Hence  k®y  G S,  and  k = (fc®  y)  ® y € Sffi y.  [See  R.  P. 
Sprague,  Tohoku  Math.  J.  41  (1936),  438-444;  P.  M.  Grundy,  Eureka  2 (1939),  6-8.] 

9.  The  Sprague-Grundy  theorem  in  the  previous  exercise  shows  that  two  piles  of  x 
and  y sticks  are  equivalent  in  play  to  a single  pile  of  x @ y sticks.  (There  is  a nonnegative 
integer  k < x ® y if  and  only  if  there  either  is  a nonnegative  i < x with  iQ  y < x T y or 
a nonnegative  j < y with  x ® j < x ® y.)  So  the  k piles  are  equivalent  to  a single  pile 
of  size  01  ® ■ ■ ■ ® a/t.  [See  C.  L.  Bouton,  Annals  of  Math.  (2)  3 (1901  1902),  35-39.] 

10.  For  clarity  and  brevity  we  shall  write  simply  xy  for  x ® y and  x + y for  x ® y.  in 
parts  (i)  through  (iv)  of  this  answer  only. 

(i)  Clearly  Oy  = 0 and  x + y — y + x and  xy  = yx.  Also  \y  — y,  by  induction  on  y. 

(ii)  If  x / x'  and  y / y then  xy  + xy'  + x' y Ax' y / 0,  because  the  definition  of  xy 
says  that  xy'  + x'y  + x y / xy  when  0 < x < x and  0 < y'  < y.  In  particular,  if  x / 0 
and  j/  / 0 then  xy  7^  0.  Another  consequence  is  that,  if  x = mex(S)  and  y = mex(T) 
for  arbitrary  finite  sets  S and  T,  we  have  xy  = rnex{xj  + iy  + ij  \ i € S,  j G T}. 

(iii)  Consequently,  by  induction  on  the  (ordinary)  sum  of  x , y,  and  z,  (x  + y)z  is 

mex{(i  + y)z  + ( x + y)z  + (x'  + y)z  , (x  + y)z  + (x  + y')z  + (x  + y')z 

| 0 < x < x,  0 < y < y,  0 < z < 2}, 

which  is  mex{xz'  + x z + x z'  + yz,  xz  + yz'  + y' z + y' z'}  = xz  + yz.  In  particular, 
there’s  a cancellation  law:  If  xz  — yz  then  (x  + y)z  = 0,  so  x = y or  2 = 0. 

(iv)  By  a similar  induction,  ( xy)z  = mex{(xy)z'  + (xy1  + x'y  + x'y'){z  + z')}  = 
m ex{(xy)z'  + ( xy')z  + (xy’)z'  + • • • } = m ex{x(yz')  + x(y'  z)  + x(y'  z')  + • • • } = 
mex{(i  + x')(yz'  + y'z  + y z')  + x' (yz)}  = x(yz). 

(v)  If  0 < x,  y < 22"  we  shall  prove  that  x ® y < 22”,  22"  <S>  y = 2 2"y,  and 
22”  ® 22"  = |22".  By  the  distributive  law  (iii)  it  suffices  to  consider  the  case  x = 2“ 
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and  y = 2b  for  0 < a,  b < 2n.  Let  a = 2P  + a'  and  b - 2q  + b' , where  0 < a'  < 2P  and 

0 < b'  < 2q\  then  x = 22P  ® 2“  and  y = 22<l  ® 2b' , by  induction  on  n. 

Ifp  < n — 1 and  q < n — 1 we’ve  already  proved  that  x®p  < 22"  1 . If  p < q = n — 1, 
then  x ® 26'  < 22“ , hence  x ® y < 22n.  And  if  p = g = n - 1,  we  have  x ® y = 

22P  ® 22P  ® 2“'  ® 2(>'  = (f 22P)  ® z , where  2 < 22P.  Thus  x ® p < 22"  in  all  cases. 

By  the  cancellation  law,  the  nonnegative  integers  less  than  22”  form  a subfield. 
Hence  in  the  formula 


22"  ®>  y = mex{22'V  © x {y  © y')  | 0 < x < 22”,  0 < y < y} 

we  can  choose  x'  for  each  y to  exclude  all  numbers  between  2 2"p'  and  22”(p'  + 1)  — 1; 
but  22"p  is  never  excluded. 

Finally  in  22"  ® 22"  = mex{22,,(x'  © y)  © (x1  ® y)  \ 0 < x',y'  < 22"},  choosing 
x'  = y'  will  exclude  all  numbers  up  to  and  including  22"  — 1,  since  x ® x = y ® y 
implies  that  (x  © y)  ® (x  © y)  = 0,  hence  x = y.  Choosing  x'  = y'  © 1 excludes  numbers 
from  22  to  §22”  — 1,  since  (x®x)©x  = (y®y)®y  implies  that  x = y or  x = yffil,  and 
since  the  most  significant  bit  of  x <g>  x is  the  same  as  that  of  x.  This  same  observation 
shows  that  |22"  is  not  excluded.  QED. 

Consider,  for  example,  the  subfield  {0, 1, . . . , 15}.  By  the  distributive  law  we  can 
reduce  x®y  to  a sum  of  x®  1,  x<g>  2,  x®4,  and/or  x®  8.  We  have  2®2  = 3,  2®4  = 8, 
4 ® 4 = 6;  and  multiplication  by  8 can  be  done  by  multiplying  first  by  2 and  then  by  4 
or  vice  versa,  because  8 = 2 ® 4.  Thus  2 ® 8 = 12,  4 ® 8 = 11,  8 ® 8 = 13. 

In  general,  for  n > 0,  let  n = 2m  + r where  0 < r < 2m.  There  is  a 2m+1  x 2m+1 
matrix  Qn  such  that  multiplication  by  2n  is  equivalent  to  applying  Qn  to  blocks  of 
2m+1  bits  and  working  mod  2.  For  example,  Q i = (“) , and  ( . . . X4X3x2XiXo)2  ® 21  = 
( • • -2/42/3 2/2 ?/i2/o)2,  where  y0  = xi,  yx  = xi  ©x0,  y2  = x3,  y3  = x3  ©x2,  t/4  = x5,  etc. 
The  matrices  are  formed  recursively  as  follows:  Let  Q0  = Ro  = (1)  and 


Q 


2 m+r  — 


I Rm 
I 0 


Qr  0 

0 Qr 


Rm  + 1 — 


Rl 

o 


Q 2”l+1  —1 5 


where  Qr  is  replicated  enough  times  to  make  2m+1  rows  and  columns.  For  example, 


/ 1 ° 1 1 \ 
0 110 
10  0 0 
\0  1 0 0/ 


Q3  = Q2 


/ 1 1 0 1 \ 

1 

0 

<5 

10  11 

V 0 Qi) 

110  0 

1 

\1  0 0 0 / 

= r2. 


If  register  x holds  any  64-bit  number,  and  if  0 < j < 7,  the  MMIX  instruction  MX0R  y.qj.x 
will  compute  y = x ® 22 , given  the  hexadecimal  matrix  constants 


q0  = 8040201008040201, 
qt  = c08030200c080302 , 
q2  = b06080400b060804 , 


q3  = d0b0c0800d0b0c08, 
q4  = 8d4b2c 1880402010, 
q5  = c68d342cc0803020, 


q6  = b9678d4bb0608040, 
q7  = deb9c68dd0b0c080. 


[J.  H.  Conway,  On  Numbers  and  Games  (1976),  Chapter  6,  shows  that  these  definitions 
actually  yield  an  algebraically  closed  field  over  the  ordinal  numbers.] 

11.  Let  m = 2a‘  + • ■ ■ + 2ai  with  a,  > ••■  > m > 0 and  n = 26‘  + f-  2bl  with 

bt  > ■ ■ ■ > bi  > 0.  Then  m ® n = mn  if  and  only  if  (as  \ ■ ■ ■ \ ai)  & (bt  \ ■ ■ ■ \ foi)  = 0. 

12.  If  x = 22  a + 6 where  0 < u,  b < 22” , let  x'  = x ® (x  © a).  Then 

x = ((22"  ® a)  © 6)  ® ((22"  ® a)  © a © b)  = (22"  _1  ® a ® o)  © (b  ® (a  © b))  < 21" . 
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To  nim-divide  by  x we  can  therefore  nim-divide  by  x'  and  multiply  by  x®a.  [This  algo- 
rithm is  due  to  H.  W.  Lenstra,  .Jr.;  see  Seminaire  de  Theorie  des  Nombres  (Universite 
de  Bordeaux,  1977-1978),  expose  11,  exercise  5.] 

13.  If  020-  • -©a*,  = ai0a3©-  ■ -0((fc  — 2)®<n,)  = 0,  every  move  breaks  this  condition; 
we  can’t  have  (a  ® x)  0 (b  ® y)  = (a®  a:')  0 (6®  y')  when  a / b unless  ( x,y ) = ( x',y '). 

Conversely,  if  a2  0 • • • 0 a*,  ^ 0 we  can  reduce  some  aj  with  j > 2 to  make  this 
sum  zero;  then  or  can  be  set  to  a3  0 • • • © ((k  - 2)  ® ak).  If  a2  0 • • • © ak  = 0 and 
«i  ^ “3  © • • • 0 ((k  - 2)  ® ak),  we  simply  reduce  ai  if  it  is  too  large.  Otherwise  there’s  a 
j — 3 such  that  equality  will  occur  if  (j  — 2)  ® a,  is  replaced  by  an  appropriate  smaller 
value  (( j - 2)  ® a' ) 0 ((i  - 2)  ® (aj  0 o'  )),  for  some  2 < i < j and  0 < a'  < a3,  because 
of  the  definition  of  nim  multiplication;  hence  both  of  the  desired  equalities  are  achieved 
by  setting  aj  <—  a'  and  Oi  «—  Oj  © aj  0 a'j . [This  game  was  introduced  in  Winning  Ways 
by  Berlekamp,  Conway,  and  Guy,  at  the  end  of  Chapter  14.] 

14.  (a)  Each  y = (...y2yiyo)2  = xT  determines  x = (...x2xix0)2  uniquely,  since 
x0  = y0  0 t and  [y/2J  = [x/2jT*o. 

(b)  When  k > 0,  it  is  a branching  function  with  labels  taap  = a for  |/3|  = k — 1, 
and  ta  = 0 for  |a|  < k.  But  when  k < 0,  the  mapping  is  not  a permutation;  in  fact,  it 
sends  2~k  different  2-adic  integers  into  0,  when  k < 0. 

[The  case  k — 1 is  particularly  interesting:  Then  xT  takes  nonnegative  integers 
into  nonnegative  integers  of  even  parity,  negative  integers  into  nonnegative  integers  of 
odd  parity,  and  -1/3  i-A  -1.  Furthermore  [.xT/2J  is  “Gray  binary  code,”  7.2.1. l-(g).] 

(c)  If  p(x  0 y)  = k we  have  T(x)  = T(y)  and  x = y + 2k  (modulo  2k+1).  Hence 
p(xT  © yT)  = p(x  ®y®T(x)®T(y))  = k.  Conversely,  if  p(xT  ®yT)  = k whenever  y = 
x + 2k,  we  obtain  a suitable  bit  labeling  by  letting  ta  = (xT^>  |a|)  mod  2 when  x=(aR)2- 

(d)  This  statement  follows  immediately  from  (a)  and  (c).  For  if  we  always  have 
p(x  0 y)  = p(xv®  yu)  — p(xv®yv),  then  p(x®y)  = p(xu®yu)  = p(xuv®yuv).  And 
if  xTU  = x for  all  x,  p(xu © yu)  = p(x  0 y)  is  equivalent  to  p(x  0 y)  = p(xT®yT). 

We  can  also  construct  the  labelings  explicitly:  If  W = UV,  note  that  when  a,  b,  c e 
{0, 1}  we  have  Wa  = UaVa,,  Wab  = UabVa,v,  and  Wabc  = UabcVa,Vc,,  where  o'  = a©u, 
b’  = i0aa,c'  = c®uab,  and  so  on;  hence  «i  = #®ii,«i„  = tta0v,  wab  = uab  ®va>b>, 
etc.  The  labeling  T inverse  to  U is  obtained  by  swapping  left  and  right  subtrees  of  all 
nodes  labeled  1;  thus  t = u,  ta,  = ua,  taibi  = uab,  etc. 

(e)  The  explicit  constructions  in  (d)  demonstrate  that  the  balance  condition  is 
preserved  by  compositions  and  inverses,  because  {O',  T}  = {0, 1}  at  each  level. 

Notes:  Hendrik  Lenstra  observes  that  branching  functions  can  profitably  be  viewed 
as  the  isometries  (distance-preserving  permutations)  of  the  2-adic  integers,  when  we 
use  the  formula  l/2p<-x®y)  to  define  the  “distance”  between  2-adic  integers  x and  y. 
Moreover,  the  branching  functions  mod  2d  turn  out  to  be  the  Sylow  2-subgroup  of  the 
group  of  all  permutations  of  {0, 1, . . . , 2d  - 1},  namely  the  unique  (up  to  isomorphism) 
subgroup  that  has  maximum  power-of-2  order  among  all  subgroups  of  that  group.  They 
also  are  equivalent  to  the  automorphisms  of  the  complete  binary  tree  with  2d  leaves. 

15.  Equivalently,  (x  + 2a)  © b = (x  0 b)  + 2a;  so  we  might  as  well  find  all  b and  c such 
that  (x  © b)  + c = (x  + c)  © b.  Setting  x = 0 and  x = — c implies  that  b + c = b © c and 
b— c = f>©(— c);  hence  fe&c  = 6&(— c)  = 0 by  (89),  and  we  have  b < 2pc . This  condition  is 
also  sufficient.  Thus  0 < b < 2pa+1  is  necessary  and  sufficient  for  the  original  problem. 

16.  (a)  If  p(x  0 y)  = k we  have  x = y+  2k  (modulo  2fc+1);  hence  x + a = y + a + 2k 
and  p((x  + a)  © (y  + a))  = k.  And  p((x  © b)  © (y  © 6))  is  obviously  k. 
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(b)  The  hinted  labeling,  call  it  P(c),  has  Is  on  the  path  corresponding  to  c,  and 
Os  elsewhere;  thus  it  is  balanced.  The  general  animating  function  can  be  written 


p(co)-“ir(c,)-“2...p(Cm^)-™ 

VI7  Cm,  , 


where  c,  = &i  ® • • • © bj ; 


so  it  is  balanced  if  and  only  if  cm  = 0. 

[Incidentally,  the  set  S = (P(0)}  U {P(k)  © P(k  + 2e)  | k > 0 and  2C  > k)  provides 
an  interesting  basis  for  all  possible  balanced  labelings:  A labeling  is  balanced  if  and 
only  if  it  is  © {g  | g G Q}  for  some  Q C S.  This  exclusive-or  operation  is  well  defined 
even  though  Q might  be  infinite,  because  only  finitely  many  Is  appear  at  each  node.] 

(c)  The  function  P(c ) in  (b)  has  this  form,  because  zp(c)  = x ® [x  ® c] . Its 
inverse,  xsM  = ((x  © c)  + 1)  © c,  is  x © [a;  © c]  = xp(s).  Furthermore  we  have 
xP(c)P(d)  = a:p(c)0| xp(c')®d]  = a;© [a:©c]©Lxffids(c)] , because  [x©y]  = \xTQ>yT~\  for  any 
branching  function  xT . Similarly  a ;p(c)p(d)p(<0  = x®  [a©c]  © |_:r©ds(c)]  © [s©es(d)s(c)] , 
etc.  After  discarding  equal  terms  we  obtain  the  desired  form.  The  resulting  numbers 
Pj  are  unique  because  they  are  the  only  values  of  x at  which  the  function  changes  sign. 

(d)  We  have,  for  example,  x © [x  © a]  © \x  © 6]  © [x  © c]  = xp(a')p(6')p(c')  where 
a'  = a,  6'  = 6p(“'),  and  c'  = cp(“')p«<'). 

[The  theory  of  animating  functions  was  developed  by  J.  H.  Conway  in  Chapter  13 
of  his  book  On  Numbers  and  Games  (1976),  inspired  by  previous  work  of  C.  P.  Welter 
in  Indagationes  Math.  14  (1952),  304-314;  16  (1954),  194-200.] 

17.  (Solution  by  M.  Slanina.)  Such  equations  are  decidable  even  if  we  also  allow  opera- 
tions such  as  a :&y,  x,  x<l,  jc»l,  2px , and  2Xx , and  even  if  we  allow  Boolean  combina- 
tions of  statements  and  quantifications  over  integer  variables,  by  translating  them  into 
formulas  of  second-order  monadic  logic  with  one  successor  (SIS).  Each  2-adic  variable 
x = (..  .X2X\xo)2  corresponds  to  an  SIS  set  variable  X,  where  j G X means  Xj  = 1: 


z = x becomes 
z = x &y  becomes 
z = 2px  becomes 
z = x + y becomes 


Vt(f  G Z &t£X)-, 

Vf(t  eZ&(teXAte  Y))- 

Vt(t  e Z <=>  (te  X A Vs(s  <t=>  s <£  X))); 

3CVt(o^c  a (tez<$  (tax)  © (teY)  © (tec)) 
A (t+i  ((taX)(taY)(taC)))). 


An  identity  such  as  i & (— x)  = 2px  is  equivalent  to  the  translation  of 


VXVFVZ((integer(X)  A 0 — x + y A z = x & y)  =>  z = 2px), 

where  integer(X)  stands  for  3tVs(s  > t =>  (s  G X » 1 £ X)).  We  can  also  include 
2-adic  constants  if  they  are,  say,  ratios  of  integers;  for  example,  z = p,0  is  equivalent  to 
the  formula  0 G Z A Vt(t  G7#(  + lj(Z).  But  of  course  we  cannot  include  arbitrary 
(uncomputable)  constants. 

J.  R.  Biichi  proved  that  all  formulas  of  SIS  are  decidable,  in  Logic,  Methodology, 
and  Philosophy  of  Science:  Proceedings  (Stanford,  1960),  1-11.  If  we  restrict  attention 
to  equations,  one  can  show  in  fact  that  exponential  time  suffices. 

On  the  other  hand  M.  Hamburg  has  shown  that  the  problem  would  be  unsolvable 
if  px,  Xx,  or  l<i  were  added  to  the  repertoire;  multiplication  could  then  be  encoded. 

Incidentally,  many  nontrivial  identities  exist,  even  if  we  use  only  the  operations 
x © y and  x + 1 . For  example,  C.  P.  Welter  noticed  in  1952  that 

((x  ®(y+  1))  + 1)  © (x  + 1)  = ((((a;  + 1)  © y)  + 1)  © x)  + 1. 


id 
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18.  Of  course  row  x is  entirely  blank  when  i is  a multiple  of  64.  The  fine  details 
of  this  image  are  apparently  “chaotic”  and  complex,  but  there  is  a fairly  easy  way  to 
understand  what  happens  near  the  points  where  the  straight  lines  x = 64 y/j  intersect 
the  hyperbolas  xy  = 2 nfc,  for  integers  j,k  > 1 that  aren’t  too  large. 

Indeed,  when  x and  y are  integers,  the  value  of  x2y  S>  11  is  odd  if  and  only  if 
x2y/212  mod  1 > Thus,  if  x = 64 \J~j  + 8 and  xy  = 2 11  (fc  + e)  we  have 


x2y 

212 


mod  1 


7128  \Tj8+82 
V 4096 


)ymod  1 = K mod  1 = ({k  + e)S~ik)  mod  L 


and  this  quantity  has  a known  relation  to  | when,  say,  5 is  close  to  a small  integer. 
[See  C.  A.  Pickover  and  A.  Lakhtakia,  J.  Recreational  Math.  21  (1989),  166-169.] 

19.  (a)  When  n = 1,  f(A,B,C)  has  the  same  value  under  all  arrangements  except 
when  do  / oi,  bo  ^ bi,  and  Co  / ci;  and  then  it  cannot  exceed  1.  For  larger  values  of  n 
we  argue  by  induction,  assuming  that  n = 3 in  order  to  avoid  cumbersome  notation.  Let 
Ao  = (do,  di,  q.2 , 03),  A\  = (d4,  ds,  d6,  d7 ) , . . . , Ci  = (C4,  C5,  cq,  C7).  Then  f(A,  B,  C)  = 
H2jBk(Bi=of(ABBki  C‘)  < T,jS)kS)i=of(ABBk:Ct)  by  induction.  Thus  we  can  assume 
that  do  > di  > d2  > d3,  04  > ds  > d6  > d7,  . . . , C4  > cs  > eg  > C7.  We  can  also 
sort  the  subvectors  A'0  = (o0,  di,  d4,  o5),  A)  = (o2,  a3,  d6,  a7),  . . . , Cj  = (c2,  c3,  c6,  c7) 
in  a similar  way.  Finally,  we  can  sort  A'i  = (ao,  di , ae,  07),  A"  = (a2,  d3,  u4,  a5),  ..., 
C"  = (c2,  C3,  C4,  C5),  because  in  each  term  cijbkCi  the  number  of  subscripts  {j,  k,  1}  with 
leading  bits  01,  10,  and  11  must  satisfy  s0i  = si0  = sn  (modulo  2).  And  these  three 
sorting  operations  leave  A,  B,  C fully  sorted,  by  exercise  5.3.4-48.  (Exactly  three  sorts 
on  subvectors  of  length  2n_1  are  needed,  for  all  n > 2.) 

(b)  Suppose  A = A* , B = B* , and  C = C* . Then  we  have  aj  = Ylt=o  1 a*[i  ^ 
where  aj  = a j — aj+i  > 0 and  we  set  a2r>  = 0;  similar  formulas  hold  for  bk  and  c/.  Let 
A(p)  denote  the  vector  (ap(0), . . . , dp(2r._1))  when  p is  a permutation  of  {0, 1, ....  2n  — 1}. 
Then  by  part  (a)  we  have 


f(A(P),  B(q),  C(r))  = £j©fc®i=o  E t,u,v  atPulv(p(j)  < t][q(k)  < u][r(l)  < n] 

< Ej®fc®i=o  E t,u,v  atPulvlj  < t][k  < u][l  <v}  = f(A,  B,  C). 

[This  proof  is  due  to  Hardy,  Littlewood,  and  Polya,  Inequalities  (1934),  §10.3.] 

(c)  The  same  proof  technique  extends  to  any  number  of  vectors.  [R.  E.  A.  C. 
Paley,  Proc.  London  Math.  Soc.  (2)  34  (1932),  265-279,  Theorem  15.] 

20.  The  given  steps  compute  the  least  integer  y greater  than  x such  that  uy  = vx. 
They’re  useful  for  generating  all  combinations  of  n objects,  taken  m at  a time  (that  is, 
all  m-element  subsets  of  an  n-element  set,  with  elements  represented  by  1 bits). 

[This  tidbit  is  Hack  175  in  HAKMEM,  Massachusetts  Institute  of  Technology 
Artificial  Intelligence  Laboratory  Memo  No.  239  (29  February  1972).] 

21.  Set  t <—  y + 1,  t®  y,  tif-  f&y,  if-  v — (u&  —v)/(u  +1).  If  y = 2m  — 1 is 
the  first  m-combination,  these  eight  operations  set  x to  zero.  (The  fact  that  x = f(y) 
does  not  seem  to  yield  any  shorter  scheme.) 

22.  Sideways  addition  avoids  the  division:  SUBUt,x,l;  ANDNu,x,t;  SADDk,t,x; 
ADDU  v,x,u;  X0Rt,v,x;  ADDUk,k,2;  SRUt,t,k;  ADDUy,v,t.  But  we  can  actually 
save  a step  by  judiciously  using  the  constant  mone  = —1:  SUBU  t,x,l;  X0R  u,t,x; 
ADDU  y,x,u;  SADDk,t,y;  ANDN  y,y,u;  SLUt,mone,k;  0RN  y,y,t. 

23.  (a)  (0... 01...  1)2  = 2m  - 1 and  (0101... 01)2  = (22m  - l)/3. 
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(b)  This  solution  uses  the  2-adic  constant  p,0  = ( . . .010101)2  = -1/3: 


t <—  x © fl0, 


(t~  1)* 


v + 1,  y <—  w + 


v & w 

Vu  + 1 


If  x = (22m  - l)/3,  the  operations  produce  a strange  result  because  u = 22m+I  - 1. 

(c)  XORt,x,mO;  SUBUu.t.l;  X0Ru,t,u;  0Rv,x,u;  SADD  y,u,mO;  ADDUw.v.l; 
ANDNt,v,w;  SRUy,t,y;  ADDU  y,w,y.  [This  exercise  was  inspired  by  Jorg  Arndt.] 

24.  It’s  expedient  to  “prime  the  pump”  by  initializing  the  array  to  the  state  that  it 
should  have  after  all  multiples  of  3,  5,  7,  and  11  have  been  sieved  out.  We  can  combine 
3 with  11  and  5 with  7,  as  suggested  by  E.  Wada: 


LOC  Data_Segment 

qbase  GREG  8 ;N  IS  3584  ;n  GREG  N ;one  GREG  1 
Q OCTA  #816dl29a64b4cb6e 
LOC  Q+N/16 
qtop  GREG  <3 

Init  OCTA  #9249249249249249  | #4008010020040080 
OCTA  #8421084210842108 1 #0408102040810204 
t IS  $255  ;x33  IS  $0  ;x35  IS  $1  ;j  IS  $4 
LOC  #100 


Q o (little-endian) 

End  of  the  Q table 

Multiples  of  3 or  11  in  [129 . . 255] 

Multiples  of  5 or  7 


Main  LDOU  x33,Init;  LD0U  x35,Init+8 

LDA  j, qbase, 8;  SUB  j,j,qtop  Prepare  to  set  Q\. 

1H  NOR  t,x33,x33;  ANDN  t,t,x35;  ST0U  t,qtop,j  Initialize  64  sieve  bits. 

SLU  t,x33,2;  SRU  x33,x33,31;  OR  x33,x33,t  Prepare  for  the  next  64  values. 
SLU  t ,x35 ,6 ; SRU  x35,x35,29;  OR  x35,x35,t 

ADD  j,j,8;  PBN  j , IB  Repeat  until  reaching  qtop.  | 

Then  we  cast  out  nonprimes  p2 , p2  + 2p,  . . . , for  p = 13,  17,  ... , until  p2  > N: 


p IS  $0  ;pp  IS  $1  ;m  IS  $2  ;mm  IS  $3  ;q  IS  $4 
LDOU  q, qbase, 0;  LDA  pp, qbase, 8 
SET  p,  13;  NEG  m,  13*13, n;  SRU  q,q,6 
SR  m,m,l 

SR  mm,m,3;  LDOU  s, qtop, mm;  AND  t,m,#3f; 
SLU  t,one,t;  ANDN  s,s,t;  ST0U  s, qtop, mm 
ADD  m,m,p ; PBN  m,2B 
SRU  q,q, 1 ; PBNZ  q,3F 
LDOU  q,pp,0;  INCL  pp,8 
OR  p,p,#7f;  PBNZ  q,3F 
ADD  p,p,2;  JMP  2B 
SRU  q,q, 1 

ADD  p,p,2 ; PBEV  q,2B 
MUL  m,p,p;  SUB  m,m,n;  PBN  m,lB 


1H 

2H 


2H 


2H 

3H 


;s  IS  $5 

Begin  with  p = 13. 

m <-  L (p2  ~ N)/Z\  ■ 

Zero  out  a bit. 

Advance  by  p bits. 

Move  to  next  potential  prime. 
Read  in  another  batch 
of  potential  primes. 

Skip  past  128  nonprimes. 

Set  p <—  p + 2 until  p is  prime. 
Repeat  until  p2  > N.  | 


The  running  time,  1172p  + 5166i>,  is  of  course  much  less  than  the  time  needed  for  steps 
P1-P8  of  Program  1.3.2'P,  namely  10037//  + 64 1 543/;  (improved  to  1 0096//  + 21535  lt> 
in  exercise  1.3.2'-  14).  [See  P.  Pritchard,  Science  of  Computer  Programming  9 (1987), 
17-35,  for  several  instructive  variations.  In  practice,  a program  like  this  one  tends 
to  slow  down  dramatically  when  the  sieve  is  too  big  for  the  computer’s  cache.  Better 
results  are  obtained  by  working  with  a segmented  sieve,  which  contains  bits  for  numbers 
between  N0  + kS  and  N0  + (k  + 1 )S,  as  suggested  by  L.  J.  Lander  and  T.  R.  Parkin, 
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Math.  Comp.  21  (1967),  483-488;  C.  Bays  and  R.  H.  Hudson,  BIT  17  (1977),  121  -127. 
Here  N0  can  be  quite  large,  but  <5  is  limited  by  the  cache  size;  calculations  are  done 
separately  for  k = 0,  1,  ....  Segmented  sieves  have  become  highly  developed;  see,  for 
example,  T.  R.  Nicely,  Math.  Comp.  68  (1999),  1311-1315,  and  the  references  cited 
there.  The  author  used  such  a program  in  2006  to  discover  an  unusually  large  gap  of 
length  1370  between  418032645936712127  and  the  next  larger  prime.] 

25.  (1  + 1 + 25  + 1 + 1 + 25  + 1 + 1 = 56)  mm;  the  worm  never  sees  pages  2-500  of 
Volume  1 or  1-499  of  Volume  4.  (Unless  the  books  have  been  placed  in  little-endian 
fashion  on  the  bookshelf;  then  the  answer  would  be  106  mm.)  This  classic  brain-teaser 
can  be  found  in  Sam  Loyd’s  Cyclopedia  (New  York:  1914),  pages  327  and  383. 

26.  We  could  multiply  by  #aa...ab  instead  of  dividing  by  12  (see  exercise  1.3.1-17); 
but  multiplication  is  slow  too.  Or  we  could  deal  with  a “flat”  sequence  of  12000000  x 5 
consecutive  bits  (=  7.5  megabytes),  ignoring  the  boundaries  between  words.  Another 
possibility  is  to  use  a scheme  that  is  neither  big-endian  nor  little-endian  but  transposed: 
Put  item  k into  octabyte  8(fcmod  220),  where  it  is  shifted  left  by  5 Lfc /220 J . Since 
k < 12000000,  the  amount  of  shift  is  always  less  than  60.  The  MMIX  code  to  put  item  k 
into  register  $1  is  AND  $0,k,  [#fffff]  ; SLU  $0,$0,3;  LD0U  $l,base,$0;  SRU  $0,k,20; 
4ADDU  $0,$0,$0;  SRU$1,$1,$0;  AND  $1  ,$1  ,#lf . 

[This  solution  uses  8 large  megabytes  (223  bytes).  Any  convenient  scheme  for  con- 
verting item  numbers  to  octabyte  addresses  and  shift  amounts  will  work,  as  long  as  the 
same  method  is  used  consistently.  Of  course,  just  ‘LDBU  $l,base,k'  would  be  faster.] 

27.  (a)  ((x— 1)  © x)  + x.  [This  exercise  is  based  on  an  idea  of  Luther  Woodrum,  who 
noticed  that  ((x— 1)  |x)  + 1 = (x  & — x)  + x.] 

(b)  (y  + x)  | y,  where  y = (x— 1)  © x. 

(c,  d,  e)  ((z©x)+x)  & z,  ((z©x)+x)  © z,  and  ((z©x)  + x)  & z,  where  z = x— 1. 

(f)  x©(a);  alternatively,  t©(t+l),  where  t = x|  (x— 1).  [The  number  (0oo01alli’)2 
looks  simpler,  but  it  apparently  requires  five  operations:  ((t  + 1)  & t)  — 1.] 

These  constructions  all  give  sensible  results  in  the  exceptional  cases  when  x = —2b . 

28.  A 1 bit  indicates  x’s  rightmost  0 (for  example,  (101011)2  i-t  (000100)2);  —1  i->  0. 

29.  pk  = Pk+ i © (/ifc+ i <S  2fc)  [see  STOC  6 (1974),  125].  This  relation  holds  also  for 
the  constants  pd.k  of  (48),  when  0 < k < d,  if  we  start  with  pd,d  = 22<i  — 1.  (There  is, 
however,  no  easy  way  to  go  from  pk  to  Pk+i,  unless  we  use  the  “zip”  operation;  see  (77).) 

30.  Append  'CSZ  rho,x,64’  to  (50),  thereby  adding  \v  to  its  execution  time;  or  replace 
the  last  two  lines  by  SRU  t,y,rho;  SLU  t,t,2;  SRU  t , [#300020104]  ,t ; AND 

ADD  rho,rho,t,  saving  lv.  For  (51),  we  simply  need  to  make  sure  that  rhotab[0]  = 8. 

31.  In  the  first  place,  his  code  loops  forever  when  x = 0.  But  even  after  that  bug  is 
patched,  his  assumption  that  x is  a random  integer  is  highly  questionable.  In  many 
applications  when  we  want  to  compute  px  for  a nonzero  64-bit  number  x,  a more 
reasonable  assumption  would  be  that  each  of  the  outcomes  {0,1,...,  63}  is  equally 
likely.  The  average  and  standard  deviation  then  become  31.5  and  ~ 18.5. 

32.  'NEGU  y,x;  AND  y,x,y;  MULU  y ,debruijn,y ; SRU  y,y,58;  LDB  rho, decode, y’  has 
estimated  cost  p + 14w,  although  multiplication  by  a power  of  2 might  well  be  faster 
than  a typical  multiplication.  Add  lv  for  the  correction  in  answer  30. 

33.  In  fact,  an  exhaustive  calculation  shows  that  exactly  94727  suitable  constants  a 
yield  a “perfect  hash  function”  for  this  problem,  90970  of  which  also  identify  the  power- 
of-two  cases  y = 22;  90918  of  those  also  distinguish  the  case  y = 0.  The  multiplier 
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#208b2430c8c82129  is  uniquely  best,  in  the  sense  that  it  doesn’t  need  to  refer  to  table 
entries  above  decode  [32400]  when  y is  known  to  be  a valid  input. 

34.  Identity  (a)  fails  when  x = 5,  y = 6;  but  (b)  is  true,  also  when  xy  = 0.  Proof  of  (c): 
If  x / V and  px  = py  = k we  have  x = alOk  and  y = /310*;  hence  x®y  = (a©/3)00fe  = 
(a:  — 1)®  (y- 1).  If  px  > py  = k we  have  (a:©?/)  mod  2fc+2  yf  ((a:-l)®(j/-l))  mod2fc+2. 

35.  Let  f{x)  = x ® 3a;.  Clearly  /( 2x)  = 2 f(x),  and  /(4a;  + 1)  = 4f{x)  + 2.  We  also 
have  /(4a;  — 1)  = 4f(x)  + 2,  by  exercise  34(c).  The  hinted  identity  follows. 

Given  n,  set  u <—  n /§>  1,  u «—  u + n,  t f-  u © u,  n+  v & t,  and  n~  <r-  u & t. 
Clearly  u = [n/2j  and  v = [3n/ 2J,  so  n+  — n = v — u = n.  And  this  is  Reitwiesner’s 
representation,  because  n+  \ n~  has  no  consecutive  Is.  [H.  Prodinger,  Integers  0 (2000), 
A08:l-A08:14.  Incidentally  we  also  have  f(—x)  = f(x).] 

36.  (i)  The  commands  x <-  a:®(a;<l),  x <-  a:®(x«2),  x <-  a;®(a;<4),  x x©(a;«8), 
i(-i®(a;«  16),  x <—  x © (x  -C  32)  change  x to  a;®,  (ii)  xk  = x & ~(a;  + 1). 

(See  exercises  66  and  70  for  applications  of  a;®;  see  also  exercises  128  and  209.) 

37.  Insert  ‘CSZ  y,x,half’  after  the  FL0TU  in  (55),  where  half  = #3f eOOOOOOOOOOOOO ; 
note  that  (55)  says  ‘SR’  (not  ‘SRU’).  No  change  is  needed  to  (56),  if  lamtab[0\  = -1. 

38.  ‘ SRU  t , x , 1 ; 0Ry,x,t;  SRUt,y,2;  0Ry,y,t;  SRUt,y,4;  ORy.y.t;  ...; 
SRU  t , y , 32 ; ORy.y.t;  SRUt.y.l;  SUBU  y,y,t’  takes  14v. 

39.  (Solution  by  H.  S.  Warren,  Jr.)  Let  cr(x)  denote  the  result  of  smearing  x to  the 
right,  as  in  the  first  line  of  (57).  Compute  x & cr((x  » 1)  & x). 

40.  Suppose  \x  = \y  = k.  If  x = y = 0,  (58)  certainly  holds,  regardless  of  how  we 
define  A0.  Otherwise  x = (la/2  and  y = (1/3)2,  for  some  binary  strings  a and  /3  with 
M = l/^l  = and  x (B  y < 2k  < x &i  y.  On  the  other  hand  if  \x  < \ y = k,  we  have 
x®y  >2k  > x&cy.  And  H.  S.  Warren,  Jr.,  notes  that  Aa:  < \y  if  and  only  if  x < y&ix. 

41-  (a)  En=i(p™)zn  = XXi  ^7(1  - *k)  = z/(  1 -z)~  EZo  z27(1  + 227-  The 

Dirichlet  generating  function  is  simpler:  J2^=i{Pn)/nz  = £(z)/(2z  - 1). 

(b)  1 (An)*"  = £r=x  ^27(1  - *)•  " 

(c)  S^Li(I/n)zn  = Xifelo  227((1  ~ 2)(1  + z2k))  = J2kLo  z2k Pk{z),  where  pk(z)  = 

(I  + 2 3 b z2k~1)/(  1 - z2,,  + 1).  (The  “magic  masks”  of  (47)  correspond  to  fik( 2).) 

[See  Automatic  Sequences  by  J.-P.  Allouche  and  J.  Shallit  (2003),  Chapter  3,  for 
further  information  about  the  functions  p and  u,  which  they  denote  by  1/2  and  .s'2 .] 

42.  ei2ei  1 + (e2  + 2)2<’2  1 + ■••  + (er  + 2r  — 2)2er~1,  by  induction  on  r.  [D.  E.  Knuth, 
Proc.  IFIP  Congress  (1971),  1,  19-27.  The  fractal  aspects  of  this  sum  are  illustrated 
in  Figs.  3.1  and  3.2  of  the  book  by  Allouche  and  Shallit.]  Consider  also  S'n  ( 1 ) where 

71—1 

Sn{z)  = = (IT*)61  +Z(l+Z)e2  +.--+*r-1(l+z)er- 

fc-0 

43.  The  straightforward  implementation  of  (63),  ‘SET  nu,0;  SET  y.x;  BZ  y.Done; 
1H  ADD  nu.nu.l;  SUBUt.y.l;  ANDy.y.t;  PBNZ  y,lB’  costs  (5  + 4vx)v;  it  beats  the 
implementation  of  (62)  when  vx  < 4,  ties  when  ux  = 4,  and  loses  when  vx  > 4. 

But  we  can  save  4v  from  the  implementation  of  (62)  if  we  replace  the  final 
multiplication-and-shift  by  ‘y  <-  y + (y  » 8),  y 4-  y + (y  > 16),  y +-  y + (y  » 32), 
v t—  y & #ff ’.  [Of  course,  MMIX’s  single  instruction  ‘SADD  nu,x,0’  is  much  better.] 
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44.  Let  this  sum  be  i/2>  x.  If  we  can  solve  the  problem  for  2d-bit.  numbers,  we  can 
solve  it  for  2d+1-bit  numbers,  because  j/2)  (22dx-\-x')  = v^x  + v^x'  + 2dvx.  Therefore 
a solution  analogous  to  (62)  suggests  itself,  on  a 64-bit  machine: 

Set  z <r-  (x  3>  1)  & /ro  and  y <—  x — z. 

Set  z «-  ((2  + (z  » 2))  & Hi)  + (( y & fix)  » 1)  and  y <-  (y  & fii)  + ((v  > 2)  & Hi)- 

Set  z +-  ((z  + (z  » 4))  & /i2)  + ((?/  & ji2)  » 2)  and  y «-  (j/  + (y  » 4))  & /x2. 

Finally  f(2)  +-  (((Az)  mod  264)  » 56)  + (({(By)  mod  264)  » 56)  « 3), 
where  A = (1111 1111)256  and  B = (0  1 23  4 5 67)256. 

But  on  MMIX,  which  has  sideways  addition  built  in,  there’s  a better  solution  by  J.  Dallos: 

SADD  nu2,x,m5  SADD  t,x,m3  2ADDU  nu2,nu2,t  SADD  t,x,m0 

SADD  t,x,m4  2ADDU  nu2  ,nu2  ,t  SADD  t,x,ml  2ADDU  nu2 , nu2 , t | 

2ADDU  nu2 ,nu2 ,t  SADD  t,x,m2  2ADDU  nu2 , nu2 , t 

[In  general,  u^x  = J2k  2ku(x&c  p,k).  See  Dr.  Dobb’s  Journal  8,4  (April  1983),  24-37.] 

45.  Let  d = (x  — y)  & (y  — x);  test  if  y / 0.  [Rokicki  found  that  this  idea,  which  is 
called  colex  ordering,  can  be  used  with  node  addresses  to  near-randomize  binary  search 
trees  or  Cartesian  trees  as  if  they  were  treaps,  without  needing  an  additional  random 
“priority  key”  in  each  node.  See  U.S.  Patent  6347318  (12  February  2002).] 

46.  SADDt,x,m;  NXORy.x.m;  CS0Dx,t,y;  the  mask  m is  * (l«i I l«j ) . (In  general, 

these  instructions  complement  the  bits  specified  by  m if  those  bits  have  odd  parity.) 

47.  y <—  (x  3>  S)  & 9,  z <—  (x  k,  9)  -C  5,  x <—  (x  & m)  \ y \ z,  where  rn  = 9 \ (6  5). 

48.  Given  <5,  there  are  ss  = E[j=o  ^\.(n+j)/s\  + \ different  (5-swaps,  including  the  identity 
permutation.  (See  exercise  4.5.3-32.)  Summing  over  5 gives  l+JZ^-T/fsa  — 1)  altogether. 

49.  (a)  The  set  S = {ai(5i-| ham(5m  | (ai  } C { — 1,0, +1}}  for  displacements 

Si,  . . . , Sm  must  contain  {n—  1,  n — 3,  ...,1  — n},  because  the  fcth  bit  must  be  exchanged 
with  the  (n  + 1 — fc)th  bit  for  1 < k < n.  Hence  |S|  > n.  And  S contains  at  most  3m 
numbers,  at  most  2 • 3m  - 1 of  which  are  odd. 

(b)  Clearly  s(mn)  < s(m)  + s(n),  because  we  can  reverse  m fields  of  n bits  each. 
Thus  s(3m)  < m and  s(2  • 3m)  < m + 1.  Furthermore  the  reversal  of  3m  bits  uses 
only  (5-swaps  with  even  values  of  (5;  the  corresponding  ((5/2)-swaps  prove  that  we  have 
s(( 3m  ± l)/2)  < m.  These  upper  bounds  match  the  lower  bounds  of  (a)  when  m > 1. 

(c)  The  string  aaf39il>zaj  with  |a|  = \fi\  = |0|  = |i/’|  = \ui\  = n can  be  changed  to 
LuzipOflaa  with  a (3n+  l)-swap  followed  by  an  (n  + l)-swap.  Then  s(n)  further  swaps 
reverse  all.  Hence  s(32)  < s(6)  + 2 = 4,  and  s(64)  < 5.  Again,  equality  holds  by  (a). 

Incidentally,  s(63)  = 4 because  s( 7)  = s(9)  = 2.  The  lower  bound  in  (a)  turns  out 
to  be  the  exact  value  of  s(n)  for  1 < n < 22,  except  that  s(16)  = 4. 

50.  Express  n = ( tm  • • • fito)3  in  balanced  ternary  notation.  Let  nj  = (tm  . . . tj) 3 and 
Sj  = 2nj  + tj- j,  so  that  rij-i  — Sj  = nj  and  25j  — nj- 1 = nj  + tj- 1 for  1 < j < rn. 
Let  E0  = {0}  and  Ej+ 1 = Ej  U {tj  — x \ x £ Ej}  for  0 < j < m.  (Thus,  for  example, 
Ex  = {0,  to}  and  E2  = {0,to,fi,fi  — to } •)  Notice  that  e 6 Ej  implies  |e|  < j. 

Assume  by  induction  on  j that  (5-swaps  for  S = (5i , . . . , Sj  have  changed  the  n-bit 
word  oti  . . . a3j  to  a3j  . . . oq , where  each  subword  ak  has  length  nj -\~Ek  for  some  sk  € Ej. 
If  nJ+i  > j,  a (5J+i -swap  within  each  subword  will  preserve  this  assumption.  Otherwise 
each  subword  ak  has  |a*,|  < nj  + j < 3nj+i  + 1 + j < 4j  + 1 < 4m.  Therefore  2fc-swaps 
for  [lg4mj  > k > 0 will  reverse  them  all.  (Note  that  a 2fc-swap  on  a subword  of  size  t, 
where  2k  < t < 2fc+1,  reduces  it  to  three  subwords  of  sizes  t — 2k,  2fc+1  — t,  t — 2k .) 
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51.  (a)  If  c - (cd_!  . . . c0)2,  we  must  have  8d-i  = cd_ x/rd,d_x.  But  for  0 < fc  < d - 1 
we  can  take  = ck/id:k  © dfc,  where  9k  is  any  mask  C fidtk. 

(b)  Let  0(d,  c)  be  the  set  of  all  such  mask  sequences.  Clearly  0(1,  c)  = {c}.  When 
d > 1 we  will  have,  recursively, 


0(d,  c)  = {(d0, . . . , 0d_2,  Qd_, , dd_2, . . . , do)  | 9k  = dLi  t ^'-r,  dfc  = d{_x  t d{'_x}, 


by  “zipping  together’’  two  sequences  (0'a, . . . , ^_3,  9'd_2,6'd_3,  ...,6'0)e  0(d-  1,  c')  and 
(d0  , . . . , dd_ 3, dd_2, d(j'_3, . . . , do)  £ 0(d  — 1, c")  for  some  appropriate  d0,  do,  c',  and  c". 

( When  c is  odd,  the  bigraph  corresponding  to  (75)  has  only  one  cycle;  so  (d0,d0, 
c ,c  ) is  either  (/rdio,0,  (c/2],  |_c/2J)  or  ( 0,fJ-d,o , (c/2j,  (c/2]).  But  when  c is  even,  the 
bigraph  has  2d  1 double  bonds;  so  d0  = d0  is  any  mask  C pd0,  and  c'  = c"  = c/2. 
[Incidentally,  lg|0(d,c)|  = 2d-x(d-  1)  - Z{(2d“fc  - l)(2fc-x-  |2fc-x  - c mod  2*  |).] 

In  both  cases  we  can  therefore  let  dd_2  = • ■ • = d0  = 0 and  omit  the  second  half 
of  (71)  entirely.  Of  course  in  case  (b)  we  would  do  the  cyclic  shift  directly,  instead  of 
using  (71)  at  all.  But  exercise  58  proves  that  many  other  useful  permutations,  such  as 
selective  reversal  followed  by  cyclic  shift,  can  also  be  handled  by  (71)  with  8k  = 0 for 
all  fc.  The  inverses  of  those  permutations  can  be  handled  with  9k  = 0 for  0 < A:  < d-  1. 

52.  The  following  solutions  make  9j  = 0 whenever  possible.  We  shall  express  the 
9 masks  in  terms  of  the  p’s,  for  example  by  writing  p6, 5 & p 0 instead  of  stating  the 
requested  hexadecimal  form  ^55555555:  the  p form  is  shorter  and  more  instructive. 

(a)  9k  = p6iA:  & p5  and  8k  = p6>fc  & (pfc+1  © for  0 < k < 5;  d5  = d4.  (Here 

p-i  = 0.  To  get  the  “other”  perfect  shuffle,  (x3ix63  • ■■  ^1*33^02:32)2,  let  d0  = p6,o kfh.) 

(b)  d0  = d3  = d0  = p6,o  & p3;  di  = d4  = di  = p6,i  & p4;  d2  = d5  = d2  = p6,2  & P5; 
d3  = d4  = 0.  [See  J.  Lenfant,  IEEE  Trans.  C-27  (1978),  637-647,  for  a general  theory.] 

(c)  do  — P6,0  & p4;  di  = P6,i  & P5;  d2  ==  d4  = P6,2  & p4;  d3  = ds  = P6,3  & ps; 
do  = M 6,0  & P2;  di  = p6il  & p3;  d2  = do  © d2;  d3  = di  © d3;  d4  = 0. 

(d)  d*,  = p6,fc  & p5— fc  for  0 < k < 5;  dfc  = 9k  for  0 < fc  < 2;  d3  = d4  = 0. 


53.  We  can  write  ip  as  a product  of  d-t  transpositions,  (uiuj) . . . (ud-tvd-t)  (see 
exercise  5. 2. 2-2).  The  permutation  induced  by  a single  transposition  ( uv ) on  the  index 
digits,  when  u < v,  corresponds  to  a (2”  - 2u)-swap  with  mask  pd,„  & p„.  We  should 
do  such  a swap  for  (uxui)  first,  . . . , ( ud-tvd-t ) last. 

In  particular,  the  perfect  shuffle  in  a 2d-bit  register  corresponds  to  the  case  where 
^ = (01 ...  (d  — 1))  is  a one-cycle;  so  it  can  be  achieved  by  doing  such  ( 2V  — 2“)-swaps 
for  (u,v)  = (0, 1),  . . . , (0,  d — 1).  For  example,  when  d = 3 the  two-step  procedure  is 
12345678  13245768  e->  15263748.  [Guy  Steele  suggests  an  alternative  (d  — l)-step 

procedure:  We  can  do  a 2fc-swap  with  mask  pd,fe+1  &pfc  for  d-  1 > k > 0.  When  d = 3 
his  method  takes  12345678  i->-  12563478  15263748.] 

The  matrix  transposition  in  exercise  52(b)  corresponds  to  d = 6 and  ( u , v)  = (0, 3), 
(f,4),  (2,5).  These  operations  are  the  7-swap,  14-swap,  and  28-swap  steps  for  8x8 
matrix  transposition  illustrated  in  the  text;  they  can  be  done  in  any  order. 

For  exercise  52(c),  use  d = 6 and  (u,u)  = (0,2),  (1,3),  (0,4),  (1,5).  Exercise  52(d) 
is  as  easy  as  52(b),  with  (u,v)  = (0,5),  (1,4),  (2,3). 


54.  Transposition  amounts  to  reversing  the  bits  of  the  minor  diagonals.  Successive 
elements  of  those  diagonals  are  m — 1 apart  in  the  register.  Simultaneous  reversal  of 
all  diagonals  corresponds  to  simultaneous  reversal  of  subwords  of  sizes  1,  . . . , m,  which 
can  be  done  with  2 -swaps  for  0 < k < [lgm]  (because  such  transposition  is  easy 
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when  m is  a power  of  2,  as  illustrated  in  the  text).  Here’s  the  procedure  for  m = 7: 


Given 

00  01  02  03  04  05  06 
10  11  12  13  14  15  16 
20  21  22  23  24  25  26 
30  31  32  33  34  35  36 
40  41  42  43  44  45  46 
50  51  52  53  54  55  56 
60  61  62  63  64  65  66 


6-swap 

00  10  02  12  04  14  06 

01  11  03  13  05  15  25 

20  30  22  32  24  16  26 

21  31  23  33  43  35  45 

40  50  42  34  44  36  46 

41  51  61  53  63  55  65 
60  52  62  54  64  56  66 


12-swap 

00  10  20  30  04  14  24 

01  11  21  31  05  15  25 

02  12  22  32  06  16  26 

03  13  23  33  43  53  63 

40  50  60  34  44  54  64 

41  51  61  35  45  55  65 

42  52  62  36  46  56  66 


24 

-swap 

00 

10 

20 

30 

40 

50 

60 

01 

11 

21 

31 

41 

51 

61 

02 

12 

22 

32 

42 

52 

62 

03 

13 

23 

33 

43 

53 

63 

04 

14 

24 

34 

44 

54 

64 

05 

15 

25 

35 

45 

55 

65 

06 

16 

26 

36 

46 

56 

66 

55.  Given  x and  y,  first  set  x 4-  x \ (x<S^2k)  and  y 4-  y \ (y<^2k)  for  2d  < k < 3d.  Then 
set  x 4—  (22d+fc  — 2fc)-swap  of  x with  mask  P2d+k&fik  and  y 4—  (22d+fc  — 2d+fc)-swap  of  y 
with  mask  p2d+k^jld+k  for  0 < k < d.  Finally  set  2 4-  xSzy,  then  either  z <-  z\  (z> 2k) 
or  z 4—  z © (z  2k ) for  2d  < k < 3d,  and  z 4—  z & (2n2  — 1).  [The  idea  is  to  form  two 
n x n x n arrays  x = (xooo  • • • *(n-i)(n-i)(n-i))2  and  y = (yooo  ■ ■ • 3/(n-i)(n-i)(n-i))2 
with  Xijk  = a,jk  and  yijk  = bjk,  then  transpose  coordinates  so  that  Xijk  = djt  and 
Vijk  = bik',  now  x&y  does  all  n3  bitwise  multiplications  at  once.  This  method  is  due  to 
V.  R.  Pratt  and  L.  J.  Stockmeyer,  J.  Computer  and  System  Sci.  12  (1976),  210-213.] 


56.  Use  (71)  with  60  = 60  = 0 , 6i  = #0010201122113231 , 02  = #00080e0400080c06 . 
63  = #00000092008100a2,  84  = #0000000000000f 16,  05  = #0000000003199c26 , 64  = 
#00000c9f 0000901a,  §3  = #003a00b50015002b , d2  = #000103080c0d0f0c , and  84  = 
#0020032033233333 . 


57.  The  two  choices  for  each  cycle  when  d > 1 have  complementary  settings.  So  we 
can  choose  a setting  in  which  at  least  half  of  the  crossbars  are  inactive,  except  in  the 
middle  column.  (See  exercise  5.3.4-55  for  more  about  permutation  networks.) 

58.  (a)  Every  different  setting  of  the  crossbars  gives  a different  permutation,  because 
there  is  exactly  one  path  from  input  line  i to  output  line  j for  all  0 < i,j  < N.  (A  net- 
work with  that  property  is  called  a “banyan.”)  The  unique  such  path  carries  input  i 
on  line  k)  = ((i  k)  <S  k)  + ( j mod  2k)  after  k swapping  steps  have  been  made. 

(b)  We  have  l(iip,  i,  k)  = l(jp,j,  k)  if  and  only  if  i mod  2k  = j mod  2k  and  iipi$>k  = 

jtp  ~S>  k\  so  (*)  is  necessary.  And  it  is  also  sufficient,  because  a mapping  that  sat- 
isfies (*)  can  always  be  routed  in  such  a way  that  jip  appears  on  line  l = l(jip,j,k) 
after  k steps:  If  k > 1,  jip  will  appear  on  line  k — 1),  which  is  one  of  the  inputs 

to  l.  Condition  (*)  says  that  we  can  route  it  to  l without  conflict,  even  if  l is  l(itp,i,k). 

[In  IEEE  Transactions  C-24  (1975),  1145-1155,  Duncan  Lawrie  proved  that  condi- 
tion (*)  is  necessary  and  sufficient  for  an  arbitrary  mapping  of  the  set  {0, 1, . . . , N — 1} 
into  itself,  when  the  crossbar  modules  are  allowed  to  be  general  2x2  mapping  modules 
as  in  exercise  75.  Furthermore  the  mapping  ip  might  be  only  partially  specified,  with 
jip  = * (“wild  card”  or  “don’t-care”)  for  some  values  of  j.  The  proof  that  appears  in 
the  previous  paragraph  actually  demonstrates  Lawrie’s  more  general  theorem.] 

(c)  i mod  2k  = j mod  2k  if  and  only  if  k < p(i  ® j);  i » k — j k if  and  only  if 
k > A (i  © j);  and  iip  = jip  if  and  only  if  i = j,  when  ip  is  a permutation. 

(d)  A (iip  © jip)  > p(i  © j)  for  all  i / j if  and  only  if  A (trip  © jrip ) > p(ir  © jr)  = 
p(i  © j)  for  all  i ^ j,  because  r is  a permutation.  [Note  that  the  notation  can  be 
confusing:  Bit  jnp  appears  in  bit  position  j if  permutation  p is  applied  first,  then  r. 
The  Sylow  group  T includes  many  interesting  and  important  permutations,  including 
bit  reversal  and  cyclic  shifts.  It  corresponds  to  settings  of  the  Omega  network  where 
crossbars  of  length  22  that  are  congruent  mod  2]  + 1 all  switch  or  all  pass,  as  a unit.] 


7.1.3 


ANSWERS  TO  EXERCISES  593 


(e)  Since  l(j,j,k ) = j for  0 < k < d,  a permutation  of  Q fixes  j if  and  only  if 
each  of  its  swaps  fixes  j.  Thus  the  swaps  performed  by  (p  and  by  r/>  operate  on  disjoint 
elements.  The  union  of  these  swaps  gives  ip-if). 

(f)  Any  setting  of  the  crossbars  corresponds  to  a permutation  that  makes  Batcher’s 
comparator  modules  do  the  equivalent  switching. 

59.  It  is  2Md(a'b\  where  Md(a,  b)  is  the  number  of  crossbars  that  have  both  endpoints 
in  [a  . . b].  To  count  them,  let  k = A (a®fe),  o'  = a mod  2k , and  b’  = b mod  2k\  notice  that 
b-  a,  = 2k  -\-b'  — a , and  Md(a,  b)  = Mk+\{a  ,2k  +b').  Counting  the  crossbars  in  the  top 
half  and  bottom  half,  plus  those  that  jump  between  halves,  gives  Mk+i(a' ,2k  + b')  = 
Mk(a' , 2k  - 1)  + Mk(0,b')  + (( b ' + 1)  — a').  Finally,  we  have  Mk(0,b')  = S(b'  + 1);  and 
Mk(a  , 2k  — 1)  = Mk{ 0, 2k  — 1 — o')  = S(2k  — o')  = k.2k~ 1 — ka'  + S(a'),  where  S(n)  is 
evaluated  in  exercise  42. 

60.  A cycle  of  length  21  corresponds  to  a pattern  u0  ■<—  uo  <4  ► iti  <4  u2  4-  u2  -(4 

■ ■ ■ -n-  V21-1  — > «2/-i  <4  U2 1,  where  u2i  = wo  and  lu  <—  v’  or  ‘v  -4  o’  means  that  the 
permutation  sends  u to  v,  lx  «4  y ’ means  that  x = y © 1. 

We  can  generate  a random  permutation  as  follows:  Given  u0,  there  are  2n  choices 
for  vo,  then  2n  — 1 choices  for  ui  only  one  of  which  causes  u2  = u0,  then  2n  — 2 choices 
for  V2,  then  2n  — 3 choices  for  U3  only  one  of  which  closes  a cycle,  etc. 

Consequently  the  generating  function  is  G(z)  = YIJ=1  \nn- Vj+\  • The  expected 
number  of  cycles,  k,  is  G'(l)  = H2n  - \Hn  = \ In n + In  2 + §7  + 0(n_1).  The  mean 
of  2k  is 

G(2)  = (2nn!)2/(2n)!  = VW+  0(n~1/2y, 
and  the  variance  is  G(4)  - G(2)2  = (n  + 1 - G(2))G(2)  = y/nn3/2  + 0(n). 

62.  The  crossbar  settings  in  P(2d)  can  be  stored  in  (2d-l)2d_1  = Nd—^N  bits.  To  get 
the  inverse  permutation  proceed  from  right  to  left.  [See  P.  Heckel  and  R.  Schroeppel, 
Electronic  Design  28,8  (12  April  1980),  148-152.  Note  that  any  way  to  represent  an 
arbitrary  permutation  requires  at  least  lg  N\  > Nd  - N/ In  2 bits  of  memory;  so  this 
representation  is  nearly  optimum,  spacewise.] 

63.  (i)  x = y.  (ii)  Either  2 is  even  or  x ® y < 2max(°’C-i)/2).  (When  2 is  odd  we  have 
(a : Jy)  »2  = (y  » [2/2])  t (x  » [z/2\),  even  when  2 < 0.)  (iii)  This  identity  holds  for 
all  w,  x,  y,  and  2 (and  also  with  any  other  bitwise  Boolean  operator  in  place  of  &). 

64.  (((2  & fi0)  + (2'  | /Zo))  & Mo)  | (((2  & mo)  + (2'  | mo))  & Mo).  (See  (86).) 

65.  xu(x2)  + v(x2)  = xu(x)2  + v(x)2 . 

66.  (a)  v(x)  = (u(a:)/(l  + x<i))  modi";  it’s  the  unique  polynomial  of  degree  less  than  n 
such  that  (l  + a;i)u(x)  = u(x)  (modulo  xn).  (Equivalently,  v is  the  unique  n-bit  integer 
such  that  (v  © (v  <C  5))  mod  2n  = u.) 

(b)  We  may  as  well  assume  that  n = 64m,  and  that  u = (um_i  ...uiu0)264, 
v = (vm-i  ■ ■ ■ v1v0)264.  Set  c <—  0;  then,  using  exercise  36,  set  <-  u?  ® (— c)  and 
c «—  Vj  » 63  for  j = 0,  1,  . . . , m — 1. 

(c)  Set  c <—  vo  <—  uq;  then  Vj  e -Uj®c  and  c <—  Vj,  for  j = 1,  2,  . . . , m — 1. 

(d)  Start  with  c •<—  0 and  do  the  following  for  j = 0,  1,  m — 1:  Set  t «—  Uj, 

t t ® (t  « 3),  t <r-  t ® (t  « 6),  t <-  t ® {t  « 12),  t <r-  t © (t  « 24),  48), 

Vj  t-  t © c,  c <-  (t  » 61)  X # 9249249249249249 . 

(e)  Start  with  v <-  u.  Then,  for  j = 1,  2,  . . . , m - 1,  set  v,  <r-  v,  ® (vj- x « 3)  and 
(if  j <m  - 1)  vj+ 1 t-  vj+ 1 ® (vj-!  » 61). 
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67.  Let  n = 21  — 1 and  m = n — 2d.  If  |n  < k < n we  have  x2k  = xm+t  + x*  (modulo 
a;n+2:m  + l),  where  t = 2k  — n is  odd.  Consequently,  if  v = (vn-i  . . . viVo)2,  the  number 

w = u © (((m  » d)  ® (u  » 2d)  © (m  » 3d)  ® • • • ) & - 2l~d ) 

turns  out  to  equal  ( vn-2  ■ ■ ■ V3Viv„-i  . . . 0200)2-  For  example,  when  l = 4 and  d = 2, 

the  square  of  u6x 6 H b ui*  + uo  modulo  (x7  + x3  + 1)  is  u6*5  + Ms*3  + (ue  © u4)x1  + 

(us  © U3)x&  + (u6  © 114  © M2)*4  + uix2  + Mo.  To  compute  v,  we  therefore  do  a perfect 
shuffle,  v = \_w/2l\  { (u)mod2,).  The  number  w can  be  calculated  by  methods  like 
those  of  the  previous  exercise.  [See  R.  P.  Brent,  S.  Larvala,  and  P.  Zimmermann, 
Math.  Comp.  72  (2003),  1443-1452;  74  (2005),  1001-1002.] 

68.  SRU  t ,x .delta;  PUT  rM, theta;  MUXx,t,x. 

69.  Notice  that  the  procedure  might  fail  if  we  attempt  to  do  the  2d-1-shift,  first  instead 
of  last.  The  key  to  proving  that  a small-shift-first  strategy  works  correctly  is  to  watch 
the  spaces  between  selected  bits;  we  will  prove  that  the  lengths  of  these  spaces  are 
multiples  of  2k+1  after  the  2fc-shift. 

Consider  the  infinite  string  \k  = • ■ • lt402*  l^O2*"  1<2021 1* 1 02,°  lto , which  represents 
the  situation  where  ti  > 0 items  need  to  move  2kl  places  to  the  right.  A 2fc-shift  with 
any  mask  of  the  form  6k  = ...0'4*2,c+  l‘30t2 ’ l*1 0io  leaves  us  with  the  situation 
represented  by  the  string  Xk+ 1 = . . . lT202t  f 1 lTl  O2^1 1T° , where  exactly  Ti  = t2i  + 
t2(+i  items  need  to  move  right  2 k+1l  places.  So  the  claim  holds  by  induction  on  k. 

70.  Let  ipk  = 6k  © {dk  <SC  1),  so  that  6k  = ip®  'n  the  notation  of  exercise  36.  If  we  take 
*2fc+1  = 02fc l2fe  in  the  previous  answer,  we  have  ipo  = \ and  ipk+ 1 = {ipk  & 8k)  2>  2fe. 
Therefore  we  can  proceed  as  follows: 

Set  ip  <—  x,  k t—  0,  and  repeat  the  following  steps  while  ip  / 0:  Set  x 4-  ip,  then 
x <—  x © (x  2l)  for  0 < l < d,  then  9k  <—*,?/'■<—  (ip  & x)  S>  2k , and  k <—  k + 1. 

The  computation  ends  with  k = Ai/x+1;  the  remaining  masks  6k,  ■ ■ ■ , #d-i,  if  any, 
are  zero  and  those  steps  can  be  omitted  from  (80).  “Minimal”  masks,  for  which  *2fe+1  = 
02's+1  ;n  answer  69,  are  obtained  if  the  operations  ‘ 6k  «—  *,  ip  «—  (ip  &:  x)  3>  2k'  are 
replaced  by  ‘1/)  <—  (ip  & x)  2k , 6k  t—  * & (*  + ip)'  in  the  loop  above. 

[See  compress  in  H.  S.  Warren,  Jr.,  Hacker’s  Delight  (Addison  Wesley,  2002),  §7  4; 
also  G.  L.  Steele  Jr.,  U.S.  Patent  6715066  (30  March  2004).  The  BESM-6  computer,  de- 
signed in  1965,  implemented  compress  under  the  name  «c6opKa»  (“gather”  or  “pack”). 
Its  «pa36opKa»  command  (“scatter”  or  “unpack”)  went  the  other  way.] 

71.  Start  with  x <—  y.  Do  a ( — 2fc)-shift  of  x with  mask  6k,  for  k = d—  1,  . . . , 1,0,  using 
the  masks  of  exercise  70.  Finally  set  z «—  x (or  z <—  *&x>  if  you  want  a “clean”  result). 

72.  Assume  that  the  leftmost  mask  bit,  xiv-i,  is  zero,  since  it  is  immaterial.  Then  the 
result  (z( ■ ■ ■ Zi^Zov)2  of  any  gather-flip  corresponds  to  a permutation  with  Oip  < 
••■  < ktp  > •■•  > (N  — l)<p,  where  k = vx-  For  example,  if  TV  = 8 and  x = (00101100)2, 
the  result  is  (zo£i 24^6 Z725 Z3 z2)2-  So  </?  is  a cyclic  shift  of  a bitonic.  permutation,  and 
tp  € fl  by  exercises  58(d)  and  58(f). 

Moreover,  the  masks  90,  6 1,  . . . , 6d-\  for  the  1-swap,  2-swap,  . . . , 2d-1-swap  can 
be  computed  as  follows:  The  permutation  ip  = ip~  satisfies  jip  = (N-l-j)xj  + Sj , 
where  Sj  =Xt-i  + '''  + Xi+Xo  counts  the  Is  following  mask  bit  Xj-  Let  ipo  = ip 
and  6k  = ([ipk/2k \ mod  2)  & gk,  where  ipk+i  is  the  2fc-swap  of  ipk  with  mask  6k ■ (In 
our  example,  s7  . . . siso  = 33221000  and  (OX7)  ■ ■ • (6xi)(7xo)  = 01030067;  hence  ip0  = 
(7 ip) . . . (hp)(0ip)  = 33221000  + 01030067  = 34251067.  Then  60  = (10011001)2  & go  = 
(00010001)2;  ipi  = 34521076;  6>i  = (10010010)2  & pi  = (00010011)2;  ip2  = 32547610; 
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62  = (00111100)2  &R 2 = (00001100)2.  In  general  jrpk  = j (modulo  2k).)  Represent 
each  permutation  ipk  as  a set  of  d bit  vectors,  namely  as  the  “bit  slices”  ipk  mod  2, 
LV>fc/2j  mod  2,  etc.  Then  0(d2)  bitwise  operations  suffice  for  this  computation. 

The  scatter-flip  operation,  which  undoes  the  effect  of  gather-flip,  is  obtained  via  the 
same  crossbar  network  but  from  right  to  left  (first  a 2d  1-swap,  ending  with  a 1-swap). 

[See  Journal  of  Signal  Processing  Systems  53  (2008),  145-169.] 

73.  (a)  Equivalently,  d sheep-and-goats  operations  must  be  able  to  transform  the  word 

x = (x(2d  — 1)7 r • • • 2?i7ra:o7r)2  into  (x2d-i  ■ ■ ■ £12:0)2,  for  any  permutation  n of  {0, 1, ... , 
2d  1 } - And  this  can  be  done  by  radix-2  sorting  (Algorithm  5.2.5R):  First  bring  the 
odd  numbered  bits  to  the  left,  then  bring  the  bits  j for  odd  [j  / 2J  left,  and  so  on. 
For  example,  when  d = 3 and  xn  = the  three  operations  yield 

successively  (2:32:12:72:52:02:22:62:4)2,  (2:32:72:22:62:12:52:02:4)2,  [gee 

Z.  Shi  and  R.  Lee,  Proc.  IEEE  Conf.  ASAP’00  (IEEE  CS  Press,  2000),  138-148.] 

(b)  With  gather-flip,  the  same  strategy  always  yields  . . .xg(i)Xg(0))2, 

where  g(k)  is  Gray  binary  code,  7.2. 1.1  (9).  For  instance,  the  example  of  (a)  is  now 
(ai5a:7a;ia:3a;oai2a:6a:4)2,  {xex2X3X7X5XiXoX4,)2,  (2:42:s2:72:62:22:32;i2:o)2. 

74.  If  \Y,c2i  - J2c2i+i  \ = 2A  > 0,  we  must  rob  A from  the  rich  half  and  give  it  to 
the  poor.  There  s a position  l in  the  poor  half  with  c;  = 0;  otherwise  that  half  would 
sum  to  at  least  2d_1.  A cyclic  1-shift  that  modifies  positions  l through  (l  + t)  mod  2d 
makes  cl+k  = C|+fc+i  for  0 < k < t,  c't+t  = ci+t+i  — <5,  c\+t+1  = 5,  and  c't+k  = cl+k 
for  all  other  k\  here  <5  can  be  any  desired  value  in  the  range  0 < <5  < ci+t+i.  (We’ve 
treated  all  subscripts  modulo  2d  in  these  formulas.)  So  we  can  use  the  smallest  even  t 
such  that  c;+i  + c;+3  + • • • + C(+t+ 1 = c;  + c;+2  + • • ■ + c/+i  + A + 5 for  some  <5  > 0. 

(The  1-shift  need  not  be  cyclic,  if  we  allow  ourselves  to  shift  left  instead  of  right. 
But  the  cyclic  property  may  be  needed  in  subsequent  steps.) 

75.  Equivalently,  given  indices  0 < *0  < n < ■ ■ ■ < is_x  < is  = 2d  and  0 = j0  < 
P <■■'  < 3a- i < j,  = 2d,  we  want  to  map  {x2d_1  . . . Xlxo)2  ^ (x(24_1)lp  . . .xlvxovh, 
where  j(p  = ir  for  jr  < j < jr+1  and  0 < r < s.  If  d = 1,  a mapping  module  does  this. 

When  d > 1,  we  can  set  the  left-hand  crossbars  so  that  they  route  input  ir  to  line 
ir®  ((ir  + r)  mod  2).  If  s is  even,  we  recursively  ask  one  of  the  networks  P(2d“1)  inside 
P(2d)  to  solve  the  problem  for  indices  [{i0,  i2, . . . , is}/2]  and  [{jo,h, ...  ,js}/2j , while 
the  other  solves  it  for  L(n,i3, . . . , t.-i,  2d}/2]  and  ■ ■ ■ ,3s}/ 2].  At  the  right  of 

P(2d),  one  can  now  check  that  when  jr  < j < jr+ 1,  the  mapping  module  for  lines  j 
and  j©  1 has  input  ir  on  line  j if  j = r (modulo  2),  otherwise  ir  is  on  line  j©l.  A similar 
proof  works  when  s is  odd.  For  example,  if  (i0,...,i5)  = (j0,  • • • , js)  = (0, 1, 3, 5,  7, 8), 
the  subproblems  have  i = j = (0, 1,  3, 4)  and  (0, 2, 4);  x2  . . . xq  i— > a?6a;7a;5a:4a:2S3a;ia;o  t-> 

' ' ' l_t  ^5a:7*5a;3a;ia;3a:ia;o  *7a:5a;5a:3XiXia;o. 

Notes:  This  network  is  a slight  improvement  over  a construction  by  Yu.  P.  Ofman, 
Trudy  Mosk.  Mat.  Obshchestv a 14  (1965),  186  199.  We  can  implement  the  correspond- 
ing network  by  substituting  a “(5-map”  for  a (5-swap;  instead  of  (69),  we  use  two  masks 
and  do  seven  operations  instead  of  six:  j/  <-  a:  ® (a: » <5) , a:  <-  a:  ® (2/  & 0)  ® ( (3/  & 0' ) < <5) . 
This  extension  of  (71)  therefore  takes  only  d additional  units  of  time. 

76.  When  a mapping  network  realizes  a permutation,  all  of  its  modules  must  act  as 
crossbars,  hence  G(n ) > lgn!.  Ofman  proved  that  G(n)  < 2.5nlgn,  and  remarked  in 
a footnote  that  the  constant  2.5  could  be  improved  (without  giving  any  details).  We 
have  seen  that  in  fact  G{n)  < 2nlgn.  Note  that  G(3)  = 3. 

77.  Represent  an  n-network  by  (x2n_1  . . .2:12:0)2,  where  xk  = [the  binary  representa- 
tion of  k is  a possible  configuration  of  0s  and  Is  when  the  network  has  been  applied  to 
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all  2n  sequences  of  Os  and  Is],  for  0 < k < 2n . Thus  the  empty  network  is  represented 
by  22"  — 1,  and  a sorting  network  for  n = 3 is  represented  by  (10001011)2-  In  general, 

x represents  a sorting  network  for  n elements  if  and  only  if  it  represents  an  n-network 
and  vx  ~ n + 1,  if  and  only  if  x = 2°  + 21  + 23  + 27  + • • • + 22"-1. 

If  x represents  a according  to  these  conventions,  the  representation  of  a [i:j]  is 
{x  © y)  \ {y  » (2n_t  - 2’1-5)),  where  y = x k k Hn-j- 

[See  V.  R.  Pratt,  M.  O.  Rabin,  and  L.  J.  Stockmeyer,  STOC  6 (1974),  122-126.] 

78.  If  k > lg (m  — 1)  the  test  is  valid,  because  we  always  have  x\  + X2  + • ■ ■ + xm  > 

xi  | X2  | • • • | xm , with  equality  if  and  only  if  the  sets  are  disjoint.  Moreover,  we  have 
(xi  + • • • + xm)  - (xr  | • • • | xm)  < (m  - l)(2n-fc~1  + • • • + 1)  < (to  - l)2n~k  < 2n . 

Conversely,  if  to  > 2k  + 2 and  n > 2k,  the  test  is  invalid.  We  might  have,  for 
example,  xi  + ---  + xm  = (2fe  + l)(2n-fc  - 2n-2fc~1)  + 2n-'t~1  = 2n  + (2n~k  - 2n~2k~1). 

But  if  n < 2k  the  test  is  still  valid  when  rn  = 2k  + 2.  because  our  proof  shows  that 
xi  + • • • + xm  — (xi  | | xm)  < (2fc  + l)(2n_fc  — 1)  < 2"  in  that  case. 

79.  x,  = (x  — 1)  k x-  (And  the  formula  x , = ((x  — b — 1)  k a)  + b corresponds  to  (85).) 
These  recipes  for  x'  and  x,  are  part  of  Jorg  Arndt’s  “bit  wizardry”  routines  (2001); 
their  origin  is  unknown. 

80.  Perhaps  the  nicest  way  is  to  start  with  x t—  \ — 1 as  a signed  number;  then  while 
x > 0,  set  x t—  x k x,  visit  x,  and  set  x <—  2x  — x-  (The  operation  2x  — x can  in  fact 
be  performed  with  a single  MMIX  instruction,  ‘2ADDU  x,x,minuschi’.) 

But  that  trick  fails  if  x is  so  large  as  to  be  already  “negative.”  A slightly  slower 
but  more  general  method  starts  with  x <—  x and  does  the  following  while  x / 0:  Set 
t <—  x k —x,  visit  x — and  set  x <—  x — t. 

81.  ((2  k x)  ~ iz'  & x))  & X-  (One  way  to  verify  this  formula  is  to  use  (18).) 

82.  Yes,  by  letting  z — z in  (86):  w \ (z  k x),  where  w = ((2  k x)  + (z  \ x))  & X- 

83.  (The  following  iteration  propagates  bits  of  y to  the  right,  in  the  gaps  of  a scattered 
accumulator  t.  Auxiliary  variables  u and  v respectively  mark  the  left  and  right  of  each 
gap;  they  double  in  size  until  being  wiped  out  by  w.)  Set  t «—  2 k x,  u t—  (x  1)  k x, 
v ((x  « 1)  + 1)  k x,  w <—  3 {u  k v),  u «—  3 u' , v t—  3v,  and  k t—  1.  Then,  while 
u / 0,  do  the  following  steps:  t <—  t | ((t  k)  k u '),  k <—  k <1,  u t—  u kw,  v <—  v kw, 
w t—  ((v  k (it  !§>  1)  k u)  <g  (k  + 1))  — (( u & (»  « 1)  & S)  » k ),  u!  <—  (u  k v)  2>  k, 

+ ((v  & «)<!),««-»  + u . Finally  return  the  answer  ((t  1)  k x)  | [z  k x). 

84.  2r-x  = w-  (2  & x)>  where  w = (((2  k x)  ^ 1)  + x)  & X appears  in  answer  82; 
2 —r  x is  the  quantity  t computed  (with  more  difficulty)  in  the  answer  to  exercise  83. 

85.  (a)  If  x = L0C(a[i,  j,  fc])  is  the  drum  location  corresponding  to  interleaved  bits  as 
stated,  then  L0C(a[i  + 1,  j,  k ])  = x ® ((x  ® (( x k x)  — x))  & x)  and  L0C(o[i  — 1,  j,  k ])  = 
x © ((x  © ((x  k x)  — 1))  k x),  where  x = (llllljs,  by  (84)  and  answer  79.  The  formulas 
for  L0C(a[i,  j ± 1,  k ])  and  L0C(a[i,  j,  k ± 1])  are  similar,  with  masks  2x  and  4y- 

(b)  For  random  access,  let’s  hope  there  is  room  for  a table  of  length  32  giving 
/[(* 4*3*21110)2]  = (2413*2*1*0)8.  Then  L0C(a[*,  j,  k})  = (((/[fc]  < 1)  + f[j])  < 1)  + /[*]. 
(On  a vintage  machine,  bitwise  computation  of  / would  be  much  worse  than  table 
lookup,  because  register  operations  used  to  be  as  slow  as  fetches  from  memory.) 

(c)  Let  p be  the  location  of  the  page  currently  in  fast  memory,  and  let  z = —128. 
When  accessing  location  x,  if  x k 2 / p it  is  necessary  to  read  128  words  from  drum 
location  x k 2 (after  saving  the  current  data  to  drum  location  p if  it  has  changed); 
then  set  p <—  x k z.  [See  J.  Royal  Stat.  Soc.  B-16  (1954),  53-55.  This  scheme  of  array 
allocation  for  external  storage  was  devised  independently  by  E.  W.  Dijkstra,  circa  1960, 


7.1.3 


ANSWERS  TO  EXERCISES  597 


who  called  it  the  “zip-fastener”  method.  It  has  often  been  rediscovered,  for  example 
in  1966  by  G.  M.  Morton  and  later  by  developers  of  quadtrees;  see  Hanan  Samet, 
Applications  of  Spatial  Data  Structures  (Addison- Wesley,  1990).  See  also  R.  Raman 
and  D.  S.  Wise,  IEEE  Trans.  C57  (2008),  567-573,  for  a contemporary  perspective. 
Georg  Cantor  had  considered  interleaving  the  digits  of  decimal  fractions  in  Crelle  84 
(1878),  242-258,  §7;  but  he  observed  that  this  idea  does  not  lead  to  an  easy  one-to-one 
correspondence  between  the  unit  interval  [0 . . I]  and  the  unit  square  [0 . . 1]  X [0 . . 1] .] 

86.  If  ( p',q',r ')  rightmost  bits  and  (p",q",r")  other  bits  of  (*,j,  k)  are  in  the  part  of 
the  address  that  does  not  affect  the  page  number,  the  total  number  of  page  faults  is 
2((2P  p — l)2q+r  + (2g~q  — l)2p+r  + (2r~r  — 1)2P+Q).  Hence  we  want  to  minimize  2 ~p'  + 
2~g  _|_2”r  over  nonnegative  integers  (p1  ,q' ,r' ,p" ,q" ,r")  withp'+p"  < p,  q'  + q"  < q , 
r +r  < r,  p + q + r +p"  + q"  -\-r"  = s.  Since  2“  + 26  > 2a_1  -t-2^1  when  a and  b are 
integers  with  a > b + 1,  the  minimum  (for  all  s)  occurs  when  we  select  bits  from  right  to 
left  cyclically  until  running  out.  For  example,  when  ( p,q,r ) = (2,6,3)  the  addressing 
function  would  be  (jsj-ijsl^jfci  ji*ifcojo*o)2-  In  particular,  Tocher’s  scheme  is  optimal. 

[But  such  a mapping  is  not  necessarily  best  when  the  page  size  isn’t  a power  of  2. 
For  example,  consider  a 16  x 16  matrix;  the  addressing  function  (j3*3*2*i*oj2jijo)2  is 
better  than  ( j 3 * 3 j 2 * 2 j 1 * 1 jo  * 0 ) 2 for  all  page  sizes  from  17  to  62,  except  for  size  32  when 
they  are  equally  good.] 

87.  Set  x x&~((a:  & "@@@@@@@@")  1);  each  byte  (0,7  ... £*0)2  is  thereby  changed  to 

(a7a6(a5Aa6)a4  . . .<*0)2-  The  same  transformation  works  also  on  30  additional  letters 
in  the  Latin- 1 supplement  to  ASCII  (for  example,  ae  ha  JE);  but  there’s  one  glitch,  yHfi. 

[Don  Woods  used  this  trick  in  his  original  program  for  the  game  of  Adventure 
(1976),  uppercasing  the  user’s  input  words  before  looking  them  up  in  a dictionary.] 

88.  Set  z «—  (x  © y)  & h,  then  z <—  ((x  \ h)  - (y  & /*))  0 z. 

89.  ft—  x\y,  t <—  (t3>  1),  2 4-  (x&iy&Lfio)  | (f &/*o)-  [From  the  “nasty”  test  program 

for  H.  G.  Dietz  and  R.  J.  Fisher’s  SWARC  compiler  (1998),  optimized  by  T.  Dahlheimer.] 

90.  Insert  ‘z  •<—  z | ((x  0 y)  & I)’  either  before  or  after  ‘z  (x  & y)  + z\  (The  ordering 
makes  no  difference,  because  x+y  = x®y  (modulo  4)  when  x+y  is  odd.  Therefore  MMIX 
can  round  to  odd  at  no  additional  cost,  using  M0R.  Rounding  to  even  in  the  ambiguous 
cases  is  more  difficult,  and  with  fixed  point  arithmetic  it  is  not  advantageous.) 

91.  If  |[x,  y]  denotes  the  average  as  in  (88),  the  desired  result  is  obtained  by  repeating 
the  following  operations  seven  times,  then  concluding  with  z <—  \[x,y\  once  more: 

2 «-  ^\x,y\,  t^ak.h,  m <—  (f  « 1)  — (t  » 7), 

x t-  (mkz)  | (m&x),  y 4-  (m  & z)  \ (m  & y),  a<r-a^l. 

Although  rounding  errors  accumulate  through  eight  levels,  the  resulting  absolute  error 
never  exceeds  807/255.  Moreover,  it  is  ~ 1.13  if  we  average  over  all  2563  cases,  and 
it  is  less  than  2 with  probability  m 94.2%.  If  we  round  to  odd  as  in  exercise  90,  the 
maximum  and  average  error  are  reduced  to  616/255  and  « 0.58;  the  probability  of  error 
< 2 rises  to  ~ 99.9%.  Therefore  the  following  MMIX  code  uses  such  unbiased  rounding: 


x GREG  ;y  GREG  ;z  GREG 

X0R  t,x,y  M0R  m, ffhi, alf 

alf  GREG  ;m  GREG  ;t  IS  $255 

M0R  z,rodd,t  PUT  rM,m 

repeat  seven  times:  < 

AND  t ,x,y  MUX  x,z,x 

rodd  GREG  #4020100804020101 

ADDU  z,z,t  MUX  y,y,z 

ffhi  GREG  -1«56 

SLU  alf, alf ,1 
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but  omit  the  final  SLU,  then  repeat  the  first  four  instructions  again.  The  total  time  for 
eight  a-blends  (66v)  is  less  than  the  cost  of  eight  multiplications. 

92.  We  get  Zj  = \{xj  +yj)/ 2]  for  each  j.  (This  fact,  noticed  by  H.  S.  Warren,  Jr., 
follows  from  the  identity  x + y — (( x | y)  <C  1)  — (x  ® y).  See  also  the  next  exercise.) 

93.  x — y = (x  © y)  — ((5  & y)  -C  1).  (“Borrows”  instead  of  “carries.”) 

94.  ( x — l)j  = ( Xj  — 1 — bj)  mod  256,  where  bj  is  the  “borrow”  from  fields  to  the  right. 
So  tj  is  nonzero  if  and  only  if  ( Xj  . . .10)236  < (1 . . . 1)256  = (256J+1  — l)/255.  (The 
answers  to  the  stated  questions  are  therefore  “yes”  and  “no.”) 

In  general  if  the  constant  l is  allowed  to  have  any  value  (lj  . . . /Ho)256,  opera- 
tion (90)  makes  tj  yf  0 if  and  only  if  (xj  . . . xo)256  < ( lj  ■ ■ ■ ^0)256  and  Xj  < 128. 

95.  Use  (90):  Test  if  h & ( t(x  © ((x  8)  + (x  -C  56)))  | t(x  © ((x  16)  + (x  -C  48)))  | 

f(a:©  {{x^>  24)  + (a:  <g40)))  \ t(i$((i»  32)  + (x<C32))))  = 0,  where  t(x)  = (x  — l)8zx. 
(These  28  steps  reduce  to  20  if  cyclic  shift  is  available,  or  to  11  with  MXOR  and  BDIF.) 

96.  Suppose  0 < x,y  < 256,  Xh  = [_£c /128J , xi  = x mod  128,  yn  = |_j//128J  , yi  = 
y mod  128.  Then  [x<y]  = {xhyh[xi  <yi])\  see  exercise  7.1.1-106.  And  [xi<yi]  = 
[yi  + 127  — xi  > 128].  Hence  [x  < y]  = \_(xyz) /\2&\ , where  z = (x  & 127)  + (y  & 127). 

It  follows  that  t = h&z(xyz)  has  the  desired  properties,  when  2 = (£&  h)  + (y&  h). 
This  formula  can  also  be  written  t = h & ~{xyz),  where  z = ~((ai  & h)  + (y  & h))  = 
(x  | h)  - (y  & h)  by  (18). 

To  get  a similar  test  function  for  [xj  <yj]  = 1 — [yj  < Xj\,  we  just  interchange  x y 
and  take  the  complement:  t h & ~(xyz)  = h & ( xyz ),  where  2 = (x  &ih)  + (y  & h). 

97.  Set  x'  4—  s®  "*****♦**",  y x(By,  t <—  h&c(x  \ ((x|  h)  — l))  & (y1  \ ((y1  \ h)  — Z)), 
m (t<0)  — (ti§>7),  t t—  t&c(x'  \((x'  \h,)  — l)),  z <—  (m&;"********")\(m&zy).  (20  steps.) 

98.  Set  u 4—  x(By,  z 4—  (x&zh)+(y8ih),  t 4—  h&(x(B(u\ (a;©z))),  v 4—  ((£<§C1)  — (t2>7))&u, 
z 4-  x Q)  v,  w 4-  y © v.  [This  14-step  procedure  invokes  answer  96  to  compute  t = 
h & {xyz),  using  the  footprint  method  of  Section  7.1.2  to  evaluate  the  median  in  only 
three  steps  when  x © y is  known.  Of  course  the  MMIX  solution  is  much  quicker,  if 
available:  BDIFt,x,y;  ADDUz,y,t;  SUBU  w,x,t.] 

99.  In  this  potpourri,  each  of  the  eight  bytes  appears  to  be  solving  a different  kind 

of  problem;  we  must  recast  the  conditions  so  that  they  fit  into  a common  framework: 
So  = [zo  © ’ ! ’ <0],  /1  = [xx  © >0],  St.  = [x2  < ’A’  - 1],  /3  = [2:3  > ’z*],  /4  = 

[3:4  > ’a’  — 1],  /5  = [x5  © ’0’  < 9],  So  = [*6  © 255  > 86],  St  = [®7  © < 3].  Aha!  We 

can  use  the  formulas  in  answer  96,  adjusting  d to  switch  between  < and  > as  needed: 
a = (’?’  (255)  ’0’000’*’  ’ ! ’)256  = #3f f f 300000002a21 ; b = h = #7f7f7f7f7f7f 7f7f  ; 
c = h & ~(3(86)9(  ’a’  -l)>z’(’A’  - 1)00)256  = #7c29761f  053f  7f7f  (the  hardest  one); 
d = #8000800000800080;  and  e = h = #8080808080808080. 

100.  We  want  Uj  = Xj+yj+Cj  — 10cj+i  and  Vj  = Xj—yj—bj+10bj+i,  where  c,  and  bj  are 
the  “carry”  and  “borrow”  into  digit  position  j.  Set  u'  4-  (x  + y + (6  . . . 66)i6)  mod  264 
and  v'  4-  [x  — y)  mod  264.  Then  we  find  «'  = Xj  + yj  + Cj  + 6 — 16cj+i  and  u'  = 
xi  ~ yj  ~ bj  + 166j+i  for  0 < j < 16,  by  induction  on  j.  Hence  u'  and  v'  have  the 
same  pattern  of  carries  and  borrows  as  if  we  were  working  in  radix  10,  and  we  have 
u = u — 6(ci6  . . • c2ci)i6,  v = v — 6(fei6  . . . fc2bi)i6-  The  following  computation  schemes 
therefore  provide  the  desired  results  (10  operations  for  addition,  9 for  subtraction): 

y 4-  y + (6  . . . 66)16,  u 4-  x + y',  v 4-  x — y, 

t 4-  { xyu  ) & (8  . . . 88)16,  t 4-  { xyv ')  & (8  . . . 88)16, 

u 4-  u — t + (t  2);  v 4—  v — t + (t  2). 
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101.  For  subtraction,  set  z 4-  x-y;  for  addition,  set  z -f-  x + y + #e8c4c4f cl8,  where 
this  constant  is  built  from  256  - 24  = #e8,  256  - 60  = #c4,  and  65536  - 1000  = 

fcl8.  Borrows  and  carries  will  occur  between  fields  as  if  mixed-radix  subtraction  or 
addition  were  being  performed.  The  remaining  task  is  to  correct  for  cases  in  which 
borrows  occurred  or  carries  did  not;  we  can  do  this  easily  by  inspecting  individual 
digits,  because  the  radices  are  less  than  half  of  the  field  sizes:  Set  t <—  z&#  8080808000, 
t 4—  (t  1)  — (t  7)  — ((t  3>  15)  & 1),  z 4—  z — (t  & *e8c4c4f  cl8).  [See  Stephen  Soule, 
CACM  18  (1975),  344-346.  We’re  lucky  that  the  ‘c’  in  ‘fcl8’  is  even.] 

102.  (a)  We  assume  that  x = (xi5  . . . x0)16  and  y = ( y15  . . . y0)  is,  with  0 < x3 , y3  < 5; 
the  goal  is  to  compute  u = (uis  . . . wo)i6  and  v = (u15  . . . r>o)i6,  with  components 
uj  = (xj  + Uj)  mod  5 and  v3  = ( x3  — y3)  mod  5.  Here’s  how: 


u i + |z,  - y + 51, 

t 4—  (u  + 3/)  & h,  t 4—  (v  + 31)  & h, 

u 4-  u - {(t  - (t  » 3))  & 51);  V <r-  v - {(t  - (t  > 3))  & 51). 

Here  l = (1 . . . 1)16  = (264  — 1)/15,  h = 81.  (Addition  in  7 operations,  subtraction  in  8.) 
(b)  Now  x = ( X20  ■ ■ ■ *o)g,  etc.,  and  we  must  be  more  careful  to  confine  carries: 


t <—  x + h, 

z i — (t  & h)  T (y  & /i), 
t ■<—  (y  | z)  & t & h, 

Mf-i  + i/-(t+(i»  2)); 


z<-(x\h)~  (y&ch), 
t 1—  (y  | z)  & x & h, 
v 1-  * - y + t + (t  » 2). 


Here  h = (4 . . . 4)g  = (265  — 4)/7.  (Addition  in  11  operations,  subtraction  in  10.) 

Similar  procedures  work,  of  course,  for  other  moduli.  In  fact  we  can  do  multibyte 
arithmetic  on  the  coordinates  of  toruses  in  general,  with  different  moduli  in  each 
component  (see  7.2.1.3-(66)). 

103.  Let  h and  l be  the  constants  in  (87)  and  (88).  Addition  is  easy:  u 4-  x\{(xk,h)+y). 
For  subtraction,  take  away  1 and  add  x3&z(l-y3  ):  t <-  (x&d)»l,  v i-  t\  (f+(a;&(j/©/))). 

104.  Yes,  in  19:  Let  a = (((1901  < 4)  + 1)  < 5)  + 1,  6 = (((2099  « 4)  + 12)  < 5)  + 28. 
Set  m (x  » 5)  & #f  (the  month),  cl—  #10  & ~((x  | ( x 1))  3>  5)  (the  leap  year 
correction),  u 1—  b + #3  & ((#3bbeecc  + c)  3>  (m  + m))  (the  max-day  adjustment),  and 
t 1—  ((x  © a © (x  — a))  | (x  © u © (it  — x)))  & *1000220  (the  test  for  unwanted  carries). 

105.  Exercise  98  explains  how  to  compute  bytewise  min  and  max;  a simple  modification 
will  compute  min  in  some  byte  positions  and  max  in  others.  Thus  we  can  “sort  by 
perfect  shuffles’  as  in  Section  5.3.4,  Fig.  57,  if  we  can  permute  bytes  between  x and  y 
appropriately.  And  such  permutation  is  easy,  by  exercise  1.  [Of  course  there  are  much 
simpler  and  faster  ways  to  sort  16  bytes.  But  see  S.  Albers  and  T.  Hagerup,  Inf.  and 
Computation  136  (1997),  25-51,  and  M.  Thorup,  J.  Algorithms  42  (2002),  205-230, 
for  asymptotic  implications  of  this  approach.] 

106.  The  n bits  are  regarded  as  g fields  of  g bits  each.  First  the  nonzero  fields  are 
detected  (H),  and  we  form  a word  y that  has  (yg- 1 . . .1/0)2  in  each  5-bit  field,  where 
Vi  — [field  j of  x is  nonzero].  Then  we  compare  each  field  with  the  constants  29-1, 
. . . , 2 (^2),  and  form  a mask  m that  identifies  the  most  significant  nonzero  field  of  x. 
After  putting  g copies  of  that  field  into  z,  we  test  z as  we  tested  y ( t3 ).  Finally  an  appro- 
priate sideways  addition  of  t2  and  t3  (5-bit-wise)  yields  A.  (Try  the  case  5 = 4,  n = 16.) 

To  compute  2X  without  shifting  left,  replace  ‘t2  « 1’  by  % + 12\  and  replace  the 
final  line  by  w <—  (((a  • (t3  © (t3  5)))  mod  2n)  ^$>  (n  — g))  ■ l;  then  w & m is  2Xx . 
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107. 

h 

GREG 

#8000800080008000 

SLU 

q,t , 16 

OR 

t,t,y 

ms 

GREG 

#00ff0f Of 33335555 

ADDU 

t,t,q 

AND 

t ,t  ,h 

1H 

SRU 

q,x,32 

SLU 

q,t ,32 

5H 

SLU 

q,t,15 

ZSNZ 

lam,q,32 

ADDU 

t,t,q 

ADDU 

t,t,q 

ADD 

t , lam, 16 

3H 

ANDN 

y,t,ms 

SLU 

q,t  ,30 

SRU 

q,x,t 

4H 

XOR 

t,t,y 

ADDU 

t,t,q 

CSNZ 

lam,q,t 

OR 

q.y.h 

6H 

SRU 

q,t  ,60 

2H 

SRU 

t ,x , lam 

SUBU 

t,q,t 

ADDU 

lam, lam, q 

The  total  time  is  22u  (and  no  mems).  [There’s  also  a mem-less  version  of  (56), 
costing  only  16u,  if  its  last  line  is  replaced  by  ADD  t,lam,4;  SRU  y,x,t;  CSNZ  lam,y,t; 
SRU  y,x,lam;  SLUt.y.l;  SRU  t , [#f  f f f aa50]  , t ; AND  t , t , 3 ; ADD  lam,  lam,  t.] 

108.  For  example,  let  e be  minimum  so  that  n < 2e  ■ 22'  . If  n is  a multiple  of  2e,  we 
can  use  2e  fields  of  size  n/2e,  with  e reductions  in  step  Bl;  otherwise  we  can  use  2e 
fields  of  size  2^gn^_e_1,  with  e + 1 reductions  in  step  Bl.  In  either  case  there  are  e 
iterations  in  steps  B2  and  B5,  so  the  total  running  time  is  O(e)  = O(loglogn). 

109.  Start  with  x 4-  x k -x  and  apply  Algorithm  B.  (Step  B4  of  that  algorithm  can 

be  slightly  simplified  in  this  special  case,  using  a constant  l instead  of  x 0 y.) 

110.  Let  s = 2d  where  d = 2e  — e.  We  will  use  s-bit  fields  in  n-bit  words. 

Kl.  [Stretch  imods.]  Set  y 4-  x k (s  - 1).  Then  set  t 4-  ykfij  and  y 4-  y©t© 
(t<S2J(s-l))  for  e > j > 0.  Finally  set  y 4-  (y<s)-y.  [If  a:  = (x2=_i . . . x0)2 
we  now  have  y = (y2<=- 1 • • • 3/0)2*,  where  y,  = ( 2s  - 1 )xj[j  < d].[ 

K2.  [Set  up  minterms.j  Set  y 4-  j/©(a2«_i  . . . a0)2»,  where  a3  = for  0 < j < d 
and  aj  = 2s  — 1 for  d < j < 2e . 

K3.  [Compress.]  Set  y 4-  y k (y  » 2J s)  for  e > j > 0.  [Now  y = 1<  (x  mod  s). 
This  is  the  key  point  that  makes  the  algorithm  work.] 

K4.  [Finish.]  Set  y 4-  y | (y  < 2Js)  for  0 < j < e.  Finally  set  y 4-  y k (/j2«,j  © 

— ((x  > j)  & 1))  for  d < j < 2e.  | 

111.  The  n bits  are  divided  into  fields  of  s bits  each,  although  the  leftmost  field  might 
be  shorter.  First  y is  set  to  flag  the  all-1  fields.  Then  t = ( . . . Mo)2»  contains  candidate 
bits  for  q,  including  “false  drops”  for  certain  patterns  01fc  with  s < k < r.  We  always 
have  vtj  < 1,  and  tj  7^  0 implies  tj-i  = 0.  The  bits  of  u and  v subdivide  t into  two 
parts  so  that  we  can  safely  compute  m = (t  > 1)  | (f  > 2)  | • • • | (t  » r),  before  making 
a final  test  to  eliminate  the  false  drops. 

112.  Notice  that  if  q = x & (x  1)  & • • • & (x  (r  — 1))  &;  ~(x  <?C  r)  then  we  have 

x & x + q = x & (x  < 1)  & • • • & (x  < (r  — 1)). 

If  we  can  solve  the  stated  problem  in  0(1)  steps,  we  can  also  extract  the  most 
significant  bit  of  an  r-bit  number  in  0(1)  steps:  Apply  the  case  n = 2r  to  the  number 
2n  — 1 — x.  Conversely,  a solution  to  the  extraction  problem  can  be  shown  to  yield  a solu- 
tion to  the  T 0 problem.  Exercise  110  therefore  implies  a solution  in  O(loglogr)  steps. 

113.  Let  0'  = 0,  x'0  = xo,  and  construct  x',  = X;  for  1 < i < r as  follows:  If 

Xi  = a Of  b and  o;  ^ {+,  — , <g},  let  i’  = (i  — 1)'  + 1 and  x',  = a1  o*  6',  where  a1  = x'  , 

if  a = Xj  and  a1  = a if  a = a.  If  xt  = a « c,  let  i1  = (i  - 1)'  + 2 and  (x'i,_1,x'i,)  = 

(a'&([2n~c]  - l),x',_1  <c).  If  Xi  = a + b,  let  i'  = (i-  l)'  + 6 and  let  (x'(i_1),+1, . . . ,x',) 
compute  (( a ' & h)  + ( b ' & h))  ® (( a ' © b')_k  h),  where  h = 2n_1.  And  if  xt  = a - b,  do 
the  similar  computation  ((o'  | h)  — ( b ' k h))  © ((o'  = b')  k h).  Clearly  r'  < 6 r. 
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114.  Simply  let  Xt  = Xj(i)  o{  Xk(i)  when  xt  = Xj (i)  o t xk(i),  Xi  = Ci  o t Xk w when 
Xi  = C{  Oi  xk(i),  and  Xi  = Xj(i)  Oj  C*  when  Xi  = Xj (*)  Oj  a , where  C*  = Ci  when  a is  a 
shift  amount,  otherwise  Ci  = (cj  . . .01)2"  = (2mn  — l)cj/(2n  — 1).  This  construction  is 
possible  thanks  to  the  fact  that  variable-length  shifts  are  prohibited. 

[Notice  that  if  m = 2d , we  can  use  this  idea  to  simulate  2d  instances  of  f(x,yi); 
then  O(d)  further  operations  allow  “quantification.”] 

115.  (a)  z 4—  (x  1)  & (x  <C  2),  y 4—  x & (2:  + z).  [This  problem  was  posed  to  the 
author  by  Vaughan  Pratt  in  1977.] 

(b)  First  find  i|  (-  (1  < 1)  & i and  xr  4—  x & (x  1),  the  left  and  right  ends 
of  x’s  blocks;  and  set  x'r  4-  xr  & (xr  - 1).  Then  ze  4-  x|.  & (x'r  - (x,  & p0))  and 
za  <—  x'r  X.  (x'T  — (xi  X no))  are  the  right  ends  that  are  followed  by  a left  end  in  even  or 
odd  position,  respectively.  The  answer  is  y 4-  x & (a;  + (ze  & £0)  + (za  & Mo));  it  can  be 
simplified  to  y 4-  x & (x  + (ze  ® (xj.  & Mo)))- 

(c)  This  case  is  impossible,  by  Corollary  I. 

116.  The  language  L is  well  defined,  by  Lemma  A (except  that  the  presence  or  absence 
of  the  empty  string  is  irrelevant).  A language  is  regular  if  and  only  if  it  can  be  defined  by 
a finite-state  automaton,  and  a 2-adic  integer  is  rational  if  and  only  if  it  can  be  defined 
by  a finite-state  automaton  that  ignores  its  inputs.  The  identity  function  corresponds 
to  the  language  L = 1(0U  1)*,  and  a simple  construction  will  define  an  automaton  that 
corresponds  to  the  sum,  difference,  or  Boolean  combination  of  the  numbers  defined  by 
any  two  given  automata  acting  on  the  sequence  X0X1X2  ....  Hence  L is  regular. 

In  exercise  115,  L is  (a)  11*(000*1(0U  1)*  UO’);  (b)  11* (00(00)*1(0  U 1)*  U0*). 

117.  Incidentally,  the  stated  language  L corresponds  to  an  inverse  Gray  binary  code: 
It  defines  a function  with  the  property  that  f(2x)  = ~/(2x  + 1),  and  g(f( 2x))  = 
g(f(2x  + 1))  = x,  where  g(x)  = x © (x  » 1)  (see  Eq.  7.2.1.1-(g)). 

118.  If  x = (xn-i  ■ • • £12:0)2  and  0 < a,j  < for  0 < j < n,  we  have  ajxj  = 

E;=o(“2  - (5  & 2J)).  Take  aj  = L2J“1J  to  get  x » 1. 

Conversely,  the  following  argument  by  M.  S.  Paterson  proves  that  monus  must  be 
used  at  least  n — 1 times:  Consider  any  chain  for  f(x)  that  uses  addition,  subtraction, 
bitwise  Booleans,  and  k occurrences  of  the  “underflow”  operation  y<z  = (2n  — 1 ){y<z]. 
If  k < n— 1 there  must  be  two  n-bit  numbers  x'  and  x"  such  that  x'  mod  2 = x"  mod  2 = 
0 and  such  that  all  k of  the  <’s  yield  the  same  result  for  both  x'  and  x" . Then 
f(x  ) mod  2-'  = f(x")  mod  2J  when  j = p(x'  © x").  So  f(x)  is  not  the  function  r>l. 

119.  z 4—  x © y,  f 4—  2P  & 2 & (z  - 1).  (See  (90).) 

120.  Generalizing  Corollary  W,  these  are  the  functions  such  that  f(x i,...,xm)  = 
f(yi,  ■ ■ ■ ,Vm ) (modulo  2k)  whenever  x3  = y3  (modulo  2k)  for  1 < j < m,  for  0 < k < n. 
The  least  significant  bit  is  a binary  function  of  m variables,  so  it  has  22"1  possibilities. 
The  next-to-least  is  a binary  function  of  2m  variables,  namely  the  bits  of  (xi  mod  4, 

. . . , xm  mod  4),  so  it  has  222m;  and  so  on.  Thus  the  answer  is  22m+22T>lH — i-2nm 

121.  (a)  If  / has  a period  of  length  pq,  where  q > 1 is  odd,  its  p-fold  iteration  has  a 
period  of  length  <7,  say  y0  1-4  Vl  t->  • ■ • yq  = y0  where  yj+1  = f[p](yj)  and  yx  / y0.  But 
then,  by  Corollary  W,  we  must  have  y0  mod  2n_1  t->-  y3  mod  2n_1  i-A  • • • i->  yq  mod  2n_1 
in  the  corresponding  (n  - l)-bit  chain.  Consequently  y-i  = y0  (modulo  2n-1),  by 
induction  on  n.  Hence  t/i  = yo  © 2"-1,  and  j/2  = yo,  etc.,  a contradiction. 

(b)  xj  = xo  + £0,  X2  = Xo  (p  — 1),  £3  = xi  | X2;  a period  of  length  p starts  with 
the  value  x0  = (1  + 2P  + 22p  -| ) mod  2n. 
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122.  Subtraction  is  analogous  to  addition;  Boolean  operations  are  even  simpler;  and 
constants  have  only  one  bit  pattern.  The  only  remaining  case  is  xr  = Xj  c,  where  we 
have  Sr  = Sj  + c;  the  shift  goes  left  when  c < 0.  Then  Vpqr  = V(p+C)(9+C)j,  and 

xr  & [2P  - 29J  = {(Xj  & |2P+C  - 29+cJ)  »c)  & (2n  - 1). 

Hence  |Ap,jr|  < |^f(P+c)(9+c)j|  < Bj  = Br  by  induction. 

123.  If  x = (xg— i . . .0:0)2,  note  first  that  t = 29~1(x0  . . . x9_i)2s  in  (104);  hence  y = 
(xo...xs_i)2  as  claimed.  Theorem  P now  implies  that  [|lggj  broadword  steps  are 
needed  to  multiply  by  a9+i  and  by  a9_i.  At  least  one  of  those  multiplications  must 
require  lggj  or  more  steps. 

124.  Initially  t <—  0,  xo  = x,  Uo  = {1, 2, . . . , 2™_1 },  and  V <—  0.  When  advancing 
t <-  t + 1,  if  the  current  instruction  is  n <—  Xj  ± rk  we  simply  define  xt  = Xy  ± xk>  and 
i'  <—  f.  The  cases  Xi  Xj  o xk  and  xt  <—  c are  similar. 

If  the  current  instruction  branches  when  r,  < Xj,  define  xt  = xt~i  and  let  Vj  = 
{x  £ Ut-i  | Xi 1 < Xj*},  Vo  = Ut-i  \ V\.  Let  Ut  be  the  larger  of  V0  and  Vi ; branch  if 
Ut  = Vi.  Notice  that  \Ut\  > \Ut-\\/2  in  this  case. 

If  the  current  instruction  is  Xi  <—  rj^$>rk,  let  W = {x  € Ut- 1 | x&L2lgn+s  — 2s  J / 0 
for  some  s € Sfc/},  and  note  that  \W\  < |Sfc/|  lgn  < 2t~1+e+f . Let  Vc  = {x  G ?7t_i  \1T  | 
= c}  for  |c|  < n,  and  Vn  = Ut- 1 \ W \ U|c|<n  ^c-  Lemma  B tells  us  that  at  most 
Bk'  + 1 < 22*  1 _1  + 1 of  the  sets  Vc  are  nonempty.  Let  Ut  be  the  largest;  and  if  it  is  Vc . 
define  xt  = xy  c,  i!  t.  In  this  case  \Ut\  > (|t/t_i|  — 2t“1+e+-f)/(22t~1-1  + 1). 

Similarly  for  r;  e-  M[xj  mod  2m]  or  M[xj  mod  2m]  e-  r,,  let  W = {x  e Ut- 1 | 

x & [2m+s  - 2SJ  ± 0 for  some  s £ Sy},  and  Vz  = {x  £ Ut-i  \ W \ xy  mod  2m  = z}, 

for  0 < z < 2m.  By  Lemma  B,  at  most  By  < 22'  1 1 of  the  sets  Vz  are  nonempty;  let 
Ut  = Vz  be  the  largest.  To  write  r;  in  M[z],  define  xt  = xt_i,  z"  <-  i';  to  read  n from 
M[z],  set  i'  <—  t and  put  xt  = xz»  if  z"  is  defined,  otherwise  let  xt  be  the  precomputed 
constant  M[z\.  In  both  cases  \Ut\  > (|I7i_i|  — 2t~1m)/22<  1-1  is  sufficiently  large. 

If  t < / we  cannot  be  sure  that  x\  = px.  The  reason  is  that  the  set  W = 

{x  £ Ut  | x & [2lsn+<!  — 2SJ  / 0 for  some  s £ Sp}  has  size  |IT|  < ISpIlgn  < 2t+e+f , 

and  \Ut\W\  > 22'+/-2‘+i-2  '+<=+/  > 22‘-i  > \{xv  & |2ls"  - lj  | x0  € Ut\W}\.  Two 
elements  of  Ut  \ W cannot  have  the  same  value  of  px  = xy  & [2lgn  — lj. 

[The  same  lower  bound  applies  even  if  we  allow  the  RAM  to  make  arbitrary 
22  -way  branches  based  on  the  contents  of  (ri, . . . , ri)  at  time  t.] 

125.  Start  as  in  answer  124,  but  with  U0  = [0..29).  Simplifying  that  argument  by 
eliminating  the  sets  W will  yield  sets  such  that  \Ut\  > 29/max(2m,2 nf\  for  example, 
at  most  2n  different  shift  instructions  can  occur. 

Suppose  we  can  stop  at  time  t with  t < [\g(h  + 1)J.  The  proof  of  Theorem  P 
yields  p and  q with  xR  & [2P  - 29J  independent  of  x & [2P+S  - 29+sJ . Hence  the  hinted 
extension  of  Lemma  B shows  that  xR  takes  on  at  most  224-1  < 2(fe_1)/2  different  values, 
for  every  setting  of  the  other  bits  {x  & [2P+S  - 29+sJ  | s £ St}.  Consequently  ri  = xf 
can  be  the  correct  value  of  xR  for  at  most  2(h_1^2+9_''  values  of  x.  But  2<-h~'1^2+9~h 
is  less  than  | C/t | , by  (106). 

126.  M.  S.  Paterson  has  proposed  a related  (but  different)  conjecture:  For  every  2-adic 
chain  with  k addition-subtraction  operations,  there  is  a (possibly  huge)  integer  x with 
i^x  = k + 1 such  that  the  chain  does  not  calculate  2Xx . 

127.  Johan  Hastad  [Advances  in  Computing  Research  5 (1989),  143-170]  has  shown 
that  every  polynomial-size  circuit  that  computes  the  parity  function  from  the  inputs 
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{xi, . . . , xn,  xi, . . . , xn}  with  AND  and  OR  gates  of  unlimited  fanin  must  have  depth 
O(logn/loglogn). 

128.  (Note  also  that  the  suffix  parity  function  xq)  is  considered  in  exercises  36  and  66.) 
130.  If  the  answer  is  “no,”  the  analogous  question  with  variable  a suggests  itself. 


131.  This  program  does 

a typical  “breadth-first  search,”  keeping  LINK(q)  = r.  Regis- 

ter  u is  the  vertex  currently  being  examined;  v is  one  of  its  successors. 

OH  LD0U  r,q,link 

1 

r <-  LINK(q) . 

ST0U  v,q,link 

l^l-IQI 

LINK(q)  4-  v. 

SET  u,r 

1 

u 4 — r. 

ST0U  r,v,link 

l-R|-|Q| 

LINK(v)  4-  r. 

1H  LD0U  a,u,arcs 

\R\ 

a < — ARCS  (u)  . 

SET  q,v 

l^l-IQI 

q 4—  v. 

BZ  a,4F 

|R| 

Is  S[u]  = 0? 

3H  PBNZ  a,2B 

s 

Loop  on  a. 

2H  LD0U  v,a,tip 

S 

v 4-  TIP(a) . 

4H  LD0U  u,u,link 

\R\ 

u 4-  LINK(u) . 

LD0U  a, a, next 

S 

a <-  NEXT (a) . 

CMPU  t ,u,r 

\R\ 

Is  u / r? 

LD0U  t.v.link 

s 

t 4-  LINK(v) . 

PBNZ  t,lB 

\R\ 

If  so,  continue. 

PBNZ  t ,3F 

s 

Is  v £ R? 

1 

132.  (a)  We  always  have  t(U)  C &zu<£u  Su  = cr(U).  And  equality  holds  if  and  only  if 
2“  C p{u)  for  all  u € U and  u'  £ U. 

(b)  We’ve  proved  that  t(U)  C ct(C7);  hence  T CU.  And  if  t £ T we  have  2*  C pu 
for  all  ueU.  Therefore  a{T)  C r(T). 

(c)  Parts  (a)  and  (b)  prove  that  the  elements  of  Cn  represent  the  cliques. 

(d)  If  m C r then  u&tpk  C v&ipk  and  u&cSk  C v&t.5k\  so  we  can  work  entirely  with 
maximal  entries.  The  following  algorithm  uses  cache-friendly  sequential  (rather  than 
linked)  allocation,  in  a manner  analogous  to  radix  exchange  sort  (Algorithm  5.2.2R). 

We  assume  that  w\  . . . ws  is  a workspace  of  s unsigned  words,  bounded  by  wo  = 0 
and  ws+ 1 = 2"  — 1.  The  elements  of  appear  initially  in  positions  w\  . . .wm,  and 
our  goal  is  to  replace  them  by  the  elements  of  C+. 

Ml.  [Initialize.]  Terminate  if  pk  = 2n  — 1.  Otherwise  set  v 4—  2fe,  i 4—  1,  j 4—  m. 

M2.  [Partition  on  «.]  While  w,  & v - 0,  set  i 4-  i + 1.  While  Wjhv  ^ 0,  set 
3 3 ~ 1-  Then  if  i > j,  go  to  M3;  otherwise  swap  Wi  4A  Wj , set  i 4—  i + 1, 

j <—  j — 1,  and  repeat  this  step. 

M3.  [Split  Wi  . . . Wm .]  Set  l <—  j , p <—  s + 1.  While  i < m,  do  subroutine  Q with 
u = Wi  and  set  i <—  i + 1. 

M4.  [Combine  maximal  elements.]  Set  me-/.  While  p < s,  set  m ■<-  m + 1, 
wm  <-  wp , and  p <—  p + 1.  | 

Subroutine  Q uses  global  variables  j,  k,  l,  p,  and  v.  It  essentially  replaces  the  word  u 
by  u'  = uk  pk  and  u"  — u & <5*,  retaining  them  if  they  are  still  maximal.  If  so,  v!  goes 
into  the  upper  workspace  wp...w„  but  u"  stays  below. 

Ql.  [Examine  u' .]  Set  w £-  u & pk  and  q «-  s.  If  w = u,  go  to  Q4. 

Q2.  [Is  it  comparable?]  If  q < p,  go  to  Q3.  Otherwise  if  w & wq  = w,  go  to  Q7. 
Otherwise  if  w&wq  = wq,  go  to  Q4.  Otherwise  set  q «—  q — 1 and  repeat  Q2. 

Q3.  [Tentatively  accept  u\]  Set  p <—  p—  1 and  wp  <—  w.  Memory  overflow  occurs 
if  p < m + 1 . Otherwise  go  to  Q7. 

Q4.  [Prepare  for  loop.]  Set  r <-  p and  wp_i  «-  0. 

Q5.  [Remove  nonmaximals.]  While  w \ wq  ^ w,  set  q <-  q - 1.  While  w | wr  = w, 
set  r r + 1.  Then  if  q < r,  go  to  Q6;  otherwise  set  wq  «—  wr,  wr  e-  0, 
9 t—  q — 1,  rf-  r+1,  and  repeat  this  step. 
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Q6.  [Reset  p.]  Set  wq  <—  w and  p t—  q.  Terminate  the  subroutine  if  w = it. 

Q7.  [Examine  u" .]  Set  w t—  u & v.  If  w = wq  for  some  q in  the  range  1 < q < j, 
do  nothing.  Otherwise  set  l <—  l + 1 and  wi  <—  w.  | 

In  practice  this  algorithm  performs  reasonably  well;  for  example,  when  it  is  applied  to 
the  8 x 8 queen  graph  (exercise  7-129),  it  finds  the  310  maximal  cliques  after  306,513 
mems  of  computation,  using  397  words  of  workspace.  It  finds  the  10188  maximal 
independent  sets  of  that  same  graph  after  about  310  megamems,  using  15090  words; 
there  are  respectively  (728, 6912,  2456, 92)  such  sets  of  sizes  (5, 6,  7, 8),  including  the  92 
famous  solutions  to  the  eight  queens  problem. 

Reference:  N.  Jardine  and  R.  Sibson,  Mathematical  Taxonomy  (Wiley,  1971),  Ap- 
pendix 5.  Many  other  algorithms  for  listing  maximal  cliques  have  also  been  published. 
See,  for  example,  W.  Knodel,  Computing  3 (1968),  239-240,  4 (1969),  75;  C.  Bron 
and  J.  Kerbosch,  CACM  16  (1973),  575-577;  S.  Tsukiyama,  M.  Ide,  H.  Ariyoshi,  and 
I.  Shirakawa,  SICOMP  6 (1977),  505-517;  E.  Loukakis,  Computers  and  Math,  with 
Appl.  9 (1983),  583-589;  D.  S.  Johnson,  M.  Yannakakis,  and  C.  H.  Papadimitriou,  Inf. 
Proc.  Letters  27  (1988),  119-123.  See  also  exercise  5-23. 

133.  (a)  An  independent  set  is  a clique  of  G;  so  complement  G.  (b)  A vertex  cover  is 
the  complement  of  an  independent  set;  so  complement  G,  then  complement  the  outputs. 

134.  a eA  00,  b i y 01,  c i 11  is  the  first  mapping  of  class  II. 

135.  The  unary  operators  are  simple:  -.( xixr ) = xrxf,  o(xtxr)  = xrxr;  a(xixr)  = xtxi. 
And  xixr  <4-  yvyr  = (z;  A zr)(zi  V zr),  where  zi  = ( xi  = yi)  and  zr  = (xr  = yr )• 

136.  (a)  Classes  II,  III,  IVa,  and  IVC  all  have  the  optimum  cost  4.  Curiously  the 
functions  z;  = xi\Zyi\Z(xr/\yr),  zr  = xr\/yr  work  for  the  mapping  (a,  b,  c)  i-a  (00, 01,11) 
of  class  II  as  well  as  for  the  mapping  (a,  b,  c)  i-a  (00, 01, 1*)  of  class  IVC.  [This  operation 
is  equivalent  to  saturating  addition,  when  a = 0,  b = 1,  and  c stands  for  “more  than  1.”] 

(b)  The  symmetry  between  a,  b,  and  c implies  that  we  need  only  try  classes  I, 
IVa,  and  Va;  and  those  classes  turn  out  to  cost  6,  7,  and  8.  One  winner  for  class  I,  with 
(a,  b,  c)  HA  (00, 01, 10),  is  zi  = vr  A ui,  zr  =■  vi  A ur,  where  ui  = xi  © yi,  ur  = xr  © yr, 
vl  = yr  ®ui , and  vr  = yi  (Bur.  [See  exercise  7.1.2-60,  which  gives  the  same  answer  but 
with  zi  O zr.  The  reason  is  that  we  have  (x  + y + z)  mod  3 = 0 in  this  problem  but 
(x  + y — z)  mod  3 = 0 in  that  one;  and  zi  zr  is  equivalent  to  negation.  The  binary 
operation  z = x o y in  this  case  can  also  be  characterized  by  the  fact  that  the  elements 
{x,  y,  z)  are  all  the  same  or  all  different;  thus  it  is  familiar  to  people  who  play  the  game 
of  SET.  It  is  the  only  binary  operation  on  n-element  sets  that  has  n!  automorphisms 
and  differs  from  the  trivial  examples  x o y = x or  x o y = y.\ 

(c)  Cost  3 is  achieved  only  with  class  I:  Let  (a,  b,  c)  i-a  (00,01,10)  and  z;  = 
(xi  V xr)  A yi , zr  ~ xr  A yr. 

137.  In  fact,  z = (x  + 1)  & y when  (a,b,  c)  H > (00,01, 10).  [It’s  a contrived  example.] 

138.  The  simplest  case  known  to  the  author  requires  the  calculation  of  two  binary 
operations,  such  as 

/ a b b \ / a b a \ 

(abb)  and  (aba); 

\ c a a J \ c a c / 

each  has  cost  2 in  class  Va,  but  the  costs  are  (3,2)  and  (2,3)  in  classes  I and  II. 

139.  The  calculation  of  z2  is  essentially  equivalent  to  exercise  136(b);  so  the  natural 
representation  (m)  wins.  Fortunately  this  representation  also  is  good  for  zi,  with 
zi;  = xi  A yi,  zir  = xr  A yr. 
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140.  With  representation  (m),  first  use  full  binary  adders  to  compute  (0100)2  = 

xi  + yi  + zi  and  (6160)2  = xr  + yr  + zr  in  5 + 5 = 10  steps.  Now  the  “greedy  footprint” 
method  shows  how  to  compute  the  four  desired  functions  of  (ai,  ao,  bi,  60)  in  eight 
further  steps.  u\  — Hi  A 60,  ur  — do  A 615  t\  — ai  0 60,  62  — do  0 61,  to  = di  0 62, 

t-4  = ao  0 ti,  vi  = t3  A ti,  vr  = 64  A 62-  [Is  this  method  optimum?] 

141.  Suppose  we’ve  computed  bits  a = o0ai  . . . a2m-i  and  6 = 606i  . . . b2m_i  such  that 

fls  = [s  = 1 or  s = 2 or  s is  a sum  of  distinct  Ulam  numbers  < m in  exactly  one  way], 
6S  = [s  is  a sum  of  distinct  Ulam  numbers  < m in  more  than  one  way], 

for  some  integer  m = U„  > 2.  For  example,  when  m = n = 2 we  have  a = 0111  and 

6 = 0000.  Then  {s  | s < m and  as  = 1}  = {Ui, . . . , Un}\  and  Un+ 1 = min{s  | s > m 

and  as  = 1}.  (Notice  that  a3  = 1 when  s = f7n_i  + Un.)  The  following  simple  bitwise 
operations  preserve  these  conditions:  n •<—  n + 1,  m «—  Un,  and 

\djn  . . . U2m  — 1 ,6 m • • ' 62 m—  1 ) dm  • • • d2m  — 1 0 do  . . . dm — 1 ) & 6,n  . . . 62m—  1 , 

( dm  • • * d2m  — 1 HO  • ■ - dm  — l ) j bm  • • • b2m  — l ) , 

where  as  = 6S  = 0 for  2Un-i  < s < 2 Un  on  the  right  side  of  this  assignment. 

[See  M.  C.  Wunderlich,  BIT  11  (1971),  217-224;  Computers  in  Number  Theory 
(1971),  249-257.  These  mysterious  numbers,  which  were  first  defined  by  S.  Ulam  in 
SIAM  Review  6 (1964),  348,  have  baffled  number  theorists  for  many  years.  The  ratio 
Un/n  appears  to  converge  to  a constant,  « 13.52;  for  example,  t/20oooooo  = 270371127 
and  U40000000  — 540752349.  Furthermore,  D.  W.  Wilson  has  observed  empirically  that 
the  numbers  form  quasi-periodic  “clusters”  whose  centers  differ  by  multiples  of  another 
constant,  « 21.6016.  Calculations  by  Jud  McCranie  and  the  author  for  Un  < 640000000 
indicate  that  the  largest  gap  U„  — Un-i  may  occur  between  U24576523  = 332250401  and 
U2 4576524  = 332251032;  the  smallest  gap  Un  — Un-\  = 1 apparently  occurs  only  when 
U„  € {2,3,4,48}.  Certain  small  gaps  like  6,  11,  14,  and  16  have  never  been  observed.] 

142.  Algorithm  E in  that  exercise  performs  the  following  operations  on  subcubes: 
(i)  Count  the  *s  in  a given  subcube  c.  (ii)  Given  c and  c',  test  if  c C c'.  (iii)  Given 
c and  c',  compute  c U c'  (if  it  exists).  Operation  (i)  is  simple  with  sideways  addition; 
let’s  see  which  of  the  nine  classes  of  two-bit  encodings  (119),  (123),  (124)  works  best 
for  (ii)  and  (iii).  Suppose  a = 0,  6 = 1,  c = *;  the  symmetry  between  0 and  1 means 
that  we  need  only  examine  classes  I,  III,  IV„,  IVC,  Va,  and  Vc. 

For  the  asterisks-and-bits  mapping  (0,1,*)  1-4  (00,01,10),  which  belongs  to 
class  I,  the  truth  table  for  c 2 c is  010*100*110*****  in  each  component.  (For  example, 
0 C * and  * 2 1-  The  *s  in  this  truth  table  are  don’t-cares  for  the  unused  codes  11.) 
The  methods  of  Section  7.1.2  tell  us  that  the  cheapest  such  functions  have  cost  3; 
for  example,  c C c if  and  only  if  ((6  0 b’)  \ d)  & a = 0.  Furthermore  the  consensus 
c U c'  = c"  exists  if  and  only  if  vz  = 1,  where  z = (l06')&  ~(a  © a').  And  in  that 
case,  d"  = (a  0 6 0 6')  & ~(a  0 a'),  b"  = (6  | 6')  & z.  [The  asterisk  and  bit  codes  were 
used  for  this  purpose  by  M.  A.  Breuer  in  Proc.  ACM  Nat.  Conf.  23  (1968),  241-250.] 

But  class  III  works  out  better,  with  (0, 1,  *)  1-4  (01, 10, 00).  Then  c C c if  and  only 
if  (cj&c])  | (cr&c{)  = 0;  cUc'  = c"  exists  if  and  only  if  vz  = 1 where  z = x&iy,  x = c,  | c[, 
y = cr  | c{;  and  c"  = x © 2,  c " = y © 2.  We  save  two  operations  for  each  consensus, 
with  respect  to  class  I,  compensating  for  an  extra  step  when  counting  asterisks. 

Classes  IVa,  Va,  and  Vc  turn  out  to  be  far  inferior.  Class  IVC  has  some  merit, 
but  class  III  is  best. 
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143.  f(x)  = ((a;&mi)<C17)  | ((x2>17)&toi)  | ((^&m2)<S15)  | ((a;2>15)&m2)  | ((i&m3)c 
10)  | ((x  > 10)  & m3)  | (( x &m4)«6)|((i»6)&  m4),  where  m1  = #7f  7f7f  7f  7f  7f , 
m2  = #fefefefefefe,  m3  = #3f 3f 3f 3f 3f 3f 3f , m4  = #f cf cf cf cf cf cf c . [See,  for 
example,  Chess  Skill  in  Man  and  Machine,  edited  by  Peter  W.  Frey  (1977),  page  59. 
Five  steps  suffice  to  compute  f(x)  on  MHIX  (four  MOR  operations  and  one  OR),  since 
f(x)  = q-  x ■ q | q - x ■ q with  q = #40a05028140a0502  and  q = #2010884422110804 .] 

144.  Node  j ® (k  -C  1),  where  k = j & —j. 

145.  It  names  the  ancestor  of  the  leaf  node  j | 1 at  height  h. 

146.  By  (136)  we  want  to  show  that  A (j  & — i)  = pi  when  l — 2pl  < i < l < j < l + 2pl . 
The  desired  result  follows  from  (35)  because  —l<—i<—l  + 2pl . 

147.  (a)  7 rvj  = (3vj  = j,  avj  = 1 -C  pj,  and  rj  = A,  for  1 < j < n. 

(b)  Suppose  n = 2ei  H \-2et  where  ej  > • • • > et  > 0,  and  let  nk  = 2ei  H \-2ek 

for  0 < k <t.  Then  nvj  = j and  j3vj  = avj  = nk  for  nk- 1 < j < nk.  Also  rnk  = vnk  l 

for  1 < k < t,  where  vo  = A;  all  other  rj  = A. 

148.  Yes,  if7ri/i  = 010000,  ny2  = 010100,  ttxj  = 010101,  nx2  = 010110,  nx3  = 010111, 
/3x3  = 010111,  f3y2  = 010100,  /3x2  = 011000,  /3yi  = 010000,  and  /3x4  = 100000. 

149.  We  assume  that  CHILD(u)  = SIB(u)  = PARENT(u)  = A initially  for  all  vertices  v 
(including  v = A),  and  that  there  is  at  least  one  nonnull  vertex. 

51.  [Make  triply  linked  tree.]  For  each  of  the  n arcs  u — >v  (perhaps  v = A),  set 
SIB(u)  4—  CHILD(u),  CHILD(v)  <—  u,  PARENTGi)  <—  v.  (See  exercise  2. 3. 3-6.) 

52.  [Begin  first  traversal.]  Set  p -f—  CHILD(A) , nf-  0,  and  A0  < 1. 

53.  [Compute  /?  in  the  easy  case.]  Set  n «—  n + 1,  7rp  t—  n,  rn  t—  A,  and 

An  f-  1 + A(r!>l).  IfCHILD(p)  ^ A,  set  p <—  CHILD (p)  and  repeat  this  step; 

otherwise  set  (ip  «—  n. 

54.  [Compute  r,  bottom-up.]  Set  r/3p  <—  PARENT (p) . Then  if  SIB(p)  ^ A,  set 
p <—  SIB  (p)  and  return  to  S3;  otherwise  set  p PARENT  (p) . 

55.  [Compute  (3  in  the  hard  case.]  If  p / A,  set  h t—  A (n  & -7r p),  then  (3p  <- 
((n  h)  | 1)  h,  and  go  back  to  S4. 

56.  [Begin  second  traversal.]  Set  p <-  CHILD  (A) , A0  t—  An,  «—  (3 A t—  qA  <—  0. 

57.  [Compute  a,  top-down.]  Set  ap  a(PARENT(p) ) | ((3p  & ~/3p).  Then  if 
CHILD  (p)  / A,  set  p <r-  CHILD  (p)  and  repeat  this  step. 

58.  [Continue  to  traverse.]  If  SIB(p)  / A,  set  p <—  SIB(p)  and  go  to  S7. 
Otherwise  set  p <—  PARENT  (p) , and  repeat  step  S8  if  p / A.  | 

150.  We  may  assume  that  the  elements  Aj  are  distinct,  by  regarding  them  as  ordered 
pairs  ( Aj,j ).  The  hinted  binary  search  tree,  which  is  a special  case  of  the  “Cartesian 
trees”  introduced  by  Jean  Vuillemin  [CACM  23  (1980),  229-239],  has  the  property  that 
k(i,j)  is  the  nearest  common  ancestor  of  i and  j.  Indeed,  the  ancestors  of  any  given 
node  j are  precisely  the  nodes  k such  that  Ak  is  a right-to-left  minimum  of  Ai  . . . Aj 
or  Ak  is  a left-to-right  minimum  of  Aj  . . . An . 

The  algorithm  of  the  preceding  answer  does  the  desired  preprocessing,  except 
that  we  need  to  set  up  a triply  linked  tree  differently  on  the  nodes  (0, 1, . . . , n}.  Start 
as  before  with  CHILD(u)  = SIB(u)  = PARENT (v)  = 0 for  0 < v < n,  and  let  A = 0. 
Assume  that  Aq  < Aj  for  1 < j < n.  Set  t <—  0 and  do  the  following  steps  for  v = n, 
n — 1,  . . . , 1:  Set  u «-  0;  then  while  Av  < At  set  u <—  t and  t 4-  PARENT (t) . If  u / 0, 
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set  SIB(«)  <—  SIB(u),  SIB(u)  <—  0,  PARENT  (u)  <—  v,  CHILD  (v)  t—  u;  otherwise  simply 
set  SIB(v)  i-  CHILD  (t).  Also  set  CHILD  (t)  <-  v,  PARENT  (v)  < - t,  t 4- v. 

Continue  with  step  S2  after  the  tree  has  been  built.  The  running  time  is  0(n), 
because  the  operation  t <—  PARENT (f)  is  performed  at  most  once  for  each  node  t.  [This 
beautiful  way  to  reduce  the  range  minimum  query  problem  to  the  nearest  common 
ancestor  problem  was  discovered  by  H.  N.  Gabow,  J.  L.  Bentley,  and  R.  E.  Tarjan, 

STOC  16  (1984),  137-138,  who  also  suggested  the  following  exercise.] 

151.  For  node  v with  k children  u\,  ...,  Uk,  define  the  node  sequence  S(v)  = v if 
k = 0;  S(v)  = vS(ui)  if  k = 1;  and  S{v)  = S(ui)v  . . . vS(uk)  if  k > 1.  (Consequently 
v appears  exactly  max(fc—  1, 1)  times  in  S(v).)  If  there  are  k trees  in  the  forest,  rooted  at 
ui,  . . . , Uk,  write  down  the  node  sequence  S(u i)  A . . . A S(uk)  = Vi  . . . Vjv.  (The  length 
of  this  sequence  will  satisfy  n < N < 2 n.)  Let  Aj  be  the  depth  of  node  Vj,  for  1 < 
j < N,  where  A has  depth  0.  (For  example,  consider  the  forest  (141),  but  add  another 
child  K — >•  D and  an  isolated  node  L.  Then  V) . . . V15  = CFAGJDHDKABEIAL 
and  Ai  . . . A 15  = 231342323012301.)  The  nearest  common  ancestor  of  u and  v,  when 
u = Vi  and  v = Vj,  is  then  Vk^j)  in  the  range  minimum  query  problem.  [See  J.  Fischer 
and  V.  Heun,  Lecture  Notes  in  Comp.  Sci.  4009  (2006),  36-48.] 

152.  Step  VI  finds  the  level  above  which  ax  and  ay  have  bits  that  apply  to  both  of 
their  ancestors.  (See  exercise  148.)  Step  V2  increases  h,  if  necessary,  to  the  level  where 
they  have  a common  ancestor,  or  to  the  top  level  An  if  they  don’t  (namely  if  k — 0). 

If  fix  / f3z,  step  V4  finds  the  topmost  level  among  x's  ancestors  that  leads  to  level  h ; 
hence  it  knows  the  lowest  ancestor  x for  which  fix  = f3z  (or  x — A).  Finally  in  V5, 
preorder  tells  us  which  of  x or  y is  an  ancestor  of  the  other. 

153.  That  pointer  has  pj  bits,  so  it  ends  after  pi  + p2  + • • • + pj  = j — vj  bits  of  the 
packed  string,  by  (61).  [Here  j is  even.  Navigation  piles  were  introduced  in  Nordic 
Journal  of  Computing  10  (2003),  238-262.] 

154.  The  gray  lines  define  36o-36°-90°  triangles,  ten  of  which  make  a pentagon  with 
72°  angles  at  each  vertex.  These  pentagons  tile  the  hyperbolic  plane  in  such  a way 
that  five  of  them  meet  at  each  vertex. 

155.  Observe  first  that  0 < (a0)i/^  < + <j>~3  + </>~5  + • • • =1,  since  there  are  no 

consecutive  Is.  Observe  next  that  F-n<j>  = 4>~n  (modulo  1),  by  exercise  1.2.8-11.  Now 
add  Fkx  4>-\ \-Fkr4>.  For  example,  (40)  mod  1 = 0~5  + 0-2;  (—20)  mod  1 = 0-4+0~1. 

This  argument  also  proves  the  interesting  formula  |7V(a)0J  = —N(a0). 

156.  (a)  Start  with  y <—  0,  and  with  k large  enough  that  |*|  < Fk+i ■ If  x < 0,  set 

fc  ■< — (fc  — 1)  | 1,  and  while  x + Fk  > 0 set  k «-  k - 2;  then  set  y <-  y + (1  < k), 
x <-  x + Fk+ 1;  repeat.  Otherwise  if  x > 1,  set  k +-  k & -2,  and  while  x - Fk  < 0 set 
k <—  k — 2;  then  set  y 4—  y + (1  k),  x <r-  x — Fk+i\  repeat.  Otherwise  set  y <—  y + x 

and  terminate  with  y = (0)2- 

(b)  The  operations  xi  <—  01,  y\  «—  —a  1,  xk  yk-i  + afc,  J/fc  <—  xk- 1 — Xk 
compute  Xk  = N(ai  . . .ak)  and  yk  = N(ai  . . . a^O).  [Does  every  broadword  chain  for 
N(ai  . . . an)  require  Q(n)  steps?]  I- 

157.  The  laws  are  obvious  except  for  the  two  cases  involving  (a—).  For  those  we  have 
N((a—)0k)=  N((Mk)  + F—k—2  for  all  k > 0,  because  decrementation  never  “borrows” 
at  the  right.  (But  the  analogous  formula  77((a+)0*)=  Al(Q0fc)+  F-k-\  does  not  hold.) 

158.  Incrementation  satisfies  the  rules  (a00)+  = aOl,  (al0)+  = (a+)00,  (al)+  = 

(a+)0.  It  can  be  achieved  with  six  2-adic  operations  on  the  integer  x = (0)2  by  setting 
y 4-  x | (x  » 1),  2 t-  y & ~(r/  + 1),  x «-  (x  | z)  + 1. 
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Decrementation  of  a nonzero  codeword  is  more  difficult.  It  satisfies  (al02fe)  — = 
a0(10),c,  (al02fc+1)  — = a(01)fc+1;  hence  by  Corollary  I it  cannot  be  computed  by  a 
2-adic  chain.  Yet  six  operations  suffice,  if  we  allow  monus:  y 4—  x — 1,  z 4—  y & x, 
w 4—  z & p o,  x<— y — w + (w  — (z  — w)). 

159.  Besides  the  Fibonacci  number  system  (146)  and  the  negaFibonacci  number  sys- 
tem (147),  there’s  also  an  odd  Fibonacci  number  system : Every  positive  integer  x can 
be  written  uniquely  in  the  form 


x = Fix  + Fh  + • • • + Fib  , where  l\  >-?-  h »-  • • • »-  ls  > 0 and  la  is  odd. 


Given  a negaFibonacci  code  a,  the  following  20-step  2-adic  chain  converts  x = (a)2  to 
V = (0)2  to  z = (7)2,  where  0 is  the  odd  codeword  with  N(a)  = F(0)  and  7 is  the 
standard  codeword  with  F(0)  = F( 7O):  x+  4—  x & p0,  x~  4-  x © x+-  d 4—  x+  - x~ ; 
t 4-  d | x~,  t 4-  t & ~(t  < 1);  y 4-  (d  & Ao)  © t ® ((t  & x~)  » 1);  2 4-  (y  + 1)  » 1; 
w 4—  z 0 (4/io);  (4-irfe  ~(u>+l);  z(-z©(t&(2®  ((w+1)  S>  1))). 

Corresponding  negaFibonacci  and  odd  representations  satisfy  the  remarkable  law 

Fk-i+m  + • ■ • + Fkr+m  = (— l)m(F;1_m  + • • ■ + Fia-m),  for  all  integers  m. 


For  example,  if  N(q)  < 0 the  steps  above  will  convert  x = (a0)2  to  y = (/3)2,  where 
F((0^>  2)0)  = —N(a).  Furthermore  /3  is  the  odd  code  for  negaFibonacci  a if  and  only 
if  aR  is  the  odd  code  for  negaFibonacci  0R,  when  |a|  = \0\  is  odd  and  N(a)  > 0. 

No  finite  2-adic  chain  will  go  the  other  way,  by  Corollary  I,  because  the  Fibonacci 
code  10fe  corresponds  to  negaFibonacci  10fc+1  when  k is  odd,  (10)fe/2l  when  k is  even. 
But  if  7 is  a standard  Fibonacci  codeword  we  can  compute  y = (/3)2  from  z = (7)2  by 
setting  y 4-  z < 1,  t 4-  2/  & (y  - 1)  & £0,  y *-  y - t + [t/0]((t  - 1)  & p0).  And  then 
the  method  above  will  compute  aR  from  (3R . The  overall  running  time  for  conversion 
to  negaFibonacci  form  will  then  be  of  order  log  |7|,  for  two  string  reversals. 

160.  The  text’s  rules  are  actually  incomplete:  They  should  also  define  the  orientation 
of  each  neighbor.  Let  us  stipulate  that  asn  — or,  aen  = a;  (a0)wn  = aO,  (al)MO  = al; 
(a00)ns  = aOO,  (al0)„«,  = alO,  (al)„e  = al;  (a0)oo  = aO,  (al01)oo  = alOl, 
(alOOl)oo  = alOOl,  (aOOOl)  ow  = aOOOl.  Then  a case  analysis  proves  that  all  cells 
within  d steps  of  the  starting  cell  have  a consistent  labeling  and  orientation,  by  induc- 
tion on  the  graph  distance  d.  (Note  the  identity  a+  = ((aO)  — ) S>  1.)  Furthermore  the 
labeling  remains  consistent  when  we  attach  y coordinates  and  move  when  necessary 
from  one  strip  to  another  via  the  (5-rules  of  (153). 


161.  Yes,  it  is  bipartite,  because  all  of  its  edges  are  defined  by  the  set  of  boundary 
lines.  (The  hyperbolic  cylinder  cannot  be  bicolored;  but  two  adjacent  strips  can. 

162.  It’s  convenient  to  view  the  hyperbolic  plane  through  another  lens, 
by  mapping  its  points  to  the  upper  halfplane  Of z > 0.  Then  the  “straight 
lines”  become  semicircles  centered  on  the  x-axis,  together  with  vertical 
halflines  as  a limiting  case.  In  this  representation,  the  edges  \z  — 1|  = y/2, 

\z\  = r,  and  5Rz  = 0 define  a 36°-45°-90°  triangle  if  r2  = tf>  + \[$.  Every 
triangle  ABC  has  three  neighbors  CBA' , ACB' , and  BAC' , obtained 

by  “reflecting”  two  of  its  edges  about  the  third,  where  the  reflection  of  ' A' 

\z-c'\  = r'  about  \z  - c|  = r is  \z  - c - |(xi  + *2)|  = ||xi  - x2\,  Xj  = r2/(c'  ±r'  - c). 

The  mapping  zh>  (2-  zo)/(z  — zo ) takes  the  upper  halfplane  into  the  unit  circle; 
when  zo  = |(\/^—  l/<f))(l  + 5 x^i)  the  central  pentagon  will  be  symmetric.  Repeated 
reflections  of  the  initial  triangle,  using  breadth-first  search  until  reaching  triangles  that 
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are  invisible,  will  lead  to  Fig.  14.  To  get  just  the  pentagons  (without  the  gray  lines), 
one  can  begin  with  just  the  central  cell  and  perform  reflections  about  its  edges,  etc. 

163.  (This  figure  can  be  drawn  as  in  exercise  162,  starting  with  vertices  that  project  to 
the  three  points  ir,  iru,  and  ireo2 , where  r2  = |(1  + V2)(4  — \/2  — \/6)  and  u j = e2nl^3 . 

Using  a notation  devised  by  L.  Schlafli  in  1852,  it  can  be  described  sis  the  infinite  tiling 
with  parameters  {3,8},  meaning  that  eight  triangles  meet  at  every  vertex;  see  Schlafli’s 
Gesammelte  Mathematische  Abhandlungen  1 (1950),  212.  Similarly,  the  pentagrid  and 
the  tiling  of  exercise  154  have  Schlafli  symbols  {5,4}  and  {5,5},  respectively.) 

164.  The  original  definition  requires  more  computation,  even  though  it  can  be  factored: 

custer' (X)  = X & ~(UN  & Y & Fs),  Y = Xw  & X & XE. 

But  the  main  reason  for  preferring  (157)  is  that  it  produces  a thinner,  kingwise  con- 
nected border.  The  rookwise  connected  border  that  results  from  the  1957  definition  is 
less  attractive,  because  it’s  noticeably  darker  when  the  border  travels  diagonally  than 
when  it  travels  horizontally  or  vertically.  (Try  some  experiments  and  you’ll  see.) 

165.  The  first  image  X*1^  is  the  “outer”  border  of  the  original  black  pixels.  Fingerprint- 
like whorls  are  formed  thereafter.  For  example,  starting  with  Fig.  15(a)  we  get 

A A A ■■  1 

in  a 120  x 120  bitmap,  eventually  alternating  endlessly  between  two  bizarre  patterns. 

(Does  every  nonempty  M x N bitmap  lead  to  such  a 2-cycle?) 

166.  IfX  = custer(X),  the  sum  of  the  elements  of  X+(X^1)  + (X<S1)  + (X3>1)-|-(XV1) 
is  at  most  4 MN  + 2 M + 2 N,  since  it  is  at  most  4 in  each  cell  of  the  rectangle  and  at 
most  1 in  the  adjacent  cells.  This  sum  is  also  five  times  the  number  of  black  pixels. 

Hence  f(M,N)  < | MN  + | M + |AL  Conversely  we  get  f(M,N)  > | MN  — | by 
letting  the  pixel  in  row  i and  column  j be  black  unless  ( i + 2 j)  mod  5 = 2.  (This 
problem  is  equivalent  to  finding  a minimum  dominating  set  of  the  M x N grid.) 

167.  (a)  With  17  steps  we  can  construct  a half  adder  and  three  full  adders  (see  7.1.2- 

(23))  s0  that  (2122)2  = Inw  + a:w  + £sw,  (2324)2  = £n  + Xs , (2525)2  = Ine  + xE  + Xsej  re 

and  (2723)2  = 22  + 24  + 26.  Then  / = Si (21, 23, 25, 27)  A (x  V 2s),  where  the  symmetric 
function  Si  needs  seven  operations  by  Fig.  9 in  Section  7.1.2.  [This  solution  is  based 
on  ideas  of  W.  F.  Mann  and  D.  Sleator.] 

(b)  Given  = XjA , x = xjA  and  x+  = XjA , compute  a <—  x~  &:  x+  (=  23), 
b <—  x~  ®x+  (=  24),  c t—  x(Bb,  d <—  c3>  1 (=  26),  c t—  c<l  (=  22),  e t—  c®d,  c <—  c&:d, 

/t-6&e,  / t— /|c(=  27),  e <r-  b ® e (=  2g),  c <-  x k,  b,  c -f-  c | 0,  b <-  c -C  1 (=  25), 
c <—  c 1 (=  21),  6&c,  c *r-  fe  | c,  a&/,  f <—  a|/,  / t—  d \ f,  c f-  b \ c, 

f <-  / ®C  (=  Si(2i,23,25,27)),  e <-  e\  x,  f 4-  f ice. 

[For  excellent  summaries  of  the  joys  and  passions  of  Life,  including  a proof  that 
any  Turing  machine  can  be  simulated,  see  Martin  Gardner,  Wheels,  Life  and  Other 
Mathematical  Amusements  (1983),  Chapters  20-22;  E.  R.  Berlekamp,  J.  H.  Conway, 
and  R.  K.  Guy,  Winning  Ways  4 (A.  K.  Peters,  2004),  Chapter  25.] 
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At  last  I've  got  what  I wanted  — an  apparently  unpredictable  law  of  genetics. 

■ . . Overpopulation,  like  underpopulation,  tends  to  kill. 
A healthy  society  is  neither  too  dense  nor  too  sparse. 
JOHN  H.  CONWAY,  letter  to  Martin  Gardner  (March  1970) 
168.  The  following  algorithm,  which  uses  four  n-bit  registers  x~,  x,  x+,  and  y,  works 
properly  even  when  M = 1 or  N = 1.  It  needs  only  about  two  reads  and  two’  writes 
per  raster  word  to  transform  X(t)  to  X((+1)  in  (158): 

Cl.  [Loop  on  fc.]  Do  step  C2  for  k = 1,2,...,  N' ■ then  go  to  C5. 

C2.  [Loop  on  j.\  Set  x -f-  A(M_1)k,  x+  «-  Aok , and  AMk  x+ . Then  perform 
steps  C3  and  C4  for  j = 0,  1,  M - 1. 


C3.  [Move  down.]  Set  x <—  x,  x <—  x+ , and  x+  <-  A(j+1)k.  (Now  x = Ajk,  and 
x holds  the  former  value  of  Compute  the  bitwise  function  values 

y 4-  f(x~  » l,x~,x~  < l,x»  l,x,x«  l,x+  > l,x+,x+  « 1). 

C4.  [Update  Ajk.]  Set  x~  <-  AHk_t)  & -2,  y «-  y k (2"-1  - 1),  t— 

x + (y»  (n  - 2)),  Ajk  A-  y + (x~  « (n  - 2)). 

C5.  [Wrap  around.]  For  0 < j < M,  set  x A-  AjN,  k -2n~1-d,  AjN,  x + 
{Aji  3>d),  and  Aj  1 •*—  Aji  + (x  d ),  where  d = 1 + (N  — 1)  mod  ( n — 2).  | 

[An  M x N torus  is  equivalent  to  an  (M  — 1)  x (N  — 1)  array  surrounded  by  zeros, 
in  many  cases  like  (157)  and  (159)  and  even  (161).  For  exercise  173  we  can  clean  an 
( M ~ 2)  x (N  — 2)  array  that  is  bordered  by  two  rows  and  columns  of  zeros.  But  Life 
images  (exercise  167)  can  grow  without  bound;  they  can’t  safely  be  confined  to  a torus.] 


169.  It  quickly  morphs  into  a rabbit,  which  proceeds  to  explode.  Beginning  at  time 
278,  all  activity  stabilizes  to  a two-cycle  formed  from  a set  of  traffic  lights  and  three 
additional  blinkers,  together  with  three  still  lifes  (tub,  boat,  and  bee  hive). 


170.  If  M > 2 and  N > 2,  the  first  step  blanks  out  the  top  row  and  the  rightmost 
column.  Then  if  M > 3 and  N > 3,  the  next  step  blanks  out  the  bottom  row  and  the 
leftmost  column.  So  in  general  we’re  left  after  t = min (M,N)  - 1 steps  with  a single 
row  or  column  of  black  pixels:  The  first  ft/ 2]  rows,  the  last  [f/2]  columns,  the  last 
[f/2j  rows,  and  the  first  [£/2j  columns  have  been  set  to  zero.  The  automaton  will  stop 
after  making  two  more  (nonproductive)  cycles. 

171.  Without  (160):  xi  «-  xSE  k xN,  x2  «-  xN  k xSE,  x3  «-  xE  k xu  x4  +-  xNE  k x2, 
X5  t-  X3  | X4,  X6  <-  XW  k XS,  X7  X!  k xNE,  x8  <-  x7  k xNW,  xg  <-  xB  \ xsw, 
X10  xs  k Xg,  Xu  <-  x10  I xe,  x12  <-  xs  k xn,  x13  < - x2  k xE,  x14  <-  x13  k xw, 
x15  <-  xNkxNE,  x16  <—  xswkxw,  x17  <-  xls  \ x16,  xi8  t-  xNE&xsw,  xw  4—  x17kx18, 

X20  <—  XE  | Xse  1 X21  <—  x20  I XS,  X22  <—  XNW  k X21,  x23  X22  k Xig , X24  <—  X\2  | Xi4, 

9 *23  | *24.  With  (160),  set  X4  <—  %E  k xN  and  leave  everything  else  the  same. 


172.  The  statement  isn  t quite  true;  consider  the  following  examples: 


The  ‘I  and  ‘H’  at  the  left  show  that  pixels  are  sometimes  left  intact  where  paths  join, 
and  that  rotating  by  90°  can  make  a difference.  The  next  two  examples  illustrate 
a quirky  influence  of  left-right  reflection.  The  diamond  example  demonstrates  that 
very  thick  images  can  be  unthinnable;  none  of  its  black  pixels  can  be  removed  without 
changing  the  number  of  holes.  The  final  examples,  one  of  which  was  inspired  by  the 
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answer  to  exercise  166,  were  processed  first  without  (160),  in  which  case  they  are 
unchanged  by  the  transformation.  But  with  (160)  they’re  thinned  dramatically. 

173.  (a)  The  hint  is  readily  verified.  Notice  that  if  X and  Y are  closed,  X & Y is  closed; 
if  X and  Y are  open,  X | Y is  open.  Thus  XD  is  closed  and  XL  is  open;  XDD  = XD 
and  XLL  = XL.  (In  fact  we  have  XL  = ~ ( ~ A') D , because  the  definitions  are  dual, 
obtained  by  swapping  black  with  white.)  Now  XDL  C XD , so  XDLD  C XDD  = XD . 
And  dually,  XL  C XLDL . We  conclude  that  there’s  no  reason  to  launder  a clean 
picture:  XDLDL  = ( XDLD)L  C XDL  C (XD)LDL  = XDLDL . 

(b)  We  have  XD  = (X  \ Xw  | XNW  | ATN)&(X  | AiN  | XNE  | XE)&(X  | ATE  | A:se  | ATS)& 
(A|XS  | XSw  | Aw).  Furthermore,  in  analogy  with  answer  167(b),  this  function  can  be 
computed  from  x~ , x,  and  x+  in  ten  broadword  steps:  / «—  x | (j:>1)  | ((x~  | (a:~3>l))& 
( x+  | ( x+  3>  1))),  /■(—/&(/  1).  [This  answer  incorporates  ideas  of  D.  R.  Fuchs.] 

To  get  XL,  just  interchange  | and  &.  [For  further  discussion,  see  C.  Van  Wyk 
and  D.  E.  Knuth,  Report  STAN-CS- 79-707  (Stanford  Univ.,  1979),  15-36.] 

174.  Three-dimensional  digital  topology  has  been  studied  by  R.  Malgouyres,  Theoret- 
ical Computer  Science  186  (1997),  1-41. 

175.  There  are  25  in  the  outline,  2 + 3 in  the  eyes,  1 + 1 in  the  ears,  4 in  the  nose,  and 
1 in  the  smile,  totalling  37.  (All  white  pixels  are  connected  kingwise  to  the  background.) 

176.  (a)  If  v isn’t  isolated,  there  are  eight  easy  cases  to  consider,  depending  on  what 
kind  of  neighbor  v has  in  G. 

(b)  There’s  a vertex  of  G'  adjacent  to  each  vertex  of  (Nu  U Nv)  \ G' . (Four  cases.) 

(c)  Yes.  In  fact,  by  definition  (161),  we  always  have  |S'(t/)|  > 2. 

(d)  Let  N'v,  = {»|«'£  Nv}.  If  v'  is  the  east  neighbor  of  v! , call  it  u'E,  either 
u € G or  u's  G G;  this  element  is  equal-or-adjacent  to  every  vertex  of  N'u,  U N'v,.  A 
similar  argument  applies  when  v'  = u'N.  If  v'  = «nE,  there’s  no  problem  if  u G G. 
Otherwise  u'w  G G,  u's  G G,  and  either  u'N  G G or  u'E  G G;  hence  N'u,  CN'vi  is  connected 
in  G.  Finally  if  v'  = UsE,  the  proof  is  easy  if  its  E G ; otherwise  it'  € G and  v'  G G. 

(e)  Given  a nontrivial  component  C of  G,  with  v G C and  v'  G S( u),  let  C'  be  the 
component  of  G'  that  contains  v' . This  component  C'  is  well  defined,  by  (a)  and  (b). 
Given  a component  C'  of  G',  with  v'  G G'  and  v G S'  (v1),  let  C be  the  component  of 
G that  contains  v.  This  component  C is  nontrivial  and  well  defined,  by  (c)  and  (d). 
Finally,  the  correspondence  C 44  C'  is  one-to-one. 

177.  Now  the  vertices  of  G are  the  white  pixels,  adjacent  when  they  are  rook- neighbors. 

So  we  define  N^j)  = Arguments  like  those  of  answer  176, 

but  simpler,  establish  a one-to-one  correspondence  between  the  nontrivial  components 
of  G and  the  components  of  G' . 

178.  Observe  that  in  adjacent  rows  of  X* , two  pixels  of  the  same  value  are  kingwise 
neighbors  only  if  they  are  rookwise  connected. 

179.  The  pixels  x\...xn  of  each  row  can  be  “runlength  encoded”  as  a sequence  of 
integers  0 = c0  < ci  < • • • < C2m+i  = N + 2 so  that  Xj  = 0 for  j G [co  . . ci)  U [c2  . . C3)  U 
• • • U [c2m  • • C2m+i)  and  Xj  = 1 for  j G [ci  . . C2)  U • • • U [c2m-i  • • C2m).  (The  number  of 
runs  per  row  tends  to  be  reasonably  small  in  most  images.  Notice  that  the  background 
condition  xo  = xjv+i  = 0 is  implicitly  assumed.) 

The  algorithm  below  uses  a modified  encoding  with  a,  = 2c,  — ( j mod  2)  for 
0 < j < 2m+ 1.  For  example,  the  second  row  of  the  Cheshire  cat  has  (ci,  C2,  C3,  C4,  C5)  = 
(5, 8,  23, 25, 32);  we  will  use  (ai,  02,  £13,  04,  as)  = (9, 16, 45, 50, 63)  instead.  The  reason  is 
that  white  runs  of  adjacent  rows  are  rookwise  adjacent  if  and  only  if  the  corresponding 
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intervals  [a,-  . . Oj+i)  and  [bk  ■ • bk+i)  overlap,  and  exactly  the  same  condition  charac- 
terizes when  black  runs  of  adjacent  rows  are  kingwise  adjacent.  Thus  the  modified 
encoding  nicely  unifies  both  cases  (see  exercise  178). 

We  construct  a triply  linked  tree  of  current  components,  where  each  node  has 
several  fields:  CHILD,  SIB,  and  PARENT  (tree  links);  DORMANT  (a  circular  list,  via  SIB 
links,  of  all  former  children  that  aren’t  connected  to  the  current  row);  HEIR  (a  node 
that  has  absorbed  this  one);  ROW  and  COL  (location  of  the  first  pixel);  and  AREA  (the 
total  number  of  pixels  in  the  component). 

The  algorithm  traverses  the  tree  in  double  order  (see  exercise  2.3.1-18),  using 
pairs  of  pointers  (P,  P'),  where  P'  = P when  P is  traversed  the  first  time,  P'  = PARENT  (P) 
when  P is  traversed  the  second  time.  The  successor  of  (P,P')  is  (Q,  Q')  = next(P.P'), 
determined  as  follows:  If  P = P'  and  CHILD  (P)  / A,  then  Q 4-  Q'  4-  CHILD (P);  otherwise 
Q ■«—  P and  Q'  4—  PARENT© . If  P / P'  and  SIB(P)  ^ A,  then  Q 4—  Q'  4—  SIB(P); 
otherwise  Q 4-  PARENT  (P)  and  Q'  4-  PARENT (Q) . 

When  there  are  m black  runs,  the  tree  will  have  m-\- 1 nodes,  not  counting  nodes 
that  are  dormant  or  have  been  absorbed.  Moreover,  the  primed  pointers  Pj,  . . . , p'2m+1 
of  the  double  traversal  (Pi, Pi),  . (P2m+i,P2m+i)  are  precisely  the  components  of 
the  current  row,  in  left-to-right  order.  For  example,  in  (163)  we  have  m = 5;  and 
(Pi,...,Pli)  point  respectively  to©,  ©.  (£),  ©,  ©,  © ©.  0,  ©,  0,  ©. 

11.  [Initialize.]  Set  t 4-  1,  ROOT  4-  L0C(N0DE(0) ) , CHILD(ROOT)  4-  SIB(ROOT)  4- 
PARENT(ROOT)  4-  DORMANT  (ROOT)  4-  HEIR(ROOT)  4-  A;  also  ROW(ROOT)  4- 
COL(ROOT)  4-  0,  AREA  (ROOT)  4-  N + 2,  s 4-  0,  a0  4-  b0  4-  0,  ax  4-  2N  + 3. 

12.  [Input  a new  row.]  Terminate  if  s > M.  Otherwise  set  bk  4-  ak  for  k = 1,2, 

• ■ • , until  bk  = 2N +3;  then  set  bk+i  4—  bk  as  a “stopper.”  Set  s 4—  s+1.  If  s > 
M,  set  ai  4—  2N  + 3;  otherwise  let  01,  . . . , a2m+ 1 be  the  modified  runlength 
encoding  of  row  s as  discussed  above.  (This  encoding  can  be  obtained  with 
the  help  of  the  p function;  see  (43).)  Set  j 4-  k 4-  1 and  P 4-  P'  4-  ROOT. 

13.  [Gobble  up  short  fe’s.]  If  f>fc+1  > aj:  go  to  19.  Otherwise  set  (Q,  Q')  4- 
next(P,P'),  (R,  R')  4—  next(Q,  Q'),  and  do  a four-way  branch  to  (14,15,16,17) 
according  as  2[Q^Q']  + [R/R']  = (0, 1,2,3). 

14.  [Case  0.]  (Now  Q = Q'  is  a child  of  P',  and  R = R'  is  the  first  child  of  Q'.  Node  Q 
will  remain  a child  of  P\  but  it  will  be  preceded  by  any  children  of  R.)  Absorb 
R intoP'  (see  below).  Set  CHILD (Q)  4-  SIB(R)  and  Q' 4- CHILD(R) . If  q'  / A, 
set  R 4-  Q',  and  while  R ^ A set  PARENT (R)  4-  P',  R'  4-  R,  R 4-  SIB(R);  then 
SIB(R)  4 Q,  Q 4—  Q\  Set  CHILD(P)  4-Q  if  P = P',  SIB(P)  4—  Q if  P ^P'.  Go  to  18. 

15.  [Case  1.]  (Now  component  Q = R is  surrounded  by  P'  = R'.)  If  P = p',  set 
CHILD (P ) 4-  SIB(Q) ; otherwise  set  SIB(P)  4-  SIB(q).  Set  R 4-  DORMANT (R' ) . 
Then  if  R = A,  set  DORMANT (R( ) 4-  SIB(Q)  4-  Q;  otherwise  SIB(Q)  4-  SIB(R) 
and  SIB(R)  4 — Q.  Go  to  18. 

16.  [Case  2.]  (Now  Q'  is  the  parent  of  both  P'  and  R.  Either  P = P'  is  childless,  or 
P is  the  last  child  of  P'.)  Absorb  R into  P'  (see  below).  Set  SIB(P')  4-  SIB(R) 
and  R 4-  CHILD (R) . If  P = P',  set  CHILD(P)  4-  R;  otherwise  SIB(P)  4-  R. 
While  R / A,  set  PARENT (R)  4-  P'  and  R 4-  SIB(R).  Go  to  18. 

17.  [Case  3.]  (Node  P'  = Q is  the  last  child  of  Q'  = R,  which  is  a child  of  R'.) 
Absorb  P into  R (see  below).  If  P = P',  set  P 4—  R.  Otherwise  set  4— 
CHILD (P'),  and  while  P'  ^ A set  PARENT (P')  4-  R',  P'  4-  SIB(P');  also 
set  SIB(P)  4-  SIB(Q')  and  SIB(Q')  4-  CHILD© . If  Q = CHILD (R) , set 
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CHILD (E)  4-  A.  Otherwise  set  R 4-  CHILD (R) , then  R 4-  SIB(R)  until 
SIB(R)  = Q,  then  SIB(R)  4—  A.  Finally  set  P'  4—  R'. 

18.  [Advance  k.]  Set  k 4—  k + 2 and  return  to  step  13. 

19.  [Update  the  area.]  Set  AREA(P')  4—  AREA(P')  + \a,j/ 2]  — [ttj-i/2] . Then  go 
back  to  12  if  aj  = 2 N + 3. 

110.  [Gobble  up  short  a.]  Ifaj+i  > bk,  go  to  111.  Otherwise  set  Q 4—  L0C(N0DE(t) ) 
and  (4-1+1.  Set  PARENT (Q)  4-  P',  DORMANT(Q)  4-  HEIR(Q)  4-  A;  also 
ROW(Q)  4- s,  COL(Q)  4-  \a,j/2\ , AREA(Q)  4-  \aj+1/2\  - \a3 /2\.  IfP  = P',  set 
SIB(Q)  4—  CHILD (P)  and  CHILD (P)  4—  Q;  otherwise  set  SIB CQ)  4—  SIB(P)  and 
SIB(P)  4—  Q.  Finally  set  P 4—  Q,  j 4—  j + 2,  and  return  to  13. 

111.  [Move  on.]  Set  j 4—  j + 1,  k 4—  k + 1,  (P,P')  4—  next(P,P'),  and  go  to  13.  | 

To  “absorb  P into  Q’’  means  to  do  the  following  things:  If  (ROW(P) , COL(P) ) is  less 
than  (ROW(Q) , COL(Q)  ),  set  (ROW(Q) , COL(Q) ) 4-  (ROW(P)  , COL(P)  ).  Set  AREA(Q)  4- 
AREA(P)  + AREA(Q)  . If  DORMANT  (Q)  = A,  set  DORMANT (Q)  4-  DORMANT  (P)  ; otherwise  if 
DORMANT  (P)  jt  A,  swap  SIB(DORMANT(P)  ) 4-4  SIB  (DORMANT  (Q)  ) . Finally,  set  HEIR  (P)  4- 
Q.  (The  HEIR  links  could  be  used  on  a second  pass  to  identify  the  final  component  of 
each  pixel.  Notice  that  the  PARENT  links  of  dormant  nodes  are  not  kept  up  to  date.) 

[A  similar  algorithm  was  given  by  R.  K.  Lutz  in  Comp.  J.  23  (1980),  262-269.] 

180.  Let  F(x,  y)  = x2  — y2  + 13  and  Q(x,  y)  = F(x  — \,y  — §)  =x2  — y2— x + y + 13. 
Apply  Algorithm  T to  digitize  the  hyperbola  from  (£,  y)  = (—6, 7)  to  (£',  y')  = (0,  -s/l3) ; 
hence  x = —6,  y = 7,  x'  = 0,  y = 4.  The  resulting  edges  are  (—6,7)  — (—5,7)  — 

(—5,  6)  — (—4, 6)  — (—4,  5)  — (—3, 5)  — (—3, 4) (0, 4).  Then  apply  it  again 

with  £ = 0,  y = \/l3,  £'  = 6,  rj  = 7,  x = 0,  y = 4,  x'  = 6,  y'  = 7;  the  same  edges  are 
found  (in  reverse  order),  but  with  negated  x coordinates. 

181.  Subdivide  at  points  (^, 77)  where  Fx(£,i 7)  = 0 or  Fy(£,y)  = 0,  namely  at  the  real 
roots  of  {Q(  — (by  + d)/(2a),y  + |)  = 0,  £ = —(by  + d)/(2a)  — 1}  or  the  real  roots  of 
{Qit  + b ~(bZ  + e)/(2c))  = V ~ ~(bt,  + e)/(2c)  - §},  if  they  exist. 

182.  By  induction  on  \x'  — x\  + \y'  — y\.  Consider,  for  example,  the  case  x > x' 

and  y < y . We  know  from  (iii)  that  (£,??)  lies  in  the  box  x — | <C<a:+|  and 
y — | < V < y + b and  from  (ii)  that  the  curve  travels  inonotonically  as  it  moves  from 
(£,  y)  to  (^',  y').  It  must  therefore  exit  the  box  at  the  edge  (x—  — — (a:—  y+ 1) 

ov  (x  - ^,y  + b — (x  + | , y + \ ) . The  latter  holds  if  and  only  if  F(x  — | , y + | ) < 0, 
because  the  curve  can’t  intersect  that  edge  twice  when  x'  < x.  And  F(x  — |,y+|)is 
the  value  Q(x,  y + 1)  that  is  tested  in  step  T4,  because  of  the  initialization  in  step  Tl. 
(We  assume  that  the  curve  doesn’t  go  exactly  through  (x  — \,y  + §),  by  implicitly 
adding  a tiny  positive  amount  to  the  function  F behind  the  scenes.) 

183.  Consider,  for  example,  the  ellipse  defined  by  F(x  — |,y  — |)  = Q(x,y)  = 13x2  + 
7 xy  + y2  — 2 = 0;  this  ellipse  is  a cigar-shaped  curve  that  extends  roughly  between 
(—2,  5)  and  (1,  —6).  Suppose  we  want  to  digitize  its  upper  right  boundary.  Hypotheses 
(i)-(iv)  of  Algorithm  T hold  with 


x = 1,  y = —6,  x'  = —2,  y'  = 5.  Step  Tl  sets  Q 4—  <3(1,  —5)  = 1,  which  causes  step  T4 
to  move  left  (L);  in  fact,  the  resulting  path  is  L3U11,  while  the  correct  digitization 
according  to  (164)  is  U3LU4LU3LU.  Failure  occurred  because  Q(x,y)  = 0 has  two 
roots  on  the  edge  (1,  —5)  — (2,  —5),  namely  ((35  ± —y/29)/26,  —5),  causing  Q(l,  —5) 
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to  have  the  same  sign  as  Q( 2,  —5).  (One  of  those  roots  is  on  the  boundary  we  are  not 
trying  to  draw,  but  it’s  still  there.)  Similar  failure  occurs  with  the  parabola  defined 
by  Q(x,y)  = 9x2  +6xy  + y2  - y = 0,  £ = -5/12,  r,  = -1/4,  £'  = -5/2,  r,'  = -19/2, 
x = 0,  y = 0,  x'  = -2,  j/'  = 9.  Hyperbolas  can  fail  too  (consider  6a:2  + 5 xy  + y2  = 1). 

Algorithms  for  discrete  geometry  are  notoriously  delicate;  unusual  cases  tend  to 
drive  them  berserk.  Algorithm  T works  properly  for  portions  of  any  ellipse  or  parabola 
whose  maximum  curvature  is  less  than  2.  The  maximum  curvature  of  an  ellipse  with 
semiaxes  a > /3  is  a//32;  the  cigar-shaped  example  has  maximum  curvature  « 42.5. 
The  maximum  curvature  of  the  parabola  y = ax 2 is  a/2\  the  anomalous  parabola  above 
has  maximum  curvature  ss  5.27.  “Reasonable”  conics  don’t  make  such  sharp  turns. 

To  make  Algorithm  T work  correctly  without  hypothesis  (v),  we  need  to  slow  it 
down  a bit,  by  changing  the  tests  ‘Q  < 0’  to  ‘Q  < 0 or  X’,  where  X is  a test  on  the 
sign  of  a derivative.  Namely,  X is  respectively  ‘S  > c\  ' R > o’,  ‘R  < -o’,  ‘S  < -c’  in 
steps  T2,  T3,  T4,  T5. 

184.  Let  Q'(x,y)  = -1  - Q(x,y).  The  key  point  is  that  Q(x,y ) < 0 if  and  only  if 
Q (xiV)  zf  0.  (Curiously  the  algorithm  makes  the  same  decisions,  backwards,  although 
it  probes  the  values  of  Q'  and  Q in  different  places.) 

185.  Find  a positive  integer  h so  that  d = (r/  — ?/ )h  and  e = { £/  — /)/:  are  integers  and 
d + e is  even.  Then  carry  out  Algorithm  T with  x = [£  + jJ,  V = [v+  5J,  x'  = [£'  + A J , 
V = W + \\,  and  Q(x,y)  = d(x  - ±)  + e(y  - ±)  + /,  where 

/ = L(r/  £ — £ v)h\  — [d  > 0 and  (t/£  — £,'ri)h  is  an  integer]. 

(The  ‘d  > 0’  term  ensures  that  the  opposite  straight  line,  from  (£',  7 /)  back  to  (£,  77),  will 
have  precisely  the  same  edges;  see  exercise  183.)  Steps  T1  and  T6  -T9  become  much 
simpler  than  they  were  in  the  general  case,  because  R = d and  5 = e are  constant. 

(F.  G.  Stockton  [CACM  6 (1963),  161,  450]  and  J.  E.  Bresenham  [IBM  Systems 
Journal  4 (1965),  25—30]  gave  similar  algorithms,  but  with  diagonal  edges  permitted.) 

186.  (a)  B(e)  = z0  + 2e(Zl  - z0)  + 0(e2);  B(  1 - e)  = z2  - 2 e(z2  - Zl)  + 0(e2). 

(b)  Every  point  of  S(zo,zi,Z2)  is  a convex  combination  of  z0,  z 1,  and  z2- 

(c)  Obviously  true,  since  (1  - t)2  + 2(1  - t)t  + t2  = 1. 

(d)  The  collinear  condition  follows  from  (b).  Otherwise,  by  (c),  we  need  only 
consider  the  case  zo  = 0 and  — 2zi  = 1,  where  Zi  = ®i  + iy\  and  7/1  ^ 0.  In  that 
case  all  points  lie  on  the  parabola  4x  = (y/yi)2  + 4yx\/y\. 

(e)  Note  that  B(u0)  = (l-u)2 z0  + 2u(l-u)((l-6)z0+8z1)+u2 B(6)  for  0 < u < 1. 
[S.  N.  Bernshtein  introduced  Bn(z0,  zlt . . . , z„\  t)  = Q)(l  -t )n~ktkzk  in 

Soobshcheniia  Khar’kovskoe  matematicheskoe  obshchestvo  (2)  13  (1912),  1-2.] 

187.  We  can  assume  that  zo  = (xo,yo),  z 1 = (x\,yi),  and  z2  = (x2,j/2),  where  the 
coordinates  are  (say)  fixed-point  numbers  represented  as  16-bit  integers  divided  by  32. 

If  zo,  zi,  and  z2  are  collinear,  use  the  method  of  exercise  185  to  draw  a straight 
line  from  zo  to  z2.  (If  zi  doesn’t  lie  between  zo  and  z2,  the  other  edges  will  cancel  out, 
because  edges  are  implicitly  XORed  by  a filling  algorithm.)  This  case  occurs  if  and 
only  if  D = xoyi  + x\% y2  + x2yo  — x\y0  — x2yi  — Xo y2  = 0. 

Otherwise  the  points  (x,y)  of  5(z0,zi,z2)  satisfy  F(x,y)  = 0,  where 

F(x,  y)  = ((x  - x0  )(y2  - 2Vl  + y0)  - (y  - y0)(x2  - 2Xl  + Xo)f 

- 4D((xi  - x0 )(y  - 7/o ) - (7/1  - y0)(x  - x0)) 
and  D is  defined  above.  We  multiply  by  324  to  obtain  integer  coefficients;  then  negate 
this  formula  and  subtract  1,  if  D < 0,  to  satisfy  condition  (iv)  of  Algorithm  T and  the 
reverse-order  condition.  (See  exercise  184.) 
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The  monotonicity  condition  (ii)  holds  if  and  only  if  ( xi  — xq)(x2  — Xi)  > 0 and 
(2/i  — yo)(y2  — y i)  > 0.  If  necessary,  we  can  use  the  recurrence  of  exercise  186(e) 
to  break  S(zo,  z\,  z-i)  into  at  most  three  monotonic  subsquines;  for  example,  setting 
9 = (xo  — xi)/(xo  — 2xi  + X2)  will  achieve  monotonicity  in  x.  (A  slight  rounding  error 
may  occur  during  this  fixed  point  arithmetic,  but  the  recurrence  can  be  performed  in 
such  a way  that  the  subsquines  are  definitely  monotonic.) 

Notes:  When  zo , z 1 , and  22  are  near  each  other,  a simpler  and  faster  method  based 
on  exercise  186(e)  with  9 = I is  adequate  for  most  practical  purposes,  if  one  doesn’t 
care  about  making  the  exactly  correct  choice  between  local  edge  sequences  like  “up- 
then-left”  versus  “left-then-up.”  In  the  late  1980s,  Sampo  Kaasila  chose  to  use  squines 
as  the  basic  method  of  shape  specification  in  the  TrueType  font  format,  because  they 
can  be  digitized  so  rapidly.  The  METAFONT  system  achieves  greater  flexibility  with 
cubic  Bezier  splines  [see  D.  E.  Knuth,  METRFONT:  The  Program  (Addison-Wesley, 
1986)],  but  at  the  cost  of  extra  processing  time.  A fairly  fast  “six-register  algorithm” 
for  the  resulting  cubic  curves  was,  however,  developed  subsequently  by  John  Hobby 
[ACM  Trans,  on  Graphics  9 (1990),  262-277].  Vaughan  Pratt  introduced  conic  splines, 
which  are  sort  of  midway  between  squines  and  Bezier  cubics,  in  Computer  Graphics 
19,3  (July  1985),  151-159.  Conic  spline  segments  can  be  elliptical  and  hyperbolic  as 
well  as  parabolic,  hence  they  require  fewer  intermediate  points  and  control  points  than 
squines;  furthermore,  they  can  be  handled  by  Algorithm  T. 

188.  The  following  big-endian  program  assumes  that  n < 74880. 


LOC 

Data_Segment 

LD0 

k,  Initk 

BITMAP 

LOC 

®+M*N/8 

OH 

SET 

s,N/64 

base 

GREG 

1H 

SET 

a,h 

A trick  (see  below) 

GRAYMAP 

LOC 

<3+M*N/64 

SET 

r,8 

GTAB 

BYTE 

255,252,249,246,243 

2H 

LD0U 

t ,base ,k 

BYTE 

240,236,233,230,227 

M0R 

u,cl  ,t 

BYTE 

224,221,217,214,211 

SUBU 

t ,t  ,u 

(Nypwise  sums) 

BYTE 

208,204,201,198,194 

M0R 

u,c2,t 

BYTE 

191,188,184,181,178 

AND 

t ,t ,mul 

BYTE 

174,171,167,164,160 

ADDU 

t ,t  ,u 

(Nybblewise  sums) 

BYTE 

157,153,150,146,142 

M0R 

u, c3  ,t 

BYTE 

139,135,131,128,124 

AND 

t , t , mu2 

BYTE 

120,116,112,108,104 

ADDU 

t,t,u 

(Bytewise  sums) 

BYTE 

100,96,92,88,84 

ADDU 

a,a,t 

BYTE 

79,75,70,66,61 

INCL 

k,N/8 

Move  to  next  row. 

BYTE 

56,52,46,41,36 

SUB 

r ,r , 1 

BYTE 

30,24,18,10,0 

PBNZ 

r,2B 

Repeat  8 times. 

Initk 

OCTA 

BITMAP-GRAYMAP 

3H 

SRU 

t ,a,56 

corr 

GREG 

N-8 

LDBU 

t ,gtab,t 

cl 

GREG 

#4000100004000100 

SLU 

a, a, 8 

c2 

GREG 

#2010000002010000 

STBU 

t ,z  ,0 

c3 

GREG 

#0804020100000000 

INCL 

Z,1 

mul 

GREG 

#3333333333333333 

PBN 

a,3B 

(The  trick) 

mu2 

GREG 

#0f0f0f0f0f0f0f0f 

SUB 

k,k,corr 

h 

GREG 

#8080808080808080 

SUB 

s,s,l 

gtab 

GREG 

GTAB-#80 

PBNZ 

s , IB 

Loop  on  columns. 

LOC 

#100 

INCL 

k,7*N/8 

Loop  on  groups 

MakeGray 

LDA 

z, GRAYMAP 

PBN 

k,0B 

of  8 rows.  | 
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[Inspired  by  Neil  Hunt’s  DVIPAGE,  the  author  used  such  graymaps  extensively 
when  preparing  new  editions  of  The  Art  of  Computer  Programming  in  1992-1998.] 

189.  If  the  rows  of  the  bitmap  are  (Jfo,  Jfi, . . . . X&3 ) . do  the  following  operations  for 
k = 0,  1,  . . . , 5:  For  all  i such  that  0 < i < 64  and  i & 2k  = 0,  let  j = i + 2k  and  either 

(a)  set  t 4—  ( Xi  © (Xj  2fc))  & p&,k,  Xi  4—  Xi  © t,  Xj  4—  Xj  © (t  -C  2*);  or  (b)  set 
t 4-  Xikfi6,k,  u 4-  Xj&rpe,k,  Xi  4-  ({Xi  < 2k)  & g6,k)  \ u,  Xj  4-  ((Xj»2k)kpe,k)\t. 

[The  basic  idea  is  to  transform  2k  x 2k  submatrices  for  increasing  k.  as  in  exercise 
5-12.  Speedups  are  possible  with  MMIX,  using  MOR  and  MUX  as  in  exercise  208,  and 
using  LDTU/STTU  when  k = 5.  See  L.  J.  Guibas  and  J.  Stolfi,  ACM  Transactions  on 
Graphics  1 (1982),  204-207;  M.  Thorup,  J.  Algorithms  42  (2002),  217.  Incidentally, 
Theorem  P and  answer  54  show  that  Q(nlogn)  operations  on  n-bit  numbers  are  needed 
to  transpose  an  n x n bit  matrix.  An  application  that  needs  frequent  transpositions 
might  therefore  be  better  off  using  a redundant  representation,  maintaining  its  matrices 
in  both  normal  and  transposed  form.] 

190.  (a)  We  must  have  aj+1  = f(aj)  © a3_j  for  j > 1,  where  a0  = 0...0  and 
f(a ) = ((a«l)&l...l)ffia®(a»l).  The  elements  of  the  bottom  row  am  satisfy 
the  parity  condition  if  and  only  if  this  rule  makes  om+1  entirely  zero. 

(b)  True.  The  parity  condition  on  matrix  entries  a{j  is  atJ  = a(i_ ^ © 

i ( j—  l ) © ^(i+i)j,  where  ciij  — 0iff  = 0orf  = m©lorji  = 0 or  j = n 1 . If  two 

matrices  (atj)  and  (bjj)  satisfy  this  condition,  so  does  (dj)  when  Cij  = atj  © btj. 

(c)  The  upper  left  submatrix  consisting  of  all  rows  that  precede  the  first  all-zero 
row  (if  any)  and  all  columns  that  precede  the  first  all-zero  column  (if  any)  is  perfect. 
And  this  submatrix  determines  the  entire  matrix,  because  the  pattern  on  the  other  side 
of  a row  or  column  of  zeros  is  the  top/bottom  or  left/right  reflection  of  its  neighbor. 
For  example,  if  am/+1  is  zero,  then  am/+1 +j  = am/  + w for  1 < j < m . 

(d)  Starting  with  a given  vector  ai  and  using  the  rule  in  (a)  will  always  lead  to 
a row  with  am+i  = 0 ...  0.  Proof:  We  must  have  (aj,aj+ 1)  = (ak,ak+1)  for  some  0 < 
j < k < 22n,  by  the  pigeonhole  principle.  If  j > 0 we  also  have  (a3_i,  a,)  = (afc_  i,ak), 
because  aij-i  = f(aj)  © a^+r  = f(ak)  © ak+i  = ak- 1-  Therefore  the  first  repeated 
pair  begins  with  a row  ak  of  zeros.  Furthermore  we  have  o,  = ak-i  for  0 < i < k: 
hence  the  first  all-zero  row  am+i  occurs  when  m is  k — 1 or  k/2  — 1. 

Rows  ai , . . . , am  will  form  a perfect  pattern  unless  there  is  a column  of  0s.  There 
are  t > 0 such  columns  if  and  only  if  t + 1 is  a divisor  of  n + 1 and  ai  has  the  form 
a0aR0  . . . 0 a (t  even)  or  a0afl0  . . . 0c*fl  ( t odd),  where  |a|  + 1 = (n  + l)/(t  + 1). 

(e)  This  starting  vector  does  not  have  the  form  forbidden  in  (d). 

191.  (a)  The  former  is  c*i,  a2,  . . . if  and  only  if  the  latter  is  OouOaf , 0a20 aR, 

(b)  Let  the  binary  string  ao<xi  . . . cin-i  correspond  to  the  polynomial  ao  + ai*  + 
+ ajv- ix  and  let  y = x 1 + l+x.  Then  cro  = 0...0  corresponds  to  Fo(y); 

ai  = 10... 0 corresponds  to  F\(y );  and  by  induction  oij  corresponds  to  Fj(y),  mod 
xN  + 1 and  mod  2.  For  example,  when  IV  = 6 we  have  a2  = 110001  44  1 + x + x5 
because  a:-1  mod  ( x6  + 1)  = x5,  etc. 

(c)  Again,  induction  on  j. 

(d)  The  identity  in  the  hint  holds  by  induction  on  m,  because  it  is  clearly  true 
when  m = 1 and  m = 2.  Working  mod  2,  this  identity  yields  the  simple  equations 

F2k(y)  = yFk(y)2;  F2k-i(y)  = (Fk-i(y)  + Fk(y))2 . 

So  we  can  go  from  the  pair  Pk  = ( Fk-j(y ) mod  (xN+l),Fk(y)  mod  (i^+l))  to  the  pair 
Pk+i  in  0(n)  steps,  and  to  the  pair  P2k  in  0(n2)  steps.  We  can  therefore  compute 
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Fj (y)  mod  (xN  + 1)  after  O(logj)  iterations.  Multiplying  by  fa( x)  + fa(x  *)  and 
reducing  mod  xN  + 1 then  allows  us  to  read  off  the  value  of  a2. 

Incidentally,  Fn+ i(x)  is  the  special  case  Kn( x,  x, . . . , x)  of  a continuant  polyno- 
mial; see  Eq.  4.5.3-(4).  We  have  Fn+i(x)  = Y<k=o  = i~nU„(ix/ 2),  where 

Un  is  the  classical  Chebyshev  polynomial  defined  by  f/n(cos0)  = sin((n  + l)0)/sin0. 

192.  (a)  By  exercise  191(c),  c(q)  is  the  least  j > 0 such  that  ( x+x~1)Fj(x~1+l+x ) = 0 
(modulo  x2q  + 1),  using  polynomial  arithmetic  mod  2.  Equivalently,  it’s  the  smallest 
positive  j for  which  Fj(y)  is  a multiple  of  ( x2q  + l)/(x2  + 1)  = (1  + x + • • • + xq~1)2 , 
when  y = x~x  + l+x. 

(b)  Use  the  method  of  exercise  191(d)  to  evaluate  ((x  + x~1)Fj(y))  mod  ( x2q  + 1) 
when  j = M/p , for  all  prime  divisors  p of  M.  If  the  result  is  zero,  set  M <—  M / p and 
repeat  the  process.  If  no  such  result  is  zero,  c(q)  = M. 

(c)  We  want  to  show  that  c(2e)  is  a divisor  of  3 • 2C“1  but  not  of  3 • 2e~2  or 
2e_1.  The  latter  holds  because  F2e-i(y)  = y 2<=  1-1  is  relatively  prime  to  x2°+1  + 1.  The 
former  holds  because 


=-1(2/)  = y 


1F3(y)2 


- y 


l(i  + yf 


l(x  1+x)2 


which  is  = 0 modulo  x2  +1  but  not  modulo  x2‘+ 2 + 1. 

(d)  F2e_i(y)  = V2' ’ ~2k ■ Since  y = x_1(l+a;+a:2)  is  relatively  prime  to  a:?  + l, 

we  have  y~'  = clq  + aix + aq-\xq~1  (modulo  xq  + 1)  for  some  coefficients  cn;  hence 


2k  2k 

y =a0  + aix 


■ + a„-ix 


: ao  + a\X 


■■  + aq.1x2k+e^=y~2k+C 


(modulo  xq  + 1)  for  0 < k < e,  and  it  follows  that  F22e_1(y)  is  a multiple  of  x2q  + 1. 

(e)  In  this  case  c(q)  divides  4(22e  — 1).  Proof:  Let  xq  + 1 = f\(x)f2{x)  . . . fr(x) 
where  fi(x)  = x + 1,  fi(x)  = x2  + x + 1,  and  each  fi(x)  is  irreducible  mod  2.  Since 
q is  odd,  these  factors  are  distinct.  Therefore,  in  the  finite  field  of  polynomials  mod 
fj(x)  for  j > 3,  we  have  y~2k  = y~2k+r  as  in  (d).  Consequently  F22e_1(y)  is  a multiple 
of  f3{x) . . . fr(x)  = (xq  + 1)/(:e3  + 1).  So  F2(22e_1-)(y)  = yF22e_1{y)2  is  a multiple  of 
{x2q  + l)/(x2  + 1)  as  desired. 

(f)  If  Fc(,)(j/)  is  a multiple  of  x2q  + \,  it’s  easy  to  see  that  c(2q)  = 2c(q).  Otherwise 
F3c(q)(y)  is  a multiple  of  F3(y)  = (1  + y)2  = a;_2(l  + a;)4;  hence  F6cM(y)  is  a multiple 
of  xA’i  + 1 and  c(2 q)  divides  6 c(q).  The  latter  case  can  happen  only  when  q is  odd. 

Notes:  Parity  patterns  are  related  to  a popular  puzzle  called  “Lights  Out,”  which 
was  invented  in  the  early  1980s  by  Dario  Uri,  also  invented  independently  about  the 
same  time  by  Laszlo  Mero  and  called  XL. 25 . [See  David  Singmaster’s  Cubic  Circular, 
issues  7&8  (Summer  1985),  39-42;  Dieter  Gebhardt,  Cubism  For  Fun  69  (March 
2006),  23-25.]  Klaus  Sutner  has  pursued  further  aspects  of  this  theory  in  Theoretical 
Computer  Science  230  (2000),  49-73. 

193.  Let  6( 2t)(2j)  = <iij,  6(2i+i)(22)  = lij  © 5(2i)(2j'+i)  = © ctjy+i),  and 

b(2i+i)(2j+i)  = 0,  for  0 < i < m and  0 < j < n,  where  we  regard  dij  = 0 when  i = 0 
or  i = m + 1 or  j = 0 or  j = n + 1.  We  don’t  have  {b{2i)1, 6(2i)2, . . . , b(2i)(2n+i))  = 
(0,0,...,  0)  because  (an, . . . ,di„)  ^ (0, . . . , 0)  for  1 < i < m.  And  we  don’t  have 
(&(2t+i)i,b(2t+i)2,  • ■ • ,b(2i+i)(2n+i))  = (0, 0, . . . , 0)  because  adjacent  rows  (a  il , . . . , Clin  ) 
and  (a(i+1)!, . . . , a(i+1)n)  always  differ  for  0 < i < m when  m is  odd. 

194.  Set  Pi  t-  (1  -C  (n— *))  | (1  (i— 1))  for  1 < i < m,  where  m = [n/2].  Also  set 
7 i t—  (Pi  & Qiii)  + (P2  & otii ) + f (pm  & aim),  where  aij  is  the  jth  row  of  the  parity 


id 


r 


? 


re 


ZZI 
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pattern  that  begins  with  /3<;  vector  7 4 records  the  diagonal  elements  of  such  a matrix. 
Then  set  r «—  0 and  apply  subroutine  N of  answer  195  for  i «—  1,  2,  . . . , m.  The  resulting 
vectors  9i , . . . , 9r  are  a basis  for  all  n x n parity  patterns  with  8-fold  symmetry. 

To  test  if  any  such  pattern  is  perfect,  let  the  pattern  starting  with  9i  first  be  zero 
in  row  Cj . If  any  ct  = n + 1,  the  answer  is  yes.  If  lcm(ci, . . . ,cr)  < n,  the  answer 
is  no.  If  neither  of  these  conditions  decides  the  matter,  we  can  resort  to  brute-force 
examination  of  2r  — 1 nonzero  linear  combinations  of  the  9 vectors. 

For  example,  when  n = 9 we  find  71  = 111101111,  72  = 73  = 010101010,  74  = 
000000000,  75  = 001010100;  then  r = 0,  9i  = 011000110  , 92  = 000101000,  Ci  = c2  = 5. 
So  there  is  no  perfect  solution. 

In  the  author’s  experiments  for  n < 3000,  “brute  force”  was  needed  only  when 
n = 1709.  Then  r = 21  and  the  values  of  Ci  were  all  equal  to  171  or  855  except  that 
c2i  = 342.  The  solution  9\  ® 621  was  found  immediately. 

The  answers  for  1 < n < 383  are  4,  5,  11,  16,  23,  29,  30,  32,  47,  59,  62,  64,  65, 
84,  95,  101,  119,  125,  126,  128,  131,  154,  164,  170,  185,  191,  203,  204,  239,  251,  254, 
256,  257,  263,  314,  329,  340,  341,  371,  383. 

[A  fractal  similar  to  Fig.  20,  called  the  “mikado  pattern,”  appears  in  a paper  by 
H.  Eriksson,  K.  Eriksson,  and  J.  Sjostrand,  Advances  in  Applied  Math.  27  (2001),  365. 
See  also  S.  Wolfram,  A New  Kind  of  Science  (2002),  rule  150R  on  page  439.] 

195.  Set  /3i  <—  1 (m  — i)  and  7 i <—  07  for  1 < i < m;  also  set  r •*—  0.  Then  perform 
the  following  subroutine  for  i = 1,  2,  . . . , m: 

Nl.  [Extract  low  bit.]  Set  x «-  7,  & — j,.  If  x = 0,  go  to  N4. 

N2.  [Find  j.]  Find  the  smallest  j > 1 such  that  77  & x / 0 and  7 j & (x  — 1)  = 0. 

N3.  [Dependent?]  If  j < i , set  7,  ^ 7,  ® 7, , /3,  <-  & ® f3j,  and  return  to  Nl. 
(These  operations  preserve  the  matrix  equation  C = BA.)  Otherwise  termi- 
nate the  subroutine  (because  7,;  is  linearly  independent  from  71,  . . . , 7i_i). 

N4.  [Record  a solution.]  Set  r <—  r + 1 and  9r  t—  /3,.  | 

At  the  conclusion,  the  m — r nonzero  vectors  7 ; are  a basis  for  the  vector  space  of  all 
linear  combinations  of  Qi,  . . . , am;  they’re  characterized  by  their  low  bits. 

196.  (a)  #0a;  #cea3;  #e7ae97;  #f09d8581. 

(b)  If  \x  = \x',  the  result  is  clear  because  1 = 1'.  Otherwise  we  have  either 
Qi  < a'i  or  (aq  = a\  and  a2  < a2);  the  latter  case  can  occur  only  when  x > 216. 

(c)  Set  j <—  k\  while  07  © #80  < #40,  set  j <—  j — 1.  Then  a(x^)  begins  with  07. 

197.  (a)  #000a;  #03a3;  #7b97;  #d834dd41. 

(b)  Lexicographic  order  is  not  preserved  when,  say,  x = #ffff  and  x'  = * 10000. 

(c)  To  answer  this  question  properly  one  needs  to  know  that  the  2048  integers 
in  the  range  #d800  < x < #e000  are  not  legal  codepoints  of  UCS;  they  are  called 
surrogates.  With  this  understanding,  /3(x(,))  begins  at  /3fc  if  f3k  © #dc00  > #0400, 
otherwise  it  begins  at  f3k- 1. 

198.  a = #e50000,  b = 3,  c = #16.  (We  could  let  b = 0,  but  then  a would  be 
huge.  This  trick  was  suggested  by  P.  Raynaud-Richard  in  1997.  The  stated  constants, 
suggested  by  R.  Pournader  in  2008,  are  the  smallest  possible.) 

199.  We  want  07  > #cl;  28ai  + a2  < #f490;  and  either  (07  & — 07)  + 07  < #100  or 
ol  1 + «2  > #17f . These  conditions  hold  if  and  only  if 

(#cl  - ai)& (28Q7  +a2  - #f 490) & (((c*i  & -07)  + cn  - #100)  | (#17f -cn -a2))  < 0. 
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Markus  Kuhn  suggests  adding  the  further  clause  (*20  — ((28ai  + 0:2)  © *edaO))’, 
to  ensure  that  aua2  doesn’t  begin  the  encoding  of  a surrogate. 

200.  If  $0  = (x7  . . .£12:0)256  then  $3  is  set  to  the  symmetric  function  S2(£7,  £4,  £2). 

201.  MOR  x,c,x,  where  c = #f  Of  Of  Of  OOfOfOfOf  . 

202.  MOR  x,x,c,  where  c = #c0c030300c0c0303;  then  MOR  x,mone,x.  (See  answer  209.) 

203.  a = # 0008000400020001 , b = #0f0f Of Of Of Of Of Of , c = #0606060606060606, 
d = * 0000002700000000 , e = #2a2a2a2a2a2a2a2a.  (The  ASCII  code  for  0 is  6 + #2a; 
the  ASCII  code  for  a is  6 + #2a  + 10  + *27.) 

204.  p = # 8008400420021001 , q = * 8020080240100401  (the  transpose  of  p),  r = 
#4080102004080102  (a  symmetric  matrix),  and  m = #aa55aa55aa55aa55 . 

205.  Shuffle,  but  with  p -H-  q,  r = #0804020180402010,  m = #f0f  Of  Of  OOfOfOfOf . 

206.  Just  change  p to  #0880044002200110 . (Incidentally,  these  shuffles  can  also  be 
defined  as  permutations  on  z = (z63  ■ ■ ■ 2120)2  in  another  way:  The  outshuffle  maps 
Zj  eA  Z(2j)  mod  63,  for  0 < j < 63,  while  the  inshuffle  maps  Zj  i->  z^j+i)  mod  65-) 

207.  Do  MOR  y ,p,x;  M0Ry,y,p;  M0Rt,y,q;  PUTrM.ml;  MUXy.y.t;  MORt.t.q; 
PUT  rM,m2;  MUX  y,y,t.  In  both  cases  p = *2004801002400801;  for  triple  zip, 
q = *4020100804020180 , mi  = *4949494949494949 , m2  = * dbdbdbdbdbdbdbdb ; for  the 
inverse,  q = *0402018040201008,  mi  =*0707070707070707,  m2  = *3f 3f3f3f 3f 3f 3f 3f . 

208.  (Solution  by  H.  S.  Warren,  Jr.)  The  text’s  7-swap,  14-swap,  28-swap  method  can 
be  implemented  with  only  12  instructions: 

MORt.x.cl;  MORt.cl.t;  PUTrM.ml;  MUXy.x.t; 

MOR  t ,y , c2;  M0Rt,c2,t;  PUTrM,m2;  MUXy.y.t; 

M0Rt,y,c3;  M0Rt,c3,t;  PUTrM,m3;  MUXy.y.t; 

here  ci  = *4080102004080102,  c2  = *2010804002010804,  c3  = *0804020180402010, 
m i = *aa55aa55aa55aa55 , m2  = *cccc3333cccc3333,  m3  = *f0f0f Of OOfOfOfOf . 

209.  Four  instructions  suffice:  MXORy.p.x;  MX0R  x , mone  , x ; MXORx.x.q;  XORx.x.y; 
here  p = *80c0e0f 0f8f cfef f = q,  and  register  mone  = —1. 

210.  SLU  x.one.x;  MOR  x,b,x;  AND  x,x,a;  M0Rx,x,#ff;  here  register  one  = 1. 

211.  In  general,  element  ij  of  the  Boolean  matrix  product  AXB  is  \J{xki  \ aikbij  = 1}. 
For  this  problem  we  choose  ant  = [i  D k\  and  bij  = [ZCj];  the  answer  is  ‘MOR  t,f,a; 
MOR  t.b.t’  where  a = *80c0a0f088ccaaff  and  b = *ff5533110f 050301  = aT. 

(Notice  that  this  trick  gives  a simple  test  [/  = /]  for  monotonicity.  Furthermore, 
the  64-bit  result  (^3  . . . tifo)2  gives  the  coefficients  of  the  multilinear  representation 

f{x  1,  . . . ,£6)  = (t63  + t62£6  + • • • + fl£l£2£3£4£5  + t0XlX2X3X4X5Xe)  mod  2, 

if  we  substitute  MX0R  for  MOR,  by  the  result  of  exercise  7.1.1-11.) 

212.  If  ■ denotes  MX0R  as  in  (183)  and  b — (/3t  • • • /3i/3o)256  has  bytes  f3j,  we  can  evaluate 

c = (a  ■ ) © ( (a  « 8)  • ( +B1q  ) ) © ( (a  « 16)  • ( B2L  +fl{/ ))  © • • • 0 ( (a  « 56)  • (B?  +B% ) ) , 

where  B f = (q/3j)  & m,  Bf  = (((qfa)  « 8)  + ft)  & m,  q = *0080402010080402,  and 
m = *7f3f  If  Of  07030100 . (Here  qft  denotes  ordinary  multiplication  of  integers.) 
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213.  In  this  big-endian  computation,  register  nn  holds  -n,  and  register  data  points 
to  the  octabyte  following  the  given  bytes  an~i  . ..aiQo  in  memory  (with  an_i  first). 
The  constants  aa  — *8381808080402010  and  bb  = *339bcf 6530180c06  correspond  to 
matrices  A and  B,  found  by  computing  the  remainders  xk  mod  p(x)  for  72  < k < 80. 


SET  c,0 
LD0U  t,data,nn 
ADD  im,im,8 
BZ  nn,2F 
1H  MX0R  u , aa , t 
MXOR  v,bb,t 
ADD  rni,nn,8 


c <-  0. 

t 4-  next  octa. 
n 4—  n — 8. 
Done  if  n = 0. 
u 4—  t ■ A. 
v <—  t ■ B. 
n <—  n — 8. 


LD0U  t,data,nn 


X0R 

SLU 

SRU 

X0R 

X0R 

PBN 


u, u,c 
c,v,56 

v, v,8 
u,u,v 
t ,t  ,u 
nn,  IB 


t 4—  next  octa. 
u u © c. 
c 4—  v <§;  56. 
iit-u>8. 
u 4-  U ® V. 
t 4 — t ® u. 
Repeat  if  n > 0. 


A similar  method  finishes  the  job,  with  no  auxiliary  table  needed: 


2H 

SET 

nn,8 

n 4—  8. 

SRU 

00 

> 

> 

3H 

AND 

x,t, ffooo 

x 4—  high  byte. 

X0R 

t ,t  , V 

MXOR 

u,aaa,x 

u 4—  x ■ A' . 

SUB 

rm,nn,  1 

MXOR 

v , bbb , x 

v 4—  x ■ B' . 

PBP 

nn,3B 

SLU 

00 

-p 

-p 

t 4 — t 8. 

X0R 

t , t , c 

X0R 

t,t,u 

t 4 — t 0 u. 

SRU 

crc,t,48 

CM:»8. 
t ^ — t ® v. 
n 4—  n — 1. 
Repeat  if  n > 0. 

t 4 — t © C. 

Return  t S>  48. 


Here  aaa  = *8381808080808080,  bbb  = #0383c363331b0f 05 , and  ffooo  = #ff00...00. 


The  Books  of  the  Big-Endians  have  been  long  forbidden. 
— LEMUEL  GULLIVER,  Travels  Into  Several  Remote  Nations  of  the  World  (1726) 

214.  By  considering  the  irreducible  factors  of  the  characteristic  polynomial  of  X, 

we  must  have  Xn  = I where  n = 23  ■ 32  ■ 5 • 7 • 17  • 31  • 127  = 168661080.  Neill 
Clift  has  shown  that  l(n  — 1)  = 33  and  found  the  following  sequence  of  33  MXOR 
instructions  to  compute  Y = X^1  =.Y"_1:  MX0Rt,x,x;  MX0R$l,t,x;  MX0R$2,t,$l; 
MXOR  $3,  $2,  $2;  MX0Rt,$3,$3;  S6;  MX0Rt,t,$2;  S3;  MX0R$l,t,$l;  MX0Rt,$l,$3; 
S13;  MXOR  t , t , $1 ; S;  MX0Ry,t,x;  here  S stands  for  ‘MXOR  To  test  if  X is 

nonsingular,  do  MXOR  t ,y  ,x  and  compare  t to  the  identity  matrix  *8040201008040201 . 

215.  SADD  $0,x,0;  SADD$l,x,a;  NEG$0,32,$0;  2ADDU  $1  ,$1 , $0;  SLU$0,b,$l;  then 
BN  $0,Yes;  here  a = *aaaaaaaaaaaaaaaa  and  b = *2492492492492492. 

216.  Start  with  Sk  4—  0 and  tk  <r-  —1  for  0 < k < m.  Then  do  the  following  for 

1 ^ m’  If  xk  7^  0 and  Xk  < 2m,  set  l Xx^  and  si  4-  si  + x^\  if  ti  <0  or 

ti  Xk-,  also  set  ti  x Finally,  set  y i — 1 and  k i — 0;  while  y > tk  and  k <C  m,  set 
V 1-  V + Sk  and  k 4-  k + 1.  Double-precision  n-bit  arithmetic  is  sufficient  for  y and  sk- 
[This  pleasant  algorithm  appeared  in  D.  Eppstein’s  blog,  2008.03.22.] 

217.  See  R.  D.  Cameron,  U.S.  Patent  7400271  (15  July  2008);  Proc.  ACM  Symp. 
Principles  and  Practice  of  Parallel  Programming  13  (2008),  91-98. 

218.  Let  b be  any  integer  with  b mod  8 = 5.  Then  x = bL(x)  mod  2d  for  some  integer 
L(x),  depending  on  b,  whenever  0 < x < 2d  and  x mod  4 = 1 (see  Section  3.2. 1.2).  The 
following  algorithm  computes  s = 4L(a;),  given  a table  of  the  numbers  tk  = - 4L{2k  + l ) 
for  1 < k < d:  Set  s «—  0,  j <-  1;  then  while  x yf  1,  set  j 4-  j + 1,  and  if  xk.  (1<  j)  / 0 
also  set  x 4—  (x  + (x  <y'))  mod  2d,  st-(s  + (,)  mod  2d. 

Now  to  compute  o- xv  we  can  proceed  as  follows  (with  all  arithmetic  done  mod  2d): 

If  x & 2 / 0,  set  x 4 x and  a <—  ( — l)#&1a.  (Now  x mod  4 = 1.)  Set  s 4—  4 L(x)  ■ y, 

using  the  algorithm  above,  and  j 4-  1;  then  while  s yf  0,  set  j 4-  j + 1,  and  if 
s & (1  <C  j)  yf:  0 also  set  s «—  s + tj,  a 4—  a + (a  j).  The  desired  answer  is  then  a. 
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Suitable  numbers  tk  can  be  computed  by  setting  td-i  4—  l4C(d— 1)  and  proceeding 
as  follows  for  k = d- 2,  d- 3,  . . . , 2:  Set  x 4-  1 + (l«fc),  x 4-  x + (x<fc),  s 4-  0,  j 4-  fc; 
then  while  i/l  set  j 4—  j + 1,  and  if  / 0 also  set  x 4-  x + (x<j),  s 4-  s — £,■; 

finally  t*  4—  s » 1.  For  example,  when  d = 32  we  get  tk  = 2k  for  32  > k > 16,  fi5  = 
# 20008000,  ti4  = * 18004000,  ti3  = #0e002000,  t12  = #07801000,  tn  = #03e00800, 
tio  = #41f80400,  tg  = #18fe0200,  ts  = #0b7f8100,  t7  = #319fe080,  t6  = #5e8bf840, 
f 5 = #4a617e20,  f4  = #17c26f90,  t3  = #6119dle8,  t2  = #2c30267c.  (This  procedure 
finds  the  L' s for  some  integer  b,  without  revealing  the  actual  value  of  b itself!) 

[The  methods  of  this  exercise  have  interesting  connections  to  the  algorithms  of 
Briggs  and  Feynman  for  real-valued  logarithm  and  exponential  in  exercises  1.2.2-25  and 
1.2.2-28.  Our  broadword  procedure  for  xv  works  also  for  calculating  the  inverse  of  x, 
modulo  2d , when  y = — 1;  but  there’s  a direct  algorithm  available  for  that:  Set  z <—  1, 
j +-  0;  while  x / 1 set  j ■<—  j + 1,  and  if  x&(l«j)  / 0,  also  set  z <-  (z+(z<gj))  mod  2d, 
ie(x  + (x«  j))  mod  2d.  The  final  z is  the  inverse  of  the  original  odd  number  x.] 


SECTION  7.1.4 


1.  Here  are  the  BDDs  for  truth  tables  0000,  0001,  . . . , 1111, 

H Dm  eh  Eli]  Em  Em  Em  E E Qj  E Em  E E 

14434354453 

2.  (The  ordering  property  determines  the  direction  of  each 


showing  the  sizes  below: 

c/nE  L^l  I^E  E 

4 3 4 4 1 

arc.) 


3.  There  are  two  with  size  1 (namely  the  two  constant  functions);  none  with  size  2 
(because  two  sinks  cannot  both  be  reachable  unless  there’s  also  a branch  node);  and 
2 n with  size  3 (namely  Xj  and  Xj  for  1 < j < n). 

4.  Set  y <-  * Of  if  fffeff  iff  fe  &cx+*  20000002 , y <-  (r/>28)&#  10000001,  x'  4-  x©y. 
(See  7.1.3-(93).) 

5.  You  get  f(x !,...,*„)  = fD(x\ , ...  ,xn),  the  dual  of  / (see  exercise  7. 1.1 -2). 

6.  The  largest  subtables  of  1011000110010011,  namely  10110001,  10010011,  1011, 
0001,  1001,  0011,  are  all  distinct  beads;  squares  and  duplicates  don’t  appear  until  we 
look  at  the  subtables  {10, 11,00,01}  of  length  2.  So  g has  size  11. 

7.  (a)  If  the  truth  table  of  / is  croon  • • ■Ot2k_1,  where  each  aj  is  a binary  string  of 
length  2n~fc,  the  truth  table  of  gk  is  f30/32  . . ./S2fe_2,  where  f32j  = a2ja2j+ia2j+ia2j+1. 


W*m*m 
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(b)  Thus  the  beads  of  / and  gk  are  closely  related.  We  get  the  BDD  for  Qk  from 
the  BDD  for  / by  changing  (j)  to  (j-l)  for  1 < j < k,  and  replacing  by  (fc— i) . 

a a'  (k)  \ 


8.  (a)  Now  /?2 j = a.2ia2j+\OL2jjr-\a.2j.  (b)  Again  change  (J)  to  (j-l]  for  1 < j < k.  If 
Jfc)  is  present  in  / but  not  , replace  by  (fc-l)  ; otherwise  replace  (Jfc|)  by 


[E.  Dubrova  and  L.  Macchiarulo,  IEEE  Trans.  C-49  (2000),  1290-1292.] 


9.  There  is  no  solution  if  s = 1.  Otherwise  set  k 4—  s — 1,  j 4—  1,  and  do  the  following 
steps  repeatedly:  (i)  While  j < vk,  set  Xj  4-  1 and  j <—  j + 1;  (ii)  stop  if  k = 0;  (iii)  if 
hi t / 1,  set  Xj  4—  1 and  k 4—  hk,  otherwise  set  Xj  4—  0 and  k 4—  lk;  (iv)  set  j 4-  j + 1. 

10.  Let  Ik  = (w/c?  Ik-  hk)  for  0 < k < s and  I'k  = (v'k?  l'k : h'k ) for  0 < k < s'.  We  may 
assume  that  s = s';  otherwise  / ^ /'.  The  following  algorithm  either  finds  indices 
(to, . . . ,t3-i)  such  that.  Ik  corresponds  to  I’tk , or  concludes  that  / / /': 

11.  [Initialize  and  loop.]  Set  ts-i  4—  s — 1,  ti  4—  1,  to  4—  0,  and  tk  4—  —1  for 
2 < k < s — 2.  Do  steps  12-14  for  k = s — 1,  s — 2,  , . . , 2 (in  this  order).  If 
those  steps  “quit”  at  any  point,  we  have  / ^ /';  otherwise  / = 

12.  [Test  Ufc.]  Set  t 4—  tk.  (Now  t > 0;  otherwise  Ik  would  have  no  predecessor.) 
Quit  if  v't  / vk. 

13.  [Test  lk.]  Set  l 4—  lk.  If  ti  < 0,  set  f;  4—  l't;  otherwise  quit  if  l(  ^ tt. 

14.  [Test  h.k.\  Set  h 4—  hk.  If  th  < 0,  set  th  4 — h't\  otherwise  quit  if  h't  ^ th.  | 

11.  (a)  Yes,  since  ck  correctly  counts  the  number  of  settings  of  xVk  . . . xn  that  lead 
from  node  k to  node  1.  (In  fact,  many  BDD  algorithms  will  run  correctly-  but  more 
slowly  — in  the  presence  of  equivalent  nodes  or  redundant  branches.  But  reduction  is 
important  when,  say,  we  want  to  test  quickly  if  / = /'  as  in  exercise  10.) 

(b)  No.  For  example,  suppose  /3  = (1? 2: 1),  I2  = (I? 0:1),  /i  = (2?  1:1),  I0  = 
(2? 0: 0);  then  the  algorithm  sets  c2  4—  1,  c3  4-  |.  (But  see  exercise  35(b).) 

12.  (a)  The  first  condition  makes  K independent;  the  second  makes  it  maximally  so. 

(b)  None  when  n is  odd;  otherwise  there  are  two  sets  of  alternate  vertices. 

(c)  A vertex  is  in  the  kernel  if  and  only  if  it  is  a sink  vertex  or  in  the  kernel  of  the 
graph  obtained  by  deleting  all  sink  vertices  and  their  immediate  predecessors. 

[Kernels  represent  winning  positions  in  nim-like  games,  and  they  also  arise  in 
rc-person  games.  See  J.  von  Neumann  and  O.  Morgenstern,  Theory  of  Games  and  Eco- 
nomic Behavior  (1944),  §30.1;  C.  Berge,  Graphs  and  Hypergraphs  (1973),  Chapter  14.] 

13.  (a)  A maximal  clique  of  G is  a kernel  of  G,  and  vice  versa,  (b)  A minimal  vertex 
cover  U is  the  complement  V \ W of  a kernel  W,  and  vice  versa  (see  7-(6i)). 

14.  (a)  The  size  is  4(n  — 2)  + 2[n  = 3],  When  n > 6 these  BDDs  form  a pattern  in 
which  there  are  four  branch  nodes  for  variables  4,  5,  . . . , n — 2,  together  with  a fixed 
pattern  at  the  top  and  bottom.  The  four  branches  are  essentially 

(x\Xj-\  = 00)  (x\Xj-\  = 01)  (xiXj-i  = 10)  ( xiXj-i  = 11) 


(xiXj  = 00)  (xi  Xj  = 01) 


(x\Xj  = 10)  ( x\Xj  = 11) 
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(b)  Here  the  numbers  for  3 < n < 10  are  (7,9,14,17,22,30,37,45);  then  a fixed 
pattern  at  the  top  and  bottom  develops  as  in  (a),  with  nine  branch  nodes  for  each 
variable  in  the  middle,  and  the  total  size  comes  to  9(n  - 5).  The  nine  nodes  on  each 
middle  level  fall  into  three  groups  of  three, 


with  one  group  for  xxx2  = 00,  one  for  *1X2  = 01,  and  one  for  x\  = 1. 

15.  Both  cases  lead  by  induction  to  well  known  sequences  of  numbers:  (a)  The  Lucas 
numbers  Ln  — Fn+ 1 + Fn- 1 [see  E.  Lucas,  Theorie  des  Nombres  (1891),  Chapter  18]. 
(b)  The  Perrin  numbers,  defined  by  P3  = 3,  P4  = 2,  P5  = 5,  Pn  = P„_2  + Pn- 3.  [See 
R.  Perrin,  L’ Intermediate  des  Mathematiciens  6 (1899),  76-77.] 

16.  When  the  BDD  isn’t  [X] , all  solutions  are  generated  by  calling  List(  1,  root),  where 
List(j,p)  is  the  following  recursive  procedure:  If  v (p)  > j,  set  Xj  t—  0,  call  List(j  + l,p), 
set  Xj  t—  1,  and  call  List(j  + l,p).  Otherwise  if  p is  the  sink  node  [T],  visit  the 
solution  xi  ...x„.  (The  idea  of  “visiting”  a combinatorial  object  while  generating 
them  all  is  discussed  at  the  beginning  of  Section  7.2.1.)  Otherwise  set  Xj  0;  call 
List(j  + 1,  LO(p))  if  LO(p)  [X] ; set  xj  «—  1;  and  call  List(j  + 1,  HI(p))  if  Hl(p)  / [X] . 

The  solutions  are  generated  in  lexicographic  order.  Suppose  there  are  N of  them. 
If  the  kth  solution  agrees  with  the  (k— l)st  solution  in  positions  x\  . . . Xj-i  but  not  in  Xj , 
let  c(k)  = n — j;  and  let  c(l)  = n.  Then  the  running  time  is  proportional  to  c(fc), 

which  is  0(nN)  in  general.  (This  bound  holds  because  every  branch  node  of  a BDD 
leads  to  at  least  one  solution.  In  fact,  the  running  time  is  usually  O(N)  in  practice.) 

17.  That  mission  is  impossible,  because  there’s  a function  with  N — 22k  and  B(f)  = 
0( 22k)  for  which  every  two  solutions  differ  in  more  than  2k~l  bit  positions.  The  running 
time  for  any  algorithm  that  generates  all  solutions  for  such  a function  must  be  Q(23k), 
because  f2(2fc)  operations  are  needed  between  solutions.  To  construct  /,  first  let 

£l(x  1,  • ■ ■ ,xk,y0, . . . ,y2k_l)  = [y(t1...tk)2  =xiti  0 • • • ® xktk  for  0 < fi, . . . , tk  < l]. 

(In  other  words,  g asserts  that  yo  ■ ■ ■ y2k- 1 is  row  (xi  ■ ■ ■ xk)2  of  an  Hadamard  matrix; 
see  Eq.  4.6.4-(38).)  Now  we  let  f(x1,...>xk,y0,...,y2k_1,x'1,...,x'k,y'0,...,y'2k_1)  = 
9(x  1,  ■ ■ -,xk,y0, . . ■,y2k_1)  A p(xi, . . .,x'k,y'0, . . -,y'2k_1)-  Clearly  B(f)  = 0(22k)  when 
the  variables  are  ordered  in  this  way.  Indeed,  T.  Dahlheimer  observes  that  B(f)  = 
2 B(g)  - 2,  where  B(g)  = 2k  + 1 + 2min('c’1+rigJ'1)  = |22fc“1  + 2k  + |. 

18.  First,  (VFi, . . . , W5)  = (5, 4, 4, 4, 0).  Then  m2  = w 4 = 4 and  t2  = 1;  m3  — t3  = 0; 

m4  = max(m3]m2+w3)  — 1,  t4  = 1;  m5  = W4-W5  = 4,  fB  = 0;  m6  = w2  + W3-W3  = 

2,  t(,  = 1;  mj  = maxfms,  m4  + w2)  = 4,  <7  = 0;  ms  = max(m.7,  me  + wi)  = 4,  ts  = 0. 

Solution  xix2x3X4  = 0001. 

1-9.  5Z7=i  min(wj,0)  < J2j=vk  min(wj,0)  < mk  < YSj=Vk  max(w3,0)  = WVk  < Wx. 

20.  Set  wi  < 1,  then  w2j  <—  Wj  and  rcgy+i  < Wj  for  1 < j < n/2.  [This  method 

may  also  compute  wn+ 1.  The  sequence  is  named  for  works  of  A.  Thue,  Skrifter  udgivne 
af  Videnskabs-Selskabet  i Christiania,  Mathematisk-Naturvidenskabelig  Klasse  (1912), 
No.  1,  §7,  and  H.  M.  Morse,  Trans.  Amer.  Math.  Soc.  22  (1921),  84  100,  §14.] 

21.  Yes;  we  just  have  to  change  the  sign  of  each  weight  Wj.  (Or  we  could  reverse  the 
roles  of  LO  and  HI  at  each  vertex.) 
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22.  If  f(x)  = f(x')  = 1 when  / represents  a graph  kernel,  the  Hamming  distance 
u(x  © x')  cannot  be  1.  In  such  cases  vi  = v + 1 when  l ^ 0 and  vk  = v + 1 when  h / 0. 

23.  The  BDD  for  the  connectedness  function  of  any  connected  graph  will  have  exactly 
n — 1 solid  arcs  on  every  root-to-  [t|  path,  because  that  many  edges  are  needed  to  con- 
nect n vertices,  and  because  a BDD  has  no  redundant  branches.  (See  also  Theorem  S.) 

24.  Apply  Algorithm  B with  weights  ( w'12 , . . . , w'S9)  = (-W12-X, - wS9-x ),  where 
x is  large  enough  to  make  all  of  these  new  weights  w'uv  negative.  The  maximum  of 
^ V uv  will  then  occur  with  ^2  xuv  = 8,  and  those  edges  will  form  a spanning  tree 
with  minimum  ^ wuvx Uv  (We’ve  seen  a better  algorithm  for  minimum  spanning  trees 
in  exercise  2.3.4. I ll,  and  other  methods  will  be  studied  in  Section  7.5.4.  However,  this 
exercise  indicates  that  a BDD  can  compactly  represent  the  set  of  all  spanning  trees.) 

25.  The  answer  in  step  Cl  becomes  (1  + z)';”-1_1cs_i;  the  value  of  ck  in  step  C2 
becomes  (1  + z)vi~vk~1ci  + (1  + z)Vh~Vk~x  zch- 

26.  In  this  case  the  answer  in  step  Cl  is  simply  cs_i;  and  the  value  of  ck  in  step  C2 
is  simply  (1  - pVk)ct  + pVkch. 

27.  The  multilinear  polynomial  H(x  1,...,xn)  = F(xi,...,xn)  - G(x i,...,xn)  is 
nonzero  modulo  q , because  it  is  ±1  for  some  choice  of  integers  with  each  xk  E {0, 1}. 
If  it  has  degree  d (modulo  q),  we  can  prove  that  there  are  at  least  (q  — l)dqn~d  sets  of 
values  (qi, . . . , qn ) with  0 < qk  < q such  that  H(q\, . . . , qn)  mod  q / 0.  This  statement 
is  clear  when  d = 0.  And  if  xk  is  a variable  that  appears  in  a term  of  degree  d > 0,  the 
coefficient  of  xk  is  a polynomial  of  degree  d—  1,  which  by  induction  on  d is  nonzero  for 
at  least  (q  — l)d  lqn  d choices  of  (51, ... , qk-i,  qk+ 1,  • • • , qn)',  for  each  of  those  choices 
there  are  q — 1 values  of  qk  such  that  H(qi,  . . . , qn ) mod  q / 0. 

Hence  the  stated  probability  is  > (1  - 1 /q)d  > (1  — 1 /q)n.  [See  M.  Blum,  A.  K. 
Chandra,  and  M.  N.  Wegman,  Information  Processing  Letters  10  (1980),  80-82.] 

28.  F(p)  = (1  — p)nG(p/(l  — p)).  Similarly,  G{z)  = (1  + z)nF(z/(l  + z)). 

29.  In  step  Cl,  also  set  c'0  <—  0,  c\  f-  0;  return  cs_x  and  Cs_!.  In  step  C2,  set 
Ck  E-  (1  -p)ci  +pch  and  c'k  (1  - p)c\  - c,  + pc'h  + ch . 

30.  The  following  analog  of  Algorithm  B does  the  job  (assuming  exact  arithmetic): 
Al.  [Initialize.]  Set  Pn+1  1 and  P,  •<—  Pj+1  max(l  - Pj,Pj)  for  n > j > 1. 

A2.  [Loop  on  fc.]  Set  m-i  <—  1 and  do  step  A3  for  2 < k < s.  Then  do  step  A4. 
A3.  [Process  Ik.\  Set  v E-  vk,  l 4-  lk,  h E-  hk,  tk  E-  0.  If  l / 0,  set  mk  <- 

m;(  1 — pv)Pv+i/ PVI  ■ Then  if  h / 0,  compute  m E-  rrihPvPv+i/ Pvh\  and  if 
l = 0 or  m > mk , set  mk  E-  m and  tk  E-  1 . 

A4.  [Compute  the  x’s.]  Set  j E-  0,  k E-  s — 1,  and  do  the  following  operations 
until  j = n:  While  j < vk  - 1,  set  j E-  j + 1 and  xj  E-  [pj  > |];  if  k > 1,  set 
j E-  j + 1 and  Xj  E~  tk  and  k t—  (tk  =0?  lk:  hk).  \ 

31.  Cl'.  [Loop  over  k.]  Set  a0  t—  -L,  a\  E-  T,  and  do  step  C2'  for  k = 2,  3,  . . , , s - 1. 

Then  go  to  C3'. 

C2'.  [Compute  ak.]  Set  v <—  vk,  l E-  lk,  and  h <—  hk.  Set  /3  E-  on  and  j E-  Vi  — 1; 
then  while  j > v set  fJ  E-  ( Xj  o Xj)  • f3  and  j E-  j — 1.  Set  7 -f — ah  and 
j E-  Vh  — 1;  then  while  j > v set  7 t-  ( Xj  o Xj)  • 7 and  j E-  j - 1.  Finally  set 
ak  4-  ( xv  • /3)  o ( xv  • 7). 

C3'.  [Finish.]  Seta  E-  as_i  and  j E-  vs-i  — 1;  then  while  j > 0 set  a E-  (xjOXj)»a 
and  j E—  j — 1.  Return  the  answer  a.  | 
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This  algorithm  performs  o and  • operations  at  most  0(nB(f))  times.  The  upper  bound 
can  often  be  lowered  to  0(n ) + 0(B(f));  but  shortcuts  like  the  calculation  of  Wk  in 
step  B1  aren’t  always  available.  [See  O.  Coudert  and  J.  C.  Madre,  Proc.  Reliability  and 
Maint.  Conf.  (IEEE,  1993),  240-245,  §4;  O.  Coudert,  Integration  17  (1994),  126-127.] 

32.  For  exercise  25,  ‘o’  is  addition,  V is  multiplication,  ‘ _L’  is  0,  ‘T’  is  1,  %■’  is  1,  ‘x/ 
is  z.  Exercise  26  is  similar,  but  ‘x/  is  1 — pj  and  ‘x/  is  pj. 

In  exercise  29  the  objects  of  the  algebra  are  pairs  (c,c'),  and  we  have  (a,  a')  o 
(b,  b')  = (a  + b,a  + b1),  (a,  a')  • (6,  b ')  = ( ab , ab'  + a'b).  Also  ‘_L’  is  (0, 0),  ‘T’  is  (1, 0), 
‘Zj’  is  (1—  p,  —1),  and  ‘z/  is  (p,  1). 

In  exercise  30,  ‘o’  is  max,  V is  multiplication,  ‘_L’  is  — oo,  ‘T’  is  1,  ‘z/  is  1 - pj, 
xj ’ is  Pj-  Multiplication  distributes  over  max  in  this  case  because  the  quantities  are 
either  nonnegative  or  — oo;  we  must  define  0 • (— oo)  = — oo  in  order  to  satisfy  (22). 

(Additional  possibilities  abound,  because  associative  and  distributive  operators  are 
ubiquitous  in  mathematics.  The  algebraic  objects  need  not  be  numbers  or  polynomials 
or  pairs;  they  can  be  strings,  matrices,  functions,  sets  of  numbers,  sets  of  strings,  sets 
or  multisets  of  matrices  of  pairs  of  functions  of  strings,  etc.,  etc.  We  will  see  many 
further  examples  in  Section  7.3.  The  min-plus  algebra,  with  o = min  and  • = +,  is 
particularly  important,  and  we  could  have  used  it  in  exercise  21  or  24.  It  is  often  called 
tropical , implicitly  honoring  the  Brazilian  mathematician  Imre  Simon.) 

33.  Operate  on  triples  (c,  c',  c"),  with  (a,  a',  a")  o (6, 6',  b")  = (a  + 6,  a'  + b' , a"  + b") 
and  (a,  a'  ,a")  • (6,  b'  ,b")  = (ab,  a'b  + b'a,  a'b  + 2 a'b1  + ab").  Interpret  ‘_L’  as  (0, 0, 0), 
‘T’  as  (1,0,0),  ‘zj’  as  (1,0,0),  and  ‘x/  as  (l,Wj,wf). 

34.  Let  xVy  = max(x,y).  Operate  on  pairs  (c,  c'),  with  (a,  a')  o (b,  b')  = (aVb,  a!  \Zb') 
and  (a,  a')  • (b,b')  = (a  + b,  (a1  + b)  V (a  + b')).  Interpret  ‘_L’  as  (—00,  —00),  ‘T’  as 
(0,  —00),  lXj’  as  (0,w"),  and  ‘x3  ’ as  ( Wj,w'j  + w").  The  first  component  of  the  result 
will  agree  with  Algorithm  B;  the  second  component  is  the  desired  maximum. 

35.  (a)  The  supposed  FBDD  can  be  represented  by  instructions  Is-i,  . . . , /o  as  in 
Algorithm  C.  Start  with  Rq  4—  Hi  4—  0,  then  do  the  following  for  k = 2,  . . . , s — 1: 
Report  failure  if  vk  e Rik  U Rhk;  otherwise  set  Rk  4-  {i;*,}  U Rik  U Rhk.  (The  set  Rk 
identifies  all  variables  that  are  reachable  from  Ik.) 

(b)  The  reliability  polynomial  can  be  calculated  just  as  in  answer  26.  To  count 
solutions,  we  essentially  set  pi  = • • ■ = pn  = | and  multiply  by  2n:  Start  with  c0  4-  0 
and  ci  4—  2n , then  set  ck  4—  ( cik  + Chk)/2  for  1 < k < s.  The  answer  is  cs_i. 

36.  Compute  the  sets  Rk  as  in  answer  35(a).  Instead  of  looping  on  j as  stated  in  step 
C2'  of  answer  31,  set  /?  4-  an  and  then  /?  4-  (Xj  o Xj)  • /3  for  all  j € Rk\Ri\  {u};  treat 
7 in  the  same  manner.  Similarly,  in  step  C3'  set  a 4-  (xj  o Xj)  • a for  all  j £ Rs_i. 

37.  Given  any  FBDD  for  /,  the  function  G(z)  is  the  sum  of  ( 1 + z) n “ length p2solid  arcs  in  p 
over  all  paths  P from  the  root  to  [T] . [See  Theoretical  Comp.  Sci.  3 (1976),  371-384.] 

38.  The  key  fact  is  that  Xj  = 1 forces  / = 1 if  and  only  if  we  have  (i)  hk  = 1 whenever 

vk  = j;  (ii)  vk  = j in  at  least  one  step  fc;  (iii)  there  are  no  steps  with  ( vk  < j < vik  and 

lk  / 1)  or  ( vk  < j < vhk  and  hk  / 1). 

Kl.  [Initialize.]  Set  tj  4-  2 and  pj  4-  0 for  1 < j < n. 

K2.  [Examine  all  branches.]  Do  the  following  operations  for  2 < k < s:  Set  j 4—  vk 

and  q 4—  0.  If  /*:  = 1,  set  q 4 1;  otherwise  set  pj  4—  ma x(pj,vik).  If  hk  = 1, 

set  q 4-  +1;  otherwise  set  pj  4-  ma x(pj,vhk).  If  tj  = 2,  set  tj  4-  q;  otherwise 
if  tj  / q set  tj  4—  0. 
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K3.  [Finish  up.]  Set  m <—  vs-i,  and  do  the  following  for  j = 1,  2,  . . . , n:  If  j < m, 
set  tj  <—  0;  then  if  pj  > m,  set  m pj.  | 

[See  S.-W.  Jeong  and  F.  Somenzi,  in  Logic  Synthesis  and  Optimization  (1993),  154 -156.] 

39.  k(n  + 1 — k)  + 2,  for  1 < k < n.  (See  (26).) 

40.  (a)  Suppose  the  BDDs  for  / and  g have  respectively  aj  and  bj  branch  nodes  (J), 
for  1 < j < n.  Each  subtable  of  / of  order  n + 1 — k has  the  form  a/3yS,  where  a, 
/3,  7,  and  J are  subtables  of  order  n — 1 — k.  The  corresponding  subtables  of  g are 
aaSS ; hence  they  are  beads  if  and  only  if  a / S,  in  which  case  either  a/3-yS  is  a bead  or 
a/3  = 7<5  is  a bead.  Consequently  bk  < ak  + a/fe+i,  and  bk+i  = 0.  We  also  have  bj  < aj 
for  1 < j < k,  because  every  bead  of  g of  order  > n + 1 — 1:  is  “condensed”  from  at  least 
one  such  bead  of  /.  And  bj  < aj  for  j > k + 1,  because  the  subtables  on  ( Xk+2 , . . . , xn) 
are  identical  although  they  might  not  appear  in  g. 

(b)  Not  always.  The  simplest  counterexample  is  f(x  1,2:2,  *3,  X4)  = *2  A (13  V X4), 
h(xi , X2,  xi,  X4)  = X2  A (xi  V X4),  when  B(f)  = 5 and  B(h)  = 6.  (We  do,  however, 
always  have  B(h)  < 2 B(f).) 


41.  (a)  3n  — 3;  (b)  2 n.  (The  general  pat- 
terns are  illustrated  here  for  n = 6.  One  can 
also  show  that  the  “organ-pipe  ordering” 

/ rFl  Tp2  F3  F4  Fn-l  TFn-2  \ 

\Xn  Xj  Xn_jX2  • • • ■C[n/2J+[neven]  xrn/2l/ 


produces  the  profile  1,  2,  4,  ... , 2[n/2]—  2, 

2[n/2j  — 1,  . . . , 5,  3,  1,  2,  giving  the  total 
BDD  size  Q)  + 3;  this  ordering  appears  to 
be  the  worst  for  the  Fibonacci  weights.) 

The  functions  [Fnx  1 + • • • + Fixn  > t] 

have  been  studied  by  J.  T.  Butler  and  T.  Sasao,  Fibonacci  Quart.  34  (1996),  413-422. 

42.  (Compare  with  exercise  2.)  The  sixteen  roots  are  the  (T)  nodes  and  the  two  sinks: 


0011 


1100 


43.  (a)  Since  f(xi, . . . ,X2n)  is  the  symmetric  function  S„(a:i, . . . ,xn,xn+i, . . . ,x2n), 

we  have  B(/)  = 1 — (—  2 — |—  * - * — |—  (n-t-1)  — (-  * • - — (—  3 — F 2 — F 2 = + 2 n -f-  2. 

(b)  By  symmetry,  the  size  is  the  same  for  \i<E  1}  = Y^{xi  | i £ /}],  |/|  = n. 

44.  There  are  at  most  min(fc,2n+2-fc  — 2)  nodes  labeled  (k),  for  1 < fc  < n,  because 
there  are  2n+2-fc  — 2 symmetric  functions  of  (in,...,i„)  that  aren’t  constant.  Thus 
En  is  at  most  2 + ^2k=i  min(^ , 2n+2-fe  — 2),  which  can  be  expressed  in  closed  form  as 
(n  + 2 — bn)(n  + 1 — bn)/ 2 + 2(2bn  —b„),  where  bn  = A(n  + 4 — A(n  + 4))  and  An  = [lgnj. 

A symmetric  function  that  attains  this  worst-case  bound  can  be  constructed  in 
the  following  way  (related  to  the  de  Bruijn  cycles  constructed  in  exercise  3. 2. 2-7): 
Let  p(x)  — xd  + aixd~ 1 + • ■ • + a j be  a primitive  polynomial  modulo  2.  Set  h f-  1 
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for  0 < k < d;  tk  «—  ( a\tk-i  + • • • + adtk-d)  mod  2 for  d < k < 2d  + d — 2;  tk  t— 

(1  + aitk-i  H h adtk-d)  mod  2 for  2d  + d - 2 < k < 2d+1  + <7-3;  and  f2rf+1+d-3  <-  1- 

For  example,  when  p(x)  = x3  + x + 1 we  get  t0  . . . t16  = 11100101101000111. 

Then  (i)  the  sequence  ti  . . . t2d+d_3  contains  all  d-tuples  except  0d  and  ld  as 
substrings;  (ii)  the  sequence  t2d+d_2  . . . t2d+ i+d_4  is  a cyclic  shift  of  f0  . . . t2d_2;  and 
(hi)  tk  = 1 for  2 — l<fc<2d  + d — 3 and  2d+1  — 2 < k < 2d~*~1  + <7  — 3.  Consequently  the 
sequence  <o  • • • t2d+i+d_3  contains  all  (<7  + l)-tuples  except  0d+1  and  ld+1  as  substrings. 
Set  f(x)  = tux  to  maximize  B(f)  when  2d  + d — 4<n<  2d+1  + d — 3. 

Asymptotically,  En  = |n2  — nlgn  + O(n).  [See  I.  Wegener,  Information  and 
Control  62  (1984),  129-143;  M.  Heap,  ,7.  Electronic  Testing  4 (1993),  191-195.] 

45.  Module  Mi  has  only  three  inputs  (x1,y1,  z\),  and  only  three  outputs  u2  = X\,  v2  = 
2/1*1,  “2  = Z\X\.  Module  Mn_i  is  almost  normal,  but  it  has  no  input  port  for  zn- 1, 
and  it  doesn’t  output  un\  it  sets  zn-2  = xn-iyn-i ■ Module  Mn  has  only  three  inputs 
(vn,wn,xn),  and  one  output  y„-i  = xn  together  with  the  main  output,  wn  V vnxn- 
With  these  definitions  the  dependencies  between  ports  form  an  acyclic  digraph. 

(Modules  could  be  constructed  with  all  bk  = 0 and  ak  < 5,  or  even  with  ak  < 4 as 
we’ll  see  in  exercise  47.  But  (33)  and  (34)  are  intended  to  illustrate  backward  signals 
in  a simple  example,  not  to  demonstrate  the  tightest  possible  construction.) 

46.  For  6 < k < n - 3 there  are  nine  branches  on  (k),  corresponding  to  three  cases 
(x1,x1x2,x1x2)  times  three  cases  (xk-i,Xk-2Xk-i,xk-3Xk-2Xk-i).  The  total  BDD 
size  turns  out  to  be  exactly  9 n — 38,  if  n > 6. 

47.  Suppose  / has  qk  subtables  of  order  n — k,  so  that  its  QDD  has  qk  nodes  that  branch 
on  Xk+i ■ We  can  encode  them  in  ak  = pg<?fc"|  bits,  and  construct  a module  Mk+ 1 with 
bk  = bk+i  = 0 that  mimics  the  behavior  of  those  qk  branch  nodes.  Thus  by  (86), 

E2^26"  =^2rig«l  < J2(2qk-l)  = 2Q(f)  — (n  + 1)  < (n  + l)B(f). 

k= 0 fc=0  k=0 

(The  2m-way  multiplexer  shows  that  the  additional  factor  of  (n+1)  is  necessary;  indeed, 
Theorem  M actually  gives  an  upper  bound  on  Q(f).) 

48.  The  sums  uk  = x\  + • • • -\~xk  and  vk  = xk+i  + • • • + xn  can  be  represented  on  1 + A k 
and  1 + A (n  - k)  wires,  respectively.  Let  tk  = xk  A [uk  +vk  = k]  and  wk  = ti  V • • • V tk. 
We  can  construct  modules  A Ik  having  inputs  uk— 1 and  wk— 1 from  Mk— 1 together  with 
inputs  vk  from  Mk+i  \ module  Mk  outputs  uk  = uk-i  -\-xk  and  wk  = wk-iVtk  to  Mk+ 1 
as  well  as  vk-i  = vk  + xk  to  Mk- 1. 

If  p is  a polynomial,  X^it=o  2F><-afc ’bfc -*  = 2d°srl)  ;s  asymptotically  less  than 
[See  K.  L.  McMillan,  Symbolic  Model  Checking  (1993),  §3.5,  where  Theorem  M was 
introduced,  with  extensions  to  nonlinear  layouts.  The  special  case  b\  = • • • = bn  = 0 
had  been  noted  previously  by  C.  L.  Berman,  IEEE  Trans.  CAD-10  (1991),  1059-1066.] 

49.  s>4  s>  3 S>  2 S>!  S4  S3  S2  S2  So 


[See  I.  Semba  and  S.  Yajima,  Trans.  Inf.  Proc.  Soc.  Japan  35  (1994),  1663-1665.] 
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51.  In  this  case  B{ft)  = 3j  + 2 for  1 < j < n,  and  B(fn+1)  = 3n  + 1;  so  the  individual 
BDDs  are  only  about  1/3  as  big  as  they  are  within  (36).  But  almost  no  nodes  are 
shared — only  the  sinks  and  one  branch.  So  the  total  BDD  size  comes  to  (3n2  + 9n)/2. 

52.  If  the  BDD  base  for  {/1, . . . , fm}  has  s nodes,  then  B(f)  = s + m + 1 + [s  = 1], 

53.  Call  the  branch  nodes  a,  b,  c,  d,  e,  /,  g,  with  ROOT  = a.  After  step  R1  we 
have  HEAD [1]  = ~a,  AUX(a)  = ~0;  HEAD [2]  = ~6,  AUX(6)  = ~c,  AUX(c)  = ~0; 
HEAD  [3]  = ~d,  AUX(d)  = ~e,  AUX(e)  = ~/,  AUX(/)  = ~g,  AUX(g)  = ~0. 

After  R3  with  « = 3we  have  s = ~0,  AUX(O)  = ~e,  AUX(e)  = /,  AU  X (/)  = 0;  also 
AVAIL  = g,  L0 (g)  --  ~1,  HI  (g)  = d,  L0(d)  = ~0,  and  HI(d)  = a,  where  a was  the 
initial  value  of  AVAIL.  (Nodes  g and  d have  been  recycled  in  favor  of  1 and  0.)  Then  R4 
sets  s <—  e and  AUX(O)  <—  0.  (The  remaining  nodes  with  V = v start  at  s,  linked  via  AUX.) 

Now  R7,  starting  with  p = q = e and  s = 0,  sets  AUX(l)  « e,  L0(/)  e, 

HI(/)  -<—  g,  AVAIL  t—  /;  and  R8  resets  AUX(l)  <—  0. 

Then  step  R3  with  v = 2 sets  L0(b)  <-  0,  L0(c)  <-  e,  and  HI(c)  <-  1.  No 
further  changes  of  importance  take  place,  although  some  AUX  fields  temporarily  become 
negative.  We  end  up  with  Fig.  21. 

54.  Create  nodes  j for  1 < j < 2n_1  by  setting  V(j)  <-  [lgj],  L0( j)  <-  2 j - 1,  and 
HI ( j ) <-  2 j;  also  for  2n_1  < j < 2n  by  setting  V(j)  t-  n,  L0(j)  <-  f(x1,...,xn-1,0), 
and  HI  (j)  <—  f(xi, , x„_i,  1)  when  j = (lxi  . . . a;„_i)2  + 1.  Then  apply  Algorithm  R 

with  ROOT  = 2.  (We  can  bypass  step  R1  by  first  setting  AUX(j)  i j for  4 < j < 2n . 

then  HEAD [fc]  •(-  ~( 2k)  and  AUX(2,C_1  + 1)  < 1 for  1 <k<  n .) 

55.  It  suffices  to  construct  an  unreduced  diagram,  since  Algorithm  R will  then  finish 
the  job.  Number  the  vertices  1,  . . . , n in  such  a way  that  no  vertex  except  1 appears 
before  all  of  its  neighbors.  Represent  the  edges  by  arcs  ax,  . . . , ae,  where  ak  is  uk  — wk 
for  some  uk  < vk,  and  where  the  arcs  having  uk  = j are  consecutive,  with  sj  < k < Sj+i 
and  1 = si  < • • • < sn  = s„+i  — e + 1.  Define  the  “frontier”  Vk  = (1,  vi, . . . , vk}  fl 
{ uk t ■ ■ ■ - n]  for  1 < k < e,  and  let  Vo  = {1}-  The  unreduced  decision  diagram  will  have 
branches  on  arc  ak  for  all  partitions  of  VZ-i  that  correspond  to  connectedness  relations 
that  have  arisen  because  of  previous  branches. 

For  example,  consider  P3DP3,  where  (si,...,si0)  = (1,3,5,7,8,10,11,12,13,13) 
and  Vo  = {1},  Vi  = {1,2},  V2  = {1,2,3},  V3  = {2,3,4},  . . . , V12  = {8,9}.  The  branch 
on  ai  goes  from  the  trivial  partition  1 of  R0  to  the  partition  1|2  of  V)  if  1 -j-  2,  or  to 
the  partition  12  if  1 — 2.  (The  notation  T|2’  stands  for  the  set  partition  {1}  U {2}, 
as  in  Section  7.2. 1.5.)  From  1|2,  the  branch  on  a2  goes  to  the  partition  1|2|3  of  V2  if 
l-f-3,  otherwise  to  13|2;  from  12,  the  branches  go  respectively  to  partitions  12|3  and 
123.  Then  from  1|2|3,  both  branches  on  a 3 go  to  [X],  because  vertex  1 can  no  longer 
be  connected  to  the  others.  And  so  on.  Eventually  the  partitions  of  Ve  = Vi2  are  all 
identified  with  [X] , except  for  the  trivial  one-set  partition,  which  corresponds  to  [T] . 
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56.  Start  with  m 4-  2 in  step  Rl,  and  v0  i- v i <-  umax  + 1,  l0  <-  h0  i-  0,  h i-  hi  i-  1 
as  in  (8).  Assume  that  HI(0)  = 0 and  HI  ( 1 ) = 1.  Omit  the  assignments  that  involve 
AVAIL  in  steps  R3  and  R7.  After  setting  AUX(HI(p))  <-  0 in  step  R8,  also  set  vm  4-  v 
lm  <-  HI(L0(p))  i hm  HKHI(p)),  HI  (p)  m,  and  m •<—  m + 1.  At  the  end  of 
step  R9,  set  5 <—  m - [R00T  = 0]. 

57.  Set  L0  (ROOT)  4-  ~L0(R00T) . (We  briefly  complement  the  LO  field  of  nodes  that  are 
still  accessible  after  restriction.)  Then  for  v = V(ROOT),  . . . , wmax,  set  p <-  ~HEAD [n] , 
HEAD  M 4-  ~0,  and  do  the  following  while  p 0:  (i)  Set  p' t- ~AUX(p) . (ii)IfLO(p)  > 
0,  set  HI(p)  4—  AVAIL,  AUX(p)  4—  0,  and  AVAIL  4—  p (node  p can  no  longer  be  reached). 
Otherwise  set  L0(p)  4-  ~L0(p);  if  FIXM  = 0,  set  HI(p)  4-  L0(p);  if  FIX[n]  = 1,  set 
L0(p)  4-  HI  (p) ; if  L0(L0  (p) ) > 0,  set  L0(L0(p))  4-  ~L0(L0(p));  if  L0(HI(p))  > 0, 

set  L0  (HI  (p)  ) 4-  ~L0(HI  (p)  ) ; and  set  AUX(p)  4-  HEAD  [u]  , HEAD  [n]  4 p.  (iii)  Set 

p 4-  p . Finally,  after  finishing  the  loop  on  v,  restore  L0(0)  4-  0,  L0(1)  4-  1. 

58.  Since  l ^ h and  l ^ h , we  have  l o l1  ^ h o h' , l o a'  ^ h o a' , and  a o l'  yl  a o h! . 
Suppose  a o a'  = /3  op,  where  /?  = (v",l",h")  and  /?'  = (v'" ,1"' ,ti").  If  v"  = v'" 

we  have  v = v" , l o l'  = l"  o l"’,  and  hoh!  = h"  o ti" . If  v"  < v'"  we  have  v = v", 
l o a = l o/3',  and  ho  a = h"  o /3'.  Otherwise  we  have  v'  = v'" , a o l'  = /3  o and 
a o h'  = /3  o h!" . By  induction,  therefore,  we  have  a = f}  and  a = j3‘  in  all  cases. 

59.  (a)  If  h isn’t  constant  we  have  B(fog)  = 3 B(h)  - 2,  essentially  obtained  by  taking 
a copy  of  the  BDD  for  h and  replacing  its  sink  nodes  by  two  other  copies. 

(b)  Suppose  the  profile  and  quasi-profile  of  h are  (b0,...,bn)  and  (q0, . . . , qn\ 
where  bn  = qn  = 2.  Then  there  are  bkqk  branches  on  x2k+ i in  fog,  and  qkbk^l 
branches  on  x2k , corresponding  to  ordered  pairs  of  beads  and  subtables  of  h.  When 
the  BDD  for  h contains  a branch  from  a to  (3  and  from  a'  to  /3',  where  V(a)  = j, 
V(/3)  = k,  V(q  ) = j , and  V(/3')  = k' , the  BDD  for  fog  contains  a corresponding 
branch  with  V(aoa')  = 2 j - 1 from  a o a to  f3  o a'  when  j < j'  < k,  and  with 
V(ttoa')  = 2 j'  from  aoa'toao^'  when  j'  < j < k' . 

60.  Every  bead  of  order  n-j  of  the  ordered  pair  (f,g)  is  either  one  of  the  6^6'  ordered 
pairs  of  beads  of  / and  g,  or  one  of  the  bj(q'j  — 6' ) + ( q j — bj)b'j  ordered  pairs  that  have 
the  form  (bead,  nonbead)  or  (nonbead,  bead).  [This  upper  bound  is  achieved  in  the 
examples  of  exercises  59(b)  and  63.] 


61.  Assume  that  v — V(a)  < V(/3).  Let  ai,  . . . , ak  be  the  nodes  that  point  to  a, 
and  let  /3i,  . . . , /3 1 be  the  nodes  with  V(/3j)  < v that  point  to  /3;  an  imaginary  node  is 
assumed  to  point  to  each  root.  (Thus  k = in-degree(a)  and  l < in-degree(/3).)  Then 
the  melded  nodes  that  point  to  ao/3  are  of  three  types:  (i)  aiofy,  where  V(ati)  = V(f3j) 
and  either  (LO(a,)  = a and  LO(/3j)  = /3)  or  (Hl(a,)  = a and  HI  (ft)  = /3);  (ii)  aofo, 
where  V(«i)  < V(/3j)  for  some  i;  or  (iii)  on  o f),  where  V(an)  > for  some  j. 


62.  The  BDD  for  / has  one  node  on  each  level,  and  the  BDD  for  g has  two,  except  at 
the  top  and  bottom.  The  BDD  for  /V<?  has  four  nodes  on  nearly  every  level,  by  exercise 
14(a).  The  BDD  for  fog  has  seven  nodes  when  5 < j < n — 3,  corresponding 
to  ordered  pairs  of  subtables  of  (f,g)  that  depend  on  Xj  when  (aq ,...,xJ_1)  have 
fixed  values.  Thus  B{f)  = n + 0(1),  B(g)  = 2 n + 0(1),  B(f  o g)  = 7n  + 0(1),  and 
B(f  V g)  = An  + 0(1).  (Also  B(f  A g)  = 7n  + 0(1),  B(f  © g)  = In  + 0(1).) 

63.  The  profiles  of  / and  g are  respectively  (1,  2,2, ... , 2m_1, 2m_1, 2m,  1, 1, . . . , 1,  2) 

4n  and  B(g)  = 


and  (0, 1 

2m+1  +2 


2,2, 

-1 


r%m  — 1 c\m  — 1 


,2m-\  1,1,.  ..,1,2);  so  B(f)  = 2 


im+2 


1 

3n.  The  profile  of  f Ag  begins  with  (1,  2,4, ... , 22m~2,  2 


r)2m-l 


-2 


,m  — 1 \ 
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because  there’s  a unique  solution  x\  . . . x2m  to  the  equations 

((xl  © Z2)(X3  © X4)  ■ ■ • (x2m-l  © X2m))2  = P,  (( x2  © X3)  . . . (x2m_2  © X2m_i)x2m)2  = q 

for  0 < p,  q < 2m,  and  p = q if  and  only  if  x\  = x3  = ■ * • = x2m_i  = 0.  After  that  the 
profile  continues  (2m+1  - 2, 2m+1  - 2,  2m+1  - 4,  2m+1  - 6, . . . , 4,  2, 2);  the  subfunctions 
are  X2m+j  A x2m-|_fc  or  af2Tri.+j  A a:2m+fc  for  1 < j < k < 2m,  together  with  X2m+j  and 
X2m+j  for  2 < j < 2m.  All  in  all,  we  have  B(f  A g)  = 22m+1  + 2m^1  — 1 « 2n2. 

64.  The  BDD  for  any  Boolean  combination  of  /i,  /2,  and  /3  is  contained  in  the  meld 
/i»/2  0 /3,  whose  size  is  at  most  B{fi)B{f2)B(f3). 

65.  h = gl  /i : /0,  where  fc  is  the  restriction  of  / obtained  by  setting  Xj  4—  c.  The 
first  upper  bound  follows  as  in  answer  64,  because  B(fc)  < B(f).  The  second  bound 
fails  when,  for  example,  n = 2m  + 3m  and  /i  = Mm(x;y)?  Mm(x';j/):  Mm(x";y), 
where  x (x4 , . . . , xm),  x (x^,...,  xm) , x — (x4 , . . . , xm),  and  y — (yo , • • • , y2m  — i ) ; 
but  such  failures  appear  to  be  rare.  [See  R.  E.  Bryant,  IEEE  Trans.  C-35  (1986),  685; 
J.  Jain,  K.  Mohanram,  D.  Moundanos,  I.  Wegener,  and  Y.  Lu,  ACM/IEEE  Design 
Automation  Conf.  37  (2000),  681-686.] 

66.  Set  NT0P  <—  fo  + 1 — l and  terminate  the  algorithm. 

67.  Let  tk  denote  template  location  P00LSIZE  — 2k.  Step  SI  sets  LEFT(ti)  ■<—  5, 
RIGHT(ti)  C—  7,  l <—  1.  Step  S2  for  / = 1 puts  t\  into  both  LLIST[2]  and  HLIST[2]. 
Step  S5  for  l = 2 sets  LEFT(t2)  4-  4,  RIGHT (f2)  4-  5,  L(ti)  4-  t2;  LEFT(t3)  4- 
3,  RIGHT(t3)  4—  6,  H(fi)  4—  t3.  Step  S2  for  1 = 2 sets  L(t2)  4—  0 and  puts  t2  in 
HLIST[3] ; then  it  puts  t3  into  LLIST[3]  and  HLIST[3]  . And  so  on.  Phase  1 ends  with 
(LSTART  [0]  , . . . , LSTART  [4]  ) = (t0 , h , t3 , t5 , ta ) and 
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representing  the  meld  aow  in  Fig.  24  but  with  lox  = xol  = l and  T o T = T. 

Let  fk  = fo  + k.  In  phase  2,  step  S7  for  I.  = 4 sets  LEFT(t6)  4—  ~0,  LEFT(t7)  4—  t6, 

LEFT(tg)  4—  ~1,  and  RIGHT (te)  4—  RIGHT (t7)  4—  RIGHT (tg)  4 1.  Step  S8  undoes  the 

changes  made  to  LEFT(0)  and  LEFT(l) . Step  Sll  with  s = tg  sets  LEFT(f8)  4-  ~2, 
RIGHT(fs)  4—  tg,  V(/2)  4—  4,  L0(/2)  4—  1,  HI(/2)  4—  0.  With  s = t7  that  step  sets 
LEFT(t7)  4—  ~3,  RIGHT(t7)  4—  t7,  V(/3)  4—  4,  L0(/3)  4—  0,  HI(/3)  4—  1;  meanwhile 
step  S10  has  set  RIGHT  (tg)  4—  t7.  Eventually  the  templates  will  be  transformed  to 

k LEFT«fc)  RIGHT  (ft)  L(ffc)  H(tfc)  k LEFT  (tfc)  RIGHT  (tfc)  L(tfc)  H (tk) 
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(but  they 

can 

then  be  discarded).  The  resulting  BDD  for  /Ay  is 
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68.  If  LEFT(t)  < 0 at  the  beginning  of  step  S10,  set  RIGHT(f)  <- t,  q <-  NTOP,  NTOP  <- 
<J+1,  LEFT(t)  <-  ~(q-  /0),  L0(q)  <-  ~LEFT  (L(t)  ) , HI(<j)  <-  ~LEFT(H(f)),  V(q)  t-  l, 
and  return  to  S9. 

69.  Make  sure  that  NTOP  < TBOT  at  the  end  of  step  SI  and  when  going  from  Sll  to 
S9.  (It’s  not  necessary  to  make  this  test  inside  the  loop  of  Sll.)  Also  make  sure  that 
NTOP  < HBASE  just  after  setting  HBASE  in  step  S4. 

70.  This  choice  would  make  the  hash  table  a bit  smaller;  memory  overflow  would 
therefore  be  slightly  less  likely,  at  the  expense  of  slightly  more  collisions.  But  it  also 
would  slow  down  the  action,  because  make-template  would  have  to  check  that  NTOP  < 
TBOT  whenever  TBOT  decreases. 

71.  Add  a new  field.  EXTRA(f)  = a" , to  each  template  t (see  (43)). 

72.  In  place  of  steps  S4  and  S5,  use  the  approach  of  Algorithm  R to  bucket-sort  the 
elements  of  the  linked  lists  that  begin  at  LLISTfO  and  HLISTff] . This  is  possible  if  an 
extra  one-bit  hint  is  used  within  the  pointers  to  distinguish  links  in  the  L fields  from 
links  in  the  H fields,  because  we  can  then  determine  the  LO  and  HI  parameters  of  t’s 
descendants  as  a function  of  t and  its  “parity.” 

73.  If  the  BDD  profile  is  (bo, . . . ,bn),  we  can  assign  pj  = \bj- i/2e"|  pages  to  branches 
on  Xj.  Auxiliary  tables  of  pi  + • • • + pn+ 1 < \B(f)/ 2e]  + n short  integers  allow  us  to 
compute  V(p)  = T[7r(p)],  LO(p)  = L0(M[7r(p)j  + <r(p)),  HI(p)  = HI(M[7r(p)]  + <r(p)). 

For  example,  if  e = 12  and  n < 216,  we  can  represent  arbitrary  BDDs  of  up  to 
232  — 228  + 216  + 212  nodes  with  32-bit  virtual  LO  and  HI  pointers.  Each  BDD  requires 
appropriate  auxiliary  T and  M tables  of  size  < 220,  constructible  from  its  profile. 

[This  method  can  significantly  improve  caching  behavior.  It  was  inspired  by  the 
paper  of  P.  Ashar  and  M.  Cheong,  IEEE/ ACM  Intern  at.  Conf.  Computer-Aided  Design 
CAD-94  (1994),  622-627,  which  also  introduced  algorithms  similar  to  Algorithm  S.] 

74.  The  required  condition  is  now  pn(xi, . . . , x2n ) A[xi  = *2"  ] A - • -A[x2n-i  = x2n-i+1]. 

If  we  set  2/1  = xi,  2/2  = X3,  ...,  y2n-2  = x2n-i_i,  2/2t*_2+i  = x2n-i,  2/2n_2+2  = 
x2n-i_2,  •••)  y2n~1  = (49)  yields  the  equivalent  condition  gn-i(yi,  ■ ■ ■ , 2/2"-1 ) A 

[y2n-2  < 2f2"-2+i]  A [y2n~2-i  < 2/2n— 2+2 ] A • • • A [2/1  < y2n-i  ],  which  is  eminently  suitable 
for  evaluation  by  Algorithm  S.  (The  evaluation  should  be  from  left  to  right;  right-to-left 
would  generate  enormous  intermediate  results.) 

With  this  approach  we  find  that  there  are  respectively  1,  2,  4,  12,  81,  2646, 
1422564,  229809982112  monotone  self-dual  functions  of  1,  2,  ...,  8 variables.  (See 
Table  7. 1.1-3  and  answer  7.1.2-88.)  The  8-variable  functions  are  characterized  by  a 
BDD  of  130,305,082  nodes;  Algorithm  S needs  about  204  gigamems  to  compute  it. 

75.  Begin  with  p\(x\,x2)  = [*i<a:2],  and  replace  G2™(x\, . . . ,x2n)  in  (49)  by  the 
function  H2™  (xi, . . . , x2n)  = (x\  < x2  < X3  < X4]  A • • • A [x2n-3  < x2n-2  < X2»_i  < x2n  ]. 

(It  turns  out  that  B(pg)  — 3,683,424;  about  170  megamems  suffice  to  compute 
that  BDD,  and  pio  is  almost  within  reach.  Algorithm  C now  quickly  yields  the  exact 
numbers  of  regular  n-variable  Boolean  functions  for  1 < n < 9,  namely  3,  5,  10,  27, 
119,  1173,  44315,  16175190,  284432730176.  Similarly,  we  can  count  the  self-dual  ones, 
as  in  exercise  74;  those  numbers,  whose  early  history  is  discussed  in  answer  7.1.1-123, 
are  1,  1,  2,  3,  7,  21,  135,  2470,  319124,  1214554343,  for  1 < n < 10.) 

76.  Say  that  xo...Xj_i  forces  Xj  if  Xi  = 1 for  some  t C j with  0 < i < j.  Then 
X0X1  . . . X2»-i  corresponds  to  a clutter  if  and  only  if  Xj  = 0 whenever  xo  . • • Xj-\  forces 
Xj,  for  0 < j < 2n . And  pn(x  0, . . . , X2«-i)  = 1 if  and  only  if  Xj  = 1 whenever  xq  ■ ■ ■ +7-1 
forces  Xj.  So  we  get  the  desired  BDD  from  that  of  pn(x  1, . . . ,x2n)  by  (i)  changing  each 
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branch  (j)  to  (j-l),  and  (ii)  interchanging  the  LO  and  HI  branches  at  every  branch 
node  that  has  LO  = [X] . (Notice  that,  by  Corollary  7.1.1Q,  the  prime  implicants  of 
every  monotone  Boolean  function  correspond  to  clutters.) 

77.  Continuing  the  previous  answer,  say  that  the  bit  vector  x0  . . ,xk-\  is  consistent 
if  we  have  x3  = 1 whenever  x0  . . . Xj-i  forces  Xj,  for  0 < j < k.  Let  bk  be  the 
number  of  consistent  vectors  of  length  k.  For  example,  b4  = 6 because  of  the  vectors 
{0000,0001,0011,0101,0111,1111}.  Notice  that  exactly  c*,  = 6fc+i  — bk  clutters  <S  have 
the  property  that  k represents  their  “largest”  set,  max{s  | s represents  a set  of  5}. 

The  BDD  for  y,n(x\, . . . ,X2™)  has  bk-i  branch  nodes  (fc)  when  1 < k < 2n~1. 
Proof:  Every  subfunction  defined  by  xu  xk-i  is  either  identically  false  or  defines 
a consistent  vector  x\  ...xk-i-  In  the  latter  case  the  subfunction  is  a bead,  because 
it  takes  different  values  under  certain  settings  of  xk+i,  . . . , x2».  Indeed,  if  x\  . ..Xk-i 
forces  xk,  we  set  xk+i  <—■■■<—  x2n  <—  1;  otherwise  we  set  Xj  <—  y3  for  k < j < 2n , 
where 

20+ 1 = [%i+i  = 1 for  some  i C j with  i + 1 < k], 
noting  that  y2n+fc  = 0. 

On  the  other  hand  there  are  by  branches  (k)  when  k = 2n  -k'  and  0 < k'  < 2n_1. 
In  this  case  the  nonconstant  subfunctions  arising  from  x\,  . . . , xk- i lead  to  values  y3 
as  above,  where  the  vector  y0>yv...yy  is  consistent.  (Here  O'  = 2n,  T = 2"  - 1, 
etc.)  Conversely,  every  such  consistent  vector  describes  such  a subfunction;  we  can,  for 
example,  set  Xj  <-  0 when  j < k- 2n_1  or  2n_1  < j < k,  otherwise  x3  +-  y2„-i+j.  This 
subfunction  is  a bead  if  and  only  if  yk'  = 1 or  yy  . . . y^k-i)'  forces  yy . Thus  the  beads 
correspond  to  consistent  vectors  of  length  k';  and  different  vectors  define  different  beads. 

This  argument  shows  that  there  are  bk-\  - ck-\  branches  (k)  with  LO  = [X]  when 
1 < k < 2n_1  and  c2n_fc  such  branches  when  2n~1  < k < 2n.  Hence  exactly  half  of  the 
B(yn)  — 2 branch  nodes  have  LO  = [X] . 

78.  To  count  graphs  on  n labeled  vertices  with  maximum  degree  < d,  construct  the 
Boolean  function  of  the  (")  variables  in  its  adjacency  matrix,  namely  /\£=1  S<d( Xk), 
where  Xk  is  the  set  of  variables  in  row  k of  the  matrix.  For  example,  when  n = 5 
there  are  10  variables,  and  the  function  is  S<d(x  1,x2,x3,x4)  A S<d{x  1,xs,xe,x7)  A 
S<d(x 2,x5,x8,xg)  A S<d( x3,  x6,Xs,  x10)  A S<d(x4,x 7,  *9,  Xio).  When  n = 12  the  BDDs 
for  d = (1,2,  ...,10)  have  respectively  (5960,  137477,  1255813,  5295204,  10159484, 
11885884,  9190884,  4117151,  771673,  28666)  nodes,  so  they  are  readily  computed  with 
Algorithm  S.  To  count  solutions  with  maximum  degree  d,  subtract  the  number  of  solu- 
tions for  degree  < d—  1 from  the  number  for  degree  < d:  the  answers  for  0 < d < 11  are: 


1 

140151 

3568119351 

8616774658305 


3038643940889754 

211677202624318662 

3617003021179405538 

17884378201906645374 


29271277569846191555 

17880057008325613629 

4489497643961740521 

430038382710483623 


[In  general  there  are  tn  — 1 graphs  on  n labeled  vertices  with  maximum  degree  1,  where 
tn  is  the  number  of  involutions,  Eq.  5.1.4-(4o).j 

The  methods  of  Section  7.2.3  are  superior  to  BDDs  for  enumerations  such  as  these, 
when  n is  large,  because  labeled  graphs  have  n!  symmetries.  But  when  n has  a moderate 
size,  BDDs  produce  answers  quickly,  and  nicely  characterize  all  the  solutions. 

79.  In  the  following  counts,  obtained  from  the  BDDs  in  the  previous  answer,  each 
graph  with  k edges  is  weighted  by  266~k.  Divide  by  366  to  get  probabilities. 
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73786976294838206464 

553156749930805290074112 

598535502868315236548476928 

68379835220584550117167595520 

1380358927564577683479233298432 

7024096376298397076969081536512 


11646725483430295546484263747584 

7767741687870924305547518803968 

2514457534558975918608668688384 

452733615636089939218193403904 

45968637738881805341545676736 

2093195580480313818292294985 


80.  If  the  original  functions  / and  g have  no  BDD  nodes  in  common,  both  algorithms 
encounter  almost  exactly  the  same  subproblems:  Algorithm  S deals  with  all  nodes  of 


fog  that  aren’t  descended  from  nodes  of  the  forms  ao  E or  [X]  o /5,  while  (55)  also 
avoids  nodes  that  descend  from  the  forms  a o [T]  or  [t]  o B.  Furthermore,  (55)  takes 
shortcuts  when  it  meets  nontrivial  subproblems  AND  (f',g')  with  f = g';  Algorithm  S 
cannot  recognize  the  fact  that  such  cases  are  easy.  And  (55)  can  also  win  if  it  happens 
to  stumble  across  a relevant  memo  left  over  from  a previous  computation. 


81.  Just  change  ‘AND’  to  ‘XOR’  and  ‘A’  to  ‘©’  throughout.  The  simple  cases  are  now 
/ ®0  = / , 0©  g — g,  and  / © g = 0 if  / = g.  We  should  also  swap  / •O-pif/>g/0. 

Notes:  The  author  experimentally  inserted  further  memos  ‘/©r  = g’  and  ‘j®r  = 
/’  in  the  bottom  line;  but  these  additional  cache  entries  seemed  to  do  more  harm 
than  good.  Considering  other  binary  operators,  there’s  no  need  to  implement  both 
BUTNOT(/,g)  = / A g and  NOTBUT(/,cf)  = / A g,  since  the  latter  is  BUTNOT(g, /). 
Also,  XOR(l,OR(/,  <7))  may  be  better  than  an  implementation  of  NOR(/,  g)  = -i(/Vj). 

82.  A top-level  computation  of  F 4—  AND  (f,g)  begins  with  / and  g in  computer 
registers,  but  REF(/)  and  REF(<?)  do  not  include  “references”  such  as  those.  (We  do, 
however,  assume  that  / and  g are  both  alive.) 

If  (55)  discovers  that  / A g is  obviously  r,  it  increases  REF(r)  by  1. 

If  (55)  finds  / A g = r in  the  memo  cache,  it  increases  REF(r),  and  recursively 
increases  REF(L0(r))  and  REF(HI(r))  in  the  same  way  if  r was  dead. 

If  step  U1  finds  p = q,  it  decreases  REF(p)  by  1 (believe  it  or  not);  this  won’t  killp. 

If  step  U2  finds  r,  there  are  two  cases:  If  r was  alive,  it  sets  REF(r)  4—  REF(r)  + 1, 
REF(p)  4—  REF(p)  — 1,  REF(q)  4—  REF(q)  — 1.  Otherwise  it  simply  sets  REF(r)  4—  1. 

When  step  U3  creates  a new  node  r,  it  sets  REF(r)  4—  1. 

Finally,  after  the  top-level  AND  returns  a value  r that  we  wish  to  assign  to  F, 
we  must  first  dereference  F,  if  F / A;  this  means  setting  REF (F)  4-  REF (F)  - 1, 
and  recursively  dereferencing  L0(F)  and  HI(F)  if  REF(F)  has  become  0.  Then  we  set 
F 4—  r (without  adjusting  REF(r)). 

[Furthermore,  in  a quantification  routine  such  as  (65)  or  in  the  composition  rou- 
tine (72),  both  ri  and  rh  should  be  dereferenced  after  the  OR  or  MUX  has  computed  r.] 

83.  Exercise  61  shows  that  the  subproblem  / A g occurs  at  most  once  per  top-level 
call,  when  REF(/)  = REF(p)  = 1.  [This  idea  is  due  to  F.  Somenzi;  see  the  paper 
cited  in  answer  84.  Many  nodes  have  reference  count  1,  because  the  average  count 
is  approximately  2,  and  because  the  sinks  usually  have  large  counts.  However,  such 
cache-avoidance  did  not  improve  the  overall  performance  in  the  author’s  experiments, 
possibly  because  of  the  examples  investigated,  or  possibly  because  “accidental”  cache 
hits  in  other  top-level  operations  can  be  useful.] 


84.  Many  possibilities  exist,  and  no  simple  technique  appears  to  be  a clear  winner. 
The  cache  and  table  sizes  should  be  powers  of  2,  to  facilitate  calculating  the  hash 
functions.  The  size  of  the  unique  table  for  xv  should  be  roughly  proportional  to  the 
number  of  nodes  that  currently  branch  on  xv  (alive  or  dead).  It’s  necessary  to  rehash 
everything  when  a table  is  downsized  or  upsized. 
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In  the  author’s  experiments  while  writing  this  section,  the  cache  size  was  doubled 
whenever  the  number  of  insertions  since  the  beginning  of  the  most  recent  top-level 
command  exceeded  In  2 times  the  current  cache  size.  (At  that  point  a random  hash 
function  will  have  filled  about  half  of  the  slots.)  After  garbage  collection,  the  cache 
was  downsized,  if  necessary,  so  that  it  either  had  256  slots  or  was  at  least  1/4  full. 

It’s  easy  to  keep  track  of  the  current  number  of  dead  nodes;  hence  we  know  at 
all  times  how  much  memory  a garbage  collection  will  reclaim.  The  author  obtained 
satisfactory  results  by  inserting  a new  step  U2§  between  U2  and  U3:  “Increase  C by  1, 
where  C is  a global  counter.  If  C mod  1024  = 0,  and  if  at  least  1/8  of  all  current 
nodes  are  dead,  collect  garbage.” 

[See  F.  Somenzi,  Software  Tools  for  Technology  Transfer  3 (2001),  171-181  for 
numerous  further  suggestions  based  on  extensive  experience.] 

85.  The  complete  table  would  have  232  entries  of  32  bits  each,  for  a total  of  234 
bytes  («  17.2  gigabytes).  The  BDD  base  discussed  after  (58),  with  about  136  million 
nodes  using  zip-ordered  bits,  can  be  stored  in  about  1.1  gigabyte;  the  one  discussed  in 
Corollary  Y,  which  ranks  all  of  the  multiplier  bits  first,  needs  only  about  400  megabytes. 

86.  If  / = Oorgr  = h,  return  g.  If  / = 1,  return  h.  If  g = 0 or  f = g,  return  AND(/,/i). 
If  h = 1 or  / = h,  return  OR (f,g).  If  g = 1,  return  IMPLIES)/, ft);  if  h = 0,  return 
BUTNOTfg, /).  (If  binary  IMPLIES  and/or  BUTNOT  aren’t  implemented  directly,  it’s 
OK  to  let  the  corresponding  cases  propagate  in  ternary  guise.) 

87.  Sort  the  given  pointer  values  /,  g,  h so  that  f < g < h.  If  / = 0,  return  AND(g,  h). 
If  / = 1,  return  OR (g,h).  If  / = g or  g = h,  return  g. 

88.  The  trio  of  functions  ( f,g,h ) = (i?,o,  R\ , R2)  makes  an  amusing  example,  when 

Ra(%l,  ■ ■ • , Xn ) = l(xn  . . . £1)2  mod  3 / a]  = -R(2a+xi)  mod  3(^2,  • • • , Xn). 

Thanks  to  the  memos,  the  ternary  recursion  finds  fAgAh  = 0hy  examining  only  one 
case  at  each  level;  the  binary  computation  of,  say,  f A g = h definitely  takes  longer. 

More  dramatically,  let  f = Xi  A (x2?  F:  G),  g = x2  A (xi?  G:  F ),  and  h = 
xi?  x2  A F:  x2  A G,  where  F and  G are  functions  of  (x3, . . . ,x„)  such  that  B(F  A G)  = 
Q(B(F)B(G))  as  in  exercise  63.  Then  / A g,  g A h,  and  h A f all  have  large  BDDs,  but 
the  ternary  recursion  immediately  discovers  that  / A g A h = 0. 

89.  (a)  True;  the  left  side  is  (/ooV/0i)V(/10V/n),  the  right  side  is  (/ooV/i0)V(/0i  V/n). 

(b)  Similarly  true.  (And  Q’s  are  commutative  too.) 

(c)  Usually  false;  see  part  (d). 

(d)  Vxi 3x2f  = (f00  V foi)  A ( fw  V /n)  = (3x2Vxi /)  V (/00  A /n)  V (/01  A f10). 

90.  Change  3 jx  . . . 3 jm  to  Qji  . . . Qjm. 

91-  (a)  / 4- 1 = /,  / 4 -Xj  = fii  and  / ].Xj  = f0,  in  the  notation  of  (63). 

(b)  This  distributive  law  is  obvious,  by  the  definition  of  4..  (Also  true  for  V,  ®,  etc.) 

(c)  True  if  and  only  if  g is  not  identically  zero.  (Consequently  the  value  of 
f(xi, . . . ,xn)  / g for  g ^ 0 is  determined  solely  by  the  values  of  x3  4- g for  1 < j < n.) 

(d)  f(xi,  1,  0,  £4,  0, 1,  xt,  . . . , xn).  This  is  the  restriction  of  / with  respect  to 
x2  = 1,  x3  = 0,  X5  = 0,  xe  = 1 (see  exercise  57),  also  called  the  cofactor  of  / with 
respect  to  the  subcube  g.  (A  similar  result  holds  when  g is  any  product  of  literals.) 

(e)  f(x  1, . . . , xn-i,xi  © • • ffi  1).  (Consider  the  case  / = Xj,  for  1 < j < n.) 

(f)  xi?  /( 1 1):  /( 0,...,0). 

(§)  /(I)  X2 , . . . , Xn)  4.  g(x2 , . . . , Xn) . 

(h)  \{  f — x2  and  g = xi  V x2  we  have  f l g = xi  V x2. 
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(i)  CONSTRAIN(/,  g)  = “If  fig  has  an  obvious  value,  return  it.  Otherwise,  if 
/ 4-  9 — r is  in  the  memo  cache,  return  r.  Otherwise  represent  / and  g as  in  (52); 
set  r «—  CONSTRAIN(/h,  (/>,)  if  gi  = 0,  r «—  CONSTRAIN (/;,  gi)  if  gh  = 0,  otherwise 
r <-  UNIQUE(u,  CONSTRAIN (fi,gi),  CONSTRAIN (/h,  <?/»));  put  ‘f  i g = r’  into  the 
memo  cache,  and  return  r.”  Here  the  obvious  values  are/40  = 0|sr  = 0;/4,l  = /; 

1 1 9 = ala  = [p#o]. 

[The  operator  fig  was  introduced  in  1989  by  O.  Coudert,  C.  Berthet,  and  J.  C. 
Madre.  Examples  such  as  the  functions  in  (h)  led  them  to  propose  also  the  modified 
operator  f if  g,  “/  restricted  to  g”  which  has  a similar  recursion  except  that  it  uses 
/ JJ.(3ict,p)  instead  of  (xv?  fity-gr-  fhi-9h)  when  fi  — fh.  See  Lecture  Notes  in  Computer 
Science  407  (1989),  365-373.] 

92.  See  answer  91(d)  for  the  “if”  part.  Notice  also  that  (i)  x\  i g = xx  if  and  only  if 
9o  / 0 and  gi  / 0,  where  gc  = g(c,x2, . . . ,x„)\  (ii)  x„ig  = xn  if  and  only  if  Qx„g  = 0 
and  g / 0. 

Suppose  ng”  = (fig)”  for  all  / and  7r.  If  g ^ 0 isn’t  a subcube,  there’s  an  index  j 
such  that  g0  / 0 and  gx  / 0 and  Qxjg  / 0,  where  gc  = g(x  1, . . . , x,-i,  c,  xj+1, . . ,,i„). 
By  the  previous  paragraph,  we  have  (i)  Xj  ig  = x3  and  (ii)  Xjig  ^ Xj,  a contradiction. 

93.  Let  / = J(x\, . ■ ■ ,xn\  fi, . . . , fn)  and  g — J(xi, . . . , xn;  gi, . . . , gn),  where 

fv  Xn+l  V • • • V X5n  V T(X5ri+l,...,  XGn  j [u  1],  . . . , [u  — n]) , 

gv  = in+i  V • • • V x5n  V J(a:5n+1, . . . ,X6n',  [v=  l]+[u  — 1], . . . , [u  = n]+[u  — n]), 

and  J is  the  junction  function  of  exercise  52. 

If  G can  be  3-colored,  let  / = J(x1; . /i, ... , /„),  where 

fv  Xn+ 1 V • • * V X5n  V J (x^n+l , • * - , Xq n , fvl  j ‘ * • , fvn)i 

and  fvw  = [v  and  w have  different  colors].  Then  B(f)  < n + 3(5 n)  + 2. 

Conversely,  suppose  there’s  an  approximating  / such  that  B(f)  < 16n  + 2,  and 
let  fv  be  the  subfunction  with  x\  = [u  = l],  . . . , xn  = [v  = n\.  At  most  three  of 
these  subfunctions  are  distinct,  because  every  distinct  fv  must  branch  on  each  of  x„+i, 

■ ■ ■ , xt>„.  Color  the  vertices  so  that  u and  v get  the  same  color  if  and  only  if  fu  = /„; 
this  can  happen  only  if  u—f—v , so  the  coloring  is  legitimate. 

[M.  Sauerhoff  and  I.  Wegener,  IEEE  Transactions  CAD-15  (1996),  1435-1437.] 

94.  Case  1:  v / gv.  Then  we  aren’t  quantifying  over  xv;  hence  g = gh , and  / Eg  = 
xv?  fi  Eg:  fh  Eg. 

Case  2:  v = gv.  Then  g = xv  A gh  and  / E g = (/;  E gh)  V ( fh  E gh)  = r;  V rh.  In 
the  subcase  w / /«,  we  have  /;  = fh  = /;  hence  n — rh,  and  we  can  directly  reduce 
/ E g to  / E gh  (an  instance  of  “tail  recursion”). 

[Rudell  observes  that  the  order  of  quantification  in  (65)  corresponds  to  bottom- 
up  order  of  the  variables.  That  order  is  convenient,  but  not  always  best;  sometimes 
it’s  better  to  remove  the  3s  one  by  one  in  another  order,  based  on  knowledge  of  the 
functions  involved.] 

95.  If  ri  = 1 and  v = gv,  we  can  set  r <-  1 and  forget  about  rh.  (This  change  led  to  a 
100-fold  speedup  in  some  of  the  author’s  experiments.) 

96.  For  V,  just  change  E to  A and  OR  to  AND.  For  0,  change  E to  D and  OR  to  XOR; 
also,  if  v ^ fv,  return  0.  [Routines  for  the  yes/no  quantifiers  A and  N are  analogous  to  Q. 
Yes/no  quantifiers  should  be  used  only  when  m = 1;  otherwise  they  make  little  sense.] 
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97.  Proceeding  bottom-up,  the  amount  of  work  on  each  level  is  at  worst  proportional 
to  the  number  of  nodes  on  that  level. 

98.  The  function  NOTEND(x)  = 3y3z(ADJ(x,  j/)  A ADJ(x,z)  A [y/z])  identifies  all 
vertices  of  degree  > 2.  Hence  ENDPT(x)  = KER(x)  A-iNOTEND(x).  And  PAIR(x,y)  = 
ENDPT(x)  A ENDPT(y)  A ADJ (x,  y). 

[For  example,  when  G is  the  contiguous-USA  graph,  with  the  states  ordered  as 
in  (104),  we  have  B(NOTEND)  = 992,  B(ENDPT)  = 264,  and  jB(PAIR)  = 203.  Before 
applying  3y3z  the  BDD  size  is  50511.  There  are  exactly  49  kernels  of  degree  1.  The 
nine  components  of  size  2 are  obtained  by  mixing  the  following  three  solutions: 


The  total  cost  of  this  calculation,  using  the  stated  algorithms,  is  about  14  megamems, 
in  6.3  megabytes  of  memory  — only  about  52  memory  references  per  kernel.] 

99.  Find  a triangle  of  mutually  adjacent  states,  and  fix  their  colors.  The  BDD  size 
also  decreases  substantially  if  we  choose  states  of  high  degree  in  the  “middle”  levels. 
For  example,  by  setting  a mo  = 6mo  = <Jtn  = &tn  = «ar  = 6ar  = 1 we  reduce  the  25579 
nodes  to  only  4642  (and  the  total  execution  time  also  drops  below  2 megamems). 

[Bryant’s  original  manuscript  about  BDDs  discussed  graph  coloring  in  detail,  but 
he  decided  to  substitute  other  material  when  his  paper  was  published  in  1986.] 

100.  Replace  IND(xme>  • • • , ®ca)  by  IND(;i:me,  • • • , *ca)  A Si2(:tme.  • • • , £ca),  to  get  the 
12-node  independent  sets;  this  BDD  has  size  1964.  Then  use  (73)  as  before,  and  the 
trick  of  answer  99,  getting  a COLOR  function  with  184,260  nodes  and  12,554,677,864 
solutions.  (The  running  time  is  approximately  26  megamems.) 

101.  If  a state’s  weight  is  w,  assign  2w  and  w as  the 
respective  weights  of  its  a and  b variables,  and  use 
Algorithm  B.  (For  example,  variable  aWY  gets  weight 
2(23  + 25)  = 96.)  The  solution,  shown  here  with 
color  codes  (T)  (2)  © © ■ is  unique. 

102.  The  main  idea  is  that,  when  gj  changes,  all  results  in  the  cache  for  functions 
with  fv  > j remain  valid.  To  exploit  this  principle  we  can  maintain  an  array  of  “time 
stamps”  Gi  > G2  > ■ ■ ■ > Gn  > 0,  one  for  each  variable.  There’s  a master  clock  time 
G > G 1,  representing  the  number  of  distinct  compositions  done  or  prepared;  another 
variable  G'  records  whether  G has  changed  since  COMPOSE  was  last  invoked.  Initially 
G = G = Gi  = • • • = Gn  = 0.  The  subroutine  NEWG  (j,  g)  is  implemented  as  follows: 

Nl.  [Easy  case?]  If  g3  — g,  exit  the  subroutine.  Otherwise  set  gj  A-  g. 

N2.  [Can  we  reset?]  If  g ^ Xj,  or  if  j < n and  Gj+1  > 0,  go  to  N4. 

N3.  [Reset  stamps.]  While  j > 0 and  gj  = Xj,  set  Gj  A-  0 and  j A-  j - 1.  Then 

if  j = 0,  set  G A-  G — G',  G'  A-  0,  and  exit. 

N4.  [Update  G?]  If  G'  = 0,  set  G A-  G + 1 and  G'  a-  1. 

N5.  [New  stamps.]  While  j > 0 and  Gj  / G,  set  Gj  a-  G and  j a-  j - 1. 

Exit.  | 
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(Reference  counts  also  need  to  be  maintained  appropriately.)  Before  launching  a top- 
level  call  of  COMPOSE,  set  G'  «-  0.  Change  the  COMPOSE  routine  (72)  to  use  f[Gv] 
in  references  to  the  cache,  where  v = fv;  the  test  ‘v  > m’  becomes  ‘ Gv  = O’. 

103.  The  equivalent  formula  g(fi(xr, xn), . . . , fm(x  1, . . . , xn))  can  be  implemented 
with  the  COMPOSE  operation  (72).  (However,  Dull  was  vindicated  when  it  turned  out 
that  his  formula  could  be  evaluated  more  than  a hundred  times  faster  than  Quick  s,  in 
spite  of  the  fact  that  it  uses  twice  as  many  variables!  In  his  application,  the  computation 
°f  (2/1  — /i(xi, . . . , x„))  A • • • A (ym  = fm(x  1, . . . , xn))  A <7(2/1, . . . , ym)  turned  out  to  be 
much  easier  than  COMPOSE’s  computation  of  gj{fi, . . . , /m)  for  every  subfunction  g3 
of  g;  see,  for  example,  exercise  162.) 

104.  The  following  recursive  algorithm  COMPARE (f,g)  needs  at  most  0(B(f)B(g)) 
steps  when  used  with  a memo  cache:  If  / = g,  return  “=’.  Otherwise,  if  / = 0 or 
9 = 1,  return  *<’;  if  / = 1 or  g = 0,  return  *>’.  Otherwise  represent  / and  g as 
in  (52);  compute  ri  *—  COMPARE (fi,gi).  If  ri  is  ‘ || ’,  return  otherwise  compute 
rh  COMPARE [fh,gh)-  If  rh  is  ‘ return  ‘||’.  Otherwise  if  r/  is  *=’,  return  ru\  if  rh 
is  “=’,  return  n;  if  rj  = rk,  return  rj.  Otherwise  return  ‘ 

105.  (a)  A unate  function  with  polarities  (3/1, . . . ,yn)  has  kx3f  = 0 when  y3  = 1 and 
N Xjf  = 0 when  y3  = 0,  for  1 < j < n.  Conversely,  / is  unate  if  these  conditions  hold 
for  all  j.  (Notice  that  kXjf  = N Xjf  = 0 if  and  only  if  QXjf  = 0,  if  and  only  if  / doesn’t 
depend  on  x}.  In  such  cases  y3  is  irrelevant;  otherwise  y3  is  uniquely  determined.) 

(b)  The  following  algorithm  maintains  global  variables  (pi, ...  ,pn),  initially  zero, 
with  the  property  that  Pj  = +1  if  Vj  must  be  0 and  Pj  = -1  if  y}  must  be  1;  Pj  will 
remain  zero  if  / doesn’t  depend  on  x}.  With  this  understanding,  UNATE(/)  is  defined  as 
follows:  If  / is  constant,  return  true.  Otherwise  represent  / as  in  (50).  Return  false  if  ei- 
ther UNATE(/;)  or  UNATE(/h)  is  false-,  otherwise  set  r <—  COMPARE(/f,  fh)  using  exer- 
cise 104.  If  r is  ‘||’,  return  false.  If  r is  *<’,  return  false  if  pv  < 0,  otherwise  set  pv  <-  +1 
and  return  true.  Ifris‘>  , return  false  \i  pv  > 0,  otherwise  set  pv  < 1 and  return  free. 

This  algorithm  often  terminates  quickly.  It  relies  on  the  fact  that  f(x)  < g(x)  for 
all  x if  and  only  if  f(x®y)  < g{x@y)  for  all  x,  when  y is  fixed.  If  we  simply  want  to  test 
whether  or  not  / is  monotone,  the  p variables  should  be  initialized  to  +1  instead  of  0. 

106.  Define  HORN(/,  g,  h)  thus:  If  f > g,  interchange  / g.  Then  if  / = 0 or  h = 1, 
return  true.  Otherwise  if  g = 1 or  h = 0,  return  false.  Otherwise  represent  /,  g, 
and  h as  in  (59).  Return  true  if  HORN(/,,  gi,  hi),  HORN  (f,,gh,h,),  HORN  (fh,gi,h,), 
and  HORN (fh,gh,hh)  are  all  true;  otherwise  return  false.  [This  algorithm  is  due  to 
T.  Horiyama  and  T.  Ibaraki,  Artificial  Intelligence  136  (2002),  189-213,  who  also 
introduced  an  algorithm  similar  to  that  of  answer  105(b).] 

107.  Let  e$f$g$h  mean  that  e(x)  = f(y)  = g(z)  = 1 implies  h((xyz ))  = 1.  Then  / is  a 
Krom  function  if  and  only  if  /$/$/$/,  and  we  can  use  the  following  recursive  algorithm 
KROM(e,  f,g,h):  Rearrange  {e,f,g}  so  that  e < / < g.  Then  if  e = 0 or  h = 1,  return 
true.  Otherwise  if  / = 1 or  h = 0,  return  false.  Otherwise  represent  e,  /,  g,  h with  the 
quaternary  analog  of  (59).  Return  true  if  KROM (e,,  ft,  gi,  h,),  KROM {et,  fh  gh,  h,), 
KROM  {e,,fh,ghh,),  KROM  (e,,fh,gh,hh),  KROM  (eh,  fh  9l,  h,),  KROM(efe,  /,,  gh,  hh), 
KROM(eh,  fh,gi,  hh),  and  KROM(eh,  fh,  gh,  hh)  are  all  true ; otherwise  return  false. 

108.  Label  the  nodes  {1, . . . , s}  with  root  1 and  sinks  {s  - 1,  s};  then  (s-3)!  permuta- 
tions of  the  other  labels  give  different  dags  for  the  same  function.  The  stated  inequality 
follows  because  each  instruction  (vk?  lk : hk)  has  at  most  n(s  - l)2  possibilities,  for 
1 < k < s — 2.  (In  fact,  it  holds  also  for  arbitrary  branching  programs,  namely  for 
binary  decision  diagrams  in  general,  whether  or  not  they  are  ordered  and/or  reduced.) 
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Since  l/(s  — 3)!  < (s  — l)3/s!  and  s!  > (s/e)s,  we  have  (generously)  b(n,s ) < 
(nse)s . Let  sn  = 2n/(n  + 9),  where  9 = lge  = l/ln2;  then  lg b(n,sn)  < snlg(nsne)  = 
2n(l  — (lg(l  + 9/n))/(n  + 6))  = 2n  — 0(2 n/n2).  So  the  probability  that  a random  n- 
variable  Boolean  function  has  B(f)  < sn  is  at  most  l/2n(2"/n2).  And  that  is  really  tiny. 

109.  i/2n(2”/n2)  is  really  tiny  even  when  multiplied  by  n\. 

110.  Let  fn  — Mm(xn-m+i, . . . , 0, . . . , 0, *i, . . . , xn—m ) V (*n— m+i  A • • • A xn  A 

[0  . . . 0*i  . . . Xn  — m is  a square]),  when  2m  1+m— l<n<  2m  + m.  Each  term  of 
this  formula  has  2m  + m — n zeros;  the  second  term  destroys  all  of  the  2m-bit  squares. 
[See  H.-T.  Liaw  and  C.-S.  Lin,  IEEE  Transactions  C-41  (1992),  661-664;  Y.  Breitbart, 
H.  Hunt  III,  and  D.  Rosenkrantz,  Theoretical  Comp.  Sci.  145  (1995),  45-69.] 

111.  Let  pn  = A (n  — An),  and  notice  that  pn  = m if  and  only  if  2 m + m < n < 
2m+1  + m+  1.  The  sum  for  0 < k < n - pn  is  2"-''"  - 1;  the  other  terms  sum  to  22'1”. 

112.  Suppose  k = n — lg  n + lg  a.  Then 


exp 


If  a < | we  have  2n  n/“a/n  < l/(n2n+1);  hence  bk  = (2n/“  — 2n^2a^)(2 n~n^aa/n)  x 
(1  + 0(2-"/“))  = 2fe(l  - 0(2-n/(2“))).  And  if  a > 2 we  have  2 n~n/aa/n  > 2 "/2+1/n; 
thus  bk  = (22"“  - 22n  k~1)(l  + 0(exp(  — 2"/2/n))). 

[For  the  variance  of  bk , see  I.  Wegener,  IEEE  Trans.  C-43  (1994),  1262-1269.] 

113.  The  idea  looks  attractive  at  first  glance,  but  loses  its  luster  when  examined  closely. 
Comparatively  few  nodes  of  a BDD  base  appear  on  the  lower  levels,  by  Theorem  U; 
and  algorithms  like  Algorithm  S spend  comparatively  little  of  their  time  dealing  with 
those  levels.  Furthermore,  nonconstant  sink  nodes  would  make  several  algorithms  more 
complicated,  especially  those  for  reordering. 

114.  For  example,  the  truth  table  might  be  01010101  00110011  00001111  00001111. 


115.  Let  Nk  = &o  H hfefc-i  be  the  number  of  nodes  (7)  of  the  BDD  for  which  j < k. 

The  sum  of  the  in-degrees  of  those  nodes  is  at  least  Nk;  the  sum  of  the  out-degrees  is 
2 Nk;  and  there’s  an  external  pointer  to  the  root.  Thus  at  most  A^  + 1 branches  can 
cross  from  the  upper  k levels  to  lower  levels.  Every  subtable  of  order  n-k  corresponds 
to  some  such  branch.  Therefore  qk  < Nk  + 1. 

Moreover,  we  must  have  qk  < bk  - 1 b bn,  because  every  subtable  of  order  n-k 

corresponds  to  a unique  bead  of  order  < n — k. 

For  (124),  change  ‘BDD’  to  ‘ZDD\  ‘ bk ’ to  lzk’,  ‘bead’  to  ‘zead’,  and  to  ‘ q'k\ 

116.  (a)  Let  vk  = 22*  + 22""1  +-.-  + 22°.  Then  Q(f)  < min(2fc-1,  22"+1_t)  = 

Un  + Examples  like  (78)  show  that  this  upper  bound  cannot  be  improved. 


(b)  qk/bk  = 22-7(22 


- 22 


) for  0 < k < n\  qn  = bn 


Hence 


117.  qk  = 2k  for  0 < k < m,  and  qm+k  = 2m  + 2 — k for  1 < k < 2" 

Q(f)  = 22m  1 + 7-2m  1 — 1 ~ B(f) 78.  (Such  /’ s make  QDDs  unattractive  in  practice.) 

118.  If  n = 2m  — 1 we  have  hn(x  1, . . . , xn)  = Mm(zm_i, . . . , zo\ 0,  *1, . . . , xn),  where 
(•2m  — 1 • • • ^0)2  = *1  + • • • + x„  is  computable  in  5 n — 5 m steps  by  exercise  7.1.2-30, 
and  Mm  takes  another  2n  + 0(v/n)  by  exercise  7.1.2-39.  Since  hn(xi, . . . , x„)  = 
h„+k{x  1, . . . ,xn,0, . . . ,0),  we  have  C(hn)  < 14 n + 0(y/h)  for  all  n.  (A  little  more 
work  will  bring  this  down  to  7 n + O(7nlogn);  can  the  reader  do  better?) 

The  cost  of  /i4  is  6 = L{hA),  and  x2  ® ((*r  ® ( x2  A *4))  A (*3  ® (*2  A *4)))  is  a 
formula  of  shortest  length.  (Also  C(hs)  — 10  and  L(h5)  = 11.) 
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119.  True.  For  example,  S2, 3,5(2:  i,...,x6)  = h13{xux2, 0, 0, 1, 1, 0, 1, 0,  x3,  x4,  x5,  x6). 

120.  We  have  h*(x  i,...,xn)  = hn(y1, . . . , yn),  where  yj  = Xjn  for  1 < j < n.  And 
hn(yi,  ...,y„)  = yyi+...+yn  = yxl  + ...+Xn  = X(xl+...+Xn*)„. 

121.  (a)  If  yk  = xn+1_k  we  have  hn(y  i, . . . , yn)  = yuy  = yn_vx  = xn+1_(n_vx)  = x„x+1. 

(b)  If  x = (xi, . ,.,xn)  and  t e {0, 1}  we  have  hn+1(x,t)  = (f?  xvx+p.  xvx). 

(c)  No.  For  example,  ip  sends  0*11 i-*-  0fc_1101 0fc_2102l  i->  ■ • • 10*T  i->  0fcll. 

(In  spite  of  its  simple  definition,  ip  has  remarkable  properties,  including  fixed  points 
such  as  10011010000101011000111001011  and  11101111011001011101111101111.) 

(d)  In  fact,  hn(xi  . . ,x„)  = ari (!) , by  induction  using  recurrence  (b). 

(If  f(xi, . . . , xn)  is  any  Boolean  function  and  r is  any  permutation  of  the  binary 
vectors  Xl  . . . xn,  we  can  write  f(x)  = f(xr),  and  the  transformed  function  / may  well 
be  much  easier  to  work  with.  Since  f(x)  A g(x ) = f(xr)  A g(xr),  the  transform  of  the 
AND  of  two  functions  is  the  AND  of  their  transforms,  etc.  The  vector  permutations 
(si  . . . xn)n  = X\n  . . . xn„  that  merely  transform  the  indices,  as  considered  in  the  text, 
are  a simple  special  case  of  this  general  principle.  But  the  principle  is,  in  a sense,  too 
general,  because  every  function  / trivially  has  at  least  one  r for  which  / is  skinny 
in  the  sense  of  exercise  170;  all  the  complexity  of  / can  be  transferred  to  r.  Even 
simple  transformations  like  ip  have  limited  utility,  because  they  don’t  compose  well; 
for  example,  ipip  is  not  a transformation  of  the  same  type.  But  linear  transformations, 
which  take  x \ xT  for  some  nonsingular  binary  matrix  T,  have  proved  to  be  useful 
ways  to  simplify  BDDs.  [See  S.  Aborhey,  IEEE  Trans.  C-37  (1988),  1461-1465;  J.  Bern, 
C.  Meinel,  and  A.  Slobodova,  ACM/IEEE  Design  Automation  Conf.  32  (1995),  408- 
413;  C.  Meinel,  F.  Somenzi,  and  T.  Theobald,  IEEE  Trans.  CAD-19  (2000),  521-533.]) 

122.  For  example,  when  n = 7 the  recurrence  in  answer  121(b)  gives 


where  shaded  nodes  compute  the  subfunction  hDR  on  the  variables  that  haven’t  yet  been 
tested.  Simplifications  occur  at  the  bottom,  because  h2(x i,x2)  = xx  and  h%R(xx,x2)  = 
x2.  [See  D.  Sieling  and  I.  Wegener,  Theoretical  Comp.  Sci.  141  (1995),  283-310.] 

123.  Let  t = k — s = xi  + ■ ■ ■ + xk.  There’s  a slate  for  every  combination  of  s'  Is  and 
t'  0s  such  that  s'  + t!  = w,  s'  < s,  and  t'  < t.  The  sum  of  (“)  = (“)  over  all  such 
(s',t')  is  (97).  (Notice  furthermore  that  it  equals  2W  if  and  only  if  w < min(s,f).) 

124.  Let  m = n-k.  Each  slate  [r0, . . . ,rm]  corresponds  to  a function  of  (xk+1, . . . ,x„), 
whose  truth  table  is  a bead  except  in  four  cases:  (i)  [0, . . . , 0]  = 0;  (ii)  [1, . . . , 1]  = 1; 
(iii)  [0^,1]  = xn  (which  doesn’t  depend  on  1);  (iv)  [1, . . . , 1,  xk+1, 0, . . . , 0],  where 
there  are  p Is  so  that  xk+1  = rp,  is  S<p(xk+2,  ...,xn). 
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The  following  polynomial-time  algorithm  computes  qk  = q and  bk  = q — q by 
counting  all  slates.  A subtle  aspect  arises  when  the  entries  of  [ro, . . . , rm]  are  all  0 or  1, 
because  such  slates  can  occur  for  different  values  of  s;  we  don’t  want  to  count  them 
twice.  The  solution  is  to  maintain  four  sets 


Cab  = In  + • • • + rm_ i | ro  = a and  rm  = b in  some  slate}. 


The  value  of  07r  should  be  artificially  set  to  n + 1,  not  0.  Assume  that  0 < k < n. 

HI.  [Initialize.]  Set  m<— n — k,  q<^q'<— sir-  0,  Coo  4—  Coi  <—  Cio  <—  C n <—  0. 

H2.  [Find  v and  ie.]  Set  v <—  ^2^=1  [(*  + j)7r  < &]  and  w t—  v + [s7r  < k]  + 
[(s  + m) 7r  <k].  If  v = m — 1,  go  to  step  H5. 

H3.  [Check  for  nonbeads.]  Set  p < 1.  If  v m — 2,  go  to  H4.  Otherwise,  if 

m = 2 and  ( s + 1)tt  = n,  set  p «—  [(s  + 2)7r  < k].  Otherwise,  if  w = m and 
(s  + j)n  = k + 1 for  some  j € [1 . . m— 1],  set  p <—  j. 

H4.  [Add  binomials.]  For  all  s'  and  t!  such  that  s'  + t1  = w,  0 < s'  < s,  and 
0 < t'  < k — s,  set  q <—  q + (“)  and  q q + [s'  =p\.  Then  go  to  H6. 

H5.  [Remember  0-1  slates.]  Do  the  following  for  all  s'  and  t'  as  in  step  H4:  If 
(s  + m) 7r  < k,  set  Coo  t—  Coo  U s'  and  Coi  Coi  U (s'  — 1);  otherwise  set 
Coi  4—  Coi  U s'.  If  S7r  < k and  (s  + m) n < k,  set  Cio  t—  Cio  U (s'  — 1)  and 

Cn  <—  Cn  U (s'  — 2).  If  S7r  < k and  (s  + m)n  > k , set  Cn  <—  Cn  U (s'  — 1). 

H6.  [Loop  on  s.]  If  s < k,  set  s <—  s + 1 and  return  to  H2. 

H7.  [Finish.]  For  ab  = 00,  01,  10,  and  11,  set  q 4-  q+  ("“(T1)  for  all  r 6 Cab-  Also 
set  q 4 — q'  + [0  £ Coo]  T \rn—  1 G Cn].  | 

125.  Let  S(n,m ) = (]})  + ■ • • + (J^).  There  are  S(k  + 1 — s,  s)  — 1 nonconst.ant  slates 

when  0 < s < k and  s > 2k  — n + 2.  The  only  other  nonconstant  slates,  one  each, 

arise  when  s = 0 and  k < (n  — l)/2.  The  constant  slates  are  trickier  to  count,  but 
there  usually  are  S(n  + 1 — k,  2k  + 1 — n)  of  them,  appearing  when  s = 2k  — n or 
s — 2k  + 1 — n.  Taking  account  of  nitpicky  boundary  conditions  and  nonbeads,  we  find 

n — k 

bk  = S(n  — k,2k  — n)  + S(n  — k — s,  2k  + l — n+s) 

s= 0 

— minffc,  n — k)  — [n  = 2k\  — [3fc  > 2n  — 1]  — 1 

for  0 < k < n.  Although  S(n,m)  has  no  simple  form,  we  can  express  & fc  as 

-Bn/2  + Eo<m<n-2Kn(n  + 3 — to  — 2 fc)(^)  + (small  change)  when  n is  even,  and  the 
same  expression  works  when  n is  odd  if  we  replace  Bn/2  by  A(n+1)/2-  The  double  sum 
can  be  reduced  by  summing  first  on  k,  since  (k  + 1)(  !)  = (”■ + 1)0; 


£(<»+=- ■»)( u"  7r++,2)/2J  )-<*»+ ^( u" 


— m + 4)  / 2] 
m + 2 


))• 


And  the  remaining  sum  can  be  tackled  by  breaking  it  into  four  parts,  depending  on 
whether  m and/or  n is  odd.  Generating  functions  are  helpful:  Let  A(z)  = 
and  B(z)  = £*<„&"+: [)z\  Then  A(z)  = 1 + E,<n  + Efc<n  (V^1)*"  = 

1 + Efc<n  (n2kk)zn+1  + T,k<n  {Zk+i) zn+2  = 1 + zA(z ) + z2B(z).  A similar  derivation 


proves  that  B(z)  = zB(z)  + zA(z).  Consequently 


A(z) 


1 — z 


1-2 ■* 


1-22  + 22 


l-Z-22-24 


B(z 


2 + 2 


1-22  + 22 


1 — 2 — 22  — 24 
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Thus  An  = 2An-i  - An-z  + An-3  = An_i  + An- 2 + An- 4 for  n > 4,  and  Bn 
satisfies  the  same  recurrences.  In  fact,  we  have  An  = (3P2n+i  + 7P2n  — 2P2n-i)/23 
and  Bn  = (3P2n+2  + 7P2n+i  — 2P2„)/23,  using  the  Perrin  numbers  of  exercise  15. 

Furthermore,  setting  A'(z)  = Efc<n  (";>“**)  2"  and  B*(z ) = £*<„  MaM-*)*"-  we 
find  A*(z)  = z2A(2)P(2)  and  B*  (z)  = z2B(z)2.  Putting  it  all  together  now  yields  the 
remarkable  exact  formula 


B(hn) 


56  Pn+2  + 77  Pn+1  + 47Pn 

n2 

7n+  1 

23 

. 4 J 

L 3 J 

+ (n  mod  2)  — 10. 


Historical  notes:  The  sequence  (An)  was  apparently  first  studied  by  R.  Austin 
and  R.  K.  Guy,  Fibonacci  Quarterly  16  (1978),  84-86;  it  counts  binary  x\  . . . xn-\  with 
each  1 next  to  another.  The  plastic  constant  \ was  shown  by  C.  L.  Siegel  to  be  the 
smallest  “Pisot  number,”  namely  the  smallest  algebraic  integer  > 1 whose  conjugates 
all  lie  inside  the  unit  circle;  see  Duke  Math.  J.  11  (1944),  597-602. 

126.  When  n > 6,  we  have  bk  = Fyy jfe+7)/2J  + F|-(jc+7)/2]  — 4 for  1 < k < 2n/3,  and 
bk  = 2n~k+2  — 6 — [k  = n — 2]  for  4n/5  < k < n.  But  the  main  contributions  to  B(hn) 
come  from  the  2n/15  profile  elements  between  those  two  regions,  and  the  methods  of 
answer  125  can  be  extended  to  deal  with  them.  The  interesting  sequences 


A 


n — 


n — 2k 
3 k 


)■ 


Bn  = 


n — 2k 
3k + 1 


)■ 


have  respective  generating  functions  (1  — z)2/p{z),  (1  — z)z/p(z),  z2/p(z ),  where  p(z)  = 
(1  — zf  — z& . These  sequences  arise  in  this  problem  because  £][L0(Ln_2fe/3J)  = An  + 
Bn~  1 + Cn—2 - They  grow  as  an,  where  a ss  1.7016  is  the  real  root  of  (a— l)3a2  = 1. 

The  BDD  size  can’t  be  expressed  in  closed  form,  but  there  is  a closed  form  in  terms 
of  ^L"/3J  through  Ayn/3 j+4  that  is  accurate  to  0(2’I^4/y/n).  Thus  B(h„)  = 0(an^3). 

127.  (The  permutation  7r  = (3,  5,  7,  . . . , 2 n'  — 1,  n,  n — 1,  n — 2,  . . . , 2 n , 2 n — 2, 

. . . , 4,  2,  1),  n!  = [2n/5j,  turns  out  to  be  optimum  for  hn  when  12  < n < 24;  but  it 
gives  B(hioo)  = 1,366,282,025.  Sifting  does  much  better,  as  shown  in  answer  152;  but 
still  better  permutations  almost  surely  exist.) 

128.  Consider,  for  example,  M3(x4,  X2,  X7;  X6,  xi,  xs,  x3,  xg,  Xu,  x$,  Xio).  The  first  m 
variables  {x4,X2,xr}  are  called  “address  bits”;  the  other  2m  are  called  “targets.”  The 
subfunctions  corresponding  to  xi  = ci,  . . . , Xk  = Ck  can  be  described  by  slates  of 
options  analogous  to  (96).  For  example,  when  k = 2 there  are  three  slates  [xe,  0,  £9,  in], 
[£6, 1,  £9,  £11],  [£8,  £3,  £5,  £10],  where  the  result  is  obtained  by  using  (14X7)2  to  select 
the  appropriate  component.  Only  the  third  of  these  depends  on  £3;  hence  g2  = 3 and 
b2  = 1.  When  k = 6 the  slates  are  [0,0],  [0,1],  [1,0],  [1,1],  [£s,0],  [£s,l],  [^9,2:11], 
[0, £10],  and  [ 1 , £10] , with  components  selected  by  £7;  hence  qe  = 9 and  be  = 7. 

In  general,  if  the  variables  {£1, . . . , Xk}  include  a address  bits  and  t targets,  the 
slates  will  have  A = 2m-a  entries.  Divide  the  set  of  all  2m  targets  into  2“  subsets, 
depending  on  the  known  address  bits,  and  suppose  Sj  of  those  subsets  contain  j known 
targets.  (Thus  so  + si  + • • • + = 2“  and  si  + 2s2  + • • • + Asa  = t.  We  have 
(so,...,s4)  = (1,1, 0,0,0)  when  k = 2 and  a = t = 1 in  the  example  above;  and 
(so,si,s2)  = (1,2,1)  when  k = 6,  a = 2,  t = 4.)  Then  the  total  number  of  slates,  qk, 
is  2°so  + 2X s\  + •■•-(-  2a~xsa-i  + 2^[s^  > 0].  If  £k+i  is  an  address  bit,  the  number 
bk  of  slates  that  depend  on  £jt+i  is  qk  — 2 a^2[sa  >0].  Otherwise  bk  = 2C,  where  c is 
the  number  of  constants  that  appear  in  the  slates  containing  target  £jt+i. 
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129.  (Solution  by  M.  Sauerhoff;  see  I.  Wegener,  Branching  Programs  (2000),  Theorem 
6.2.13.)  Since  Pm(xu...,xm  2)  = Qm(Xl, . . . , xm2 ) A Sm(aq, . . . ,xm2)  and  B(Sm)  = 
m3  + 2,  we  have  B(P£l)  < (m3  + 2 )B(Q^n).  Apply  Theorem  K. 

(A  stronger  lower  bound  should  be  possible,  because  Qm  seems  to  have  larger 
BDDs  than  Pm.  For  example,  when  m = 5 the  permutation  (l7r, . . . ,257r)  = (3,  1,  5, 
7,  9,  2,  4,  6,  8,  10,  11,  12,  13,  14,  15,  16,  20,  23,  17,  21,  19,  18,  22,  24,  25)  is  optimum 
for  Qs;  but  B(Ql ) = 535,  while  S(P5)  = 229.) 

130.  (a)  Each  path  that  starts  at  the  root  of  the  BDD  and  takes  .s  HI  branches  and  t 
LO  branches  defines  a subfunction  that  corresponds  to  graphs  in  which  s adjacencies 
are  forced  and  t are  forbidden.  We  shall  show  that  these  (s/f ) subfunctions  are  distinct. 

If  subfunctions  g and  h correspond  to  different  paths,  we  can  find  k vertices  W 
with  the  following  properties:  (i)  W contains  vertices  w and  w'  with  w — w'  forced 
in  g and  forbidden  in  h.  (ii)  No  adjacencies  between  vertices  of  W are  forced  in  h or 
forbidden  in  g.  (iii)  If  u £ W and  v ^ W and  u — v is  forced  in  h,  then  u = w or 
u = w'.  (These  conditions  make  at  most  2 s + t = m-k  vertices  ineligible  to  be  in  W.) 

We  can  set  the  remaining  variables  so  that  u — v if  and  only  if  (w,  v } C IT,  when- 
ever adjacency  is  neither  forced  nor  forbidden.  This  assignment  makes  g = 1,  h = 0. 

(b)  Consider  the  subfunction  of  Cmj  fm/2]  in  which  vertices  {1, . . . , k}  are  required 
to  be  isolated,  but  u v whenever  k < u < [m/2]  < v < m.  Then  a fc-clique  on 
the  [m/2j  vertices  { [m/2]  + 1, . . . , m}  is  equivalent  to  an  [m/2]-clique  on  {1, . . . , m}. 
In  other  words,  this  subfunction  of  Cm, \m/l\  is  Cym/2p k- 

Now  chose  k sa  i/m/3  and  apply  (a).  [I.  Wegener,  JACM  35  (1988),  461-471.] 

131.  (a)  The  profile  can  be  shown  to  be  (1,  1,  2,  4,  ... , 2q~1,  (p-2)  x (29  -1,  qx  2q~1), 
2q  — 1,  2q  1 , . - - , 4,  2,  1,  2),  where  r x b denotes  the  r-fold  repetition  of  b.  Hence  the 
total  size  is  ( pq  + 2p  — 2q  + 2)2q~1  - p + 2. 

(b)  With  the  ordering  Xl,  x2,  ...,  xp,  3/11,  3/21,  ypl,  ...,  ylq,  y2q,  ...,  ypq , 
the  profile  comes  to  (1,  2,  4,  . . . , 2P  \ (q~l)p  x (2^),  2P~\  . . . , 4,  2,  1,  2),  making 
the  total  size  (pq  — p + 4)2P_1. 

(c)  Suppose  exactly  m = [min(p,  q)/2]  x’s  occur  among  the  first  k variables  in 
some  ordering;  we  may  assume  that  they  are  {aq, . . . ,xm}.  Consider  the  2m  paths  in 
the  QDD  for  C such  that  xj  = im+j  for  1 < j < p - m and  ytj  = [i  = j or  i = j + m 
or  j > m].  These  paths  must  pass  through  distinct  nodes  on  level  k.  Hence  qk  > 2m; 
use  (85).  [See  M.  Nikolskaia  and  L.  Nikolskaia,  Theor.  Comp.  Sci.  255  (2001),  615-625.] 

Optimum  orderings  for  (p,q)  = (4,4),  (4,5),  and  (5,4),  via  exercise  138,  are: 

xiyilX2y2lX3y3iy4iy12V22y32y42yi3y23y33y43yi4y24y34y44X4  (size  108); 

•Ei 3/11 2-2 2/21  :r3f/3i 3/41 3/123/223/323/423/13 2/233/33 3/433/143/243/3433443/153/25 3/353/452:4  (size  140); 
xryiiX2y2iyi2y22yi3y23yi4y24X3y3iy32y33y34X4y4iy42y5iys2y43y53y44y54xs  (size  167). 

132.  There  are  616,126  essentially  different  classes  of  5-variable  functions,  by  Table 
7.1. 1-5.  The  maximum  Bmin(/),  17,  is  attained  by  38  of  those  classes.  Three  classes 
have  the  property  that  B(f  *)  = 17  for  all  permutations  7r;  one  such  example,  ((a;2  ® 
X4  ® (aq  A (x3  V *4)))  A ((aq  ® x5)  V (a:3  ® x4)))  © (x5  A (aq  © (aq  V x2))),  has  the 
interesting  symmetries  /(aq,  aq,  aq,  a;4,  aq)  = f(x2,x3,x4,Xl,xs)  = f( x2,  xs,  Xl,  x3,  x4). 

Incidentally,  the  maximum  difference  Hmax(/)  - Bmin(f)  = 10  occurs  only  in  the 
“junction  function”  class  aq?  aq:  aq?  a:4:  aq,  when  Bmin  = 7 and  Bmax  = 17. 

(When  n = 4 there  are  222  classes;  and  Bmin(f)  = 10  in  25  of  them,  including  S2 
and  S2,4.  The  class  exemplified  by  truth  table  16ad  is  uniquely  hardest,  in  the  sense 
that  Bmin(/)  = 10  and  most  of  the  24  permutations  give  B(f7r)  = 11.) 
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133.  Represent  each  subset  X C { 1 , . . . , n } by  the  n-bit  integer  i(X ) = ^2x€X  2x  l, 
and  let  6,(x).x  be  the  weight  of  the  edge  between  X and  X U x.  Set  Co  t—  0,  and  for 
1 < i < 2n  set  Ci  «—  min{cj®j  + bi®j,x  \ j = 2X~1  and  i & j / 0}.  Then  = 

c 2«  — i + 2,  and  an  optimum  ordering  can  be  found  by  remembering  which  x = x(i) 
minimizes  each  Ci.  For  Bmax,  replace  ‘min’  by  ‘max’  in  this  recipe. 


The  maximum  profile,  (1,2,4, 2,2),  occurs  on  paths  such  as  0 — > {2}  — > {2,3}  — » 
{2,3,4}  — > {1,2,3, 4}.  The  minimum  profile,  (1,2,2, 1,2),  occurs  only  on  the  paths 

0 — > ({3}  or  {4})  — > {3,4}  — ► {1,3,4}  — > {1,2, 3, 4}.  (Five  of  the  24  possible  paths 
have  the  profile  (1,  2, 3,  2,  2)  and  are  unimprovable  by  sifting  on  any  variable.) 

135.  Let  do  — 1,  0i  = xi,  62  = x\  A X2 , and  9n  = xn ? Qn-i:  6n-3  for  n > 3.  One  can 
prove  that,  when  n > 4,  B(9%)  = n + 2 if  and  only  if  (nn, . . . , l7r)  = (1, . . . ,n).  The  key 
fact  is  that  if  k < n and  n > 5,  the  subfunctions  obtained  by  setting  Xk  <—  0 or  Xk  ■<— 

1 are  distinct,  and  they  both  depend  on  the  variables  {xi, . . . ,Xk-i,Xk+i,  ■ ■ ■ ,xn}, 
except  that  the  subfunction  for  xn-i  <—  0 does  not  depend  on  xn_2-  Thus  the  weights 
{xk}  — ► {xk,xi}  in  the  master  profile  chart  are  2 except  when  k = n or  (k,l)  = 
(n— l,n  — 2).  Below  {xn-i,  x„-2}  there  are  three  subfunctions,  namely  x„?  9„- 4:  9n- 3, 
in?  9n-5\  9n- 3,  and  0n_ 3;  all  of  them  depend  on  {xi, . . . , xn_3},  and  two  of  them  on  x„. 

136.  Let  n = 2 n — 1 and  m = 2m!  — 1.  The  inputs  form  an  m x n matrix,  and  we’re 
computing  the  median  of  m row-medians.  Let  Vt  be  the  variables  in  row  i.  If  X is 
a subset  of  the  mn  variables,  let  Xi  = X IT  V)  and  r\  = Y,  | . Subfunctions  of  type 
(»i  , . . . , Sm  ) arise  when  exactly  s,  elements  of  X,  are  set  to  1;  these  subfunctions  are 

(S1S2  ...Sm),  where  Si  = S>n/_Si (Vi\Xi)  and  0 < s,  < n for  1 < * < m. 

When  x ^ X,  we  want  to  count  how  many  of  these  subfunctions  depend  on  x.  By 
symmetry  we  may  assume  that  x = xm„.  Notice  that  the  symmetric  threshold  function 
S>t(x i, . . . , xn)  equals  0 if  t > n,  or  1 if  t < 0;  it  depends  on  all  n variables  if  1 < t < n. 
In  particular,  Sm  depends  on  x for  exactly  rm$n  = inin(rm  + 1 , n — rm)  choices  of  sm. 

Let  a,j  = f°r  0 < j < n.  Then  an  of  the  functions  {Si, . . . , Sm_i} 

are  constant,  and  an_i  + • • • + ani  of  them  might  or  might  not  be  constant.  Choosing 
Ci  to  be  nonconstant  gives  us  (rm$n)((a„  + <Zn- 1 + • • ■ + an'  — cn_  1 cn')$m)  times 

V • • ( an'  ) l“°2ai  . . . (n')0"'-1  (n  - l)c"'  (n  - 2)c"'+i  . . . lCn_1 

V On— 1 / V cn>  J 


distinct  subfunctions  that  depend  on  x.  Summing  over  {cn-i , . . . , c„< } gives  the  answer. 
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When  variables  have  the  natural  row-by-row  order,  these  formulas  apply  with 
rm  = k mod  n,  an  — [k/n\ , ao  = m — 1 — an.  The  profile  element  bk  for  0 < k < mn  is 
therefore  ([ k/n\$m)((k  mod  n)$n),  and  we  have  ^j^To  = (m'n1)2  + 2.  This  ordering 
is  optimum,  although  no  easy  proof  is  apparent;  for  example,  some  orderings  can 
decrease  bn+2  or  &2n-2  from  4 to  3 while  increasing  bk  for  other  k. 

Every  path  from  top  to  bottom  of  the  master  chart  can  be  represented  as  ao  -4 
Qi  -4  • • • -4  amn,  where  each  Qj  is  a string  r,i  . . . rjm  with  0 < rji  < ■ ■ ■ < rjm  < n, 

T] i 4 I -rjm  = j,  one  coordinate  increasing  at  each  step.  For  example,  one  path  when 

m-  5 and  n = 3 is  00000  -4  00001  -4  00011  -4  00111  -4  00112  -4  00122  -4  00123  -4 
01123  -4  11123  -4  11223  -4  12223  -4  12233  -4  12333  -4  22333  -4  23333  -4  33333.  We 
can  convert  this  path  to  the  “natural”  path  by  a series  of  steps  that  don’t  increase  the 
total  edge  weight,  as  follows:  In  the  initial  segment  up  to  the  first  time  rjm  = n,  do 
all  transitions  on  the  rightmost  coordinate  first.  (Thus  the  first  steps  of  the  example 
path  would  become  00000  -4  00001  -4  00002  -4  00003  -4  00013  -4  00113  -4  00123.) 
Then  in  the  final  segment  after  the  last  time  r,i  = 0,  do  all  transitions  on  the  leftmost 
coordinate  last.  (The  final  steps  would  thereby  become  01123  -4  01223  -4  02223  -4 
02233  -4  02333  -4  03333  -4  13333  -4  23333  -4  33333.)  Then,  after  the  first  n steps, 
normalize  the  second-last  coordinates  in  a similar  fashion  (00003  -4  00013  -4  00023  -4 
00033  -4  00133  — > 01133  — > 01233  -4  02233);  and  before  the  last  n steps,  normalize  the 
second  coordinates  (00133  -4  00233  -4  00333  -4  01333  -4  02333  -4  03333).  Et  cetera. 

[This  back-and-forth  proof  technique  was  inspired  by  the  paper  of  Bollig  and 
Wegener  cited  below.  Can  every  nonoptimal  ordering  be  improved  by  merely  sifting?] 

137.  If  we  add  a clique  of  c new  vertices  and  (^)  new  edges,  the  cost  of  the  opti- 

mum arrangement  increases  by  (‘’I]1).  So  we  may  assume  that  the  given  graph  has 
m edges  and  n vertices  {1, . . . ,rz},  where  m and  n are  odd  and  sufficiently  large.  The 
corresponding  function  /,  which  depends  on  mn  + m + 1 variables  xtj  and  sk  for 
1 < * < rn,  1 < j < n,  and  0 < k < m,  is  J(s0,  Si, . . . , sm;  h,gi, . . . ,gm),  where 
Qi  — ® VCivi  ) A f\{xiw  | w ^ {Mi,Vj}}  when  the  ith  edge  is  Ui  — Vi,  and  where 

h = ((*11  . . . Xmi)  ■ ■ ■ (xin  ■ ■ ■ Xmn ))  is  the  transpose  of  the  function  in  exercise  136. 

One  can  show  that  Bmin(/)  = min^  + )2 (n±l)2 +mn+m+2; 

the  optimum  ordering  uses  ( ) 2 ( ^ y 2 nodes  for  h , n + | m n - vl  7r  | nodes  for  g{ , one 
node  for  each  s*,,  and  two  sink  nodes,  minus  one  node  that  is  shared  between  h and 
some  gt.  [See  B.  Bollig  and  I.  Wegener,  IEEE  Trans.  C-45  (1996),  993-1002.] 

138.  (a)  Let  Xk  = {*i, . . . , **,}.  The  QDD  nodes  at  depth  k represent  the  subfunctions 
that  can  arise  when  constants  replace  the  variables  of  Xk.  We  can  add  an  rc-bit  field 
DEP  to  each  node,  to  specify  exactly  which  variables  of  Xn  \ Xk  it  depends  on.  For 
example,  the  QDD  for  / in  (92)  has  the  following  subfunctions  and  DEPs: 

depth  0:  0011001001110010  [1111]; 
depth  1:  00110010  [0111],  01110010  [0111]; 
depth  2:  0010  [0011],  0011  [0010],  0111  [0011]; 
depth  3:  00  [0000],  01  [0001],  10  [0001],  11  [0000]. 

An  examination  of  all  DEP  fields  at  depth  k tells  us  the  master  profile  weights  between 
Xk  and  Xk  U for  0 < k < l < n. 

(b)  Represent  the  nodes  at  depth  k as  triples  Nkp  = ( lkpi  hkp,  dkp)  for  0 < p < qk, 
where  ( lkp,hkp ) are  the  (LO,HI)  pointers  and  dkp  records  the  DEP  bits.  If  k < n, 
these  nodes  branch  on  xk+i,  so  we  have  0 < lkp,hkp  < qk+ 1;  but  if  k = n,  we  have 
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ln o = hn o = 0 and  ln\  = hn\  = 1 to  represent  [I]  and  [T] . We  define 
dkp  = ^ | Nkp  depends  on  xt}', 

hence  0 < dkp  < 2n~k . For  example,  the  QDD  (82)  is  equivalent  to  Noo  = (0,1,7); 
N10  = (0,1,3),  Nu  = (1,2,3);  N20  = (0,0,0),  N21  = (0,1,1),  N22  = (1,1,0);  N30  = 
(0,0,0),  JV31  = (1,1,0). 

To  jump  up  from  depth  b to  depth  a,  we  essentially  make  two  copies  of  the  nodes 
at  depths  b — 1,  b — 2,  . . . , a,  one  for  the  case  Xb+i  = 0 and  one  for  the  case  aj^+i  = 1. 
Those  copies  are  moved  down  to  depths  b,  b — 1,  . . . , a + 1,  and  reduced  to  eliminate 
duplicates.  Then  every  original  node  at  depth  a is  replaced  by  a node  that  branches  on 
Xb+i;  its  LO  and  HI  fields  point  respectively  to  the  0-copy  and  the  1-copy  of  the  original. 

This  process  involves  some  simple  (but  cool)  list  processing  to  update  DEPs  while 
bucket  sorting:  Nodes  are  unpacked  into  a work  area  consisting  of  auxiliary  arrays  r,  s, 
t,  u,  and  v , initially  zero.  Instead  of  using  lkp  and  hkp  for  LO  and  HI,  we  store  HI  in  cell 
Up  of  the  work  area,  and  we  let  vp  link  to  the  previous  node  (if  any)  with  the  same  LO 
field:  furthermore  we  make  si  point  to  the  last  node  (if  any)  for  which  LO  = l.  The  algo- 
rithm below  uses  UNPACK(p,  l,  h)  as  an  abbreviation  for  “up  4—  h,  vp  4—  si,  si  4—  p + 1.” 

When  nodes  of  depth  k have  been  unpacked  in  this  way  to  arrays  s,  u,  and  v, 
the  following  subroutine  ELIM(fc)  packs  them  back  into  the  main  QDD  structure  with 
duplicates  eliminated.  It  also  sets  rp  to  the  new  address  of  node  p. 

El.  [Loop  on  L]  Set  q 4—  0 and  th  4—  0 for  0 < h < qk+i ■ Do  step  E2  for  0 < l < qk+i- 

Then  set  qk  4—  q and  terminate. 

E2.  [Loop  on  p.\  Set  p 4—  si  and  s;  4—  0.  While  p > 0,  do  step  E3  and  set  p 4—  vp-\. 

Then  resume  step  El. 

E3.  [Pack  node  p — 1.]  Set  h 4—  up- 1.  (The  unpacked  node  has  (LO,  HI)  = ( l,h ).)  If 

th  / 0 and  lk(th-i)  = l,  set  rp_  1 4—  th  — 1.  Otherwise  set  Ikq  4—  l,  hkq  4—  h,  dkq  4— 

((d(fc+1);  |d(jc+1)/l)<l)  + [/  /h],  rp_i  4-  q,  q 4-  q+ 1,  th  4-  q.  Resume  step  E2.  | 

We  can  now  use  ELIM  to  jump  up  from  b to  a.  (i)  For  k = b — 1,6  — 2,  a, 
do  the  following  steps:  For  0 < p < qk,  set  l 4 — lkp,  h 4—  hkP\  if  k = b — 1, 
UNPACK(2p,  Ibi,  hbi)  and  UNPACK(2p+l,  Ibh,  hbh),  otherwise  UNPACK(2p,  r2i,  r2h)  and 
UNPACK(2p  + l,r2i+i,r2h+i)  (thereby  making  two  copies  of  Nkp  in  the  work  area). 
Then  ELIM(fc  + 1).  (ii)  For  0 < p < qa,  UNPACK(p, r2P, r2p+i)-  Then  ELIM(a). 
(iii)  If  a > 0,  set  l 4-  Z(a-i)p,  h,  4-  /i(0_1)p,  l(a-i)P  <-  n,  h(o-i)P  Th , for  0 < p < qa- 1. 

This  jump-up  procedure  garbles  the  DEP  fields  above  depth  a,  because  the  vari- 
ables have  been  reordered.  But  we’ll  use  it  only  when  those  fields  are  no  longer  needed. 

(c)  By  induction,  the  first  2n~2  steps  account  for  all  subsets  that  do  not  contain  n; 
then  comes  a jump-up  from  n — 1 to  0,  and  the  remaining  steps  account  for  all  subsets 
that  do  contain  n. 

(d)  Start  by  setting  yk  4—  k and  uik  4 — 2k  — 1 for  0 < k < n.  In  the  following 
algorithm,  the  y array  represents  the  current  variable  ordering,  and  the  bitmap  Wk  = 
Y^{2yj  I 0 < j < k}  represents  the  set  of  variables  on  the  top  k levels. 

We  augment  the  subroutine  ELIM(fc)  so  that  it  also  computes  the  desired  edge 
weights  of  the  master  profile:  Counters  Cj  are  initially  0 for  0 < j < n — k;  after 
setting  dkq  in  step  E3,  we  set  Cj  4—  Cj  + 1 for  each  j such  that  23  C dkq;  finally  we  set 
bwk, yk+j  + i 4 — Cj  for  0 < j < n — k,  using  the  notation  of  answer  133.  [To  speed  this  up, 
we  could  count  bytes  not  bits,  increasing  by  I for  0 < j < (n  — k)/ 8.] 
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We  initialize  the  DEP  fields  by  doing  the  following  for  k = n — 1,  n — 2,  . . . , 0: 
UNPACK(p,  Ifcp,  hkp)  for  0 < p <C  q k\  ELIM(Zc);  if  A’  0.  set  l i — l(k  — i)p,  b i — h(fc— i)pi 
l(k-i)P  n,  and  /i(*_i)p  «-  rfe,  for  0 < p < qk-\. 

The  main  loop  of  the  algorithm  now  does  the  following  for  1 < i < 2n_1:  Set 
a ~ 1 and  b 4-  ui  + pi.  Set  (ya,  ■■■,yb)  4-  ( yb,ya , ■ ■ ■ ,yb-i ) and  (w0+i, . . . ,wb)  4- 
{2Vb  + wa,  ■ ,2Vb  + Wb-i).  Jump  up  from  b to  a with  the  procedure  of  part  (b);  but 
use  the  original  (non-augmented)  ELIM  routine  for  ELIM(a)  in  step  (ii). 

(e)  The  space  required  for  nodes  at  depth  k is  at  most  Qk  = min(2fe,22"  k)-  we 
also  need  space  for  2 max(Q1, . . . , Qn)  elements  in  arrays  r,  u , v,  plus  max(Qi, . . . , Qn ) 
elements  in  arrays  s and  t.  So  the  total  is  dominated  by  0(2nn)  for  the  outputs  bw,x. 

Subroutine  ELIM(fc)  is  called  (£)  times  in  augmented  form,  for  0 < k < n.  and 

(£“!)  times  non-augmented.  Its  running  time  in  either  case  is  0(qk{n  - k)).  Thus  the 

total  comes  to  0(^2k  (£)2fc(n  — k))  = 0(3"n),  and  it  will  be  substantially  less  if  the 
QDD  never  gets  large.  (For  example,  it’s  0((1  + \f2)nn)  for  the  function  hn.) 

[The  first  exact  algorithm  to  determine  optimum  variable  ordering  in  a BDD  was 
introduced  by  S.  J.  Friedman  and  K.  J.  Supowit,  IEEE  Trans.  C-39  (1990),  710-713. 
They  used  extended  truth  tables  instead  of  QDDs,  obtaining  a method  for  m = 1 that 
required  0(3n/%/n)  space  and  0(3nn2)  time,  improvable  to  0(3nn).] 

139.  The  same  algorithm  applies,  almost  unchanged:  Consider  all  QDD  nodes  that 

branch  on  xa  to  be  at  level  0,  and  all  nodes  that  branch  on  xb+i  to  be  sinks.  Thus 
we  do  2b~a  jump-ups,  not  2n_1.  (The  algorithm  doesn’t  rely  on  the  assumptions  that 

<?o  = 1 and  qn  = 2,  except  in  the  space  and  time  analyses  of  part  (e).) 

140.  We  can  find  shortest  paths  in  a network  without  knowing  the  network  in  advance, 
by  generating  vertices  and  arcs  “on  the  fly”  as  needed.  Section  7.3  points  out  that  the 
distance  d(X,  Y ) of  each  arc  X — > Y can  be  changed  to  d'(X,Y)  = d(X.Y)  — l(X)+l(Y) 
for  any  function  l(X),  without  changing  the  shortest  paths.  If  the  revised  distances  d! 
are  nonnegative,  l(X)  is  a lower  bound  on  the  distance  from  A'  to  the  goal;  the  trick  is 
to  find  a good  lower  bound  that  focuses  the  search  yet  isn’t  difficult  to  compute. 

If  | A' | = Z,  and  if  a QDD  for  / with  A on  its  top  l levels  has  q nonconstant 
nodes  on  the  next  level,  then  /(A)  = ma x(q,n  — Z)  is  a suitable  lower  bound  for  the 
Bmin  problem.  [See  R.  Drechsler,  N.  Drechsler,  and  W.  Gunther,  ACM/IEEE  Design 
Automation  Conf.  35  (1998),  200-205.]  However,  a stronger  lower  bound  is  needed 
to  make  this  approach  competitive  with  the  algorithm  of  exercise  138,  unless  / has  a 
relatively  short  BDD  that  cannot  be  attained  in  very  many  ways. 


141.  False.  Consider  g(xx  V •••  V x6,  x7  V • • • V zi2,  (*i3  V • • • V xi6)  0 *i8,  x17,  xi9V 
■■■  v *22),  where  g{yi,...,y5)  = ((((jh  V ys)  A y4)  ® y3)  A (( y , A y2)  © y4  © j/5))  © y5. 
Then  B(g)  = 40  = Bmin(g)  can’t  be  achieved  with  {xi3, . . . , a:i6,  Xis}  consecutive. 
[M.  Teslenko,  A.  Martinelli,  and  E.  Dubrova,  IEEE  Trans.  C-54  (2005),  236-237.] 


142.  (a)  Suppose  m is  odd.  The  subfunctions  that  arise  after  (aq, . . . ,*m+1)  are  known 

are  [wm+2Xm+2  H 1-  wnxn  > 2Tn~1m  - 2m~2  - t],  where  0 < t < 2m.  The  subcases 

Xm+2  + • ■ ■ + xn  = (to  — l)/2  show  that  at  least  w2)  of  these  subfunctions  differ. 

But  organ-pipe  order,  (xxx2™ x\xT ~2 x\  ■ ■ ■ x2Z-2  2"'^2 x2^ ),  is  much 
better:  Let  tk  = aq  + (2m-l):r2+2:3+-  ■ ■ + (2m-2fc~1):c2fc+2fc-1:E2fc+i,  for  1 < k < m-1. 
The  remaining  subfunction  depends  on  at  most  2k  + 2 different  values,  \tk/2k]. 

(b)  Let  n = 1 + 4m2.  The  variables  are  x0  and  xtj  for  0 <i,j<  2m;  the  weights 
are  wo  = 1 and  Wij  = 2!  + 22m+1+;,m.  Let  X[  be  the  first  Z variables  in  some  ordering, 
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and  suppose  Xi  includes  elements  in  ii  rows  and  ji  columns  of  the  matrix  (xij).  If 
ma x(ii,ji)  = m , we  will  prove  that  qt  > 2m;  hence  B(f)  > 2m  by  (85). 

Let  I and  J be  subsets  of  {1, ... , 2m)  with  \I\  = |J|  = m and  Xt  C x0  U {xtj  \ 
i £ 7,  j e J}\  let  /'  and  J'  be  the  complementary  subsets.  Choose  m elements 
X'  1=  Xi  \ xo,  in  different  rows  (or,  if  i;  < m,  in  different  columns).  Consider  2m 
paths  in  the  QDD  defined  as  follows:  x0  = 0,  and  xi3  = 0 if  a Uj  6 Xi  \ X' \ also 
xi'j  = xij'  = xi'j'  = Xij  for  * 6 I,  j 6 J,  where  i <r+  i1  and  j O j'  are  matchings 
between  I 4-/  /'  and  J 4-4  J' . Then  there  are  2m  distinct  values  t = ft1.  .,  r waxa ; 
but  Ylo<i,i<2mWijXij  = (22m  — l)(l+22m+1m)  on  each  path.  The  paths  must  pass 
through  distinct  nodes  on  level  l.  Otherwise,  if  t ^ t' , one  of  the  lower  subpaths  would 
lead  to  [X] , the  other  to  [T] . 

[These  results  are  due  to  K.  Hosaka,  Y.  Takenaga,  T.  Kaneda,  and  S.  Yajima, 
Theoretical  Comp.  Sci.  180  (1997),  47-60,  who  also  proved  that  \Q(f)  — Q(fR)\  < n. 
Do  self-dual  threshold  functions  always  satisfy  also  | B(f)  — B(fR)  \ < n?] 


143.  In  fact,  the  algorithm  of  exercises  133  and  138  proves  that  organ-pipe  order  is 
best  for  these  weights:  (1,  1023,  1,  1022,  2,  1020,  4,  1016,  8,  1008,  16,  992,  32,  960,  64, 
896,  128,  768,  256,  512)  gives  the  profile  (1,  2,  2,  4,  3,  6,  4,  8,  5,  10,  4,  8,  3,  6,  2,  4, 
1,  2,  2,  1,  2)  and  B(f)  = 80.  The  worst  ordering,  (1022,  896,  512,  64,  8,  1,  4,  32,  1008, 
1020,  768,  992,  1016,  1023,  960,  256,  128,  16,  2,  1),  makes  B(f)  = 1913. 

(One  might  think  that  properties  of  binary  notation  are  crucial  to  this  example. 

Rut  It.t  t.2,.4^.8^16^1  60^116  2M  224  5M  «20  649  fi64  672  676  678  679\  ■ , 

.DUl  \pO\X2X^X^Xr>XQ  Xj  Xg  Xg  3^10  #11  #12  #13  ^14  #15  #16  *^17  #18  ^19  #20  / ^ elCtu- 

ally  the  same  function,  by  exercise  7.1. 1-103(1). ) 


144.  (5,7,7,10,6,9,5,4,2);  the  QDD-not-BDD  nodes  correspond  to  fi,  /2,  /3,  0,  1. 

145.  Bmin  = 31  is  attained  in  (36).  The  worst  ordering  for  + (2/33/22/12/0)2 

is  2/o , 2/i)  2/2)  2/3)  2:2,  a;i,  xo,  X3 , making  Bmax  = 107.  Incidentally,  the  worst  ordering 
for  the  24  inputs  of  12-bit  addition,  (a:n  . . .*0)2  + (2/11  • • -2/0)2,  turns  out  to  be  t/o,  2/i, 
• ■ • , 2/11,  *10,  a:g,  xe,  X4,  x3,  x5,  x2,  xt,  xi,  x9,  xo,  Xu,  yielding  Bmax  = 39111. 

[B.  Bollig,  N.  Range,  and  I.  Wegener,  Lecture  Notes  in  Comp.  Sci.  4910  (2008), 
174-185,  have  proved  that  -Bmjn  = 9n  — 5 for  addition  of  two  n-bit  numbers  whenever 
n > 1,  and  also  that  Bm-m(MTn)  = 2 n — 2m  + 1 for  the  2m-way  multiplexer.] 

146.  (a)  Obviously  b0  < q0;  and  if  q0  = b0  + a0,  then  61  < 2 b0  + a0  = b0  + q0.  Also 
9o  — 6o  = a0  < bi+q2  < qi,  the  number  of  strings  of  length  2 on  a <j2-Ietter  alphabet;  sim- 
ilarly 60  + 61  +92  < {bi+q2)2.  (The  same  relations  hold  between  qk,  qk+2,  6fe,  and  bk+ 1.) 

(b)  Let  the  subfunctions  at  level  2 have  truth  tables  a3  for  1 < j < q2 , and  use 
them  to  construct  beads  ft,  ... , /3bl  at  level  1.  Let  (71, ... , 7,2+6l ) be  the  truth  tables 
(aiQi, . . , , a92a92,  ft , . . . ,fti).  If  b0  < 6i/2,  let  the  functions  at  level  0 have  truth 
tables  {fti-ift;  | 1 < i < 60}  U {ftft  | 2 b0  < j < bi}  U {7 jjj  \ 1 < j < b0  + q0  - h). 
Otherwise  it’s  not  difficult  to  define  b0  beads  that  include  all  the  ft s,  and  use  them  at 
level  0 together  with  the  nonbeads  {7,-7,-  | 1 < j < qo  — 6o}- 

147.  Before  doing  any  reordering,  we  clear  the  cache  and  collect  all  garbage.  The 
following  algorithm  interchanges  levels  (u)  <->  (v)  when  v = u+ 1.  It  works  by  creating 
linked  lists  of  solitary,  tangled,  and  hidden  nodes,  pointed  to  by  variables  S,  T,  and 
H (initially  A),  using  auxiliary  LINK  fields  that  can  be  borrowed  temporarily  from  the 
hash-table  algorithm  of  the  unique  lists  as  they  are  being  rebuilt. 

Tl.  [Build  5 and  T.\  For  each  (u)-node  p,  set  q /-  L0(p),  r 4-  HI(p),  and  delete  p 
from  its  hash  table.  If  V (<y)  / v and  V(r)  v ( p is  solitary),  set  LINK(p)  4-  S and 
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S 4—  p.  Otherwise  ( p is  tangled),  set  REF(g)  4—  REF(g)  — 1,  REF (r)  4-  REF(r)  - 1, 
LINK(p)  4-  T,  and  T 4-  p. 

T2.  [Build  H and  move  the  visible  nodes.]  For  each  @-node  p,  set  q 4—  L0(p) , 
r 4—  HI (p) , and  delete  p from  its  hash  table.  If  REF (p)  = 0 (p  is  hidden),  set 
REF(g)  4—  REF(g)  — 1,  REF(r)  4— REF(r)  — 1,  LINK(p)  4—  H,  and  H 4—  p;  otherwise 
(p  is  visible)  set  V(p)  4—  u and  INSERT(u,p). 

T3.  [Move  the  solitary  nodes.]  While  S / A,  set  p 4-  S,  S 4—  LINK(p),  V(p)  4—  v, 
and  INSERT(n,p). 

T4.  [Transmogrify  the  tangled  nodes.]  While  T / A,  set  p 4-  T,  T 4-  LINK(p),  and 
do  the  following:  Set  q 4-  L0(p),  r 4-  HI(p).  If  V(g)  > v,  set  go  4-  gx  4-  g; 
otherwise  set  g0  4-  L0(g)  and  gi  4-  HI(g).  If  V(r)  > v,  set  r0  4-  n 4-  r; 
otherwise  set  r0  4—  L0(r)  and  ri  4-  HI(r).  Then  set  L0(p)  4-  UNIQUE(v,  g0,  r0), 
HI(p)  4-  UNIQUE(n,gi,ri),  and  INSERT(u,p). 

T5.  [Kill  the  hidden  nodes.]  While  H / A,  set  p 4-  H,  H 4-  LINK(p),  and  recycle 
node  p.  (All  of  the  remaining  nodes  are  alive.)  | 

The  subroutine  INSERT(u,p)  simply  puts  node  p into  x„’s  unique  table,  using  the  key 
(L0(p) , HI  (p) );  this  key  will  not  already  be  present.  The  subroutine  UNIQUE  in  step 
T4  is  like  Algorithm  U,  but  instead  of  using  answer  82  it  treats  reference  counts  quite 
differently  in  steps  U1  and  U2:  If  U1  finds  p = g,  it  increases  REF  (p)  by  1;  if  U2  finds  r, 
it  simply  sets  REF(r)  4—  REF(r)  + 1. 

Internally,  the  branch  variables  retain  their  natural  order  1,  2,  . . . , n from  top  to 
bottom.  Mapping  tables  p and  n represent  the  current  permutation  from  the  external 
user’s  point  of  view,  with  p = tt~;  thus  the  user’s  variable  xv  appears  on  level  vn  — 1, 
and  node  UNIQUE(u,p,  g)  on  level  v — 1 represents  the  user’s  function  ( xvp ? p:  q).  To 
maintain  these  mappings,  set  j 4-  up,  k 4—  vp,  up  4—  k,  vp  4—  j,  jn  4—  v,  kn  4—  u. 

148.  False.  For  example,  consider  six  sinks  and  nine  source  functions,  with  extended 
truth  tables  1156,  2256,  3356,  4456,  5611,  5622,  5633,  5644,  5656.  Eight  of  the  nodes 
are  tangled  and  one  is  visible,  but  none  are  hidden  or  solitary.  There  are  16  newbies: 
15,  16,  25,  26,  35,  36,  45,  46,  51,  61,  52,  62,  53,  63,  54,  64.  So  the  swap  takes  15  nodes 
into  31.  (We  can  use  the  nodes  of  B(x 3 © X4,X3  © *4)  for  the  sinks.) 

149.  The  successive  profiles  are  bounded  by  (bo,  61, ... , bn ),  (bo  + bi,2bo,  62,  • • • , bn), 
(b0  + bi,  2bo  + b2, 4fe0,  f>3,  ■ • ■ , b„),  . . . , (2°bo  + bi, . . . , 2k~2b0  + bk-\,  2k~1b0,bk,  ■ ■ ■ , bn). 

Similarly,  we  also  have  B(f? , . . . , /£)  < B(fx, . . . , /m)  + 2(b0  + • • - + bfc_i)  in  addi- 
tion to  Theorem  J+,  because  swaps  contribute  at  most  2bk-i,  2bk-2,  • • • , 2bo  new  nodes. 

150.  We  may  assume  that  m = 1,  as  in  exercise  52.  Suppose  we  want  to  jump  Xk  to 
the  position  that  is  jth  in  the  ordering,  where  j / k.  First  compute  the  restrictions 
of  / when  Xk  = 0 and  x*  = 1 (see  exercise  57);  call  them  g and  h.  Then  renumber 
the  remaining  variables:  If  j < k,  change  (xj, . . . ,xk-i)  to  (xj+i, . . . , xk);  otherwise 
change  (xfc+i, . . . ,Xj)  to  (xk, . . .,Xj- 1).  Then  compute  / 4-  (x}  A g)  V (xj  A h),  using 
the  linear-time  variant  of  Algorithm  S in  exercise  72. 

To  show  that  this  method  has  the  desired  running  time,  it  suffices  to  prove  the 
following:  Let  g(x  1, . . . ,xn)  and  h(x %, . . . ,xn ) be  functions  such  that  g(x)  = 1 implies 
Xj  = 0 and  h(x)  = 1 implies  Xj  = 1.  Then  the  meld  g oh  has  at  most  twice  as  many 
nodes  as  g V h.  But  this  is  almost  obvious,  when  truth  tables  are  considered:  For 
example,  if  n = 3 and  j = 2,  the  truth  tables  for  g and  h have  the  respective  forms 
abOOcdOO  and  OOstOOuv.  The  beads  /3  of  g V h on  levels  < j correspond  uniquely  to  the 
beads  /3'  0/3"  of  g oh  on  those  levels,  because  (3  = /3'  V f3"  can  be  “factored”  in  only 
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one  way  by  putting  Os  in  the  appropriate  places.  And  the  beads  j3  of  g V h on  levels 
> j correspond  to  at  most  two  beads  of  g o h,  namely  to  /3  o [T]  and/or  [Xj  o /3. 

[See  P.  Savicky  and  I.  Wegener,  Acta  Informatica  34  (1997),  245-256,  Theorem  1.] 

151.  Set  tk  4—  0 for  1 < k < n,  and  make  the  swapping  operation  Xj-i  4-4  Xj  also  swap 
tj- 1 4-4  tj.  Then  set  k 4—  1 and  do  the  following  until  k > n:  If  tk  = 1 set  k <—  k + 1; 
otherwise  set  t*,  4—  1 and  sift  £*,. 

(This  method  repeatedly  sifts  on  the  topmost  variable  that  hasn’t  yet  been  sifted. 
Researchers  have  tried  fancier  strategies,  such  as  to  sift  the  largest  level  first;  but  no 
such  method  has  turned  out  to  dominate  the  simple-minded  approach  proposed  here.) 

152.  Applying  Algorithm  J as  in  answer  151  yields  B(hi00)  = 1,382,685,050  after 
17,179  swaps,  which  is  almost  as  good  as  the  result  of  the  “hand-tuned”  permuta- 
tion (95).  Another  sift  brings  the  size  down  to  300,451,396;  and  further  repetitions 
converge  down  to  just  231,376,264  nodes,  after  a total  of  232,951  swaps. 

If  the  loops  of  steps  J2  and  J5  are  aborted  when  S > 1.05s,  the  results  are  even 
better(l),  although  fewer  swaps  are  made:  1,342,191,700  nodes  after  one  sift  reduce 
eventually  to  208,478,228  after  139,245  total  swaps.  Moreover,  Filip  Stappers  used 
sifting  together  with  random  swapping  in  September  2010  to  get  the  value  of  B(hi00) 
down  to  only  198,961,868,  with  the  following  “current  champion”  permutation  n: 

3 4 6 8 10  12  14  16  18  20  22  24  27  28  30  32  35  37  39  41 

43  45  47  49  51  53  54  83  85  98  99  100  79  77  81  75  73  95  71  97 

69  96  57  91  67  59  65  60  63  62  64  61  66  87  58  68  56  94  93  70 

92  72  90  74  76  78  80  89  88  86  84  82  55  52  50  48  46  44  42  40 

38  36  34  33  31  29  26  25  23  21  19  17  15  13  11  9 7 5 1 2 

Incidentally,  if  we  sift  the  variables  of  hioo  in  order  of  profile  size,  so  that  £60  is 
sifted  first,  then  £59,  x^i,  £58,  £57,  £62,  £56,  etc.  (wherever  they  currently  happen  to 
be),  the  resulting  BDD  turns  out  to  have  2,196,768,534  nodes. 

Simple  “downhill  swapping”  instead  of  full  sifting  is  of  no  use  whatever  for  hwo'- 
The  (1°°)  swaps  X 1 4-4  £2,  x3  +4  £1,  £3  4-4  £2,  ...,  £100  4-4  £1,  ...,  £100  4-4  £99 
completely  reverse  the  order  of  all  variables  without  changing  the  BDD  size  at  any  step. 

153.  Each  gate  is  easily  synthesized  using  recursions  like  (55).  About  1 megabyte  of 
memory  and  3.5  megamems  of  computation  suffice  to  construct  the  entire  BDD  base  of 
8242  nodes.  Using  exercise  138  we  may  conclude  that  the  ordering  £7,  £3,  xg,  xi,  o9,  01, 
03,  07,  £4,  xq , o6,  04,  02,  o8,  £2,  £8,  o5,  £5  is  optimum,  and  that  Bmin(j/i, . . . , t/9)  = 5308. 

Reordering  of  variables  is  not  advisable  for  a problem  such  as  this,  since  there 
are  only  18  variables.  For  example,  autosifting  whenever  the  size  doubles  would  require 
more  than  100  megamems  of  work,  just  to  reduce  8242  nodes  to  about  6400. 

154.  Yes:  CA  was  moved  between  ID  and  OR  at  the  last  sifting  step,  and  we  can  work 
backwards  all  the  way  to  deduce  that  the  first  sift  moved  ME  between  MA  and  RI. 

155.  The  author’s  best  attempt  for  (a)  is 

ME  NH  VT  MA  CT  RI  NY  DE  NJ  MD  PA  DC  VA  DH  WV  KY  NC  SC  GA  FL  AL  IN  MI  IA 
IL  M0  TN  AR  MS  TX  LA  CO  WI  KS  SD  ND  NE  OK  WY  MN  ID  MT  NM  AZ  OR  CA  WA  UT  NV 

giving  B(fi)  = 403,  B(f£ ) = 677,  B(fi,f£)  = 1073;  and  for  (b)  the  ordering 

NH  ME  MA  VT  CT  RI  NY  DE  NJ  MD  PA  VA  DC  OH  WV  KY  TN  NC  SC  GA  FL  AL  IN  MI 
IL  IA  AR  M0  MS  TX  LA  CO  KS  OK  WI  SD  NE  ND  MN  WY  ID  MT  AZ  NM  UT  OR  CA  WA  NV 

gives  B(/D  = 352,  B(ff)  = 702,  B(fi,fi)  = 1046. 
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156.  One  might  expect  two  “siftups”  to  be  at  least  as  good  as  a single  sifting  process 
that  goes  both  up  and  down.  But  in  fact,  benchmark  tests  by  R.  Rudell  show  that  siftup 
alone  is  definitely  unsatisfactory.  Occasional  jump-downs  are  needed  to  compensate  for 
variables  that  temporarily  jump  up,  although  their  optimum  final  position  lies  below. 

157.  A careful  study  of  answer  128  shows  that  we  always  improve  the  size  when  the  first 
address  bit  that  follows  a target  bit  is  jumped  up  past  all  targets.  [But  simple  swaps 
are  too  weak.  For  example,  M2(x\,  x6;  x2,  x3,  *4,  *5)  and  M3{xi,  xw,  sen;  x2,  x3, . . . , x9) 
are  locally  optimal  under  the  swapping  of  Xj-\  Xj  for  any  j .} 

158.  Consider  first  the  case  when  m = 1 and  n = 3f  — 1 > 5.  Then  if  nn  = k,  the 
number  of  nodes  that  branch  on  j is  a,j  if  jn  < k,  bj  if  jn  = k,  and  an+2-j  if  jn  > k, 
where 

aj  — j — 3max(j  - 2f,0),  bj  = min (j,t,n  + 1 — j). 

The  cases  with  {x\, . . . , xn-\}  consecutive  are  k = 1 and  B(f1T)  = 3t2  + 2;  k = n and 
B(fn)  = 3f2  + l.  But  when  k = [n/2]  we  have  B(fn)  = [3t/2j  ( [3f/2]  — l)+n—  [t/2J  +2. 

Similar  calculations  apply  when  m > 1:  We  have  B(fn)  > 6(p^3)  + B(gn)  when 
7r  makes  {*1, . . . , xp}  consecutive,  but 

B(D  « 2(p/2)  + %B(gn) 

when  7r  puts  {xp+\ ,...,xp+m}  in  the  middle.  Since  g is  fixed,  pB(gn)  = O(n)  as 
n — > 00. 

[If  g is  a function  of  the  same  kind,  we  obtain  examples  where  symmetric  variables 
within  g are  best  split  up,  and  so  on.  But  no  Boolean  functions  are  known  for  which 
the  optimum  B(fn)  is  less  than  3/4  of  the  best  that  is  obtainable  under  the  constraint 
that  no  blocks  of  symmetric  variables  are  split.  See  D.  Sieling,  Random  Structures  & 
Algorithms  13  (1998),  49-70.] 

159.  The  function  is  almost  symmetric,  so  there  are  only  nine  possibilities.  When 
the  center  element  x is  placed  in  position  (1,2, . . . , 9)  from  the  top,  the  BDD  size  is 
respectively  (43, 43, 42, 39,  36, 33,  30, 28,  28). 

160.  (a)  Compute  Afeo  A^=o (~'Lij(X)),  a Boolean  function  of  64  variables  — for  ex- 
ample, by  applying  COMPOSE  to  the  relatively  simple  L function  of  exercise  159, 
100  times.  With  the  author’s  experimental  programs,  about  320  megamems  and  35 
megabytes  are  needed  to  find  this  BDD,  which  has  251,873  nodes  with  the  normal 
ordering.  Then  Algorithm  C quickly  finds  the  desired  answer:  21,929,490,122.  (The 
number  of  11  x 11  solutions,  5,530,201,631,127,973,447,  can  be  found  in  the  same  way.) 

(b)  The  generating  function  is  1 + 64z  -)-  201622  + 39740a3  H b80z45 -t-8246,  and 

Algorithm  B rapidly  finds  the  eight  solutions  of  weight  46.  Three  of  them  are  distinct 
under  chessboard  symmetry;  the  most  symmetric  solution  is  shown  as  (A0)  below. 

(c)  The  BDD  for  A,8=i  Aj=ihLij(X))  has  305,507  nodes  and  21,942,036,750 
solutions.  So  there  must  be  12,546,628  wild  ones. 

(d)  Now  the  generating  function  is  40.Z14  + 936z15  + 10500z16  b 16z55  + z56; 

examples  of  weight  14  and  56  appear  below  as  (Al)  and  (A2). 

(e)  Exactly  28  of  weight  27  and  54  of  weight  28,  all  tame;  see  (A3). 

(f)  There  are  respectively  (26260,  5,  347,  0,  122216)  solutions,  found  with  about 
(228,  3,  32,  1,  283)  megamems  of  calculation.  Among  the  lightest  and  heaviest  solutions 
to  (1)  are  (A4)  and  (A5);  the  nicest  solution  to  (2)  is  (A6);  (A7)  and  (A9)  solve  (3) 
lightly  and  (5)  heavily.  Pattern  (4),  which  is  based  on  the  binary  representation  of  n, 
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has  no  8 x 8 predecessor;  but  it  does,  for  example,  have  the  9 x 10  in  (A8): 


(AO)  (Al)  (A2)  (A3)  (A4)  (A5)  (A6)  (A7)  (A8)  (A9) 

161.  (a)  With  the  normal  row-by-row  ordering  (m,  x12, . . . , x„(„_i),  xnn),  the  BDD 
has  380,727  nodes  and  characterizes  4,782,725  solutions.  The  computational  cost  is 
about  2 gigamems,  in  100  megabytes.  (Similarly,  the  29,305,144,137  still  Lifes  of  size 
10  x 10  can  be  enumerated  with  14,492,923  nodes,  after  fewer  than  50  gigamems.) 

(b)  This  solution  is  essentially  unique;  see  (Bl)  below.  There’s  also  a unique  (and 
obvious)  solution  of  weight  36. 

(c)  Now  the  BDD  has  128  variables,  with  the  ordering  (xn,yu, . . . ,xnn,ynn). 
We  could  first  set  up  BDDs  for  [L(X)  = Y\  and  [L(Y)  = X],  then  intersect  them;  but 
that  turns  out  to  be  a bad  idea,  requiring  some  36  million  nodes  even  in  the  7x7 
case.  Much  better  is  to  apply  the  constraints  Lij(X)  = ytJ  and  Lij(Y)  = Xij  row  by 
row,  and  also  to  add  the  lexicographic  constraint  X < Y so  that  still  Lifes  are  ruled 
out  early.  The  computation  can  then  be  completed  with  about  20  gigamems  and  1.6 
gigabytes;  there  are  978,563  nodes  and  582,769  solutions. 

(d)  Again  the  solution  is  unique,  up  to  rotation;  see  the  “spark  plug”  (B2)  AA  (B3). 
(And  (B4)  AA  (B5)  is  the  unique  7x7  flip-flop  of  constant  weight  26.  Life  is  astonishing.) 


(Bl)  (B2)  (B3)  (B4)  (B5)  (B6) 


162.  Let  T(X)  = [X  is  tame]  and  Ek(X)  = [X  escapes  after  k steps].  We  can  compute 
the  BDD  for  each  Ek  by  using  the  recurrence 

Ei(X)  = -,T(X);  Ek+1(X)  = 3 Y(T(X)  A (L(X)  = Y]  A Ek(Y)). 

(Here  3Y  stands  for  3 yu  3yi2  • • • 3r/66-  As  noted  in  answer  103,  this  recurrence  turns 
out  to  be  much  more  efficient  than  the  rule  Ek+i  = T(X)  A Ek(Lii(X), . . . , L66(A)), 
although  the  latter  looks  more  “elegant.”)  The  number  of  solutions,  \Ek\,  is  found 
to  be  (806544  • 216,  657527179  • 24,  2105885159,  763710262,  331054880,  201618308, 
126169394,  86820176,  63027572,  41338572,  30298840,  17474640,  9797472,  5258660, 
3058696,  1416132,  523776,  204192,  176520,  62456,  13648,  2776,  2256,  440,  104,  0) 
for  k = (1,  2,  . . . , 26);  thus  J2lh  \Ek\  = 67,166,017,379  of  the  236  = 68,719,476,736 
possible  configurations  eventually  escape  from  the  6x6  cage.  (One  of  the  104  procras- 
tinators in  E2 5 is  shown  in  (B6)  above.) 

BDD  techniques  are  excellent  for  this  problem  when  k is  small;  for  example, 
B(E\)  = 101  and  B(E2)  = 14441.  But  Ek  eventually  becomes  a complicated  “nonlocal” 
function:  The  size  peaks  at  B(Ee)  = 28,696,866,  after  which  the  number  of  solutions 
gets  small  enough  to  keep  the  size  down.  More  than  80  million  nodes  are  present  in  the 
formula  T(X)  A [L(X)  = Y]  AE$(Y)  before  quantification;  this  stretches  memory  limits. 
Indeed,  the  BDD  for  Vfc=i  Ek{X)  takes  up  more  space  than  its  233-byte  truth  table. 
Therefore  a “forward”  method  for  this  exercise  would  be  preferable  to  the  use  of  BDDs. 
(Cages  larger  than  6x6  appear  to  be  impossibly  difficult,  by  any  known  method.) 

163.  Suppose  first  that  o is  A.  We  obtain  the  BDD  for  / = g A h by  taking  the  BDD 
for  g and  replacing  its  [T]  sink  by  the  root  of  the  BDD  for  h.  To  represent  also  /,  make 
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a separate  copy  of  the  BDD  for  g,  and  use  a BDD  base  for  both  h and  h\  replace  the 
[X]  in  the  copy  by  [T] , and  replace  the  [T]  in  the  copy  by  the  root  of  the  BDD  for  h. 
This  decision  diagram  is  reduced  because  h isn’t  constant. 

Similarly,  if  o is  ®,  we  obtain  a BDD  for  / = g<$h  (and  possibly  /)  from  the  BDD 
for  g (and  possibly  g)  after  replacing  [±]  and  [T]  by  the  roots  of  BDDs  for  h and  h. 

The  other  binary  operations  o are  essentially  the  same,  because  B(f)  = B(f).  For 
example,  if  / = gDh  = gAh,  we  have  B(f)  = B(f)  = B(g)+B(h,)- 2 = B(g)  + B(h)-2. 

164.  Let  Ux(xi)  = Vi(x\)  = xi,  Un+X (xi, . . . , xn+1)  = xi  ® Vn(x2, . . . , xn+i),  and 
V„+i(xi, . . . ,xn+i)  = Un(x  i,...,xn)  A xn+x.  Then  one  can  show  by  induction  that 
B(f)  < B(Un ) = 2^{n+^2~\  4-2L(n+1)/2J  — 1 for  all  read-once  /,  and  also  that  we  always 
have  B(f,  f ) < B(Vn,  Vn ) = 2^n/2^+1  + 2 + 1 —2.  (But  an  optimum  ordering  reduces 
these  sizes  dramatically,  to  B(U„)  = |_§ n + 2j  and  B(Vn,Vn")  = 2 n + 2.) 

165.  By  induction,  we  prove  also  that  B{u2m,u2m)  = 2,nF2rn+3+2,  B(u2m+X,u2rn+X)  = 
2 + F2m+3  + 2,  B(v2m,  v2m)  = 2m+1F2m+i  + 2,  B(v2rn+X , h2m+i)  = 2m+1F2m+3  + 2. 

166.  We  may  assume  as  in  answer  163  that  o is  either  A or  ®.  By  renumbering, 
we  can  also  assume  that  jcr  = j for  1 < j < n,  hence  /CT  = /.  Let  (60,...,6„) 
be  the  profile  of  /,  and  (b'0, . . . ,b'n)  the  profile  of  (/,/);  let  (cl7T , . . . , c{n+1)n)  and 
(ci c'(n+1)jr)  be  the  profiles  of  f”  and  (/",/"),  where  {n  + l)n  = n + 1.  Then 
cjn  is  the  number  of  subfunctions  of  fn  = g7*  o hn  that  depend  on  xjn  after  setting 
the  variables  {xi^, . . .jX^-i^}  to  fixed  values.  Similarly,  c'„  is  the  number  of  such 
subfunctions  of  fn  or  /\  We  will  try  to  prove  that  bj„ < cj7r  and  b'j7T^1  < c'j7t  for  all  j. 

Case  1:  o is  A.  We  may  assume  that  mr  = n,  since  A is  commutative.  Case 
la:  1 < jix  < k.  Then  bj„_1  and  i count  subfunctions  in  which  only  the  variables 
XiV  with  1 < i < j and  1 < in  < k are  specified.  These  subfunctions  of  g A h or 
g\J  h have  counterparts  that  are  counted  in  cj7r  and  c'jn,  because  h"  is  not  constant  in 
any  subfunction  when  nn  = n.  Case  lb:  k < jn  < n.  Then  bj7r_1  and  count 

subfunctions  of  h or  h,  which  have  counterparts  counted  in  cJ7r  and 

Case  2:  o is  ®.  We  may  assume  that  In  = 1,  since  ® is  commutative.  Then  an 
argument  analogous  to  Case  1 applies.  [Discrete  Applied  Math.  103  (2000),  237-258.] 

167.  Let  / = /in;  proceed  recursively  to  compute  ctJ  = Bmin(f,j),  c'i:j  = Bmin(/ij,  fij), 
and  a permutation  mj  of  {i, . . . ,j}  for  each  subfunction  fij(xi, . . . ,Xj)  as  follows:  If 

* = i;  we  have  fij( Xi)  = xp,  let  c4j-  = 3,  c^-  = 4,  mj  = i.  Otherwise  i < j,  and 
we  have  fij(xi, . . . ,Xj)  = fik(xi, . . . , Xk)  o f(k+i)j(xk+i,  ■ ■ ■ , Xj)  for  some  k and  some 
operator  o.  If  o is  like  A,  let  aj  = cik  + c{k+1)j  - 2,  and  either  (c'l3  = 2cik  + c'(k+1)j  - 4, 
nij  = nikn(k+X)j)  or  (c^  = 2c^k+1^  + c'ik  — 4,  TCij  = n^k+X)f nk),  whichever  minimizes 
c'ir  If  o is  like  ®,  let  cC  = c'ik  + c[k+1)j  - 2,  and  either  (Cij  = clk  + c[k+1)j  - 2, 
ntj  = nikn(k+i)j)  or  (ci3  = c(fc+1)j  + c'ik  - 2,  np  = n(k+1)jnik),  whichever  minimizes  cXj. 

(The  permutations  n^  represented  as  strings  in  this  description  would  be  repre- 
sented as  linked  lists  inside  a computer.  We  could  also  construct  an  optimum  BDD 
for  / recursively  in  0(Dmin(/))  steps,  using  answer  163.) 

168.  (a)  This  statement  transforms  and  simplifies  the  recurrences  (112)  and  (113). 

(b)  True  by  induction;  also  x > n. 

(c)  Easily  verified.  Notice  that  T is  a reflection  about  the  22|°  line  y = (^2-l)x. 

(d)  If  2 6 Sk  and  z'  G Sn-k  we  have  |z|  = qp  and  |z'|  = q'B,  where  q < k and 
q'  < n - k by  induction.  By  symmetry  we  may  let  q = (1  - 5)t  and  q'  = (1  + 6)t,  where 

* = |(<?  + Q1)  ^ \n-  Then  if  the  first  hint  is  true,  we  have  \z»z'\  < (2t)0  < n0 . And  we 
also  will  have  \z  o z'|  < n0 , by  (c),  since  |zT|  = |z|. 
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To  prove  the  first  hint,  we  note  that  the  maximum  | z • z'\  occurs  when  y = y' . 
For  when  y > y we  have  \z»z'\2  - (x  + x1  +y')2  +y2  = r2  + 2(x'  + y')x  + (x'  + y')2\  the 
largest  value,  given  z' , occurs  when  y = y' . A similar  argument  applies  when  y'  > y. 

Now  when  y = y'  we  have  y = \/ rr'  sin  6 for  some  6;  and  one  can  show  that 
x + x'  < (r  + r')  cos  6.  Thus  z%  z = (x  + x1  + y,y)  lies  in  the  ellipse  of  the  second  hint. 
On  that  ellipse  we  have  (a  cos  d + 6 sin  9)2  + (b  sin#)2  = a2/2  + b2  + u sin  2d  + v cos  2d  = 
a2/ 2 + b2  +w  sin(20  + r),  where  u = ab,  v = |a2  — b2 , w2  = u2  + v2,  and  cost  = u/w. 
Hence  \z»z'\2  < \a2+b2+w.  And  4w2  = (r+r')4+4(rr')2  < (r2  + (2v/5-2)rr'+r'2)2,  so 

\z  • z |2  < r2  + (Vb  + 1 )rr  + r'2 , r = (1  — 6)^ , r = (1  + 5)13 . 

The  remaining  task  is  to  prove  that  this  quantity  is  at  most  22p  = 2</>2;  equivalently, 
M2)  < ft(2f3),  where  ft(a)  = (e*/a  + e-*/“)°  - 2“  and  t = /31n((l  - <5)/(l  + 5)). 
One  can  show,  in  fact,  that  ft  is  an  increasing  function  of  a when  a > 2.  [See  G. 
Bennett,  AMM  117  (2010),  334-351.  The  0(np)  bound  on  Sn  seems  to  require  a 
delicate  analysis;  an  earlier  attempt  by  Sauerhoff,  Wegener,  and  Werchner  was  flawed. 
The  proof  given  here  is  due  to  A.  X.  Chang  and  V.  I.  Spitkovsky  in  2007.] 

169.  This  conjecture  has  been  verified  for  m < 7.  [Many  other  curious  properties  also 
remain  unexplained.  A paper  that  describes  what  is  known  so  far  is  currently  being 
prepared  by  members  of  the  “curious  research  group.”] 

170.  (a)  22n_1.  There  are  four  choices  at  (j)  when  1 < j < n,  namely  LO  = [X|  or 
LO  = [T]  or  HI  = [Xj  or  HI  = [t]  ; and  there  are  two  choices  for  (n). 

(b)  2n_1,  since  half  the  choices  at  each  branch  are  ruled  out. 

(c)  Indeed,  if  t = (tx  . . . tn)2  we  have  LO  = [±]  at  (j)  when  tj  = 1 and  HI  = [T]  at 
(7)  when  tj  = 0.  (This  idea  was  applied  to  random  bit  generation  in  exercise  3.4.1-25. 
Since  there  are  2n_1  such  values  of  t,  we’ve  shown  that  every  monotone,  skinny  function 
is  a threshold  function,  with  weights  (2n_1, . . . , 2, 1}.  The  other  skinny  functions  are 
obtained  by  complementing  individual  variables.) 

(d)  ft(x)  = [(x)2  <t]  = [(x)2  >t]  = [(x)2  >2n  -l-t]  = /2"-t(z). 

(e)  By  Theorem  7.1.1Q,  the  shortest  DNF  is  the  OR  of  the  prime  implicants,  and 
its  general  pattern  is  exhibited  by  the  case  n = 10  and  t = (1100010111)2:  (ziAx2Az3)V 
(zi Az2Aa;4)  V ( ziAa^Azs ) V (xiAx2/\XgAxz)  V (ziAa^AzeAisAzgAzio).  (One  term  for 
each  0 in  t,  and  one  more.)  The  shortest  CNF  is  the  dual  of  the  shortest  DNF  of  the 
dual,  which  corresponds  to  2n  - t = (0011101001)2:  (aq)  A (z2)  A (zsVzWxsVzs)  A 
(Z3VZ4VZ5VX7VX8)  A (Z3VZ4VZ5VZ7VZ9)  A (z3Vz4Vz5Va:7Vzio). 

171.  Note  that  the  classes  of  read-once,  regular,  skinny,  and  monotone  functions  are 
each  closed  under  the  operations  of  taking  duals  and  restrictions.  A skinny  function  is 
clearly  read-once;  a monotone  threshold  function  with  w\  > • • • > w„  is  regular;  and  a 
regular  function  is  monotone.  We  must  show  that  a regular  read-once  function  is  skinny. 

Suppose  f(x\, ...  ,xn)  = g(xil , ... , Xik)  o h(xj1 , . . . , Xjt ),  where  o is  a nontriv- 
ial binary  operator  and  we  have  H < •••  < ik,  j 1 < •••  < jh  k + l = n,  and 
|*i,  • - • , h,  ji,  • ■ ■ ,ji}  = {1, . . . ,n}.  (This  condition  is  weaker  than  being  “read-once.”) 
We  can  assume  that  n = 1 . By  taking  restrictions  and  using  induction,  both  g and  h 
are  skinny  and  monotone;  thus  their  prime  implicants  have  the  special  form  in  exercise 
170(e).  The  operator  o must  be  monotone,  so  it  is  either  V or  A.  By  duality  we  can 
assume  that  o is  V. 

Case  1:  f has  a prime  implicant  of  length  1.  Then  Xi  is  a prime  implicant  of  /, 
by  regularity.  Hence  f(x  1, . . . , xn ) =iiV  /( 0,  z2, . . . , xn),  and  we  can  use  induction. 
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Case  2:  All  prime  implicants  of  g and  h have  length  > 1.  Then  xJI  A • • • Ax',p  is  a 
prime  implicant,  for  some  p > 2,  but  Xj1-iAxj2 A • ■ • A Xjp  is  not,  contradicting  regular- 
ity. [See  T.  Eiter,  T.  Ibaraki,  and  K.  Makino,  Theor.  Comp.  Sci.  270  (2002),  493-524.) 

172.  By  examining  the  CNF  for  ft  in  exercise  170(e),  we  see  that  when  t = (t\  . . . tn) 2 

the  number  of  Horn  functions  obtainable  by  complementing  variables  is  one  more  than 
the  number  for  (t2  - • • tn) 2 when  ti  = 0,  but  twice  that  number  when  t\  = 1.  Thus  the 
example  t = (1100010111)2  corresponds  to  2 x (2  x (l  + (l  + (l  + (2  x (l  + (2  x (2x  2)))))))) 
Horn  functions.  Summing  over  all  t gives  sn  where  sn  = (2n_2  + sn-i)  + 2sn_i,  where 
si  = 2;  and  the  solution  to  this  recurrence  is  3n  — 2"  . 

To  make  both  / and  / Horn  functions,  assume  (by  duality)  that  t mod  4 = 3. 
Then  we  must  complement  Xj  if  and  only  if  tj  =0,  except  for  the  string  of  Is  at  the 
right  of  t.  For  example,  when  t = (1100010111)2,  we  should  complement  x3,  x4 , x5, 
xt,  and  then  at  most  one  of  {x8,  xg,  xio}.  This  gives  p(t  + 1)  + 1 >3  choices  related 
to  ft.  Summing  over  all  t with  t mod  4 = 3 gives  2n  — 1;  so  the  answer  is  2ri+1  — 2. 

173.  Consider  monotone  functions  Brst.  We  can  write  t = (0ai  1“2  . . . 0a2fc“1 1"2'1  )2, 

where  ai  -| \-a2k  = n,  ai  > 0,  aj  > 1 for  1 < j < 2k,  and  a2k  > 2 when  t mod  4 = 3. 

When  t mod  4=1,2 n — t has  this  form.  Then  ft  has  ai ! a2!  ...  a2*,!  automorphisms,  so 
it  is  equivalent  to  n!/(ai!a2!  . . . a2fc!)  — I others,  none  of  which  are  skinny.  Summing 
over  all  t gives  2 (Pn  — nPn-\)  monotone  Boolean  functions  that  are  reorderable  to 
skinny  form,  when  n > 2,  where  P„  is  the  number  of  weak  orderings  (exercise  5. 3. 1-3). 
[See  J.  S.  Beissinger  and  U.  N.  Peled,  Graphs  and  Combinatorics  3 (1987),  213-219.) 

Every  such  monotone  function  corresponds  to  2”  different  unate  functions  that 
are  equally  skinny,  when  variables  are  complemented.  (These  are  the  functions  with  the 
property  that  all  of  their  restrictions  are  canalizing,  known  also  as  “unate  cascades,” 
“1-decision  list  functions,”  or  “generalized  read-once  threshold  functions.”) 

174.  (a)  Assign  the  numbers  0,  . . . , n— 1,  n,  n+1  to  nodes  (T),  . . . , (n ),  [T],  [X];  and 
let  the  (LO,Hl)  branches  from  node  k go  to  nodes  (a2k+i,  a2k+2)  for  0 < k < n.  Then 
define  pk  as  follows,  for  1 < k < 2 n:  Let  l = [(k  — 1)/2J  and  Pi  = {pi, . . . ,p2i}.  Set 
Pk  tifc  if  flfc  ^ Pi ; otherwise,  if  a*,  is  the  mth  smallest  element  of  Pi  n {Z  + 1 , . . . , n + 1 } , 
set  pk  to  the  mth  smallest  element  of  {n  + 2, . . . , n + l + 1}  \ Pi.  (This  construction  is 
due  to  T.  Dahlheimer.) 

(b)  The  inverse  pf1  . . ,p2lJ  of  a Dellac  permutation  satisfies  2 (k  — n)  ~ 1 < Pk1  < 
2k.  It  corresponds  to  a Genocchi  derangement  q\ . . . q2n+2  when  q2  = 1,  q2n+i  = 2n  + 2, 
and  q2k+2  = 1 +pf\  q2k-i  = 1 + Pkln  for  1 < k < n. 

(c)  Given  a permutation  q\  . . .q2n+2,  let  rk  be  the  first  element  of  the  sequence 
Qk\  'if1 1 1 •••  that,  is  > k.  This  transformation  takes  Genocchi  permutations  into 

”k 

Dumont  pistols,  and  has  the  property  that  qk  = k if  and  only  if  r*,  = k £ {r\, . . . , r^-i}. 

(d)  Each  node  (j,  k)  represents  a set  of  strings  r\  . . .rj,  where  (1,0)  = {1}  and  the 
other  sets  are  defined  by  the  following  transition  rules:  Suppose  ri  . . .rj  € (j,  k),  and  let 
l = 2k.  If  k = 0 then  ( j + 1,  k)  contains  1 rf  . . . rj1"  when  j is  even,  2 rf  . . . r+  when  j is 
odd,  where  r+  denotes  r + 1.  If  k > 0 then  ( j + 1,  k)  contains  r f . . . rf  (Z+lJr^j  . . . r t 
when  j is  even,  rf  . . . rfLi(l)rf  ...rf  when  j is  odd,  where  r*  denotes  r + 1 when 
r > Z,  r — 1 when  r < l.  Going  vertically,  if  l < j — 3 and  j is  odd,  (j,k  + 1) 
contains  r4  . . . nri+2ri+3(l+3)ri+4  . . . rj . On  the  other  hand  if  k = 1 and  j is  even, 
(j,  0)  contains  r2r±r3  . . .rj.  Finally  if  k > 1 and  j is  even,  (j,  k — 1)  contains  the 
string  r[  . . . r[_3(l— 2)r[_2ri'_1r[+1  . . . r' , where  r'  denotes  l when  r = l — 2,  otherwise 
r'  = r.  (One  can  show  that  the  elements  of  (2 j,  k)  are  the  Dumont  pistols  for  Genocchi 
permutations  of  order  2 j whose  largest  fixed  point  is  2k.) 
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All  of  these  constructions  are  invertible.  For  example,  the  path  (1,0)  — > (2,0)  — ► 

(3.0)  ->  (3,1)  -4  (4,1)  -4  (5,1)  ->  (6,1)  -4  (7,1)  -4  (7,2)  -4  (7,3)  -4  (8,3)  -4  (8,2)  -4 

(8.1)  -4-  (8,0)  corresponds  to  the  pistols  1 -4  22  — » 133  -4  333  -4  4244  -4  53355  -4 

624466  -4  7335577  -4  7355577  -4  7355777  -4  82448688  -4  82646888  -4  82466888  -4 
28466888.  The  latter  pistol,  which  can  be  represented  by  the  diagram  cor- 

responds to  the  Genocchi  derangement  qi  . . . q8  = 61537482.  And  this  derangement 
corresponds  to  pf1  . . .p$l  = 231546  and  the  Dellac  permutation  pj  . . ,p6  — 312546. 
That  permutation,  in  turn,  corresponds  to  ai...a6  = 312343,  which  stands  for  the 
thin  BDD 

P- 

Let  djk  be  the  number  of  pistols  in  (j,  k),  which  is  also  the  number  of  directed 
paths  from  (1,  0)  to  (j,  k).  These  numbers  are  readily  found  by  addition,  beginning  with 

38227  38227  • • • 

2073  2073  38227  76454  • • • 

155  155  2073  4146  36154  112608  ••• 

17  17  155  310  1918  6064  32008  144616  ••• 

3 3 17  34  138  448  1608  7672  25944  170560  ••• 

1 1 3 6 14  48  104  552  1160  8832  18272  188832  ••• 

1 1 1 2 2 8 8 56  56  608  608  9440  9440  198272  • • • ; 

and  the  column  totals  D3  = dik  are  (D1,D2,...)  = (1,1,2,3,8,17,56,155,608, 
2073, 9440, 38227, 198272, 929569, . . . ).  The  even-numbered  elements  of  this  sequence, 
D 2n,  have  long  been  known  as  the  Genocchi  numbers  G2n+2 • The  odd-numbered 
elements,  D2n+i,  have  therefore  been  called  “median  Genocchi  numbers.”  The  number 
Sn  of  thin  BDDs  is  d(2n+2)0  = D2n+i. 

References:  L.  Euler  discussed  the  Genocchi  numbers  in  the  second  volume  of 
his  Institutiones  Calculi  Differentialis  (1755),  Chapter  7,  where  he  showed  that  the 
odd  integers  G2n  are  expressible  in  terms  of  the  Bernoulli  numbers:  In  fact,  G2n  = 
(2“"  + 1 — 2)|J32n|,  and  ztan|  = G2nz2n/ (2n)\.  A.  Genocchi  examined  these 

numbers  further  in  Annali  di  Scienze  Matematiche  e Fisiche  3 (1852),  395-405;  and 
L.  Seidel,  in  Sitzungsberichte  math.-phys.  Classe,  Akademie  Wissen.  Miinchen  7 (1877), 
157-187,  discovered  that  they  could  be  computed  additively  via  the  numbers  djk  . Their 
combinatorial  significance  was  not  discovered  until  much  later;  see  D.  Dumont,  Duke 
Math.  J.  41  (1974),  305-318;  D.  Dumont  and  A.  Randrianarivony,  Discrete  Math.  132 
(1994),  37-49.  Meanwhile  H.  Dellac  had  proposed  an  apparently  unrelated  problem, 
equivalent  to  enumerating  what  we  have  called  Dellac  permutations;  see  L’ Intermediate 
des  Math.  7 (1900),  9-10,  328;  Annales  de  la  Faculte  sci.  Marseille  11  (1901),  141-164. 

There’s  also  a direct  connection  between  thin  BDDs  and  the  paths  of  (d),  discov- 
ered in  2007  by  Thorsten  Dahlheimer.  Notice  first  that  unrestricted  Dumont  pistols  of 
order  2n  + 2 correspond  to  thin  BDDs  that  are  ordered  but  not  necessarily  reduced, 
because  we  can  let  n . . .r2nr2n+1r2n+2  = (2ai) . . . (2a2n)(2n+2)(2n+2).  The  number 
of  such  pistols  in  which  min{i  | r2i_x  = r2i}  = l turns  out  to  be  d(2„+2)(„+x_j). 

To  prove  this,  we  can  use  new  transition  rules  instead  of  those  in  answer  (d): 
Suppose  ri  . . . rj  e (j,  k),  and  let  l = j — 2k.  Then  ( j + 1,  k ) contains  rf  . . . r,+ r;+  . . . r+ 
when  j is  odd,  rf  . . . r)*L1(Z— l)rf  . . . rf  when  j is  even.  If  j is  odd,  ( j , k + 1)  contains 
lrir3...r,  when  l = 3,  and  when  l > 3 it  contains  r'i  ■ ■ ■ r{_4(/-4)r{_3r-{_2r,' . . . r' , 
where  r'  = r + 2[r  = Z— 4],  Finally,  if  j is  even  and  k > 0,  ( j,k  — 1)  contains 
ri  . . . ri-iqri+2n+2  . . . rj,  where  q = l if  n = n+1,  otherwise  q = r/+1. 
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With  these  magic  transitions  the  path  above  corresponds  to  1 — » 22  — » 313  — > 
133  -4  2244  -A  31355  4-  424466  -4  5153577  -4  5135577  -A  1535577  -4  22646688  -4 
26446688  -4  26466688  -4  26466888;  so  <n  . . . aB  = 132334. 

175.  This  problem  seems  to  require  a different  approach  from  the  methods  that  worked 
when  b0  = ■ ■ ■ = bn-i  = 1.  Suppose  we  have  a BDD  base  of  N nodes  including  the  two 
sinks  jX]  and  [T|  together  with  various  branches  labeled  (2) , . . . , (n) , and  assume  that 
exactly  s of  the  nodes  are  sources  (having  in-degree  zero).  Let  c(b , s,  t,  N)  be  the  number 
of  ways  to  introduce  b additional  nodes  labeled  (l) , in  such  a way  that  exactly  s + b — t 
source  nodes  remain.  (Thus  0 < t < 26;  exactly  t of  the  old  source  nodes  are  now 
reachable  from  a (T)  branch.)  Then  the  number  of  nonconstant  Boolean  functions 
/(si,. . . ,xn)  having  the  BDD  profile  (b0,  ■■■  ,b„)  is  equal  to  T(b0, . . . , 6n_  i;  1),  where 

T(bo;  s)  = 2[s  = 6o  = 1]  + [s  = 2] [fro  = 0]  + [s  — 2][6o  — 2]; 

26q 

T(b0,  . • • , bn— i;  s)  = ^ ^ c(6o,  s-\-t— bo,  £,  &i-f-  • • • +6n_i-f  2)  T(b\ , . . . , 6n— 1;  s+t—bo). 

t=max(0,6o  — s) 


One  can  show  that  c(b,  s,t,  N)  = Ylf—o  arbPtr{s,  N) /b\,  where  we  have  (N(N  — 1))~  = 
E“o  *rbNr  and  ptr(s,N)  = £,  - »)-*  = Ek  {DOHN  - s)*=*  = 

r![iuV]e(w~s)2(u;e2-w  + l)3. 

176.  (a)  If  p + p we  have  ]Ca£yiii)eB[Vt>(p)  = K,b(p' )]  < |A|  |B|/2!,  by  the  definition 
of  universal  hashing.  Let  ri(a,b)  be  the  number  of  p £ P such  that  ha,b{p)  = i . Then 

Y Y r*(a’6)2  = Y Y Y [ha,b(p)  = ha,b(p')} 

aeA,beB  0<i<21  a£A,beBpePp'€P 

< |-P| \A\ |B|  + Y Y = 2‘|A||B|(l  + ~). 

p€P  p'£P 


On  the  other  hand  J2l= &1  ri(a,b)2  = Y?iAo  (ri(aA)  - 2‘/|/|)2  + 22t/|/|  > 22t/|/|,  for 
any  a and  b.  Similar  formulas  apply  when  there  are  Sj(a,b ) solutions  to  ha,b{q)  = j- 
So  there  must  be  a 6 A and  b € B such  that 


+ < Yri(a'b'>2  + Ysi(a'b'>2  - 2*+1  ( 1 "h  ) 


< 


22t  ( 22t 
~¥  + (1  - e)2r 


2 2t 

m 1 \j\ 

(b)  The  middle  l bits  of  aqk  + b and  aqk+ 2 + b differ  by  at  least  2,  so  the  middle 
l — 1 bits  of  aqk  and  aqk+2  must  be  different. 

(c)  Let  q and  q be  different  elements  of  Q*  with  (g(  q)  — g(q))  mod2,_1  > 2,_2. 
(Otherwise  we  can  swap  q -<4  q .)  If  l > 3,  the  condition  g{p)+g{q)  = 2,_1  implies  that 
fq(p)  =0.  Now  we  have  (g(p)  +g(q'))  mod  2i_1  = (g(q')  - g(q))  mod  2(_1;  furthermore 
g{q)  and  g(p)  are  both  even.  Therefore  no  carry  can  propagate  to  change  the  middle 
bit,  and  we  have  }q'(p)  = 1. 

(d)  The  set  Q"  has  at  least  (1  — e)2,_1  elements,  and  so  does  the  analogous  set  P" . 
At  most  2i_2  elements  of  Q”  have  g(q)  odd;  and  at  most  2i_1  + 1 — \P"\  of  the  elements 


with  g(q)  even  are  not  in  Q* . Thus  |Q*|  > (1  — e)2 


1 — 2,_1  — 1 + (1  — e)2( 


(1  - 4e)2  - 1,  and  we  have  Bmin(Znia ) > (1  — 4e)2  1 - 2 by  (85). 

Finally,  choose  l — t — 4 and  e = 1/9.  The  theorem  is  obvious  when  n < 14. 
177.  Suppose  k > n/2  and  x = 2k+1xh  + xi,  y = 2 kyh  + yi-  Then  ( xy  » k)  mod  2n 
depends  on  2xkyi , xiyk,  and  xiyi  » k,  modulo  2n~k,  so  q^k+i  < 2" 


Summing  up,  we  get  qk  < 


0<fc<6n/5 


2fc  + E 


6n/5<fc<2n 


k — — fc+n- 

23n  — 2 [fc/2j  — ffc/2] 
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If  n = 5t  + (0, 1,2,3, 4)  the  total  comes  to  exactly  (2r6ri/5l  • (19, 10, 12, 13, 17)  — 12)/7. 
[M.  Sauerhoft,  in  Discrete  Applied  Math.  158  (2010),  1195-1204,  has  proved  the  lower 
bound  Q( 26n/5)  for  this  ordering.] 

178.  We  can  write  x = 2 kxk  + xi  as  in  the  proof  of  Theorem  A;  but  now  xi  = xi  + 
(x  mod  2),  where  xi  is  even  and  x mod  2 is  not  yet  known.  Similarly  y = 2kyh  + yi  = 
2kVh  + ili  + {y  mod  2).  Let  zt  = xiyi  mod  2k . At  level  2k  - 2,  for  n/2  < k < n,  we  need 
only  “remember”  three  ( n - fc)-bit  numbers  xi  mod  2n~k,  yt  mod  2n~k,  (xtyi  » k)  mod 
2n_fc,  and  three  “carries”  a = (x,  + £,)  » k,  c2  = (in  + zt)  3 >k,c3  = ( xi  +yt+  zt)  » k. 
These  six  quantities  tell  us  the  middle  bit,  once  xh,  yh,  x mod  2,  and  y mod  2 are  known. 

There  are  only  six  possibilities  for  the  carries:  cic2c3  = 000,  001,  Oil,  101,  111, 
or  112.  Thus  q2k-2  < 6 • 2(n_fc_1)+(*>-fc-i)+0-fc/  Similarly,  when  n/2  < k < n — 1,  we 
have  q2k-i  < 6 • 2("  k -)  + ('1  k !)+("-*)_  With  these  estimates,  together  with  q k < 2k , 
we  §et  T,ll o4  Qk  < (26<  • (37, 86, 184,  464, 1024)  - 268)/28  when  n = 5t  + (0, 1,  2, 3, 4). 

The  actual  BDD  sizes,  for  the  function  / of  Theorem  A and  the  function  g of  this 
exercise,  are  B(f)  = (169,  381,  928,  2188,  5248,  12373,  29400,  68777,  162768,  377359, 
879709)  and  B(g)  = (165,  352,  806,  1802,  4195,  9774,  22454,  52714,  121198,  278223, 
650188)  for  6 < n < 16;  so  this  variant  appears  to  save  about  25%.  A slightly  better 
ordering  is  obtained  by  testing  (lo-bit(a:),  hi-bit(j/),  hi-bit(x),  lo-bit(y))  on  the  last  four 
levels,  giving  B(h)  = B(g)~  20  for  n > 6.  Then  B(h)/Bmin(f)  « (1.07,  1.05,  1.04,  1.04, 
1.04,  1.01,  1.02)  for  6 < n < 12,  so  this  ordering  may  be  close  to  optimal  asn->oo. 

180.  By  letting  am+ x = am+2  = • • • = 0,  we  may  assume  that  m > p.  Let  a = 
(ap  . . . a i)2,  and  write  x = 2kXh  + xi  as  in  the  proof  of  Theorem  A.  If  p < n,  we  have 
qk  < 2p~k  for  0 < k < p,  because  the  given  function  / = Z$n(a;x)  depends  only  on 
a,  Xh,  and  (axi  k)  mod  2p-fc.  We  may  therefore  assume  that  p > n. 

Consider  the  multiset  A = {2kxha  mod  2P_1  | 0 < xh  < 2n~k}.  Write  A = 
{2P_1  - «i, . . . , 2P_1  - qs},  where  s = 2n~k  and  0 < ak  < ■ ■ ■ < as  = 2P_1,  and  let 
aa+i  = oti  + 2P  1 for  0 < i < s.  Then  qk  < 2s,  because  / depends  only  on  a,  Xh,  and 
the  index  i £ [0 . . 2s)  such  that  cti  < axi  mod  2P  < 1. 

Consequently  ^2k=0  qk  < min(2fc,  2n+1_fc)  = 2Ln/2j  + 1 + 2rn/2l+1  -3. 

181.  For  every  (x\, . . . ,xm ) only  0(n)  further  nodes  are  needed,  by  exercise  170. 

182.  Yes;  B.  Bollig  [Lecture  Notes  in  Comp.  Sci.  4978  (2008),  306-317]  has  shown 
that  it  is  n(2rl//43“ ) . Incidentally,  Bmin(Li2,i2)  = 1158  is  obtained  with  the  strange  or- 
dering L I2,i2(xi8, 3^7,  £16,  a:i5,  Z14,  a:i2,  Xio,  xs,x6,x4,  x2,  aq;  aqg,  X20,  *21,  a:22,  «23,  a:i3, 
Xu,  X9,  X7,  x$,  X3,  X24)‘,  and  Bmax(L  1 2 , i2 ) = 9302  arises  with  Li2ix2(a:24,  a:23,  x2o,  aqg, 
X22,  Xu,  xe,  XT,  Xs,  Xg,  xw,  Xi3;  Xi,x2,  x3,  xA,  x5,  x2i,x1s,  Xn,  £16,  £i5,  £14,  £12).  Simi- 
larly T?min (L/8, 16 ) = 606  and  Bmax(Ls,ie)  = 3415  aren’t  terribly  far  apart.  Could 
Bmin(Lm.n)  and  Bmax(Lm,n)  both  conceivably  be  ©(2mm(m,n))? 


183.  The  profile  (b0,bi,...)  begins  (1,  1,  1,  2,  3, 

5,  7,  11,  15,  23,  31,  47,  63,  95,  . . . ).  When  k>  0 
there’s  a node  on  level  2k  for  every  pair  of  inte- 
gers (a,  b)  such  that  2fc~1  < a,  b < 2k  and  ab  < 

22k  1 < (a  + 1 )(b  + 1);  this  node  represents  the 
function  [((a  + x)/2k)((b  + y)/2k)  > i].  When 
b is  given,  in  the  appropriate  range,  there  are 
[2 2k~1/b]  — [22fc— 1 /(6  + 1)J  choices  for  a;  hence 
l>2k  = J22k-l<b<2k(['22k  1/fe]  - [22k  1/(b+l)J), 
which  telescopes  to  2k  — 1.  A similar  argument  shows  that  b2k+i 


: 2k  + 2fe_1 


- 1. 
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184.  Two  kinds  of  beads  contribute  to  6m(i-i)+j-i:  One  for  every  choice  of  i columns , 

at  least  one  of  which  is  <j;  and  one  for  every  choice  of  i—  1 columns,  missing  at  least  one 
element  > j.  Thus  6m(i_1)+j_1  = (("*)-  (m+l~3) ) + ( Summing  over 
1 < < m gives  B(Pm ) = (2m  — 3)2m  + 5.  (Incidentally,  qk  = bk  + 1 for  2 < k < m2.) 

The  ZDD  has  simply  zm(i-i)+j_i  = ("*7,  ) for  1 < i,j  < m,  one  for  every  choice 
of  i — 1 columns  / j;  hence  Z(Pm)  = m2m_1  + 2 « \B(Prn).  (The  lower  bound  of 
Theorem  K applies  also  to  ZDD  nodes,  because  only  such  nodes  get  tickets;  therefore 
the  natural  ordering  of  variables  is  optimum  for  ZDDs.  The  natural  ordering  might  be 
optimum  also  for  BDDs;  this  conjecture  is  known  to  be  true  for  m < 5.) 

185.  Suppose  f(x)  = £„x  for  some  binary  vector  to...tn.  Then  the  subfunctions 
of  order  d > 0 correspond  to  the  distinct  substrings  ti  . . .ti+d-  Such  substrings  r 
correspond  to  beads  if  and  only  if  r ^ 0d+1  and  r / ld+1;  they  correspond  to  zeads  if 
and  only  if  r / 0d+1  and  r / 10d. 

Thus  the  maximum  Z(f)  is  the  function  Sn  of  answer  44.  To  attain  this  worst 
case  we  need  a binary  vector  of  length  2d+1  +d  — 2 that  contains  all  (d+  l)-tuples  except 
0d+1  and  10d  as  substrings;  such  vectors  can  be  characterized  as  the  first  2d+1+d  — 2 
elements  of  any  de  Bruijn  cycle  of  period  2d+1,  beginning  with  0dl. 

186.  Xi  A X2  A £3  A X4  A Xg  A Xg. 

187.  (These  diagrams  should  be  compared  with  the  answer  to  exercise  1.) 


188.  To  avoid  nested  braces,  let  e,  a,  b,  and  ab  stand  for  the  subsets  0,  {1},  {2}, 
and  {1,2}.  The  families  are  then  0,  {a6},  {a},  {a,ab},  {6},  {b,  ab},  {a,  6},  {a,  b,  ab}, 
{e},  {e,  ab},  {e,  a},  {e,  a,  ab},  {e,  6},  {e,  b,  ab},  {e,  a,  6},  {e,  a,  b,  ab},  in  truth-table  order. 

189.  When  n = 0,  only  the  constant  functions;  when  n > 0,  only  0 and  x\  A • • • A xn. 
(But  there  are  many  functions,  such  as  X2  A (xi  V X3),  with  (bo, ... , bn ) = (z 0, . . . , zn).) 

190.  (a)  Only  x\  0 • • • ® xn  and  1 © xi  © • • • ® xn,  for  n > 0.  (b)  This  condition  holds 

if  and  only  if  all  subtables  of  order  1 are  either  01  or  11.  So  there  are  22"-1  solutions 
when  n > 0,  namely  all  functions  such  that  }(x\, . . . 1)  = 1. 

191.  The  language  Ln  of  truth  tables  for  all  such  functions  has  the  context-free  gram- 
mar Lq  — » 1;  Ln+i  — > LnLn  \ Ln O2".  The  desired  number  ln  = \Ln\  therefore  satisfies 
lo  = 1,  ln+ 1 = ln(ln  + I);  so  (l0,h,l 2,...)  is  the  sequence  (1,  2,  6,  42,  1806,  3263442, 
10650056950806,  . . . ).  Asymptotically,  ln  = 9 2”  — 4 — e,  where  0 < e < S-2"/ 8 and 


6 = 1.59791  02180  31873  17833  80701  18157  45531  23622+. 


[See  CMath  exercises  4.37  and  4.59,  where  ln  + 1 is  called  en+i  (a  “Euclid  number”)  and 
9 is  called  E2 . The  numbers  ln  + 1 were  introduced  by  J.  J.  Sylvester  in  connection  with 
his  study  of  Egyptian  fractions,  Amer.  J.  Math.  3 (1880),  388.  Notice  that  a monotone 
decreasing  function,  like  a function  representing  independent  sets,  always  has  zn  = 1.] 

192.  (a)  10101101000010110. 

(b)  True,  by  induction  on  |r|,  because  a / (3  / 0"  if  and  only  if  az  ^ /3Z  ^ 0". 

(c)  The  beads  of  / of  order  k are  the  zeads  of  fz  of  order  k , for  0 < k < n. 
Hence  the  beads  of  fz  are  also  the  zeads  of  (fz)z  = f.  Therefore,  if  (bo,  ■ ■ ■ , bn)  and 
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(z0,  are  the  profile  and  z-profile  of  / while  b'n ) and  ( z'0 , . . . , z'n)  are  the 

profile  and  z-profile  of  fz , we  have  6*.  = z*  and  zk  = b'k  for  0 < k < n. 

(We  also  have  zn  = z'n,  but  they  might  both  be  1 instead  of  2.  The  quasi-profiles 
of  / and  fz  may  differ,  but  only  by  at  most  1 at  each  level,  because  of  all-0  subtables.) 

193.  S>k(x i,...,*„),  by  induction  on  n.  (Hence  we  also  have  S>k(x i,...,xn)  = 
Sk(x i, . . . , xn).  Exercise  249  gives  similar  examples.) 

194.  Define  cq  . . .a2n  as  in  answer  174,  but  use  the  ZDD  instead  of  the  BDD.  Then 
(1, . . . , 1)  is  the  z-profile  if  and  only  if  (2ai) . . . (2a2n)  is  an  unrestricted  Dumont  pistol 
of  order  2 n.  So  the  answer  is  the  Genocchi  number  G2n+2- 

195.  The  z-profile  is  (1, 2, 4, 4, 3,  2,  2).  We  get  an  optimum  z-profile  (1,  2, 3,  2, 3, 2, 2) 
from  M2(x 4,  *2;  *5,*6,  *3,  *i),  and  a pessimum  z-profile  (1,  2,4, 8, 12, 2, 2)  comes  from 
M2(x5,X6',xi,X2,X3,X4)  as  in  (78).  (Incidentally,  the  algorithm  of  exercise  197  can  be 
used  to  show  that  Zmin(M4)  = 116  is  obtained  with  the  strikingly  peculiar  ordering 

Mi(x 8,2:5,  *17,  *2;  *20,  *19,  *18,  *16,  *15,  *13,  *14,  *12,  *11,  *9,  *10,  *4,  *7,  *6,  *3,*l)!) 

196.  For  example,  Mm(*i, . . . , *m;  em+i, . . . , en),  where  n = m + 2m  and  ej  is  the 
elementary  function  of  exercise  203.  Then  we  have  Z(f)  = 2(n  — m)  + 1 and  Z(f ) = 
(n  — m + 7 )(n  — m)/2  — 2. 

197.  The  key  idea  is  to  change  the  significance  of  the  DEP  fields  so  that  dkp  is  now 
YJ{2t~k~1  | Nkp  supports  *(},  where  we  say  that  g(x i, . . . ,*m)  supports  xj  if  there  is 
a solution  to  g(x i, . . . , *m)  = 1 with  Xj  = 1. 

To  implement  this  change,  we  introduce  an  auxiliary  array  (Co,  • • • , C n),  where  we 
will  have  (*,  = <3  if  Nkq  denotes  the  subfunction  0 and  Cfc  — -1  if  that  subfunction 
does  not  appear  on  level  k.  Initially  Cn  <—  0,  and  we  set  Cfc  ~ 1 at  the  beginning 
of  step  El.  In  step  E3,  the  operation  of  setting  dkq  should  become  the  following:  “If 
d(k+\)h  # Cfc+i , set  dkq  <—  ((d(fc+i )/  | d( k+i)h)  <S  1)  + 1;  otherwise  set  dkq  d(k+X)i  1. 
Also  set  Cfc  «—  q if  d(jfe+i)i  = d(t+ i)/,  = Cfc+i-” 

(The  master  z-profile  chart  can  be  used  as  before  to  minimize  z0  + • • • + zn_i; 
but  additional  work  is  needed  to  consider  zn  if  the  absolute  minimum  is  important.) 

198.  Reinterpreting  (50),  we  represent  an  arbitrary  family  of  sets  / as  (*„?  fy.  fh), 
where  v = fv  indexes  the  first  variable  that  f supports;  see  answer  197.  Thus  ft  is  the 
subfamily  of  / that  doesn’t  support  xv,  and  fh  is  the  subfamily  that  does  (but  with  *„ 
deleted).  We  also  let  fv  = 00  if  / has  no  support  (i.e.,  if  / is  either  0 or  {0},  represented 
internally  by  [T]  or  [T] ; see  answer  200).  In  (52),  v = min (fv,gv)  now  indexes  the 
first  variable  supported  by  either  / or  g;  thus  fh  = 0 if  /„  > gv,  and  gh  = 0 if  /„  < gv. 

Subroutine  AND(/, g),  ZDD-style,  is  now  the  following  instead  of  (55):  “Repre- 
sent / and  g as  in  (52).  While  fv  / gv,  return  0 if  either  / = 0 or  g = 0;  otherwise 
set  / Si  if  fv  < Sf,  set  g «-  gi  if  /„  > gv.  Swap  / g if  / > g.  Return  / if 
/ = g or  / = 0.  Otherwise,  if  / A g = r is  in  the  memo  cache,  return  r.  Otherwise 
compute  r,  <-  AND and  rh  <-  AND (fh,gh);  set  r <-  ZUNIQUE(u,  ru  rh),  using 
an  algorithm  like  Algorithm  U except  that  the  first  step  returns  p when  q = 0 instead 
of  when  q = p;  put  ‘/  A g = r'  into  the  memo  cache,  and  return  r.”  (See  also  the 
suggestion  in  answer  200.) 

Reference  counts  are  updated  as  in  exercise  82,  with  slight  changes;  for  example, 
step  U1  will  now  decrease  the  reference  count  of  [T]  (and  only  of  this  node),  when 
S = 0.  It  is  important  to  write  a “sanity  check”  routine  that  double-checks  all  reference 
counts  and  other  redundancies  in  the  entire  BDD/ZDD  base,  so  that  subtle  errors  are 
nipped  in  the  bud.  The  sanity  checker  should  be  invoked  frequently  until  all  subroutines 
have  been  thoroughly  tested. 


660  ANSWERS  TO  EXERCISES 


7.1.4 


199.  (a)  If  / = g,  return  /.  If  / > g,  swap  / 4A  g.  If  / = 0,  return  g.  If  / V g = r is  in 
the  memo  cache,  return  r.  Otherwise 

set  v 4-  /„,  r{  4-  OR rh  4-  OR (fh,gh),  if  /„  = gv; 

set  v 4-  fv , n 4-  OR  {fug),  rh  4-  fh,  increase  REF(/h)  by  1,  if  /„  < gv; 

set  v 4-  gv,  rt  4-  OR (f,gi),  rh  4-  gh,  increase  REF(gh)  by  1,  if  /„  > gv. 

Then  set  r 4—  ZUNIQUE(u, n, r>,);  cache  it  and  return  it  as  in  answer  198. 

(b)  If  f = g,  return  0.  Otherwise  proceed  as  in  (a),  but  use  (®,  XOR)  not  (V,  OR). 

(c)  If  / = 0 or  / = g,  return  0.  If  g = 0,  return  /.  Otherwise,  if  gv  < /„,  set 
g 4—  gi  and  begin  again.  Otherwise 

set  r,  4-  BUTNOT(//,  <7;),  rh  4-  BUTNOT {fh,gh),  if  fv  = gv- 

set  n 4-  BUTNOT(/;,g),  rh  4-  fh,  increase  REF(/h)  by  1,  if  fv  < gv. 

Then  set  r 4—  ZUNIQUE(/t,, r/, r/,)  and  finish  as  usual. 

200.  If  / = 0,  return  g.  If  / = h,  return  OR (/,</).  If  g = h.,  return  g.  If  g = 0 or 
f = g,  return  AND (/,  fi).  If  h = 0,  return  BUTNOT(g,  /).  If  fv  < gv  and  fv  < hv,  set 
/ 4—  fi  and  start  over.  If  hv  < /„  and  hv  < gv,  set  h 4—  hi  and  start  over.  Otherwise 
check  the  cache  and  proceed  recursively  as  usual. 

201.  In  applications  of  ZDDs  where  projection  functions  and/or  the  complementation 
operation  are  permitted,  it’s  best  to  fix  the  set  of  Boolean  variables  at  the  beginning, 
when  everything  is  being  initialized.  Otherwise,  every  external  function  in  a ZDD  base 
must  change  whenever  a new  variable  enters  the  fray. 

Suppose  therefore  that  we’ve  decided  to  deal  with  functions  of  (x\, , . . , xn),  where 
N is  prespecified.  In  answer  198,  we  let  /„  = N + 1,  not  oo,  when  / = 0 or  / = {0}. 
Then  the  tautology  function  1 = p has  the  (IV  + l)-node  ZDD  (?)  ' (?)  '"•  • -"''(/v)""fTl , 
which  we  construct  as  soon  as  N is  known.  Let  tj  be  node  (J)  of  this  structure,  with 
fiv+i  = [T].  The  ZDD  for  Xj  is  now  (TX_'  • '—(J) — tj+ 1 Qlb  thus  the  ZDD  base  for  the 
set  of  all  Xj  will  occupy  (JV^‘1)  nodes  in  addition  to  the  representations  of  0 and  p. 

If  N is  small,  all  N projection  functions  can  be  prepared  in  advance.  But  N is 
large  in  many  applications  of  ZDDs;  and  projection  functions  are  rarely  needed  when 
“family  algebra”  is  used  to  build  the  structures  as  in  exercises  203-207.  So  it’s  generally 
best  to  wait  until  a projection  function  is  actually  required,  before  creating  it. 

Incidentally,  the  partial-tautology  functions  tj  can  be  used  to  speed  up  the  synthe- 
sis operations  of  exercises  198-199:  If  v = fv  < gv  and  / = f„,  we  have  AND(/,  g)  = g, 
OR (f,g)  = f,  and  (if  v < hv)  also  MUX(/,g,  h)  = h,  MUX(g,  h,  f ) = OR(g,  h). 

202.  In  the  transmogrification  step  T4,  change  lq0  4—  qi  4—  q'  to  ‘qo  4—  q,  qi  4—  0’  and 
Vo  4—  ri  4-  r’  to  V0  4—  r,  r\  4—  0’.  Also  use  ZUNIQUE  instead  of  UNIQUE;  within  T4, 
this  subroutine  increases  REF(p)  by  1 if  step  U1  finds  q = 0. 

A subtler  change  is  needed  to  keep  the  partial-tautology  functions  of  answer  201 
up  to  date,  because  of  their  special  meaning.  Correct  behavior  is  to  keep  tu  unchanged 
and  set  tv  4—  L0(fu). 

203.  (a)  fUg  = {{1,  2},  {1, 3},  (1, 2, 3},  {3}}  = (e,  U ((e2  U (e3  U e))  U e3))  U e3;  the 
other  is  (ei  LI  e2)  U e,  because  / n g = (ei  U (e2  U e))  Ue3Ue  and  / EE  ei  = ex  U e2  U e3. 

(b)  (/  U g){z)  = 3x  By  (/( x)  A g{y)  A (z  = x V y));  (f  H g)(z)  = Bx  By  ( f(x ) A 
g(y)  A (z  = x A y));  (f  EB  g){z)  = Bx  By  ( f(x ) A g(y)  A (z  = x © y)).  Another  formula  is 

(/  ffl  g)(z ) = V{/(«  © y ) I g(y)  = 1}  = \/{g(z  © x)  \ f(x)  = 1}. 

(c)  Both  (i)  and  (ii)  are  true;  also  / ffl  (g  U h)  = (/  ffl  g)  U (/  ffl  h).  Formula  (iii) 
fails  in  general,  although  we  do  have  / U (g  n h)  C (/  U g)  n (/  U h).  Formula  (iv)  makes 
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little  sense;  the  right-hand  side  is  (/U/)U(/U/i)U(gU/)U(jUli),  by  (i).  Formula 
(v)  is  true  because  all  three  parts  are  0.  And  (vi)  is  true  if  and  only  if  / / 0. 

(d)  Only  (ii)  is  always  true.  For  (i),  the  condition  should  be  / n g C e,  since 
/ n g = 0 implies  / _L  g.  For  (iii),  notice  that  \f  U g\  = \ f n g\  = \f  ffl  g|  = 1 whenever 
I/I  = Iffl  = 1-  Finally,  in  statement  (iv),  we  do  have  / _L  g =>■  f L\  g = f m g\  but  the 
converse  fails  when,  say,  f = g = e iUt. 

(e)  / = 0 in  (i)  and  / = e in  (ii);  also  e E0  g = g for  all  g.  There’s  no  solution 
to  (iii),  because  / would  have  to  be  {{1,2,3,...}}  and  we  are  considering  only  finite 
sets.  But  in  the  finite  universe  of  answer  201  we  have  / = {{1, . . . , TV}}.  (This  family  U 
has  the  property  that  (/  03  U)  U (g  Eg  U)  = (/  n g)  EE]  U .)  The  general  solution  to  (iv) 
is  / = ei  U e2  U /',  where  /'  is  an  arbitrary  family;  similarly,  the  general  solution 
to  (v)  is  / = (er  U /')  U (e2  U /")  U (ei  U e2  U (/'  U /"  U /"')),  where  /',  /",  and  /"' 
are  arbitrary.  In  (vi),  / = ((((ei  U e2)  Ue)U  /')  U ((ei  U e2)  U /''))  U (e3  U e),  where 
f U /"  1 ei  Ue2  Ue3;  this  representation  follows  from  exercise  204(f).  In  (vii),  |/|  = 1. 
Finally,  (viii)  characterizes  Horn  functions  (Theorem  7.1.1H). 

204.  (a)  This  relation  is  obvious  from  the  definition.  (Also  (fUg)/h  D (f/h)  U (g/h).) 

(b)  //e2  = {{1}, 0}  = ei  Ue;  //e i = e2  Ue3;  f/e  = /;  hence  f/(e i Ue)  = e2  Ue3. 

(c)  Division  by  0 gives  trouble,  because  all  sets  a belong  to  //0.  (But  if  we 
restrict  consideration  to  families  of  subsets  of  {1, . . . , N},  as  in  exercises  201  and  207, 
we  have  //0  = p;  also  p/p  = e,  and  //p  = 0 when  / / p.)  Clearly  f/e  = f.  And 
f/f  = e when  / / 0.  Finally,  (f  mod  g)/g  = 0 when  g / 0,  because  a € (f  mod  g)/g 
and  )3eg  implies  that  a U P € /,  a € f/g,  and  aU^  (f/g)  Uj-a  contradiction. 

(d)  If  (3  E g,  we  have  P U a e f and  P n a = 0 for  all  a e f/g ; this  proves  the 
hint.  Hence  f/g  C f/(f/(f/g)).  Also  f/h  C f/g  when  h D g,  by  (a);  let  h = f/(f/g). 

(e)  Let  f //g  be  the  family  in  the  new  definition.  Then  f/g  C f//g,  because 
9 U (f/g)  ^ / and  g _L  (f/g).  Conversely,  if  a e f //g  and  Peg,  we  have  a e h for 
some  h with  g U h C / and  g _L  h;  consequently  a U P e f and  ad  P = 9. 

(f)  If  / has  such  a representation,  we  must  have  g = f/ej  and  h = / mode.,. 
Conversely,  those  families  satisfy  e,  1 j U h.  (This  law  is  the  fundamental  recursive 
principle  underlying  ZDDs  — just  as  the  unique  representation  / = (xj?  g:  h ),  with  g 
and  h independent  of  Xj,  underlies  BDDs.) 

(g)  Both  true.  (To  prove  them,  represent  / and  g as  in  part  (f).) 

[R.  K.  Brayton  and  C.  McMullen  introduced  the  quotient  and  remainder  opera- 
tions in  Proc.  Int.  Symp.  Circuits  and  Systems  (IEEE,  1982),  49-54,  but  in  a slightly 
different  context:  They  dealt  with  families  of  incomparable  sets  of  subcubes.] 

205.  In  all  cases  we  construct  a recursion  based  on  exercise  204(f).  For  example,  if 
fu  = gv  = v,  we  have  / U g = (v?  /,  U g,-.  (/,  U gh)  U (fh  U gt)  U (fh  U Sh));  fUg  = 
(v?  (fingi)U(fingh)U(fhngi):  fhngh)-,  fmg  = (v?  (fimg,)U(fhmgh):  (fhBgi)U(fimgh)). 

(a)  If  fv  < gv  or  (/„  = gv  and  / > g),  swap  / -H-  g.  If  / = 0,  return  /;  if  / = e, 
return  g.  If  / U g = r is  in  the  memo  cache,  return  r.  If  fv  > gv,  set  r,  e-  JOIN (f,gi) 
and  rh  e-  JOlN(f,gh);  otherwise  set  rt  e-  JOIN (fi,gi),  rth  4-  JOIN (ft,gh),  rM  e- 
JOIN(/h,g(),  rhh  <—  JOIN (fh,gh),  rh  4—  OROR (rih,rhi,rhh),  and  dereference  rih,  thi , 
Thh-  Finish  with  r 4—  ZUNIQUE^, n, r^);  cache  it  and  return  it  as  in  exercise  198. 

(We  could  also  compute  rh  via  the  formula  OR(n/,,  JOIN(/h, OR(g;, gh))),  or  via 
OR(r/i;,  JOIN(OR(/(,  fh),gh))-  Sometimes  one  way  is  much  better  than  the  other  two.) 

The  DISJOIN  operation,  which  produces  the  family  of  disjoint  unions  {a  U P \ 
a e /,  P e g,  a n P = 0},  is  similar  but  with  rhh  omitted. 
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(b)  If  fv  < gv  or  (/„  = gv  and  / > g),  swap  /f>j.  If  / < e,  return  /.  (We 
consider  0 < e and  e < all  others.)  Otherwise,  if  MEET(/,  g)  hasn’t  been  cached,  there 
are  two  cases.  If  fv  > gv,  set  rh  <—  OR (gi,gh),  r <—  MEET(/,rh),  and  dereference  r/,; 
otherwise  proceed  analogously  to  (a)  but  with  l o h.  Cache  and  return  r as  usual. 

(c)  This  operation  is  similar  to  (a),  but  ri  <—  OR (ru,rhh)  and  77  <—  OR(nii,rw). 

(d)  First  we  implement  the  important  simple  cases  f /ev  and  / mod  e„: 

{If  fv  = v,  return  //,;  if  fv  > v,  return  0.  Otherwise  look  for 
f /ev  = r in  the  cache;  if  it  isn’t  present,  compute  it  via 
r <-  ZUNIQUE(/„,  EZDIV(/;,  v),  EZDIV(/h,  v)). 

{If  fv  = v,  return  /;;  if  fv  > v,  return  /.  Otherwise  look  for 
/ mod  ev  — r in  the  cache;  if  it  isn’t  present,  compute  it  via 
r <-  ZUNIQUE(/„,  EZMOD (/i , r) , EZMOD (//,,  v)). 

Now  DIV(/,  g)  = “If  3 = 0,  see  below;  if  g = e,  return  /.  Otherwise,  if  / < e,  return  0; 
if  / = g,  return  e.  If  gi  = 0 and  gh  = e,  return  EZDIV(/, gv).  Otherwise,  if  f /g  = r is 
in  the  memo  cache,  return  r.  Otherwise  set  r;  «—  EZDIV(/, gv),  r <—  DYV(ri,  gh),  and 
dereference  r;.  If  r 0 and  gi  ^ 0,  set  rh  t—  EZMOD(/,3„)  and  n ■<—  DIV(rh,3i),  deref- 
erence rh,  set  rh  <—  r and  r <—  AND(r;,  rh),  dereference  r;  and  rh-  Insert  'f/g  = r’  in  the 
memo  cache  and  return  r.”  Division  by  0 returns  p if  there  is  a fixed  universe  {1, . . . , N } 
as  in  exercise  201.  Otherwise  it’s  an  error  (because  the  universal  family  p doesn’t  exist). 

(e)  If  g = 0,  return  /.  If  g = e,  return  0.  If  ( gi,gh ) = (0,  e),  return  EZMOD (f,gv)- 
If  / mod  3 = r is  cached,  return  it.  Otherwise  set  r <—  DIV(/, g)  and  r/,  «—  JOIN(r, 3), 
dereference  r,  set  r <—  BL'TNOT(/, rv, ) , and  dereference  rh-  Cache  and  return  r. 

[S.Minato  gave  EZDIV(/, v),  EZREM(/, v),  and  DELTA(/, ev)  in  his  original 
paper  on  ZDDs.  His  algorithms  for  JOIN(/,  3)  and  DIV(/,  3)  appeared  in  the  sequel, 
ACM/IEEE  Design  Automation  Conf.  31  (1994),  420-424.] 

206.  The  upper  bound  0(Z(f)3Z(g)3)  is  not  difficult  to  prove  for  cases  (a)  and  (b), 
as  well  as  0(Z(f)2 Z(g)2)  for  case  (c).  But  are  there  examples  that  take  such  a long 
time?  And  can  the  running  time  for  (d)  be  exponential?  All  five  routines  seem  to  be 
reasonably  fast  in  practice. 

207.  If  / = a1  U • • • U e,,  and  k > 0,  let  SYM(/,  v,  k)  be  the  Boolean  function  that  is 
true  if  and  only  if  exactly  k of  the  variables  {xit , . . . ,Xit}  fl  {xv,xv+i, . . . } are  1 and 
xi  = • • • = xv-i  = 0.  We  compute  (e^  U • • • U e,, ) § k by  calling  SYM(/,  1,  k). 

SYM(/,  v,  k)  = “While  fv  < v,  set  f <—  fi-  If  /„  = N + 1 and  k > 0,  return  0. 
If  fv  = N + 1 and  k = 0,  return  the  partial-tautology  function  tv  (see  answer  201).  If 
/§u§fc  = r is  in  the  cache,  return  r.  Otherwise  set  r <—  SYM (/,  /„  + 1,  k).  If  k > 0,  set 
q ■<—  SYM(/i,/„  + l,k  — 1)  and  r «—  ZUNIQUE(/„, r, 3).  While  fv  > v,  set  — 1, 

increase  REF(r)  by  1,  and  set  r <—  ZUNIQUE(/„,  r,  r).  Put  ‘/  § v § k = r1  in  the  cache, 
and  return  r.”  The  running  time  is  0((k  + 1 )N).  Notice  that  0 § 0 = p. 

208.  Just  omit  the  factors  2Va~1~x , 2Vl~Vk~1 , and  2Vh~Vk~1  from  steps  Cl  and  C2. 
(And  we  get  the  generating  function  by  setting  Ck  <-  ci+zch  in  step  C2;  see  exercise  25.) 
The  number  of  solutions  equals  the  number  of  paths  in  the  ZDD  from  the  root  to  [T] . 

209.  Initially  compute  5n  «—  J_  and  Sj  e—  (xj+i  o Xj+i)  • <5j+i  for  n > j > 1.  Then, 
where  answer  31  says  ‘a  «—  (xj  o Xj)  • a’,  change  it  to  ‘a  «—  (xj  • a)  o ( Xj  • Sj)’.  Also 
make  the  analogous  changes  with  /3  and  7 in  place  of  a. 
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210.  In  fact,  when  x = x\  . . . x„  we  can  replace  vx  in  the  definition  of  g by  any  linear 
function  c(x)  = Cixi  + • • • + cnxn,  thus  characterizing  all  of  the  optimal  solutions  to 
the  general  Boolean  programming  problem  treated  by  Algorithm  B. 

For  each  branch  node  x of  the  ZDD,  with  fields  V (re) , LO(.x).  HI  (x) , we  can  com- 
pute its  optimum  value  M(x)  and  new  links  L(x),  H(x)  as  follows:  Let  mi  = M(L0(x)) 
and  mh  = cV(x) +M(HI  (x) ) , where  M([I])  = -oo  andM([T|)  = 0.  Then  L(x)  <-  L0(x) 
if  mi  > mh,  otherwise  L(x)  *-  [X| ; H(x)  -f-  HI  (a;)  if  mt  < mh,  otherwise  H(x)  <-  [X]. 

The  ZDD  for  g is  obtained  by  reducing  the  L and  H links  accessible  from  the  root. 

Notice  that  Z(g)  < Z(f),  and  the  entire  computation  takes  0(Z(f))  steps.  (This  nice 
property  of  ZDDs  was  pointed  out  by  O.  Coudert;  see  answer  237.) 

211.  Yes,  unless  the  matrix  has  all-zero  rows.  Without  such  rows,  in  fact,  the  profile 
and  2-profile  of  / satisfy  bk  > qu  — 1 > Zk  for  0 < k < n,  because  the  only  level-fc 
subfunction  independent  of  Xk+i  is  the  constant  0. 

212.  The  best  alternative  in  the  author’s  experiments  was  to  make  ZDDs  for  each 

term  T)  = S i(Xj)  in  (129),  using  the  algorithm  of  exercise  207,  and  then  to  AND  them  ,d 

together.  For  example,  in  problem  (128)  we  have  Xi  = {xi,x2},  X2  = {xi,x3,x4}, 

■ ■ ■ , Xe4  = {xio5 , X112 };  to  make  the  term  Si  (X2)  = S i(xi,  x3,  x4),  whose  ZDD  has  115 
nodes,  just  form  the  5-node  ZDD  for  ei  U (e3  U e4)  and  compute  T2  <-  (ei  U e3  U e4)  § 1. 

But  in  what  order  should  the  ANDs  be  done,  after  we’ve  got  the  individual  terms 
T\ , . . . , Tn  of  (129)?  Consider  problem  (128).  Method  1:  T\  Ti  A T2,  Ti  «—  Ti  A T3, 

. . . , Ti  <—  Ti  A Tq4 . This  “top-down”  method  fills  in  the  upper  levels  first,  and  takes 
about  6.2  megamems.  Method  2:  T64  <-  Te4  A T63,  T64  <—  T64  A T62,  ...,  T64  t- 
T64  A Ti.  By  filling  in  the  lower  levels  first  (“bottom-up”),  the  time  goes  down  to 
about  1.75  megamems.  Method  3:  T2  <-  T2  A Tx,  T4  <-  T4  A T3,  . . . , T64  t-  T64  A T63; 

T4  <-  T4  A T2,  T8  <-  T8  A Te,  . . . , T64  t-  T64  A T62;  T8  ■<—  Ts  A T4,  T16  <-  Ti6  A T12,  . . . , 

Tf34  <—  T64  ATeo;  . . . ; Ta4  T64  A T32.  This  “balanced”  approach  also  takes  about  1.75 
megamems.  Method  J,:  T33  t—  T33  A Ti,  T34  T34  A T2,  . . . , T64  T64  A T32;  T49  e- 

T49  A T33,  T50  t-  T50  A T34,  . . . , T64  <-  T64  A T48;  T57  t-  T57  A T49,  T58  <- T58AT50,  ..., 

T(34  <—  T64  A T56;  . . . ; Tq4  <—  T64  A Tq3  . This  is  a much  better  way  to  balance  the  work, 
needing  only  about  850  kilomems.  Method  5:  An  analogous  balancing  strategy  that  uses 
the  ternary  ANDAND  operation  turns  out  to  be  still  better,  costing  just  675  kilomems. 

(In  all  five  cases,  add  190  kilomems  for  the  time  to  form  the  64  initial  terms  Tj.) 

Incidentally,  we  can  reduce  the  ZDD  size  from  2300  to  1995  by  insisting  that 
X]  = 0 and  x2  = 1 in  (128)  and  (129),  because  the  “transpose”  of  every  covering  is 
another  covering.  This  idea  does  not,  however,  reduce  the  running  time  substantially. 

The  rows  of  (128)  appear  in  decreasing  lexicographic  order,  and  that  may  not  be 
ideal.  But  dynamic  variable  ordering  is  unhelpful  when  so  many  variables  are  present. 

(Sifting  reduces  the  size  from  2300  to  1887,  but  takes  a long  time.) 

Further  study,  with  a variety  of  exact  cover  problems,  would  clearly  be  desirable. 

213.  It  is  a bipartite  graph  with  30  vertices  in  one  part  and  32  in  the  other.  (Think  of 

a chessboard  as  a checkerboard : Every  domino  joins  a white  square  to  a black  square,  t- 

and  we’ve  removed  two  black  squares.)  A row  sum  of  (1, ... , 1, 1,  *,  *)  has  Is  in  at  least 
31  “white”  positions,  so  its  last  two  coordinates  must  be  either  (2,1)  or  (3,2). 

214.  Add  further  constraints  to  the  covering  condition  (128),  namely  /\ji  1 S>i(Yj), 
where  Y)  is  the  set  of  Xi  that  cross  the  jth  potential  fault  line.  (For  example,  Yi  = 
{x2,x4,x6,x8,xi0,xi2,xi4,xis}  is  the  set  of  ways  to  place  a domino  vertically  in  the 
top  two  rows  of  the  board;  each  \Yj\  = 8.)  The  resulting  ZDD  has  9812  nodes,  and 
characterizes  25,506  solutions.  Incidentally,  the  BDD  size  is  26,622.  [Faultfree  domino 
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tilings  ofraxn  boards  exist  if  and  only  if  mn  is  even,  m > 5,  n > 5,  and  ( m , n)  / 
(6,6);  see  R.  L.  Graham,  The  Mathematical  Gardner  (Wadsworth  International,  1981), 
120  126.  The  solution  in  (127)  is  the  only  8x8  example  that  is  symmetric  under  both 
horizontal  and  vertical  reflection;  see  Fig.  29(b)  for  symmetry  under  90°  rotation.] 

215.  This  time  we  add  the  constraints  /\j=i  S>i(Zj),  where  Zj  is  the  set  of  four  place- 
ments Xi  that  surround  an  internal  corner  point.  (For  example,  Z\  = {xi , x2,  X4,  a?i6}-) 
These  constraints  reduce  the  ZDD  size  to  66.  There  are  just  two  solutions,  one  the 
transpose  of  the  other,  and  they  can  readily  be  found  by  hand.  [See  Y.  Kotani, 
Puzzlers’  Tribute  (A.  K.  Peters,  2002),  413-420.  The  set  of  all  tatami  tilings  has 
been  characterized  by  Dean  Hickerson;  the  corresponding  generating  functions  have 
been  obtained  by  Frank  Ruskey  and  Jennifer  Woodcock,  Electronic  J.  Combinatorics 
16,1  (2009),  #R126.] 

216.  (a)  Assign  three  variables  (ai,6i,Ci)  to  each  row  of  (128),  corresponding  to  the 
domino’s  color  if  row  i is  chosen.  Every  branch  node  of  the  ZDD  for  / in  (129)  now 
becomes  three  branch  nodes.  We  can  take  advantage  of  symmetry  under  transposition 
by  replacing  / by  / A xi\  this  reduces  the  ZDD  size  from  2300  to  1995,  which  grows  to 
5981  when  each  branch  node  is  triplicated. 

Now  we  AND  in  the  adjacency  constraints,  for  all  682  cases  where  rows  i 

and  i!  are  adjacent  domino  positions.  Such  constraints  have  the  form  -i((aj  A <v)  V 
(bi  A fv)  V (Cj  A Cj/)),  and  we  apply  them  bottom-up  as  in  Method  2 of  answer  212. 
This  computation  inflates  the  ZDD  until  it  reaches  more  than  800  thousand  nodes;  but 
eventually  it  settles  down  and  ends  up  with  size  584,205. 

The  desired  answer  turns  out  to  be  13,343,246,232  (which,  of  course,  is  a multiple 
of  3!  = 6,  because  each  permutation  of  the  three  colors  yields  a different  solution). 

(b)  This  question  is  distinct  from  part  (a),  because  many  coverings  (including 
Fig.  29(b))  can  be  3-colored  in  several  ways;  we  want  to  count  them  only  once. 

Suppose  /(ai,  &i,ci, . . . ,am,  6m,cm)  = f(xi, . . . ,x3rn)  is  a function  with  Oj  = 
X31-2,  bi  = X3i-i,  and  c,  = X3 <,  such  that  f(x  1, . . . ,X3m)  = 1 implies  a,  + bi  + a < 1 
for  1 < i < m.  Let’s  define  the  uncoloring  $/  of  / to  be 

$/(xi,  . . . ,Xm)  = 3r/l  • ■ • 3r/3m(/(yi,  ■ • • ,V3m) 

A (xi  = 2/1  + 2/2  + 2/3)  A ■ • ■ A (xm  = V3m-2  + 2/3m-l  + t/3m))- 

A straightforward  recursive  subroutine  will  compute  the  ZDD  for  $/  from  the  ZDD 
for  /.  This  process  transforms  the  584,205  nodes  obtained  in  part  (a)  into  a ZDD  of 
size  33,731,  from  which  we  deduce  the  answer:  3,272,232. 

(The  running  time  is  1.2  gigamems  for  part  (a),  plus  1.3  gigamems  to  uncolor; 
the  total  memory  requirement  is  about  44  megabytes.  A similar  computation  based  on 
BDDs  instead  of  ZDDs  cost  13.6  + 1.5  gigamems  and  occupied  185  megabytes.) 

217.  The  separation  condition  adds  4198  further  constraints  of  the  form  -i(x;  A av), 
where  rows  i and  % specify  adjacent  placements  of  congruent  pieces.  Applying  these 
constraints  while  also  evaluating  the  conjunction  A^i  Si(Xj)  turned  out  to  be  a bad 
idea,  in  the  author’s  experiments;  even  worse  was  an  attempt  to  construct  a separate 
ZDD  for  the  new  constraints  alone.  Much  better  was  to  build  the  512,227-node  ZDD 
as  before,  then  to  incorporate  the  new  constraints  one  by  one,  first  constraining  the 
variables  at  the  lowest  levels.  The  resulting  ZDD  of  size  31,300,699  was  finally  com- 
pleted after  286  gigamems  of  work,  proving  that  exactly  7,099,053,234,102  separated 
solutions  exist. 
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We  might  also  ask  for  strongly  separated  solutions,  where  congruent 
pieces  are  not  allowed  to  touch  even  at  their  corners;  this  requirement 
adds  1948  more  constraints.  There  are  554,626,216  strongly  separated 
coverings,  findable  after  245  gigamems  with  a ZDD  of  size  4,785,236. 

(But  standard  backtracking  finds  them  faster  and  with  neglible  memory.) 

218.  This  is  an  exact  cover  problem.  For  example,  the  matrix  when  n = 3 is 

001001010  (—2—2) 

010001001  (-3—3) 

010010010  (-2—2-) 

010100100  (-1-1—) 

100010001  (3  — 3-) 

100100010  (2—2—) 

101000100  (1-1—-) 

and  in  general  there  are  3n  columns  and  (2n2-1)  - (2)  rows.  Consider  the  case  n = 12: 

The  ZDD  on  187  variables  has  192,636  nodes.  It  can  be  found  with  a cost  of  300 
megamems,  using  Method  4 of  answer  212  (binary  balancing);  Method  5 turns  out  to 
be  25%  slower  than  Method  4 in  this  case.  The  BDD  is  much  larger  (2,198,195  nodes) 
and  it  costs  more  than  900  megamems. 

Thus  the  ZDD  is  clearly  preferable  to  the  BDD  for  this  problem,  and  it  identifies 
the  L 12  = 108,144  solutions  with  reasonable  efficiency.  (However,  the  “dancing  links” 
technique  of  Section  7.2.2  is  about  four  times  faster,  and  it  needs  far  less  memory.) 

219.  (a)  1267;  (b)  2174;  (c)  2958;  (d)  3721;  (e)  4502.  (To  form  the  ZDD  for  WORDS(n) 
we  do  n-  1 ORs  of  the  7-node  ZDDs  for  wi  Uh2  U i3  U c4  Uh5,  ti  U h2  U e3  U r4  U e5,  etc.) 

220.  (a)  There  is  one  a2  node  for  the  descendants  of  each  initial  letter  that  can  be 
followed  by  a in  the  second  position  (aargh,  babel,  . . . , zappy);  23  letters  qualify, 
all  except  q,  u,  and  x.  And  there’s  one  62  node  for  each  initial  letter  that  can  be  followed 
by  b (abbey,  ebony,  oboes).  However,  the  actual  rule  isn’t  so  simple;  for  example,  there 
are  three  z2  nodes,  not  four,  because  of  sharing  between  czars  and  tzars. 

(b)  There’s  no  vb  because  no  five-letter  word  ends  with  v.  (The  SGB  collection 
doesn’t  include  arxiv  or  webtv.)  The  three  nodes  for  11)5  arise  because  one  stands  for 
cases  where  the  letters  < W5  must  be  followed  by  w (agio  and  many  others);  another 
node  stands  for  cases  where  either  w or  y must  follow  (stra,  or  resa,  or  when  we’ve 
seen  alio  but  not  allot);  and  there’s  also  a UI5  node  for  the  case  when  unse  is  not 

HI  li'  fw 

followed  by  e or  t,  because  it  must  then  be  followed  by  either  w or  x.  Similarly,  the 
two  nodes  for  x5  represent  the  cases  where  x is  forced,  or  where  the  last  letter  must  be 
either  x or  y (following  rela).  There’s  only  one  2/5  node,  because  no  four  letters  can 
be  followed  by  both  y and  z.  Of  course  there’s  just  one  z5  node,  and  two  sinks. 

221.  We  compute,  for  every  possible  zead  C,  the  probability  that  ( will  occur,  and 

sum  over  all  £.  For  definiteness,  consider  a zead  that  corresponds  to  branching  on  r'3 , 
and  suppose  it  represents  a subfamily  of  10  three-letter  suffixes.  There  are  exactly 
( 10  ) ~ (Ji°8)  ~ 1-3  x 1031  such  zeads,  and  by  the  principle  of  inclusion  and  exclusion 
they  each  arise  with  probability  (T) (-f)fe+1  O18*™™  ^o*84*)/^®™?76)  ~ 2-5  x 

10-32.  [Hint:  \{r,s,t,u,v,w,x,y,z}\  = 9,  676  = 262,  and  6084  = 9 x 262.]  Thus  such 
zeads  contribute  about  0.33  to  the  total.  The  r3-zeads  for  subfamilies  of  sizes  1,  2,  3,  4, 

5,  • ■ • , contribute  approximately  11.5,  32.3,  45.1,  41.9,  29.3,  . . . , by  a similar  analysis; 
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so  we  expect  about  188.8  branches  on  r3  altogether,  on  average.  The  grand  total 
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plus  2 for  the  sinks,  comes  to  ss  7151.986.  The  average  z-profile  is  ss  (1.00,  . . . , 1.00; 
25.99,  . . . , 25.99;  188.86,  . . . , 171.43;  86.31,  . . . , 27.32;  3.53,  . . . , 1.00;  2.00). 

222.  (a)  It’s  the  set  of  all  subsets  of  the  words  of  F.  (There  are  50,569  such  subwords, 
out  of  275  = 14,348,907  possibilities.  They  are  described  by  a ZDD  of  size  18,784, 
constructed  from  F and  p via  answer  205(b)  at  a cost  of  about  15  megamems.) 

(b)  This  formula  gives  the  same  result  as  F n p,  because  every  member  of  F 
contains  exactly  one  element  of  each  Xj.  But  the  computation  turns  out  to  be  much 
slower  — about  370  megamems  — in  spite  of  the  fact  that  Z(X ) = 132  is  almost  as 
small  as  Z(p)  = 131.  (Notice  that  \p\  = 2130  while  |X|  = 265  « 223'5.) 

(c)  (F / P)  LI  P,  where  P = ti  U U3  U hs  is  the  pattern.  (The  words  are  touch, 
tough,  truth.  This  computation  costs  about  3000  mems  with  the  algorithms  of  answer 
205.)  Other  contenders  for  simple  formulas  are  FnQ,  where  Q describes  the  admissible 
words.  If  we  set  Q = ti  U X2  U U3  U X4  U hs,  we  have  Z(Q)  = 57  and  the  cost  once 
again  is  ~ 3000p.  With  Q = (ti  UU3  Uhs)  §3,  on  the  other  hand,  we  have  Z(Q ) = 132 
and  the  cost  rises  to  about  9000  mems.  (Here  |Q|  is  262  in  the  first  case,  but  2127  in 
the  second  — reversing  any  intuition  gained  from  (a)  and  (b)!  Go  figure.) 

(d)  Ffl((ViU  • • • U V5)  §/c).  The  number  of  such  words  is  (24,  1974,  3307,  443,  9,  0) 
for  k = (0,  . . . , 5),  respectively,  from  ZDDs  of  sizes  (70,  1888,  3048,  686,  34,  1).  (“See 
exercise  7-34  for  the  words  F mod  yx  mod  y2  mod  ■ ■ ■ mod  y5,”  said  the  author  wryly.) 

(e)  The  desired  patterns  satisfy  P = (Fn  p)  HQ,  where  Q = ((AA  U • ■ • LDY5)  §3). 
We  have  Z(Q)  = 386,  Z(P)  = 14221,  and  |P|  = 19907. 

(f)  The  formula  for  this  case  is  trickier.  First,  P2  = F n F gives  F together  with 
all  patterns  satisfied  by  two  distinct  words;  we  have  Z(P2)  = 11289,  |P2|  = 21234,  and 
|P2n<3|  = 7753.  But  P2DQ  is  not  the  answer;  for  example,  it  omits  the  pattern  *atc*, 
which  occurs  eight  times  but  only  in  the  context  *atch.  The  correct  answer  is  given  by 
P2nQ,  where  P2  = (P2\F)flp.  Then  Z(P2)  = 8947,  Z(P2nQ)  = 7525,  |P2nQ|  = 10472. 

(g)  Gi  U ■ ■ ■ U Gg,  where  Gj  = (F/(bj  U o j))  Ubj.  The  answers  are  bared,  bases, 
basis,  baths,  bobby,  bring,  busts,  herbs,  limbs,  tribs. 

(h)  Patterns  that  admit  all  vowels  in  second  place:  b*lls,  b*nds,  m*tes,  p*cks. 

(i)  The  first  gives  all  words  whose  middle  three  letters  are  vowels.  The  second 
gives  all  patterns  with  first  and  last  letter  specified,  for  which  there’s  at  least  one 
match  with  three  vowels  inserted.  There  are  30  solutions  to  the  first,  but  only  27  to 
the  second  (because,  e.g.,  louis  and  luaus  yield  the  same  pattern).  Incidentally,  the 
complementary  family  p \ F has  2130  - 57  57  members,  and  46316  nodes  in  its  ZDD. 

223.  (a)  d(a,ix)  + d(/3,/x)  + d( 7,^)  = 5,  since  d(a,fi)  = [on  //rx]  -f 1-  [05/^5], 

(b)  Given  families  /,  g,  h,  the  family  {p  | ^ = ( a/3j ) for  some  a G /,  /3  € g, 

7 G /i  with  a / n,  (3  ^ /r,  7 / fj,,  and  a ft  /3  PI  7 = 0}  can  be  defined  recursively  to 
allow  ZDD  computation,  if  we  consider  eight  variants  in  which  subsets  of  the  inequality 
constraints  are  relaxed.  In  the  author’s  experimental  system,  the  ZDDs  for  medians 
of  WORDS  (n)  for  n = (100,  1000,  5757)  have  respectively  (595,  14389,  71261)  nodes 
and  characterize  (47,  7310,  86153)  five-letter  solutions.  Among  the  86153  medians 
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when  n = 5757  are  chads,  stent,  blogs,  ditzy,  phish,  bling,  and  tetch;  in  fact, 
tetch  = (fetch  teach  total)  arises  already  when  n = 1000.  (The  running  times  of 
about  (.01,  2,  700)  gigamems,  respectively,  were  not  especially  impressive;  ZDDs  are 
probably  not  the  best  tool  for  this  problem.  Still,  the  programming  was  instructive.) 

(c)  When  n = 100,  exactly  (1,  14,47)  medians  of  WORDS  (n)  belong  to  WORDS  (100) , 
WORDS(IOOO) , W0RDS(5757),  respectively;  the  solution  with  most  common  words  is 
while  = (white  whole  still).  When  n = 1000,  the  corresponding  numbers  are  (38, 
365,  1276);  and  when  n = 5757  they  are  (78,  655,  4480).  The  most  common  English 
words  that  aren’t  medians  of  three  other  English  words  are  their,  first,  and  right. 

224.  Every  arc  u — > v of  the  dag  corresponds  to  a vertex  v of  the  forest.  The  ZDD 
has  exactly  one  branch  node  for  every  arc.  The  LO  pointer  of  that  node  leads  to  the 
right  sibling  of  the  corresponding  vertex  v,  or  to  [X]  if  v has  no  right  sibling.  The  HI 
pointer  leads  to  the  left  child  of  v , or  to  [T]  if  v is  a leaf.  The  arcs  can  be  ordered  in 
many  ways  (e.g.,  preorder,  postorder,  level  order),  without  changing  this  ZDD. 

225.  As  in  exercise  55,  we  try  to  number  the  vertices  in  such  a way  that  the  “frontier” 
between  early  and  late  vertices  remains  fairly  small;  then  we  needn’t  remember  too 
much  about  what  decisions  were  made  on  the  early  vertices.  In  the  present  case  we 
also  want  the  source  vertex  s to  be  number  1. 

In  answer  55,  the  relevant  state  from  previous  branches  corresponded  to  an 
equivalence  relation  (a  set  partition);  but  now  we  express  it  by  a table  maie[i]  for 
j < i < l,  where  j = Uk  is  the  smaller  vertex  of  the  current  edge  Uk  — Vk  and  where 
l = max{i>i,. . . ,r>fc_x}.  Let  mate[i]  = i if  vertex  i is  untouched  so  far;  let  mate[i\  = 0 
if  vertex  i has  been  touched  twice  already.  Otherwise  mate[i\  = r and  mate[r]  = i,  if 
previous  edges  form  a simple  path  with  endpoints  {i,r}.  Initially  we  set  mate[i]  <-  i for 
1 < i < n,  except  that  mate[  1]  <-  t and  mate[t ] «-  1.  (If  t > l,  the  value  of  mate[t]  need 
not  be  stored,  because  it  can  be  determined  from  the  values  of  mate[i]  for  j < i < l .) 

Let  j'  = Uk+ 1 and  l'  = max{ui, . . . , u*,}  be  the  values  of  j and  l after  edge  k 
has  been  considered;  and  suppose  Uk  = j,  Vk  = m,  mate[j ] = j,  mate[m]  = m.  We 
cannot  choose  edge  j — m if  j = 0 or  m = 0.  Otherwise,  if  j / m,  the  new  mate  table 
after  choosing  edge  j — m can  be  computed  by  doing  the  assignments  mate[j]  <—  0, 
mate[m]  «—  0,  mate[j\  <—  m,  mate[m]  «—  j (in  that  order). 

Otherwise  we  have  j = m and  m = j;  we  must  contemplate  the  endgame.  Let 
* be  the  smallest  integer  such  that  i > j,  i / m,  and  either  i > l'  or  mate[i\  ^ 0 and 
mate[i ] ^ i.  The  new  state  after  choosing  edge  j — m is  0 if  * < l1,  otherwise  it  is  e. 

Whether  or  not  the  edge  is  chosen,  the  new  state  will  be  0 if  mate[i\  ^ 0 and 
mate[i\  ^ i for  some  i in  the  range  j < i < j'. 

For  example,  here  are  the  first  steps  for  paths  from  1 to  9 in  a 3x3  grid  (see  (132)): 


k 

j 

l 

m 
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9 
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003956784 

where  mate 1 

describes  the  next  state  if  edge 

j — 

- m 

is  chosen.  The  state  transitions 

mate^.ij-^  matey  v are  9 (12?  92:  09);  92  (13?  0:  29);  09  i->  (13?  93:  0); 

29  (24?  294:  492);  93  (24?  934:  039). 

After  all  reachable  states  have  been  found,  the  ZDD  can  be  obtained  by  reducing 
equivalent  states,  using  a procedure  like  Algorithm  R.  (In  the  3x3  grid  problem, 
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57  branch  nodes  are  reduced  to  28,  plus  two  sinks.  The  22-branch  ZDD  illustrated  in 
the  text  was  obtained  by  subsequently  optimizing  with  exercise  197.) 

226.  Just  omit  the  initial  assignments  lm.ate[  1]  «—  t,  mate[t\  «—  1.’ 

227.  Change  the  test  ‘ mate[i ] ^ 0 and  mate[i\  / i'  to  just  lmate[i]  / 0’  in  two  places. 
Also,  change  li  < l”  to  ‘i  < n’. 

228.  Use  the  previous  answer  with  the  following  further  changes:  Add  a dummy  vertex 
d = n + 1,  with  new  edges  v — d for  all  v ^ s;  accepting  this  new  edge  will  mean  “end 
at  v .”  Initialize  the  mate  table  with  mate[l]  «—  d,  mate[d]  <—  1.  Leave  d out  of  the 
maximization  when  calculating  l and  V . When  beginning  to  examine  a stored  mate 
table,  start  with  mate[d]  <—  0 and  then,  if  encountering  mate[i]  = d,  set  mate[d]  i. 

229.  149,692,648,904  of  the  latter  paths  go  from  VA  to  MD;  graph  (133)  omits  DC. 
(However,  the  graphs  of  (18)  have  fewer  Hamiltonian  paths  than  (133),  because  (133) 
has  1,782,199  Hamiltonian  paths  from  CA  to  ME  that  do  not  go  from  VA  to  MD.) 

230.  The  unique  minimum  and  maximum  routes  from  ME  both  end  at  WA: 


Let  g(z)  = ^zmlles(r);  summed  over  all  routes  r.  The  average  cost,  g’(l)/g(l)  = 
1022014257375/68656026  ~ 14886.01,  can  be  computed  rapidly  as  in  answer  29. 

(Similarly,  g"  ( 1)  = 15243164303013274,  so  the  standard  deviation  is  « 666.2.) 

231.  The  algorithm  of  answer  225  gives  a proto-ZDD  with  8,062,831  branch  nodes;  it 
reduces  to  a ZDD  with  3,024,214  branches.  The  number  of  solutions,  via  answer  208, 
is  50,819,542,770,311,581,606,906,543. 

232.  With  answer  227  we  find  h = 721,613,446,615,109,970,767  Hamiltonian  paths 
from  a corner  to  its  horizontal  neighbor,  and  d = 480,257,285,722,344,701,834  of  them 
to  its  diagonal  neighbor;  in  both  cases  the  relevant  ZDD  has  about  1.3  million  nodes. 
The  number  of  oriented  Hamiltonian  cycles  is  2 h + d=  1,923,484,178,952,564,643,368. 
(Divide  by  2 to  get  the  number  of  undirected  Hamiltonian  cycles.) 

Essentially  only  two  king’s  tours  achieve  the  maximal  length  8 + 56\/2: 


233.  A similar  procedure  can  be  used  but  with  mate[i]  = r and  mate[r]  = —i  when 
the  previous  choices  define  an  oriented  path  from  i to  r.  Process  all  arcs  Uk  — >i’k  and 
Uk< — vk  consecutively  when  Uk  = j < vk  = m.  Define  j = —j  if  mate[j)  = j,  otherwise 
j = mate\j\.  Choosing  j — >m  is  illegal  if  j > 0 or  rh  < 0.  The  updating  rule  for  that 
choice,  when  legal,  is:  mate[j]  «-  0,  mate[m]  «-  0,  mate[—j]  «-  rh,  mate[rh]  <—  j. 

234.  The  437  oriented  cycles  can  be  represented  by  a ZDD  of  « 800  nodes.  The  short- 

est are,  of  course,  AL — >-LA — > AL  and  MN — » NM — >-MN.  There  are  37  of  length  17  (the 
maximum),  such  as  (ALARINVTNMIDCOKSC)  i.e.,  AL — >LA — > >-SC — >Ck — >AL. 
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Incidentally,  the  directed  graph  in  question  is  the  arc  digraph  D"  of  the  digraph  D 
on  26  vertices  {A,B, . . . , Z } whose  49  arcs  are  A — > L , A — * R . . . . , W — > Y.  Every  oriented 
walk  of  D*  is  an  oriented  walk  of  D.  and  conversely  (see  exercise  2.3.4.2-21);  but  the 
oriented  cycles  of  D*  are  not  necessarily  simple  in  D.  In  fact,  D has  only  37  oriented 
cycles,  the  longest  of  which  is  unique:  (ARINMOKSDC). 

If  we  extend  consideration  to  the  62  postal  codes  in  exercise  7-54(c),  the  number 
of  oriented  cycles  rises  to  38336,  including  the  unique  1-cycle  (A),  as  well  as  192  that 
have  length  23,  such  as  ( APRI ALASCTNMNVINCOKSDCA ) . About  17000  ZDD  nodes  suffice 
to  characterize  the  entire  family  of  oriented  cycles  in  this  case. 

235.  The  digraph  has  7912  arcs;  but  we  can  prune  them  dramatically  by  removing 
arcs  from  vertices  of  in-degree  zero,  or  arcs  to  vertices  of  out-degree  zero.  For  example, 
owner  tnerdy  goes  away,  because  nerdy  is  a dead  end;  in  fact,  all  successors  of  owner 
are  likewise  eliminated,  so  crown  is  out  too.  Eventually  we’re  left  with  only  112  arcs 
among  85  words,  and  the  problem  can  basically  be  done  by  hand. 

There  are  just  74  oriented  cycles.  The  unique  shortest  one,  slant  — > antes  — > 
tesla  Aslant,  can  be  abbreviated  to  ‘(slante)’  as  in  the  previous  answer.  The  two 
longest  are  (aw)  and  ((3w),  where  a = picastepsomaso,  (3  — pointrotherema,  and 

uj  = nicadrearedidoserumoreliciteslabsitaresetuplenactoricedarerunichesto. 

236.  (a)  Suppose  a € / and  /3  € g.  If  a C /3,  then  a € f n g.  If  a n /3  G /,  then 
an/3  £ f Sg.  A similar  argument,  or  the  use  of  part  (b),  shows  that  f \<g  = f\(f  U g). 

Notes:  The  complementary  operations  uf'\g  = f\(f\g)  = {a€f\aDf3 
for  some  /3  6 g}v  for  supersets,  and  “/  ^ g = / \ (/  T g)  = {a  G / | a C /3  for  some 
0 € g}”  for  subsets,  are  also  important  in  applications.  They  were  omitted  from  this 
exercise  only  because  five  operations  are  already  rather  intimidating.  The  superset 
operation  was  introduced  by  O.  Coudert,  J.  C.  Madre,  and  H.  Fraisse  [ACM/IEEE 
Design  Automation  Conference  30  (1993),  625-630],  The  identity  /\j  = /n(/Ug) 
was  noted  by  H.  G.  Okuno,  S.  Minato,  and  H.  Isozaki  [ Information  Processing  Letters 
66  (1998),  195-199],  who  also  listed  several  of  the  laws  in  (d). 

(b)  Elementary  set  theory  suffices.  (The  first  six  identities  appear  in  pairs,  each 
of  which  is  equivalent  to  its  mate.  Strictly  speaking,  fc  involves  infinite  sets,  and  U 
is  the  AND  of  infinitely  many  variables;  but  the  formulas  hold  in  any  finite  universe. 
Notice  that,  when  cast  in  the  language  of  Boolean  functions,  fc(x)  = f(x)  is  the 
complement  of  fD,  the  Boolean  dual;  see  exercise  7. 1.1-2.  Is  there  any  use  for  the  dual 
of  /a,  namely  {a  | /3  6 / implies  aU^  / U}^?  If  so,  we  might  denote  it  by  /b.) 

(c)  All  true  except  (ii),  which  should  have  said  that  x\  = x°iC  = x\c  = ec  = U. 

(d)  The  “identities”  to  cross  out  here  are  (ii),  (viii),  (ix),  (xiv),  and  (xvi);  the 

others  are  worth  remembering.  Regarding  (ii)-(vi),  notice  that  / = /r  if  and  only  if 
/ = if  and  only  if  / is  a clutter.  Formula  (xiv)  should  be  / \ <?,  the  dual 

of  (xiii).  Formula  (xvi)  is  almost  right;  it  fails  only  when  / = 0 or  g = 0.  Formula  (ix) 
is  perhaps  the  most  interesting:  We  actually  have  fM  = f if  and  only  if  / is  a clutter. 

(e)  Assuming  that  the  universe  of  all  vertices  is  finite,  we  have  (i)/  = p\j  and 
(ii)  9 = (p  \ /)'*',  where  p is  the  universal  family  of  exercises  201  and  222,  because  g is 
the  family  of  minimal  dependent  sets.  (Purists  should  substitute  pv  = U„ev(e  U ev ) 
for  p in  these  formulas.  The  same  relations  hold  in  any  hypergraph  for  which  no  edge 
is  contained  in  another.) 

237.  MAXMAL(/)  = “If  / = 0 or  / = e,  return  /.  If  f^  = r is  cached,  return  r.  Oth- 
erwise set  r <—  MAXMAL(/z),  rh  t—  MAXMAL (/h),  rj  t—  NONSUB(r, r/j),  dereference  r, 
and  r •<—  ZUNIQUE(/„, r;, r^);  cache  and  return  r.” 
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MINMAL(/)  = “If  / = 0 or  / = e,  return  /.  If  f^=r  is  cached,  return  r.  Oth- 
erwise set  r;  4—  MINMAL(/;),  r 4—  MINMAL(/h),  r/,  4—  NONSUP(r,  n),  dereference  r, 
and  r 4—  ZUNIQUE(/„, r;, rh)\  cache  and  return  r.” 

NONSUB(/,g)  = “If  <7  = 0,  return  /.  If  / = 0 or  / = e or  / = g,  return  0. 
If  / 9 = r is  cached,  return  r.  Otherwise  represent  / and  g as  in  (52).  If  v < gv, 

set  r;  4—  NONSUB(/j,g),  rh  4—  fh,  and  increase  REF(//,)  by  1;  otherwise  set  rh  4— 
NONSUB (fi,gi),  r 4—  NONSUB (fi,gh),  n 4—  AND(r,r^),  dereference  r and  rh,  and  set 
rh  4—  NONSUB (fh,gh)-  Finally  r 4—  ZUNIQUE(u,  ri,  r^);  cache  and  return  r.” 

NONSUP  (/,g)  = “If  g = 0,  return  /.  If  / = 0 or  g = e or  / = g,  return  0. 
If  fv  > g„,  return  NONSUP(/, gi).  If  / N g = r is  cached,  return  r.  Otherwise  set 
v = fv.  If  v < gv , set  r;  4—  NONSUP(/;,g)  and  4—  NONSUP(/h,g);  otherwise  set 
77  4—  NONSUP (fh,gh),  t 4—  NONSUP(/h, g;),  rh  4—  AND(r,r;),  dereference  r and  n, 
and  set  n 4—  NONSUP(/j, g;).  Finally  r 4—  ZUNIQUE(u, rj,  r^);  cache  and  return  r.” 

MINHIT(/)  = “If  / = 0,  return  e.  If  / = e,  return  0.  If  fi=r  is  cached,  return  r. 
Otherwise  set  r 4—  OR(fi,  fh),  ri  4 — MINHIT(r),  dereference  r,  r 4—  MINHIT(/i), 
rh  4—  NONSUP(r,  r;),  dereference  r,  and  r 4—  ZUNIQUE(/„,  n,  r^);  cache  and  return  r.” 

As  in  exercise  206,  the  worst-case  running  times  of  these  routines  are  unknown. 
Although  NONSUB  and  NONSUP  can  be  computed  via  JOIN  or  MEET  and  BUTNOT, 
by  exercise  236(a),  this  direct  implementation  tends  to  be  faster.  It  may  be  preferable 
to  replace  lf  = e ’ by  ‘e  € /’  in  MINMAL  and  MINHIT;  also  ‘g  = e’  by  ‘e  £ g’  in  NONSUP. 

[Olivier  Coudert  introduced  and  implemented  the  operators  /T,  //'g,  and  / \g 
in  Proc.  Europ.  Design  and  Test  Conf.  (IEEE,  1997),  224-228.  He  also  gave  a recursive 
implementation  of  the  interesting  operator  / © g = (/  U g)^;  however,  in  the  author’s 
experiments,  much  better  results  have  been  obtained  without  it.  For  example,  if  / is 
the  177-node  ZDD  for  the  independent  sets  of  the  contiguous  USA,  the  operation  g 4— 
JOIN  (/,  /)  costs  about  350  kilomems  and  h 4—  MAXMAL(g)  costs  about  3.6  megamems; 
but  more  than  69  gigomems  are  needed  to  compute  h 4—  MAXJOIN(/, /)  all  at  once. 
Improved  caching  and  garbage-collection  strategies  may,  of  course,  change  the  picture.] 

238.  We  can  compute  the  177-node  ZDD  for  the  family  / of  independent  sets,  using 
the  ordering  (104),  in  two  ways:  With  Boolean  algebra  (67),  / = ->  \/  v(xu  A xv); 
the  cost  is  about  1.1  megamems  with  the  algorithms  of  answers  198-201.  With  family 
algebra,  on  the  other  hand,  we  have  / = p \ Uu  — „(e«  ^ e,))  by  exercise  236(e);  the 
cost,  via  answer  237,  is  less  than  175  kilomems. 

The  subsets  that  give  2-colorable  and  3-colorable  subgraphs  are  g — f U / and 
h = g LI  /,  respectively;  the  maximal  ones  are  g ^ and  h' . We  have  Z(g ) = 1009, 
Z(gt)  = 3040,  Z{h)  = 179,  Z(hf)  = 183,  |g|  = 9,028,058,789,780,  |gT|  = 2,949,441, 
|/i|  = 543,871,144,820,736,  and  |/f7|  = 384.  The  successive  costs  of  computing  g,  g^ , 
h,  and  hf  are  approximately  350  Kp  (kilomems),  3.6  M/r,  1.1  Mp,  and  230  Kg,.  (We 
could  compute  hf  by,  say,  (g^  U /)!;  but  that  turns  out  to  be  a bad  idea.) 

The  maximal  induced  bipartite  and  tripartite  subgraphs  have  the  respective 

generating  functions  7654z25  + b 9040233  + 689234  and  128z43  + 84z44  + 112z45  + 

36z46  + 24z47.  Here  are  typical  examples  of  the  smallest  and  largest: 


(Compare  with  the  smallest  and  largest  “1-partite”  subgraphs  in  7-(6i)  and  7-(62).) 
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Notice  that  the  families  g and  h tell  us  exactly  which  induced  subgraphs  can  be 
2-colored  and  3-colored,  but  they  don’t  tell  us  how  to  color  them. 

239.  Since  h = ((ei  U • • ■ U e49)  § 2)  \ g is  the  set  of  nonedges  of  G , the  cliques  are 
/ = p\h,  and  the  maximal  cliques  are  /f.  For  example,  we  have  Z(f)  = 144  for  the 
214  cliques  of  the  USA  graph,  and  Z(f^)  = 130  for  the  60  maximal  ones.  In  this  case 
the  maximal  cliques  consist  of  57  triangles  (which  are  easily  visible  in  (18)),  together 
with  three  edges  that  aren’t  part  of  any  triangle:  AZ  — MM,  WI  — MI,  NH  — ME. 

Let  fk  describe  the  sets  coverable  by  k cliques.  Then  fi  = /,  and  /fe+1  = fk  U f 
for  k > 1.  (It’s  not  a good  idea  to  compute  /16  as  fs  U fa;  much  faster  is  to  do  each 
join  separately,  even  if  the  intermediate  results  are  not  of  interest.) 

The  maximum  elements  of  fk  in  the  USA  graph  have  sizes  3,  6,  9,  ... , 36,  39, 
41,  43,  45,  47,  48,  49  for  1 < k < 19;  these  maxima  can  readily  be  determined  by  hand, 
in  a small  graph  such  as  this.  But  the  question  of  maximal  elements  is  much  more 
subtle,  and  ZDDs  are  probably  the  best  tool  for  investigating  them.  The  ZDDs  for 
fi,  • • • , /19  are  quickly  found  after  about  30  megamems  of  calculation,  and  they  aren’t 
large:  ma xZ(fk)  = Z(fn)  = 9547.  Another  400  megamems  produces  the  ZDDs  for 
/i , • • • , fl9,  which  likewise  are  small:  maxZ(fl)  = Z(fl1)  = 9458. 

We  And,  for  example,  that  the  generating  function  for  fls 
is  12z4'  + 13z48;  eighteen  cliques  suffice  to  cover  all  but  one  of 
the  49  vertices,  if  we  leave  out  CA,  DC,  FL,  IL,  LA,  MI,  MN,  MT, 

SC,  TN,  UT,  WA,  or  WV.  There  also  are  twelve  cases  where  we  can 
maximally  cover  47  vertices;  for  example,  if  all  but  NE  and  NM  are 
covered  by  18  cliques,  then  neither  of  those  states  are  covered.  An  unusual  example  of 
maximal  clique  covering  is  illustrated  here:  If  the  29  “black”  states  are  covered  by  12 
cliques,  none  of  the  “white”  states  will  also  be  covered. 

240.  (a)  In  fact,  the  subformula  f(x)  = f\v(xv  V M u_v  xu ) of  (68)  precisely  character- 
izes the  dominating  sets  x.  And  if  any  element  of  a kernel  is  removed,  it  isn’t  dominated 
by  the  others.  [C.  Berge,  Theorie  des  graphes  et  ses  applications  (1958),  44.] 

(b)  The  Boolean  formula  of  part  (a)  yields  a ZDD  with  Z(f)  = 888  after  about 
1.5  M/i  of  computation;  then  another  1.5  Mg  with  the  MINMAL  algorithm  of  answer 
237  gives  the  minimal  elements,  with  Z(f^)  = 2082. 

A more  clever  way  is  to  start  with  h = \Jv(ev  U\Ju_v  eu),  and  then  to  compute 
h\  because  h®  = /•*-.  However,  cleverness  doesn’t  pay  in  this  case:  About  80  Kg  suffice 
to  compute  h,  but  the  computation  of  h®  by  the  MINHIT  algorithm  costs  about  350  M/r. 

Either  way,  we  deduce  that  there  are  exactly  7,798,658  minimal  dominating  sets. 

More  precisely,  the  generating  function  has  the  form  192zn + 58855z12  H h4170z18  + 

40z19  (which  can  be  compared  to  80Z11  + 7851z12  + ■ ■ ■ + 441z18  + 18z19  for  kernels). 

(c)  Proceeding  as  in  answer  239,  we  can  determine  the  sets  of  vertices  dk  that  are 

dominated  by  subsets  of  size  k = 1,  2,  3,  ... , because  dk+\  = dk  U dk.  Here  it’s  much 
faster  to  start  with  dk  = p n h instead  of  d\  = h,  even  though 
Z(pflh)  = 313  while  Z(h ) = 213,  because  we  aren’t  interested  in 
details  about  the  small-cardinality  members  of  dk . Using  the  fact 
that  the  generating  function  for  d7  is b61z42+z43,  one  can  ver- 

ify that  the  illustrated  solution  is  unique.  (Total  cost  « 300  M/r.) 

241.  Let  g be  the  family  of  all  728  edges.  Then,  as  in  previous  exercises,  / = p \ g is 
the  family  of  independent  sets,  and  the  cliques  are  c = p ((((J„  ev)  § 2)  \ g).  We  have 
Z(g)  = 699,  Z(f)  = 20244,  Z(c)  = 1882. 
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(a)  Among  |/|  = 118969  independent  sets,  there  are  |/t|  = 10188  kernels,  with 
z(fr)  = 8577  and  generating  function  728a5+6912a6  + 2456a7-|-9228.  The  92  maximum 
independent  sets  are  the  famous  solutions  to  the  classic  8-queens  problem,  which  we 
shall  study  in  Section  7.2.2;  example  (Cl)  is  the  only  solution  with  no  three  queens  in  a 
straight  line,  as  noted  by  Sam  Loyd  in  the  Brooklyn  Daily  Eagle  (20  December  1896). 
The  728  = 91  x8  minimum  kernels  were  first,  listed  by  C.  F.  de  Jaenisch,  Traite  des  ap- 
plications de  1 analyse  math,  au  jeu  des  echecs  3 (1863),  255-259,  who  ascribed  them  to 
“Mr  de  R***.”  The  upper  left  queen  in  (CO)  can  be  replaced  by  king,  bishop,  or  pawn, 
still  dominating  every  open  square  [H.  E.  Dudeney,  The  Weekly  Dispatch  (3  Dec  1899)]. 
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(b)  Here  Z(cr)  = 866;  the  310  maximal  cliques  are  described  in  exercise  7-129. 

(c)  These  subsets  are  computationally  more  difficult:  The  ZDD  for  all  dominating 
sets  d has  Z(d)  = 12,663,505,  |d|  = 18,446,595,708,474,987,957;  the  minimal  ones  have 
Z(dl)  = 11,363,849,  Id4-]  = 28,281,838,  and  generating  function  4860a5  + 1075580a6  + 
14338028a7  + 11978518a8+873200z9-(-11616aln-|-36a11.  One  can  compute  the  ZDD  for  d 
in  1.5  Gp  by  Boolean  algebra,  and  then  the  ZDD  for  dl  in  another  680  Gp;  alternatively, 
the  “clever”  approach  of  answer  240  obtains  dl  in  775  Gp  without  computing  d.  The 
11-queen  arrangement  in  (C5)  is  the  only  such  minimal  dominating  set  that  is  confined 
to  three  rows.  H.  E.  Dudeney  presented  (C4),  the  only  5-queen  solution  that  avoids  the 
central  diamond,  in  Tit  Bits  (1  Jan  1898),  257.  The  set  of  all  4860  minimum  solutions 
was  first  enumerated  by  K.  von  Szily  [Deutsche  Schachzeitung  57  (1902),  199];  his  com- 
plete list  appears  in  W.  Ahrens,  Math.  Unterh  alt  ungen  und  Spiele  1 (1910),  313-318. 

(d)  Here  it  suffices  to  compute  (c  n d)1  instead  of  c n (d4),  if  we  don’t  already 
know  d4,  because  c Dp  = c.  We  have  Z(cf~)d4)  = 342  and  jcDd4]  = 92,  with  generating 
function  20a5  + 56a6  + 16a7.  Once  again,  Dudeney  was  first  to  discover  all  20  of  the 
5-queen  solutions  [The  Weekly  Dispatch  (30  July  1899)]. 

(e)  We  have  Z(f  U /)  = 91,780,989  at  a cost  of  24  Gp;  then  Z((f  U /)r)  = 
11,808,436  after  another  290  Gp.  There  are  27,567,390  maximal  induced  bipartite  sub- 
graphs, with  generating  function  109894a10 -(-2561492a11  + 13833474a12 +9162232a13 + 
1799264a14  + 99408a15-|-1626216.  Any  8 independent 
queens  can  be  combined  with  their  mirror  reflection 
to  obtain  a 16-queen  solution,  as  (Cl)  yields  (C9). 

But  the  disjoint  union  of  minimum  kernels  is  not  al- 
ways a maximal  induced  bipartite  subgraph;  for  ex- 
ample, consider  the  union  of  (CO)  with  its  reflection: 
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Parts  (a),  (b),  (d),  and  possibly  (c)  can  be  solved  just  as  well  without  the  use 
of  ZDDs;  see,  for  example,  exercise  7.1.3-132  for  (a)  and  (b).  But  the  ZDD  approach 
seems  best  for  (e).  And  the  computation  of  all  the  maximal  tripartite  subgraphs  of  Qs 
may  be  beyond  the  reach  of  any  feasible  algorithm. 

[In  larger  queen  graphs  Qn,  the  smallest  kernels  and  the  minimum  dominating 
sets  are  each  known  to  have  sizes  either  \n/ 2]  or  \n/ 2]  + 1 for  12  < n < 120.  See 
P.  R.  J.  Ostergard  and  W.  D.  Weakley,  Electronic  J.  Combinatorics  8 (2001),  #R29; 
D.  Finozhenok  and  W.  D.  Weakley,  Australasian  J.  Combinatorics  37  (2007),  295-300. 
The  largest  minimal  dominating  sets  have  been  investigated  by  A.  P.  Burger,  E.  J. 
Cockayne,  and  C.  M.  Mynhardt,  Discrete  Mathematics  163  (1997),  47-66.] 

242.  These  are  the  kernels  of  an  interesting  3-regular  hypergraph  with  1544  edges.  Its 
4.113,975,079  independent  subsets  / (that  is,  its  subsets  with  no  three  collinear  points) 
have  Z(f)  = 52,322,105,  computable  with  about  12  gigamems  using  family  algebra 
as  in  answer  236(e).  Another  575  Gp  will  compute  the  kernels  /4,  for  which  we  have 
Z(f^)  = 31,438,750  and  |/^|  = 66,509,584;  the  generating  function  is  228z8  + 8240z9  + 
728956z10  + 9888900z“  + 32215908z12  + 20739920z13  + 2853164z14  + 73888zls  + 380z16 . 
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[The  problem  of  finding  an  independent  set  of  size  16  was  first  posed  by  H.  E.  Dudeney 
in  The  Weekly  Dispatch  (29  Apr  1900  and  13  May  1900),  where  he  gave  the  leftmost 
pattern  shown  above.  Later,  in  the  London  Tribune  (7  Nov  1906),  Dudeney  asked 
puzzlists  to  find  the  second  pattern,  which  has  two  points  in  the  center.  The  full  set  of 
maximum  kernels,  including  57  that  are  distinct  under  symmetry,  was  found  by  M.  A. 
Adena,  D.  A.  Holton,  and  P.  A.  Kelly,  Lecture  Notes  in  Math.  403  (1974),  6-17,  who 
also  noted  the  existence  of  an  8-point  kernel.  The  middle  pattern  above  is  the  only  such 
kernel  with  all  points  in  the  central  4x4.  The  other  two  patterns  yield  kernels  that 
have  respectively  (8, 8, 10, 10, 12, 12, 12)  points  in  n x n grids  for  n = (8, 9, ... , 14);  they 
were  found  by  S.  Ainley  and  described  in  a letter  to  Martin  Gardner,  27  Oct  1976.] 

243.  (a)  This  result  is  readily  verified  even  for  infinite  sets.  (Notice  that,  as  a Boolean 
function,  fn  is  the  least  Horn  function  that  is  D /,  by  Theorem  7.1. 1H.) 

(b)  We  could  form  f<'2'>  = /n/,  then  /<4^  = / ® fl/^2*,  . . . , until  y (2fc+1 ) = f(2k\ 
using  exercise  205.  But  it’s  faster  to  devise  a recurrence  that  goes  to  the  limit  all  at 
once.  If  / = fo  U (ei  U /i)  we  have  fn  = f U (ex  U fP),  where  /'  = /0n  U (/0n  n fP). 
[An  alternative  formula  is  /'  = (/0  U /i)n  \ (fP  S /0);  see  S.  Minato  and  H.  Arimura, 
Transactions  of  the  Japanese  Society  for  Artificial  Intelligence  22  (2007),  165-172.] 

(c)  With  the  first  suggestion  of  (b),  the  computation  of  F^2\  F^4\  and  F^  — 
F(4)  costs  about  (610  + 450  + 460)  megamems.  In  this  example  it  turns  out  that 
F(4)  = F(3),  and  that  just  three  patterns  belong  to  F^  \ F^2\  namely  c***f,  *k*t*, 
and  ***sp.  (The  words  that  match  ***sp  are  clasp,  crisp,  and  grasp.)  A direct 
computation  of  Fn  using  the  recurrence  based  on  fP  n fP  costs  only  320  Mp;  and 
in  this  example  the  alternative  recurrence  based  on  (/0  U /i)n  costs  470  Mp.  The 
generating  function  is  1 + 124z  + 2782 z2  + 7753z3  + 4820z4  + 5757z5. 

244.  To  convert  Fig.  22  from  a BDD  to  a ZDD,  we  add  appropriate  nodes  with  LO  = HI 
where  links  jump  levels,  obtaining  the  z-profile  (1,  2,  2,  4,  4,  5,  5,  5,  5,  5,  2,  2,  2). 
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To  convert  it  from  a ZDD  to  a BDD,  we  add  nodes  in  the  same  places,  but  with 
HI  = [T|,  obtaining  the  profile  (1,  2,  2,  4,  4,  5,  5,  5,  5,  5,  2,  2,  2).  (In  fact,  the 
connectedness  function  and  the  spanning  tree  function  are  ^-transforms  of  each  other; 
see  exercise  192.) 

245.  See  exercise  7.1.1  26.  (It  should  be  interesting  to  compare  the  performance  of 
the  Fredman-Khachiyan  algorithm  in  exercise  7.1.1-27  with  the  ZDD-based  algorithm 
MINHIT  in  answer  237,  on  a variety  of  different  functions.) 

246.  If  a nonconstant  function  doesn’t  depend  on  xi,  we  can  replace  xi  in  the  formulas 
by  xv,  as  in  (50).  Let  P and  Q be  the  prime  implicants  of  functions  p and  q.  (For 
example,  if  P = e2  U (e3  LI  e 4)  then  p = x 2 V (3:3  A 14)-)  By  (137)  and  induction  on  |/|, 
the  function  / described  in  the  theorem  is  sweet  if  and  only  if  p and  q are  sweet  and 
PI(/o)  H PI(/i)  = 0.  The  latter  equality  holds  if  and  only  if  p C q. 

247.  We  can  characterize  them  with  BDDs  as  in  (49)  and  exercise  75;  but  this  time 

CT„(xi,.  . . ,X2»)  = crn-l(xi,  . . . ,X2"-l)  A 

/ f 2kl 

((x2  A • ••  A x2»)  v [an-i(x2, . . . ,x2n)  A /\  (x2j+i  V \J x2i+2)  ))  • 

3=0  iCj 

The  answers  \an\  for  0 < n < 7 are  (2,  3,  6,  18,  106,  2102,  456774,  7108935325).  (This 
computation  builds  a BDD  of  size  B(a7)  = 7,701,683,  using  about  900  megamems  and 
725  megabytes  altogether.) 

248.  False;  for  example,  ( x\  V x 2 ) A (x2  V X3)  isn’t  sweet.  (But  the  conjunction  is  sweet 
if  / and  g depend  on  disjoint  sets  of  variables,  or  if  xi  is  the  only  variable  on  which 
they  both  depend.) 

249.  (Solution  by  Shaddin  Dughmi  and  Ian  Post.)  A nonzero  monotone  Boolean 
function  is  ultrasweet  if  and  only  if  its  prime  implicants  are  the  bases  of  a matroid;  see 
Section  7.6.1.  By  extending  answer  247  we  can  determine  the  number  of  ultrasweet 
functions  f(x  1, . . . , xn)  for  0 < n < 7:  (2,  3,  6,  17,  69,  407,  3808,  75165). 

250.  Exhaustive  analysis  shows  that  ave  B(f)  = 76726/7581  « 10.1;  ave  Z(PI(/))  = 
71513/7581  « 9.4;  Pr(Z(PI(/))  > B{f))  = 151/7581  « .02;  and  max  Z(PI(f))/B(f)  = 
8/7  occurs  uniquely  when  / is  (xiAa:4)  V (xiAx5)  V (x2Ax3Ax4)  V (x2Ax5). 

251.  More  strongly,  could  it  be  that  limsup  ^(Pl(/))/B(/)  = 1? 

252.  The  ZDD  should  describe  all  words  on  {e1,e'1).. . ,en,e'n}  that  have  exactly  j 

unprimed  letters  and  k — j primed  letters,  and  no  occurrences  of  both  e;  and  e)  in  the 
same  word,  for  some  set  of  pairs  (j,  k).  For  example,  if  n = 9 and  f(x)  = vvx,  where 
v = 110111011,  the  pairs  are  (0,8),  (3,6),  and  (8,8).  Regardless  of  the  set  of  pairs,  the 
z-profile  elements  will  all  be  0(n2),  hence  Z(Pl(f))  = 0(n3).  (We  order  the  variables 
so  that  Xi  and  x\  are  adjacent.)  And  f(x)  = 5Ln/3J |2n/3J  (x)  has  Z(pl(f ))  = ^{n3)- 

253.  Let  I(/)  be  the  family  of  all  implicants  of  /;  then  PI(/)  = I (f)1.  The  formula 
!(/)  = l(/o  A /1)  U (e'i  Ul(/o))  U (eiUIj/i))  is  easy  to  verify.  Thus  I (/)'*'  = A U (ej  U 
(PI(/o)  N A))  U (ex  U (Pl(/i)  N A)),  as  in  exercise  237.  But  PI(/o)  \ A = PI(/o)  \ A, 
since  A C I(/). 

[This  recurrence  for  prime  implicants  is  due  to  O.  Coudert  and  J.  C.  Madre, 
ACM/IEEE  Design  Automation  Conf.  29  (1992),  36-39.  Partial  results  had  previously 
been  formulated  by  B.  Reusch,  IEEE  Trans.  C-24  (1975),  924-930.] 
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254.  By  (53)  and  (137),  we  need  to  show  that  Pl(gh)  \ Pl{fh  U gi)  = (PI(gh)\PI(ffl))  \ 
(F>I(/n)\Pl(/0)-  But  both  of  these  are  equal  to  PI(gh)  \ (PI  (fh)  U PI  (s'/)),  because 
fl  ^ fh  C gh  and  ft  C g,  C gh. 

[This  recurrence  produces  a ZDD  directly  from  the  BDDs  for  / and  g,  and  it 
yields  PI(g)  when  / = 0.  Thus  it  is  easier  to  implement  than  (137),  which  requires  also 
the  set-difference  operator  on  ZDDs.  And  it  sometimes  runs  much  faster  in  practice.] 

255.  (a)  A typical  item  a like  e2  U e$  U e6  has  a very  simple  ZDD.  We  can  readily 
devise  a BUMP  routine  that  sets  g 4—  g ® a and  returns  [a  € g],  given  ZDDs  g and  a. 

To  insert  a into  the  multifamily  /,  start  with  k 4—  c 4—  0;  then  while  c = 0,  set 
c 4—  BUMP(/fc)  and  k 4—  k + 1.  To  delete  a,  assuming  that  it  is  present,  start  with 
k 4—  0 and  cf-  1;  while  c = 1,  set  c 4—  BUMP(/t,)  and  k 4—  k + 1. 

(b)  Suppose  fk  and  gk  are  0 for  k > m.  Set  k 4-  0 and  t 4-  0 (the  ZDD  [T]). 
While  k < m,  set  hk  4-  fk  © gk  © t and  t 4—  ( fkgkt ).  Finally  set  hm  4 - t. 

[This  representation  and  its  insertion  algorithm  are  due  to  S.  Minato  and  H.  Ari- 
mura,  Pioc.  Workshop,  Web  Information  Retrieval  and  Integration  (IEEE,  2005),  4-11.] 

256.  (a)  Reflect  the  binary  representation  from  left  to  right,  and  append  0s  until  the 

number  of  bits  is  2n  for  some  n.  The  result  is  the  truth  table  of  the  corresponding 
Boolean  function  f(x  1, . . . ,xn),  with  xk  corresponding  to  22"  6 U.  When  x = 41, 

for  example,  10010100  is  the  truth  table  of  (x1Ax2/\x3)  V (xiAx2Ax3)  V (iiAx2As3). 

(b)  If  x < 22  , we  have  Z(x)  <Un  = 0(2n/n),  by  (79)  and  exercise  192. 

(c)  There’s  a simple  recursive  routine  ADD(i,y,  c),  which  takes  a “carry  bit”  c 
and  pointers  to  the  ZDDs  for  x and  y and  returns  a pointer  to  the  ZDD  for  x + y + c. 
This  routine  is  invoked  at  most  4 Z(x)Z(y)  times. 

(d)  We  cannot  claim  that  Z(x  — y)  = 0(Z(x)Z(y)),  because  Z(x  — y)  = n + 1 
and  Z(x)  = 3 and  Z(y)  = 1 when  x = 22  and  y = 1.  But  by  computing  x — y = 
(x  + 1 + ((22  — 1)  © y))  — 22  when  y < x < 22  , we  can  show  that  Z(x  — y)  = 
0(Z(x)  Z(y)  log  log  x).  (See  the  ZDD  nodes  tj  in  answer  201.)  So  the  answer  is  “yes.” 

(e)  No.  For  example,  if  x = (222  +k  - 1)/ (22^  — 1),  we  have  Z(x)  = 2k  + 1 but 
Z(x2)  = 3 • (22  — 1)  = U2k+k+1  — 2,  where  U2k+k+1  is  the  largest  possible  ZDD  size 
for  numbers  with  lglg*2  < 2k  + k + 1 (see  part  (b)). 

[This  exercise  was  inspired  by  Jean  Vuillemin,  who  began  to  experiment  with 
such  sparse  integers  about  1993.  Unfortunately  the  numbers  that  are  of  greatest  im- 
portance in  combinatorial  calculations,  such  as  Fibonacci  numbers,  factorials,  binomial 
coefficients,  etc.,  rarely  turn  out  to  be  sparse  in  practice.] 

257.  See  Proc.  Europ.  Design  and  Test  Conf.  (IEEE,  1995),  449-454.  With  signed  coef- 
ficients one  can  use  { — 2, 4,  —8, . . . } instead  of  {2, 4, 8, . . . },  as  in  negabinary  arithmetic. 

[In  the  special  case  where  the  degree  is  at  most  1 in  each  variable  and  where 
addition  is  done  modulo  2,  the  polynomials  of  this  exercise  are  equivalent  to  the 
multilinear  representations  of  Boolean  functions  (see  7.1.1-(ig)),  and  the  ZDDs  are 
equivalent  to  “binary  moment  diagrams”  (BMDs).  See  R.  E.  Bryant  and  Y.-A.  Chen, 
ACM/IEEE  Design  Automation  Conf.  32  (1995),  535-541.] 

258.  If  n is  odd,  the  BDD  must  depend  on  all  its  variables,  and  there  must  be  at  least 
|4gn]  of  them.  Thus  B(f)  > [lgn]+2  when  n > 1,  and  the  skinny  functions  of  exercise 
170(c)  achieve  this  bound.  If  n is  even,  add  an  unused  variable  to  the  solution  for  n/2. 

The  ZDD  question  is  easily  seen  to  be  equivalent  to  finding  a shortest  addition 
chain,  as  in  Section  4.6.3.  Thus  the  smallest  Z(f)  for  |/|  = n is  l(n)  + 1,  including  [T] . 
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259.  The  theory  of  nested  parentheses  (see,  for  example,  exercise  2.2. 1-3)  tells  us  that 
Nn(x)  = 1 if  and  only  if  xi  + ■ ■ • + xk  > x\  + • • ■ + xk  for  0 < k < 2n,  with  equality 

when  k = 2 n.  Equivalently,  k - n < x i + \-  xk  < k/2  for  0 < k < 2n.  So  the 

BDD  for  Nn  is  rather  like  the  BDD  for  Sn(x),  but  simpler;  in  fact,  the  profile  elements 
are  bk  = [k/2\  + 1 for  0 < k < n and  bk  = n + 1 - \k/2\  for  n < k < 2 n.  Hence 
B(Nn)  = bo  + • • • + &2n-i  + 2 = (nJ2)  + 1-  The  z-profile  has  zk  = bk  — [k  even]  for 
0 < k < 2 n,  because  of  HI  branches  to  [X]  on  even  levels;  hence  Z(Nn)  = B(N„)  — n. 

[An  interesting  BDD  base  for  the  n + 1 Boolean  functions  that  correspond  to  Cnn, 
C(n-i)(n+i)>  • • • , Co(2n)  in  7.2.1.6-(2i)  can  be  constructed  by  analogy  with  exercise  49.] 

260.  (a, b)  Arrange  the  variables  xnt0,  xn,i,  . ..,  x„_i>0,  ...,  aq.o,  from  top 

to  bottom.  Then  the  HI  branch  from  the  ZDD  root  of  Rn  is  the  ZDD  root  of  R„-\. 
(This  ordering  actually  turns  out  to  minimize  Z(Rn)  for  n < 6,  probably  also  for 
all  n.)  The  z-profile  is  1,  . . . , 1;  n - 2,  . . . , 2,  1,  1;  n - 3,  . . . , 2,  1,  1;  . . . ; hence 
Z(R„)  = (”)  + 2n  + 1 |n3  and  .Z(-Rioo)  = 161,901.  The  ordinary  profile  is  1,  2,  2, 

3,  4,  . . . , n — 1;  n — 1,  2n  — 4,  2n  — 5,  . . . , n — 1;  n — 2,  2n  — 6,  . . . , n — 2;  . . . ; altogether 
B(Rn)  = 3(”)  + + 3 for  n > 2,  and  B(Ri0o)  = 490,153. 

[See  I.  Semba  and  S.  Yajima,  Trans.  Inf.  Proc.  Soc.  Japan  35  (1994),  1666-1667. 
Incidentally,  the  method  of  exercise  7.2.1.5-26  leads  to  a ZDD  for  set  partitions  that  has 
only  (2)  variables  and  (!])  + 1 nodes.  But  the  connection  between  that  representation 
and  the  partitions  themselves  is  less  direct,  thus  harder  to  restrict  in  a natural  way.] 
(c)  Now  there  are  573  variables  instead  of  5050  when  n = 100;  the  number  of 
variables  in  general  is  nl  - 2l  + 1,  where  l = [lgn],  by  Eq.  5.3.1-(3).  We  examine  the 
bits  of  an,  a„_i,  . . . , with  the  most  significant  bit  first.  Then  B(R'100)  = 31,861,  and 
one  can  show  that  B(R'n)  = (![ )l  - \ Al  - |2'  - i/(n  - 1)  + 1 + § for  n > 2.  The  ZDD  size 
is  more  complicated,  and  appears  to  be  roughly  60%  larger;  we  have  Z(R'100)  = 50,154. 

261.  Given  a Boolean  function  f(x i,...,xn),  the  set  of  all  binary  strings  x\...xn 
such  that  f(xi, . . . ,xn)  = 1 is  a finite  language,  so  it  is  regular.  The  minimum-state 
deterministic  automaton  A for  this  language  is  the  QDD  for  /.  (In  general,  when  L is 
regular,  the  state  of  A after  reading  x\  . . . xk  accepts  the  language  {a  \ xx  . . . xka  € L}.) 

[The  quoted  theorem  was  discovered  in  a more  general  context  by  D.  A.  Huffman, 
Journal  of  the  Franklin  Institute  257  (1954),  161-190,  and  independently  by  E.  F. 
Moore,  Annals  of  Mathematics  Studies  34  (1956),  129-153.] 

An  interesting  example  of  the  connection  between  this  theory  and  the  theory  of 
BDDs  can  be  found  in  early  work  by  Yuri  Breitbart  that  is  summarized  in  Doklady 
Akad.  Nauk  SSSR  180  (1968),  1053-1055.  Lemma  7 of  Breitbart’s  paper  states,  in  es- 
sence, that  Bmin(ip)  = Q(2n/4),  where  ip  is  the  function  of  2 n variables  x = (aq, . . . ,xn) 
and  y = (y\,,..,yn)  defined  by  ip(x,y)  = xvy  ® yvx,  with  the  understanding  that 
xo  = Vo  = 0.  (Notice  that  ip  is  sort  of  a “two-sided”  hidden  weighted  bit  function.) 

262.  (a)  If  a denotes  the  function  or  subfunction  /,  we  can  for  example  let  C(a)  = 
a ® 1 denote  /,  assuming  that  each  node  occupies  an  even  number  of  bytes.  Then 
C(C(a))  = a,  and  a link  to  a denotes  a nonnormal  function  if  and  only  if  a is  odd; 
a & —2  always  points  to  a node,  which  always  represents  a normal  function. 

The  LO  pointer  of  every  node  is  even,  because  a normal  function  remains  normal 
when  we  replace  any  variable  by  0.  But  the  HI  pointer  of  any  node  might  be  comple- 
mented, and  an  external  root  pointer  to  any  function  of  a normalized  BDD  base  might 
also  be  complemented.  Notice  that  the  [T]  sink  is  now  impossible. 

(b)  Uniqueness  is  obvious  because  of  the  relation  to  truth  tables:  A bead  is  either 
normal  (i.e.,  begins  with  0)  or  the  complement  of  a normal  bead. 
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(c)  In  diagrams,  each  complement  link  is  conveniently  indicated  by  a dot: 


1332323333232331 


(d)  There  are  22’"  -1  — 22’"  1-1  normal  beads  of  order  m.  The  worst  case,  B°(f)  < 

B°(fn ) = 1 + min(2fc,  22"  A-i—  22”  1 1_1)  = (f7„+i  — l)/2,  occurs  with  the 

functions  of  answer  110.  For  the  average  normalized  profile,  change  22"_fc  — 1 in  (8o) 
to  22  — 2,  and  divide  the  whole  formula  by  2;  again  the  average  case  is  very  close 

to  the  worst  case.  For  example,  instead  of  (8i)  we  have 

(1.0,  2.0,  4.0,  8.0,  16.0,  32.0,  64.0,  127.3,  103.9,  6.0,  1.0,  1.0). 

(e)  We  save  [T],  one  (e),  two  (fT)s,  and  three  (4)s,  leaving  45  normalized  nodes. 

(f)  It’s  probably  best  to  have  subroutines  AND,  OR,  BUTNOT  for  the  case  where 
/ and  g are  known  to  be  normal,  together  with  a subroutine  GAND  for  the  general  case. 
The  routine  GAND  (/,<?)  returns  AND (f,g)  if  / and  g are  even,  BUTNOT(/,  C(g))  if  / 
is  even  but  g is  odd,  BUTNOT (g,C(f))  if  g is  even  but  / is  odd,  C(OR(C(/),  C(g)))  if 
/ and  g are  odd.  The  routine  AND (f,g)  is  like  (55)  except  that  ru  <—  GAND (/h,  <?/,); 
only  the  cases  / = 0,  g = 0,  and  / = g need  be  tested  as  “obvious”  values. 

Notes:  Complement  links  were  proposed  by  S.  Akers  in  1978,  and  independently 
by  J.  P.  Billon  in  1987.  Although  such  links  are  used  by  all  the  major  BDD  packages, 
they  are  hard  to  recommend  because  the  computer  programs  become  much  more 
complicated.  The  memory  saving  is  usually  negligible,  and  never  better  than  a factor 
of  2;  furthermore,  the  author’s  experiments  show  little  gain  in  running  time. 

With  ZDDs  instead  of  BDDs,  a “normal  family”  of  functions  is  a family  that 
doesn’t  contain  the  empty  set.  Shin-ichi  Minato  has  suggested  using  C(a)  to  denote 
the  family  / © e,  instead  of  /,  in  ZDD  work. 

263.  (a)  If  Hx  = 0 and  x / 0,  we  can’t  have  ux  = 1 or  2 because  the  columns  of  H 
are  nonzero  and  distinct.  [R.  W.  Hamming,  Bell  System  Tech.  J.  29  (1950),  147-160.] 

(b)  Let  rk  be  the  rank  of  the  first  k columns  of  H , and  Sk  the  rank  of  the  last  k 
columns.  Then  bk  = 2rk+Sn~k~rn  for  0 < k < n,  because  this  is  the  number  of  elements 
in  the  intersection  of  the  vector  spaces  spanned  by  the  first  k and  last  n — k columns. 
In  the  Hamming  code,  r*,  = 1 + AA  and  Sk  = min(m,  2 + A (k  — 1))  for  k > 1;  so  we  find 
B(f)  = (n2  + 5)/2.  [See  G.  D.  Forney,  Jr.,  IEEE  Trans.  IT-34  (1988),  1184-1187.] 

(c)  Let  qk  = 1 -pk-  Maximizing  n*=i  p^k=Vk^q^k*Vk^  is  the  same  as  maximizing 
ELi  WkXk,  where  wu  = (2 yk  — 1)  log (pk/qk),  so  we  can  use  Algorithm  B. 

Notes:  Coding  theorists,  beginning  with  unpublished  work  of  Forney  in  1967, 
have  developed  the  idea  of  a code’s  so-called  trellis.  In  the  binary  case,  the  trellis  is  the 
same  as  the  QDD  for  /,  but  with  all  nodes  for  the  constant  subfunction  0 eliminated. 
(Useful  codes  have  distance  > 1;  then  the  trellis  is  also  the  BDD  for  /,  but  with  [T] 
eliminated.)  Forney’s  original  motivation  was  to  show  that  the  decoding  algorithm  of 
A.  Viterbi  [IEEE  Trans.  IT-13  (1967),  260-269]  is  optimum  for  convolutional  codes. 
A few  years  later,  L.  R.  Bahl,  J.  Cocke,  F.  Jelinek,  and  J.  Raviv  [IEEE  Trans.  IT-20 
(1974),  284-287]  extended  trellis  structure  to  linear  block  codes  and  presented  further 
optimization  algorithms.  See  also  the  papers  of  G.  B.  Horn  and  F.  R.  Kschischang 
[IEEE  Trans.  IT-42  (1996),  2042-2048];  J.  Lafferty  and  A.  Vardy  [IEEE  IVans.  C-48 
(1999),  971-986]. 
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264.  Procedures  that  combine  the  “bott.om-up”  methods  of  Algorithm  B with  “top- 
down”  methods  that  optimize  over  predecessors  of  a node  might  be  more  efficient  than 
methods  that  go  strictly  in  one  direction. 

265.  Compute  counts  Cj  bottom-up  as  in  Algorithm  C,  using  n-bit  arithmetic.  Then 
proceed  top-down,  by  starting  with  k <—  s — 1,  j «—  1,  m «—  m — 1,  and  repeating 
the  following  steps  (during  which  we  will  have  0 < m < 2 Vk~]Ck)'.  If  Vk  > j,  set 
Xj  <—  \m/2Vk~i~1Ck\ , m m mod  2 Vk~-'~1Ck,  j <—  j + 1;  otherwise  if  k = 1,  terminate; 
otherwise  set  l <-  Ik,  h 4-  hk,  and  if  m < 2v‘~Vk~1ci  set  Xj  4—  0,  k 4—  l,  j 4—  j + 1; 
otherwise  set  Xj  4—1 , m 4—  m — 2 v‘~Vk~1ci,  k 4—  h,  j 4—  j + 1. 

266.  In  fact,  the  ZDD  is  obtained  directly  from  the  standard  “left  child,  right  sibling” 
binary  tree  for  F (see  7.2.1.6-(4))  if  we  use  the  left  child  link  for  HI  and  the  right  sibling 
link  for  LO;  null  links  are  changed  to  point  to  |T] , except  that  the  LO  link  of  the  root 
of  the  rightmost  tree  (the  final  node  in  postorder)  should  point  to  [T] . 

267.  The  ZDD  size  of  d(F)  can  be  computed  as  follows,  using  an  auxiliary  function 
C(T)  defined  recursively  on  trees:  If  |T|  = 1 (that  is,  if  T has  only  one  node),  £(T)  = 1. 
Otherwise  T consists  of  a root  together  with  k > 1 subtrees,  T\,  . ...  rI\,  and  we  define 
C (T)  = 1 + C(7i)  + ■ • ■ + C(7fc)  + |T|  — |Tfc|.  Then  if  F consists  of  k > 1 trees  T),  . . . , 
Tfc,  we  have  Z(d(F))  = 1 + £(T\)  + • ■ ■ + C(7fc)  + [|7fc|  = l]. 

The  minimum  size,  n,  clearly  occurs  when  F consists  of  n one-node  trees.  The 
maximum  size,  [n2/ 4J  + 2n  + 1,  occurs  for  n = 2m  — 1 in  a tree  for  which  node  k has 
two  children,  k + 1 and  k + m for  1 < k < m;  the  case  n = 2m  is  similar. 

For  the  average  size,  consider  the  generating  function  Z(w,z)  = ^ , 

summed  over  all  trees  T.  The  definition  of  £ yields  the  functional  equation  Z(w,z ) = 
wz+w2zZ(w,  z)/(l—Z(w,  wz)).  Differentiation  with  respect  to  w and  to  z,  then  setting 
w = 1,  tells  us  that  Z(  1,  z)  = (1  — s)/2,  Zw(  1,  z)  = z/s+z/s2 , and  Zz(  1,  z)  = 1/s,  where 
s = y/1  — 4z.  The  generating  function  ^2Fwz(-d^F^ z^F\  summed  over  all  nonempty 
forests  F,  is  ( wZ(w,z ) + w3z  — w2z)/{  1 — Z(w,z)).  Differentiating  with  respect  to  w 
and  setting  w 4-  1,  we  obtain  z/(  1 — 4 z)  -\-2z/ \J\  — 4 z;  hence  the  average  of  Z(d(F))  is 

(4n_1  + 2nCn-i) /Cn  = \^n3/2  + f + 0(n1/2), 
where  Cn  is  the  Catalan  number  7. 2.1. 6 (15). 

SECTION  7.2. 1.1 

1.  Let  mj  = Uj  — lj  + 1,  and  visit  (01  + h, . . . , an  +l„ ) instead  of  visiting  (ai, . . . , an) 
in  Algorithm  M.  Or,  change  la,j  ■<—  0’  to  la,j  •<—  lj ’ and  Laj  = mj  — 1’  to  ‘dj  = Uj1  in 
that  algorithm,  and  set  lo  0,  uo  <—  1 in  step  Ml. 

2.  (0,0, 1,2, 3, 0,2, 7,  0,9). 

3.  Step  M4  is  performed  mi  m2  . . . mr  times  when  j = k:  therefore  the  total  is 
J2k= 0 n-=i  mj  = mi. . .m„(  1 + l/m„  + l/m„m„_i  + ■ • • + l/mn  . . . mi).  If  all  mj 
are  2 or  more,  this  is  less  than  2mi  . . . mn.  [Thus,  we  should  keep  in  mind  that  fancy 
Gray-code  methods,  which  change  only  one  digit  per  visit,  actually  reduce  the  total 
number  of  digit  changes  by  at  most  a factor  of  2.] 

4.  Nl.  [Initialize.]  Set  a,j  <r-  mj  — 1 for  0 < j < n,  where  mo  = 2. 

N2.  [Visit.]  Visit  the  n-tuple  (aj, . . . ,o„). 

N3.  [Prepare  to  subtract  one.]  Set  j t—  n. 

N4.  [Borrow  if  necessary.]  If  aj  = 0,  set  aj  <—  mj  — 1,  j «—  j — 1,  and  repeat  this 
step. 
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N5.  [Decrease,  unless  done.]  If  j = 0,  terminate  the  algorithm.  Otherwise  set 
a,j  «—  a,j  — 1 and  go  back  to  step  N2.  | 

5.  Bit  reflection  is  easy  on  a machine  like  MMIX,  but  on  other  computers  we  can 
proceed  as  follows: 

Zl.  [Initialize.]  Set  j <—  k <—  0. 

Z2.  [Swap.]  Interchange  A[j  + 1]  <->  A[k  + 2n_1].  Also,  if  j < k,  interchange 
A[j]  -H-  A[fc]  and  A[j  + 2"_1  + 1]  -H-  A[k  + 2n_1  + 1], 

Z3.  [Advance  k.]  Set  k <—  k + 2,  and  terminate  if  k > 2n_1. 

Z4.  [Advance  j.]  Set  h «—  2n~2.  If  j > h,  repeatedly  set  j <—  j — h and  h <—  h/2 
until  j < h.  Then  set  j j + h.  (Now  j = (b0  . . . bn-i)2  if  k = {bn-i  . . . 60)2-) 
Return  to  Z2.  | 

6.  If  g((0bn-i  . . .6160)2)  = (0(6„_!)  . . . (62©&i)(61®60))2  then  g((lbn-i  . . .6160)2)  = 
2"  + g((06„_i  . . .6160)2)  = (l(6n_i)  . . . (62  © 61) (61  © 60))2,  where  6 = 6 © 1. 

7.  To  accommodate  2 r sectors  one  can  use  g(k)  for  2n  — r < k < 2"  + r,  where 
n = [lgr],  because  g(2n  - r)  © g( 2n  + r-  1)  = 2"  by  (5).  [G.  C.  Tootill,  Proc.  IEE 
103,  Part  B Supplement  (1956),  434.]  See  also  exercise  26. 

8.  Use  Algorithm  G with  n <—  n — 1 and  include  the  parity  bit  a 00  at  the  right.  (This 
yields  #(0),  jr(2),  g( 4),  . . . .) 

9.  Replace  the  rightmost  ring,  since  1,(101 1000)  is  odd. 

10.  An  + Bn  = gl-1]( T - 1)  = [2’1+1/3]  and  An  = Bn+n.  Hence  An  = [2n/3  + n/2j 
and  Bn  = [2n/3  — n/2] . 

Historical  notes:  The  early  Japanese  mathematician  Yoriyuki  Arima  (1714-1783) 
treated  this  problem  in  his  Shuki  Sanpo  (1769),  Problem  44,  observing  that  the  ra- 
ring puzzle  reduces  to  an  (ra  — l)-ring  puzzle  after  a certain  number  of  steps.  Let 
Cn  = An  — 1 = Bn  — _B„_i  + 1 be  the  number  of  rings  removed  during  this 

reduction.  Arima  noticed  that  Cn  — 2Cn-i  — [ra  even];  thus  he  could  compute  An  = 
Ci  + C2  + • • • + Cn  for  ra  = 9 without  actually  knowing  the  formula  C„  = [2n_1/3] . 

More  than  two  centuries  earlier,  Cardano  had  already  mentioned  the  “complicati 
annuli”  in  his  De  Subtilitate  Libri  XXI  (Nuremberg:  1550),  Book  15.  He  wrote  that 
they  are  “useless  yet  admirably  subtle,”  stating  erroneously  that  95  moves  are  needed 
to  remove  seven  rings  and  95  more  to  put  them  back.  John  Wallis  devoted  seven 
pages  to  this  puzzle  in  the  Latin  edition  of  his  Algebra  2 (Oxford:  1693),  Chapter  111, 
presenting  detailed  but  nonoptimum  methods  for  the  nine-ring  case.  He  included  the 
operation  of  sliding  a ring  through  the  bar  as  well  as  putting  it  on  or  off,  and  he  hinted 
that  shortcuts  were  available,  but  he  did  not  attempt  to  find  a shortest  solution. 

11.  The  solution  to  Sn  = S„_2  + 1 + Sn_2  + Sn_i  when  Si  = S2  - 1 is  S„  = 
2n_1  — [ra  even].  [Math.  Quest.  Educational  Times  3 (1865),  66-67.] 

12.  (a)  The  theory  of  ra  — 1 Chinese  rings  proves  that  Gray  binary  code  yields  the 
compositions  in  a convenient  order  (4,  31,  211,  22,  112,  1111,  121,  13): 

Cl.  [Initialize.]  Set  t 4—  0,  j <—  1,  si  «—  ra.  (We  assume  that  ra  > 1.) 

C2.  [Visit.]  Visit  si  ...  Sj.  Then  set  t <—  1 — t,  and  go  to  C4  if  t = 0. 

C3.  [Odd  step.]  If  Sj  > 1,  set  Sj  <—  Sj  — 1,  Sj+i  «—  1,  j <—  j + 1;  otherwise  set 
j <—  j — 1 and  Sj  <—  Sj  + 1.  Return  to  C2. 

C4.  [Even  step.]  If  > 1,  set  Sj_i  e-  Sj-i  - 1,  sj+ 1 •(-  Sj,  Sj  <-  1,  j 4-  j + 1; 
otherwise  set  j <—  j — 1,  Sj  «—  Sj+i,  Sj- 1 «—  Sj-i  + 1 (but  terminate  if 
j — 1 = 0).  Return  to  C2.  | 
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(b)  Now  qi,  . . . , qt  — i represent  rings  on  the  bar: 

Bl.  [Initialize.]  Set  t t—  1,  <jo  4—  n.  (We  assume  that  n > 1.) 

B2.  [Visit.]  Set  qt  4—  0 and  visit  (<jo  — <?i)  • • • (qt-i  — qt )•  Go  to  B4  if  f is  even. 

B3.  [Odd  step.]  If  qt-i  = 1,  set  t 4—  t — 1;  otherwise  set  qt  4—  1 and  t 4—  t + 1. 

Return  to  step  B2. 

B4.  [Even  step.]  If  qt-2  = qt- i + 1,  set  qt- 2 4—  qt-i  and  t 4—  t — 1 (but  terminate 
if  t = 2);  otherwise  set  qt  4-  qt- 1,  qt- 1 4-  qt  + 1,  t 4-  t + 1.  Return  to  B2.  | 

These  algorithms  [see  J.  Misra,  ACM  Trans.  Math.  Software  1 (1975),  285]  are  loopless 
even  in  their  initialization  steps. 

13.  In  step  Cl,  also  set  C 4—  1.  In  step  C3,  set  C 4—  SjC  if  Sj  > 1,  otherwise 

C t—  C/ (sj-i  + 1).  In  step  C4,  set  C 4-  Sj-iC  if  Sj-i  > 1,  otherwise  C 4—  C /{sj- 2 + 1). 
Similar  modifications  apply  to  steps  Bl,  B3,  B4.  Sufficient  precision  is  needed  to 

accommodate  the  value  C = n!  for  the  composition  1 . . . 1;  we  are  stretching  the 

definition  of  looplessness  by  assuming  that  arithmetic  operations  take  unit  time. 

14.  VI.  [Initialize.]  Set  j 4—  0. 

V2.  [Visit.]  Visit  the  string  ai  . . . a3. 

V3.  [Lengthen.]  If  j < n,  set  j 4-  j + 1,  aj  4-  0,  and  return  to  V2. 

V4.  [Increase.]  If  aj  < rrij  — 1,  set  aj  4—  a,j  + 1 and  return  to  V2. 

V5.  [Shorten.]  Set  j 4-  j — 1,  and  return  to  V4  if  j > 0.  | 

15.  Jl.  [Initialize.]  Set  j 4—  0. 

J2.  [Even  visit.]  If  j is  even,  visit  the  string  ai  . . . aj. 

J3.  [Lengthen.]  If  j < n,  set  j 4r-  j + 1,  aj  4-  0,  and  return  to  J2. 

J4.  [Odd  visit.]  If  j is  odd,  visit  the  string  ai  . . .aj. 

J5.  [Increase.]  If  aj  < m3  - 1,  set  a,  <-  + I and  return  to  J2. 

J6.  [Shorten.]  Set  j t—  j — 1,  and  return  to  J4  if  j > 0.  | 

This  algorithm  is  loopless,  although  it  may  appear  at  first  glance  to  contain  loops;  at 
most  four  steps  separate  consecutive  visits.  The  basic  idea  is  related  to  exercise  2.3. 1-5 
and  to  “prepostorder”  traversal  (Algorithm  7.2.1.6Q). 

16.  Suppose  LINK(j  - 1)  = j + nbj  for  1 < j < n and  LINK(j  - 1 + n)  = j + n(l  - bj) 
for  1 < j < n.  These  links  represent  (01, . . . , an)  if  and  only  if  g(ax  . . . an)  = 61  . . . bn, 
so  we  can  use  a loopless  Gray  binary  generator  to  achieve  the  desired  result. 

17.  Put  the  concatenation  of  3-bit  codes  (g(j),  g(k))  in  row  j and  column  k,  for  0 < 
j • k < 8.  [It  is  not  difficult  to  prove  that  this  is  essentially  the  only  solution,  except  for 
permuting  and/or  complementing  coordinate  positions  and/or  rotating  rows,  because 
the  coordinate  that  changes  when  moving  north  or  south  depends  only  on  the  row, 
and  a similar  statement  applies  to  columns.  Karnaugh’s  isomorphism  between  the 
4-cube  and  the  4x4  torus  can  be  traced  back  to  The  Design  of  Switching  Circuits 
by  W.  Keister,  A.  E.  Ritchie,  and  S.  H.  Washburn  (1951),  page  174.  Incidentally, 
Keister  went  on  to  design  an  ingenious  variant  of  Chinese  rings  called  SpinOut,  and  a 
generalization  called  The  Hexadecimal  Puzzle,  U.S.  Patents  3637215-3637216  (1972).] 

18.  Use  2-bit  Gray  code  to  represent  the  digits  Uj  = (0, 1,  2,  3)  respectively  as  the  bit 
pairs  U2j-iu'2j  = (00,01,  II,  10).  [C.  Y.  Lee  introduced  his  metric  in  IRE  Trans.  IT-4 
(1958),  77-82.  A similar  m/2-bit  encoding  works  for  even  values  of  m;  for  example, 
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when  m = 8 we  can  represent  (0, 1,  2, 3, 4, 5, 6,  7)  by  (0000,0001,0011,0111, 1111, 1110, 
1100, 1000).  But  such  a scheme  leaves  out  some  of  the  binary  patterns  when  m > 4.] 

19.  (a)  A modular  Gray  quaternary  algorithm  needs  slightly  less  computation  than 
Algorithm  M,  but  it  doesn’t  matter  because  256  is  so  small.  The  result  is  z®  + zf  + 

4 + zf  + 14(44  + 44)  + 56z02i^223(4  + 4)(4  + 4)- 

(b)  Replacing  (20,21,22,23)  by  (l,z,z2,z)  gives  1 + 112z6  + 304  + 112z10  + z16; 
thus  all  of  the  nonzero  Lee  weights  are  > 6.  Now  use  the  construction  in  the  previous 
exercise  to  convert  each  (uo,  Hi,  112, 113,  «4,  Ms,  «6,  n<x>)  into  a 16-bit  number. 

20.  Recover  the  quaternary  vector  (u0,  mi,  m2,  m3,  u4,  u5,  m6,  Moo)  from  u',  and  use  Al- 
gorithm 4.6. ID  to  find  the  remainder  of  u0  + u\x  H b u6x6  divided  by  g(x),  mod  4; 

that  algorithm  can  be  used  in  spite  of  the  fact  that  the  coefficients  do  not  belong  to  a 
field,  because  g(x)  is  monic.  Express  the  remainder  as  xj  + 2xk  (modulo  g(x)  and  4), 
and  let  d = (k  - j ) mod  7,  s = (u0  b m6  + Moo)  mod  4. 

Case  1:  s = 1.  If  k = 00,  the  error  was  x3  (in  other  words,  the  correct  vector  has 

Uj  «—  (uj  - 1)  mod  4);  otherwise  there  were  three  or  more  errors. 

Case  2:  s = 3.  If  j = k the  error  was  — x3;  otherwise  > 3 errors  occurred. 

Case  3:  s = 0.  If  j = k = 00,  no  errors  were  made;  if  j = oc  and  k < 00, 
at  least  four  errors  were  made.  Otherwise  the  errors  were  xa  - xb,  where  a = (j  + 
(oo,6,5,2,3, 1,4,0))  mod  7 according  as  d = (0, 1, 2, 3, 4, 5,6, 00),  and  b = (j+2d)  mod  7. 

Case  4 • s = 2.  If  j = oc  the  errors  were  2xk . Otherwise  the  errors  were 

x 3 + x°°,  if  k = 00; 

— x3  — x°°,  if  d = 0; 

xa  + xb,  if  d e {1,2, 4},  a = (j  — 3d)  mod  7,  b = (j  — 2d)  mod  7; 

—xa  — xb,  if  d € {3, 5, 6},  a = (j  — 3d)  mod  7,  b = ( j — d)  mod  7. 

Given  u — (1100100100001111)2,  we  have  u = (2, 0, 3, 1, 0, 0,  2, 2)  and  2 + 3x2  + 
x3  + 2a:6  = 1 + 3a:  + 3a;"  = a:5  + 2x’6;  also  s = 2.  Thus  the  errors  are  x2  + x3 , and 
the  nearest  errorfree  codeword  is  (2, 0, 2, 0, 0,  0,  2,  2).  Algorithm  4.6.1D  tells  us  that 
2 -b  2a:2  + 2a;6  = (2  + 2x  + 2a:3)gr(a:)  (modulo  4);  so  the  eight  information  bits  correspond 
to  (v0,vi,V2,v3)  = (2,  2,  0,2).  [A  more  intelligent  algorithm  would  also  say,  “Aha:  The 
first  16  bits  of  7r .”  ] 

For  generalizations  to  other  efficient  coding  schemes  based  on  quaternary  vectors, 
see  the  classic  paper  by  Hammons,  Kumar,  Calderbank,  Sloane,  and  Sole,  IEEE  Trans. 
IT-40  (1994),  301-319. 

21.  (a)  C(e)  = 1,  C(0a)  = C(la)  = C(  a),  and  C(*a)  = 2C(a)  - [10...  0 Go], 
Iterating  this  recurrence  gives  C(a)  = 2t  — 2t~1et  — 2t~2et-i  — •••  — 2°ei,  where 
ej  = [10  ...  0 G ay]  and  ay  is  the  suffix  of  a following  the  jth  asterisk.  In  the  example 
we  have  ai  = *10**0*,  02  = 10**0*,  . . . , £*5  = e;  thus  ei  = 0,  e2  = 1,  e3  = 1,  e±  = 0, 
and  e5  = 1 (by  convention),  hence  C(**10**0*)  = 25  — 24  — 22  — 21  = 10. 

(b)  We  may  remove  trailing  asterisks  so  that  t = t' . Then  et  = 1 implies  et_i  = 
• • • = ei  =0.  [The  case  C(a)  — 2t'~x  occurs  if  and  only  if  a ends  in  10-’**’.] 

(c)  To  compute  the  sum  of  C(a)  over  all  f-subcubes,  note  that  (")  clusters  begin  at 
the  n-tuple  0. . .0,  and  ("4)  begin  at  each  succeeding  n-tuple  (namely  one  cluster  for 
each  t-subcube  containing  that  n-tuple  and  specifying  the  bit  that  changed).  Thus  the 
average  is  ((")  + (2n  — l)(nt  1))/2n_t  (")  = 2t(\  — t/n)  + 2t~n(t/n).  [The  formula  in  (c) 
holds  for  any  n-bit  Gray  path,  but  (a)  and  (b)  are  specific  to  the  reflected  Gray  binary 
code.  These  results  are  due  to  C.  Faloutsos,  IEEE  Trans.  SE-14  (1988),  1381-1393.] 
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22.  Let  a*-’  and  ft*k  be  consecutive  lieves  of  a Gray  binary  trie,  where  a and  ft  are 
binary  strings  and  j < k.  Then  the  last  k — j bits  of  a are  a string  a such  that  a 
and  fta  are  consecutive  elements  of  Gray  binary  code,  hence  adjacent.  [Interesting 
applications  of  this  property  to  cube-connected  message-passing  concurrent  computers 
are  discussed  in  A VLSI  Architecture  for  Concurrent  Data  Structures  by  William  J. 
Dally  (Kluwer,  1987),  Chapter  3.] 

23.  2-'  = g(k)  ® gift)  = g(k  © l ) implies  that  l = k © (^_1](2J)  = k © (22+1  — 1).  In 
other  words,  if  k = (6n-i  • • • 60)2  we  have  l = (fen-i  • • • bj+ibj  . . . bo)2- 

24.  Defining  g(k)  = fc©  \_k/2\  as  usual,  we  find  g(k)  = g(  — 1 — fc);  hence  there  are  two 

2-adic  integers  k such  that  g(k)  has  a given  2-adic  value  l.  One  of  them  is  even,  the 
other  is  odd.  We  can  conveniently  define  to  be  the  solution  that  is  even;  then 

(8)  is  replaced  by  bj  = dj- 1 © • • • © 00,  for  j > 0.  For  example,  </_1'(l)  = —2  by  this 
definition;  when  l is  a normal  integer,  the  “sign”  of  is  the  parity  of  l. 

25.  Let  p = k © /;  exercise  7. 1.3-3  tells  us  that  2^lsp-*+1  — p < |fc  — /|  < p.  We 
have  n(g(p))  = v(g(k)  © g(l))  = t if  and  only  if  there  are  positive  integers  j\ , . . . , jt 
such  that  p = (1J10J21-'3  . . . (0  or  l)Jt)2-  The  largest  possible  p < 2n  occurs  when 
ji  = n + 1 — t and  j'2  = •••  = jt  = 1,  yielding  p — 2n  — [2*/3].  The  smallest 
possible  q = 2^gpJ+1  — p = (1J20-'3  ...  (1  or  0)^)2  + 1 occurs  when  j‘2  = • • • = jt  = 1, 
yielding  q = [2*/3l.  [C.  K.  Yuen,  IEEE  Trans.  IT-20  (1974),  668;  S.  R.  Cavior,  IEEE 
Trans.  IT-21  (1975),  596.  The  analogous  bound  for  the  modular  m- ary  Gray  code  is 
[mt/(m2  — 1)],  and  this  formula  holds  also  for  the  reflected  m- ary  Gray  code  when  m 
is  even;  see  van  Zanten  and  Suparta,  IEEE  Trans.  IT-49  (2003),  485-487;  Proc.  South 
East  Asian  Math.  Soc.  Conf.  (Yogyakarta:  Gadjah  Mada  University,  2003),  98-105.] 

26.  Let  N = 2nt  + • • • + 2ni  where  nt  > ■ ■ ■ > n 1 > 0;  also,  let  Tn  be  any  Gray  code 
for  {0, 1, . . . ,2n  — 1}  that  begins  at  0 and  ends  at  1,  except  that  To  is  simply  0.  Use 

r£,  2nt+rntl,  ...,  2nt+---  + 2n3+r£,  2nt+---  + 2"2+rni,  if  t is  even; 

rn„  2n*+r£_1,  ...,  2r“  + ---  + 2n3  + r*,  2nt  + ---+2n2  + rni,  if  t is  odd. 

27.  In  general,  if  k = (6n-i  • • • b 0)2,  the  ( k + l)st  largest  element  of  Sn  is  equal  to 

1/(2  - (— l)a-V(2 /(2  - ( — l)ai/ (2  - (-ID) . . .)), 

corresponding  to  the  sign  pattern  g(k ) = (a„-i  ...0.0)2-  Thus  we  can  compute  any 
element  of  Sn  in  O(n)  steps,  given  its  rank.  Setting  k = 2100  — 1010  and  n = 100 
yields  the  answer  373065177/1113604409.  [Whenever  f(x)  is  a positive  and  monotonic 
function,  the  2n  elements  /(±/(. . . ±/(±  x ) . . .))  are  ordered  according  to  Gray  binary 
code,  as  observed  by  H.  E.  Salzer,  CACM  16  (1973),  180.  In  this  particular  case  there 
is,  however,  another  way  to  get  the  answer,  because  we  also  have 

Sn  = //2,  ±2, . . . , ±2,  ±1// 

using  the  notation  of  Section  4.5.3;  continued  fractions  in  this  form  are  ordered  by 
complementing  alternate  bits  of  fc.] 

28.  (a)  As  t = 1,  2,  . . . , bit  dj  of  median(G()  runs  through  the  periodic  sequence 

0 0*1  1*0  0* 

with  asterisks  at  every  21+Jth  step.  Thus  the  strings  that  correspond  to  the  binary 
representations  of  [(£  — 1 )/2j  and  [</2j  are  medians.  And  those  strings  are  in  fact 
“extreme”  cases,  in  the  sense  that  all  medians  agree  with  the  common  bits  of  |_(t—  l)/2j 
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and  [t/ 2J,  hence  asterisks  appear  where  they  disagree.  For  example,  when  t = 100  = 
(01100100)2  and  n — 8,  we  have  median(G  100)  = 001100**. 

(b)  Since  G2f  = 2G(  U (2 Gt  + 1),  we  may  assume  that  t = (a„_2  . . . aiaol)2  is 
odd.  If  a is  g(p)  and  /3  is  g(q)  in  Gray  binary,  we  have  p = (p„-i  . . .po)2  and  q = 
(Pn-i  • • -Pj  + iPj  ■ ■ -P0)2',  and  an_ian_2  = 01  = Pn-iPn-2-  We  cannot  have  p < t < q, 
because  this  would  imply  that  j = n — 1 and  pn-3  = pn- 4 = • • • = p0  — 1.  [See  A.  J. 
Bernstein,  K.  Steiglitz,  and  J.  E.  Hopcroft,  IEEE  Trans.  IT-12  (1966),  425-430.] 

29.  Assuming  that  p ^ 0,  let  l — [lg pj  and  Sa  = {s  | 2(a  < s < 2 l(a  + 1)}  for 
0 < a < 2n~l . Then  (A  ® p)  — k has  a constant  sign  for  all  k G Sa,  and 


E 

kesa 


(k  0 p)  — k 


= 2'|Sa|  = 22'. 


Also  Jl(p(A)  © P)  = k © gl  11  (p),  and  [lgp[  1](p)J  = LlgpJ.  Therefore 


^ E 1](9(k)®p)-k  E E 1](P))~k  = ^7  E 22'  = 2'- 

k = 0 a=0  k £ Sa  a= 0 

[See  Morgan  M.  Buchner,  Jr.,  Bell  System  Tech.  J.  48  (1969),  3113-3130.] 

30.  The  cycle  containing  A-  > 1 has  length  2Llslg,fc|  + 1,  because  it  is  easy  to  show  from 
Eq.  (7)  that  if  A = (bn-i  ■ ■ ■ 60)2  we  have 


g12  '(A)  = (cn_x  . . . c0)2,  where  Cj  = bj  © bj+2i . 

To  permute  all  elements  A such  that  [lg  AJ  = t,  there  are  two  cases:  If  t is  a power  of  2, 
the  cycle  containing  2[A/2J  also  contains  2[A/2J  + 1,  so  we  must  double  the  cycle  leaders 
for  1—1.  Otherwise  the  cycle  containing  2[A/2]  is  disjoint  from  the  cycle  containing 
2|_A/2J  + 1,  so  Lt  — (2Lt~\)  U ( 2Lt-\  + 1)  = (Lf_1*)2.  This  argument,  discovered  by 
Jorg  Arndt  in  2001,  establishes  the  hint  and  yields  the  following  algorithm: 

PI.  [Initialize.]  Set  t <—  1,  m <—  0.  (We  may  assume  that  n > 2.) 

P2.  [Loop  through  leaders.]  Set  r <—  in.  Perform  Algorithm  Q with  A = 2*  + r; 
then  if  r > 0,  set  r <—  (r  — 1 ) &m  and  repeat  until  r = 0.  [See  exercise  7.1.3-79.] 

P3.  [Increase  IgA.]  Set  t 4—  t + 1.  Terminate  if  t is  now  equal  to  n;  otherwise  set 
m «—  2m  +[!&(!—  1)^0]  and  return  to  P2.  | 

Ql.  [Begin  a cycle.]  Set  s «-  Xk,  l <—  A,  j ■<—  / © [1/2] . 

Q2.  [Follow  the  cycle.]  While  j / A,  set  Xi  <—  Xj,  l 4-  j,  and  j «-  l © [1/2] . Then 
set  Xi  <—  s.  | 

31.  We  get  a field  from  /„  if  and  only  if  we  get  one  from  /j,2' , which  takes  (an_i  . . . a0)2 
to  ((a„_i  © an_2)(o„_i  © an_3)(an_2  © an- 4)  • • • (<22  © «o)(ai))2.  Let  cn(x)  be  the 
characteristic  polynomial  of  the  matrix  A defining  this  transformation,  mod  2;  then 
ci (x)  = x + 1,  c2(x)  = x2  + x + 1,  and  Cj+i(x)  = xcj(x)  + Cj-i(x).  Since  c„(A)  is  the 
zero  matrix,  by  the  Cayley  Hamilton  theorem,  a field  is  obtained  if  and  only  if  cn(x)  is 
a primitive  polynomial,  and  this  condition  can  be  tested  as  in  Section  3.2.2.  The  first 
such  values  of  n are  1,  2,  3,  5,  6,  9,  11,  14,  23,  26,  29,  30,  33,  35,  39,  41,  51,  53,  65,  69, 
74,  81,  83,  86,  89,  90,  95. 

[Running  the  recurrence  backwards  shows  that  c-j-i(x)  = Cj(x),  hence  Cj(x) 
divides  C(2j+i)k+j (x);  for  example,  c3k+ i(x)  is  always  a multiple  of  x + 1.  All  numbers  n 
of  the  form  2j A + j + A are  therefore  excluded  when  j > 0 and  A > 0.  The  polynomials 
cig(a:),  C5o(x),  cgS(x),  and  cgg(x)  are  irreducible  but  not  primitive.] 


684  ANSWERS  TO  EXERCISES 


7.2. 1.1 


32.  Mostly  true,  but  false  at  the  points  where  wk(x)  changes  sign.  (Walsh  originally 
suggested  that  wk{x)  should  be  zero  at  such  points;  but  the  convention  adopted  here 
is  better,  because  it  makes  simple  formulas  like  (i5)-(ig)  valid  for  all  x.) 

33.  By  induction  on  fc,  we  have 

wk(x)  = ieLJfe/2J  ( 2x ) = r1(2x)i>1+i'2?'2(2a;)b2+*>3  . . . = n{x)bo+bl  r2{x)bl+b2 r3{x)b2+b:<  . . . 

for  0 < x < because  r,(2x)  = rj  + i(x)  and  r\(x)  = 1 in  this  range.  And  when 
\ < x < 1, 

wk{x)  = (-l)rfc/2lu>Lfc/2j  {2x  - 1)  = r1(a:)!>0+t’1ri(2x  - l)6l+f>2r2(2a;  - l)b2+(’3  . . . 

= ri  (x)bo+bl  r2(x)6l+i>2r3(a;)i’2+i>3  . . . 


because  \k/2 ] eJo+Ji  (modulo  2)  and  rj( 2x  — 1)  = r,+i (x  — i)  = rj+1(x)  for  j > 1. 

34.  pk(x)  = n,->0  rAi(x);  hence  wk(x ) = pk(x)p[k/2i  (*)  = Pg(k)(x).  [R.  E.  A.  C. 
Paley,  Proc.  London  Math.  Soc.  (2)  34  (1932),  241-279.] 

35.  If  j = (a„_i  . . . 00)2  and  k = (bn-\  . . . 60)2,  the  element  in  row  j and  column  k is 
(_l)/0.fc),  where  f(j,  k ) is  the  sum  of  all  arbs  such  that:  r = s (Hadamard);  r+s  = n—  1 
(Paley);  r + s = norn-l  (Walsh). 

Let  Rn , Fn,  and  G„  be  permutation  matrices  for  the  permutations  that  take 
j = (an-i  ■ ■ ■ 00)2  to  k = (ao  ■ ■ ■ an- 1)2,  k = 2n  — 1 — j = (a„-i  . . . 00)2,  and  k = 
9[~1](j)  = {(an- 1)  • ■ ■ (a„_i  © • • • ® ao))2,  respectively.  Then,  using  the  direct  product 
of  matrices,  we  have  the  recursive  formulas 


-Rrz  + l — 


f Rn  ® (1  0)\ 

V Rn  ® (0  1)  J ’ 


Gn+l 


(Gn  0 \ 

\ 0 GnFn  J ’ 


Lfn+l  — Hn  ® 


Pn  + 1 — 


( Pn®  (1  1)\ 


wn+ 1 


( Wn®{  1 1)_  \ 
\FnWn  ® (1  1)  J ■ 


Thus  Wn  = GTnPn  = PnGn ; Hn  = PnRn  = RnPn ; and  Pn  = WnGTn  = GnWn  = 
HnRn  = RnR-n • 

36.  Tl.  [Hadamard  transform.]  For  k = 0,  1,  . . . , n — 1,  replace  the  pair  (X3 . XJ+2k ) 

by  (Xj  + Xj+2k,Xj  — Xj+2k)  for  all  j with  \j/2k\  even,  0 < j < 2n . (These 
operations  effectively  set  XT  4-  HnXT .) 

T2.  [Bit  reversal.]  Apply  the  algorithm  of  exercise  5 to  the  vector  X.  (These 
operations  effectively  set  XT  4—  RnXT,  in  the  notation  of  exercise  35.) 

T3.  [Gray  binary  permutation.]  Apply  the  algorithm  of  exercise  30  to  the  vec- 
tor X.  (These  operations  effectively  set  XT  4—  G„XT.)  | 

If  n has  one  of  the  special  values  in  exercise  31,  it  may  be  faster  to  combine  steps  T2 
and  T3  into  a single  permutation  step. 

37.  If  k = 2ei  +•  • - + 2et  with  ei  > • • • > et  > 0,  the  sign  changes  occur  at  Sei  U-  • -USe, , 
where 


So  = 


Si 


Se  = 


f 2 j + 1 
l 2e+! 


0 <j<  2C}. 


Therefore  the  number  of  sign  changes  in  (0  . . x)  is  [2ejx+~\.  Setting  x = l/(k+l) 

gives  l+0(t)  changes;  so  the  Z th  is  at  a distance  of  at  most  0(v{k))/ 2^lsfcJ  from  l/(k+ 1). 


7.2.1. 1 


ANSWERS  TO  EXERCISES  685 


[This  argument  makes  it  plausible  that  infinitely  many  pairs  (k,l)  exist  with 
I zki  ~ l/{k  + 1)1  = f2((log/c)/fc).  But  no  explicit  construction  of  such  “bad”  pairs 
is  immediately  apparent.] 

38.  Let  t0(x)  = 1 and  tk(x)  = u;L3xJ  r2fc/3l  tLA:/3J  (3a;),  where  u = e2,ri/3.  Then  tk{x) 
winds  around  the  origin  |/c  times  as  x increases  from  0 to  1.  If  sk(x)  = wL3*1!  is  the 
ternary  analog  of  the  Rademacher  function  rk(x),  we  have  tk( x)  = fjj >0  Sj+i(x)bi~bi+1 
when  k = (6n_i  . . . 60)3,  as  in  the  modular  ternary  Gray  code. 

39.  (a)  Let’s  call  the  symbols  {x0,  xi, . . . , x7}  instead  of  {a,b,c,d,e,f,g,h}.  We  want 
to  find  a permutation  p of  (0,1,...,  7}  such  that  the  matrix  with  (-l)i-*a:p(j-)efc  in 
row  j and  column  k has  orthogonal  rows;  this  condition  is  equivalent  to  requiring  that 

(j  ® /)  • (p(j)  ©p(j'))  = 1 (modulo  2),  for  0 < j < j’  < 8. 

One  solution  is  p( 0) . . . p( 7)  = 017  2 5 6 3 4,  yielding  the  identity  ( a 2 + b2  + c2  + d2  + 
e2  + f2+g2  + h2)(A2  + B2  + C2  + D2  + E2  + F2  + G2  + H2)  = A2  + B2  + C2  +V2  + 

£2  + T2  + G2  + H2,  where  id 

/-4\  / a b c d e f g /i  \ / A 

B b -a  d -c  f -e  h -g  B 

c h g -f  -e  d c -b  -a  C 

B _ c —d  -a  b g -h  -e  f D 

£ f e h g —b  —a  —d  —c  E 

T g -h  e -f  -c  d -a  b F 

G I d c —b  —a  —h  —g  f e G 

\ki/  \ e -f  -g  h -a  b c -d  / \H 

[This  identity  was  discovered  by  C.  F.  Degen,  Memoires  de  l’Acad.  Sci.  St.  Petersbourg 
(5)  8 (1818),  207-219.  The  related  octonions  are  discussed  in  an  interesting  survey  by 
J.  C.  Baez,  Bull.  Amer.  Math.  Soc.  39  (2002),  145-205;  42  (2005),  213,  229-243.  See 
also  J.  H.  Conway  and  D.  A.  Smith,  On  Quaternions  and  Octonions  (2003).] 

(b)  There  is  no  16  x 16  solution.  The  closest  one  can  come  is 

P( 0)  . . . p(  15)  = 0 1 11  2 14  15  13  4 9 10  7 12  5 6 3 8, 

which  fails  if  and  only  if  j ©/  = 5.  (See  Philos.  Mag.  34  (1867),  461-475.  In  §9,  §10, 

§11,  and  §13  of  this  paper,  Sylvester  stated  and  proved  the  basic  results  about  what 
has  somehow  come  to  be  known  as  the  Hadamard  transform  — although  Hadamard 
himself  gave  credit  to  Sylvester  [Bull,  des  Sciences  Mathematiques  (2)  17  (1893),  240- 
246].  Moreover,  Sylvester  introduced  transforms  of  mn  elements  in  §14,  using  mth 
roots  of  unity.) 

40.  Yes;  this  change  would  in  fact  run  through  the  swapped  subsets  in  lexicographic 
binary  order  rather  than  in  Gray  binary  order.  (Any  5x5  matrix  of  0s  and  Is  that 
is  nonsingular  mod  2 will  generate  all  32  possibilities  when  we  run  through  all  linear 
combinations  of  its  rows.)  The  most  important  thing  is  the  appearance  of  the  ruler 
function,  or  some  other  Gray  code  delta  sequence,  not  the  fact  that  only  one  aj  changes 
per  step,  in  cases  like  this  where  any  number  of  the  a,j  can  be  changed  simultaneously 
at  the  same  cost. 

41.  At  most  16;  for  example,  fired,  fires,  finds,  fines,  fined,  fares,  fared,  wares, 
wards,  wands,  wanes,  waned,  wines,  winds,  wires,  wired.  We  also  get  16  from  paced/ 
links  and  paled/mints;  perhaps  also  from  a word  mixed  with  an  antipodal  nonword. 


686  ANSWERS  TO  EXERCISES 


7.2. 1.1 


42.  Suppose  n < 22  + r + 1,  and  let  s = 2r . We  use  an  auxiliary  table  of  2r+s  bits 
for  0 < j < 2s  and  0 < k < s,  representing  focus  pointers  as  in  Algorithm  L,  together 
with  an  auxiliary  s-bit  “register”  j = (js_i  . ,.j0) 2 and  an  (r  + 2)-bit  “program  counter” 
P = (Pr+i  ■ ■ ■ Po ) 2 • At  each  step  we  examine  the  program  counter  and  possibly  the  j 
register  and  one  of  the  / bits;  then,  based  on  the  bits  seen,  we  complement  a bit  of  the 
Gray  code,  complement  a bit  of  the  program  counter,  and  possibly  change  a j or  / bit, 
thereby  emulating  step  L3  with  respect  to  the  most  significant  n — r — 2 bits. 

For  example,  here  is  the  construction  when  r = 1: 


P2P1P0 

Change  Set 

P2P1P0 

Change 

Set 

0 0 0 

ao,Po  jo  foo 

} 3 <—  fo 

1 1 0 

a0,po 

fj 0 /(j+l)0  \ 

0 0 1 

dli  Pi  j 1 «-  foi 

1 1 1 

ai,pi 

fi  1 <-  /(j+i)i  I 

0 1 1 

aoiPo  foo  <—  0 ) 

> fo^O 

1 0 1 

a0,Po 

/(j  + l)0  0’  + l)i 

0 1 0 

a2,P2  foi  <-  0 J 

10  0 

dj+3,P2 

fu+ 1)1  0+1) 

<-/; 


3 + 1 


■3  + 1 


The  process  stops  when  it  attempts  to  change  bit  an. 

[In  fact,  we  need  change  only  one  auxiliary  bit  per  step  if  we  allow  ourselves  to 
examine  some  Gray  binary  bits  as  well  as  the  auxiliary  bits,  because  pr  . . ,p0  = ar  . . . a0, 
and  we  can  set  f0  <—  0 in  a more  clever  way  when  j doesn’t  have  its  final  value  2s  - 1. 
This  construction,  suggested  by  Fredman  in  2001,  improves  on  another  that  he  had 
published  in  SICOMP  7 (1978),  134-146.  With  a more  elaborate  construction  it  is 
possible  to  reduce  the  number  of  auxiliary  bits  to  O(n).] 

43.  This  number  was  estimated  by  Silverman,  Vickers,  and  Sampson  [IEEE  Trans.  IT- 
29  (1983),  894-901]  to  be  about  7 x 1022.  Exact  calculation  might  be  feasible  because 
every  6-bit  Gray  cycle  has  only  five  or  fewer  segments  that  lie  in  a 5-cube  corresponding 
to  at  least  one  of  the  six  coordinates.  (In  unpublished  work,  Steve  Winker  had  used  a 
similar  idea  to  evaluate  d( 5)  in  less  than  15  minutes  on  a “generic”  computer  in  1972.) 

44.  Every  n-bit  Gray  cycle  defines  a pair  of  perfect  matchings  (see  exercise  55). 

45.  (a)  (000  002  012  010  090  094  0b4  ...  112  102  100),  in  hexadecimal,  32  elements  in 
all.  Notice  that  the  signatures  of  elements  in  each  cycle  run  through  the  Gray  code  F4. 

(b)  A ground  vertex  v is  preceded  in  its  cycle  by  its  sibling  v © 2.  If  v is  a ground 
vertex  in  a different  cycle  from  its  sibling  u = v ® 1,  we  can  join  the  cycles  by  deleting 
{u®2  — u,  u©2  — v}  and  inserting  {u  — v,  «©2 — v©2}.  Repeat  for  all  ground  v. 

(c)  Consider  the  multigraph  G'  whose  vertices  are  the  cycles  and  whose  edges  go 
from  the  cycle  of  v to  the  cycle  of  v + 1 for  all  even  ground  vertices  v.  Every  vertex  of 
G'  has  even  degree,  so  the  edges  are  a union  of  cycles  in  G' . Thus  any  edge  of  G'  can 
be  deleted  without  changing  the  connected  components. 

(d)  It’s  not  difficult  to  construct  a path  P = i/0'  — — • • • through 

vertices  of  G with  vo  = V-i  = 0 that  passes  through  all  such  v with  a(v)  < 1,  and  such 
that  a(v^)  6 {0,1,  2, 4, 8}  for  all  i.  Take  the  cycle  from  (b)  that  contains  v ^ and  call 
it  the  “working  cycle”  W;  then  do  the  following  for  i = 1,  2,  . . . , until  W includes  all 
vertices:  If  v — ^ W,  suppose  u = has  ui  ^ vi.  Case  1:  rt©c  — u is  an  edge 

of  W,  for  c = 1 or  c = 2.  Take  a cycle  for  the  equivalence  class  of  v that  has  the  edge 
V®c  — v.  Delete  those  edges  and  insert  {u  — v,  u®c  — n©c}.  Case  2:  Otherwise 
Case  1 must  have  applied  to  w = and  u on  the  previous  step.  If  c = 1 then  W 

contains  the  edge  u© 2 — u©3.  We  can  find  a cycle  with  w©2  — u©3,  and  replace 
those  edges  by  {u©2  — u©3,  u©3  — u©3}.  A similar  edge-swap  works  when  c = 2. 

(e)  The  final  cycle  W allows  us  to  reconstruct  Mi(v)(v).  When  l(v)  / 0 the 
function  A4((„)  is  equivalent  to  t = 23r~1  independent  matchings  of  the  r-cube,  because 
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there  are  t ways  to  choose  the  Vi  for  i ^ l having  the  correct  signature.  So  the  number 
of  different  cycles  is  at  least  M(r)12*  (see  exercise  44). 

46.  There  are  fc-bit  signatures  <r(v).  When  a{v)  = g(j)  in  Gray  binary  code,  l(v)  = 
(p(j  + 1)  + [j  / 2k  - 1])  [j  + 2 is  not  a power  of  2].  At  least  M(r)(2*’-*:)t  cycles  arise, 
where  t = 2('c_1)(r~1)+2.  [Information  Processing  Letters  109  (2009),  267-272.] 

47.  The  bounds  (^)2’"1  < 2r-x !/(2T-1/r)2''_1  < M(r)  < r^~^T  = (r+0(log r))2”"1 
are  proved  in  Section  7.5.1.  Hence  d(n)1/2"  < n/e  + O(logn)  by  exercise  44. 

The  lower  bound  from  exercise  46,  if  we  let  Gj  be  an  r^-cube,  is 


(M(n)2 


(M(r2)2"  ■*—*'}*-  ~.(M(r3)2 

• . . . • (M^-,)2""'*-1'^1)2  • (M(rkr-rk-k+1)2  ; 


and  it’s  better  to  choose  r j m (n  — 2)/2-i  ^ k ^ for  1 < j < k instead  of  using  cubes 
of  roughly  the  same  size.  Let  ctj  = r,-/e  be  a lower  bound  on  M(rj)2l~rj . The  lower 
bound  on  d(n)1/2”  simplifies  to 


-k/ 2k  1/4  1/8 

*1  a2  a3 


1/2- 


\/2k-1  n\/2k~l  _ 2-2+(fc-4)/2*  (n-2\l  k/2 


(^)  ' (— (J))- 


and  this  is  n/(4e)  + O(logn)2  when  k = lgn  + 0(1). 

49.  Take  any  Hamiltonian  path  P from  0 ...  0 to  1 ...  1 in  the  (2 n - l)-cube,  such 
as  the  Savage-Winkler  code,  and  use  OP,  IP.  (All  such  cycles  are  obtained  by  this 
construction  when  n = 1 or  n = 2,  but  many  more  possibilities  exist  when  n > 2.) 

50.  a1(n+l)a?na1j1a2na2{n+l)ot2  . . ■ ji-iatfna? (n+l)atnaf  jt_1  . . ,j1a^n. 

51.  Let  Cj  = 2[(2n  1 +j)/n\  and  c'  = 2|_(2n+1  +j)/(n  + 2)J.  If  n ^ 3,  it  is  not  difficult 
to  verify  that  4Cj  > 8L2n_1/nJ  > 2[2n+1/(n  + 2)]  > c'k  for  0 < j < n and  0 < k < n + 2. 
Therefore  we  can  apply  Theorem  D to  any  n-bit  Gray  cycle  with  transition  counts  Cj, 
underlining  bj  copies  of  j and  putting  an  underlined  digit  0 last,  where  bj  = 2c j — 
2c(j+2+d)  mod  (n+2)  — [.?  = 0]  and  d is  chosen  so  that  c'd  = c'd+1.  This  construction  works 

because  l = b0  + ■ ■ ■ + fon_i  = 2(c0  + • • • + c„_j)  — |(cf,  H 1-  c'n+1  -c'd-  c'd+1)  - 1 = 

cd  - 1 is  odd.  [Corollary  B was  discovered  by  T.  Bakos  in  the  1950s,  and  proved  in 
detail  by  A.  Adam  in  Truth  Functions  (Budapest:  1968),  28-37.  Adam’s  book  also 
presents  a proof  by  G.  Poliak  that,  in  fact,  c'0  = c\  for  all  n;  hence  we  may  take  d=  0. 
See  also  J.  P.  Robinson  and  M.  Cohn,  IEEE  Trans.  C-30  (1981),  17-23.] 

52.  The  number  of  different  code  patterns  in  the  smallest  j coordinate  positions  is  at 
most  c0  + • • • + Cj- 1. 

53.  Theorem  D produces  only  cycles  with  Cj  = cJ+1  for  some  j,  so  it  can’t  produce 
the  counts  (2,4, 6,8, 12).  The  extension  in  exercise  50  gives  also  Cj  = Cj+i  — 2,  but  it 
can’t  produce  (6,10,14,18,22,26,32).  The  sets  of  numbers  satisfying  the  conditions 
of  exercise  52  are  precisely  those  obtainable  by  starting  with  {2, 2, 4, . . . , 2n-1}  and 
repeatedly  replacing  some  pair  {cj,ck}  for  which  Cj  < ck  by  the  pair  {cj  + 2 ,ck-  2}. 

54.  Suppose  the  values  are  {pi, . . . ,pn},  and  let  Xjk  be  the  number  of  times  pj  occurs 
in  (ai, . . . , ak).  We  must  have  (in, . . . , xnk)  = (in, . . . , xni ) (modulo  2)  for  some  k < l. 
But  if  the  p’s  are  prime  numbers,  varying  as  the  delta  sequence  of  an  n-bit  Gray  cycle, 
the  only  solution  is  k = 0 and  l = 2n.  [AMM  60  (1953),  418;  83  (1976),  54.] 

55.  In  fact,  given  any  perfect  matching  Q of  AT2" , one  can  find  in  0(2")  steps  a perfect 
matching  R of  the  n-cube  such  that  Q U R is  a Hamiltonian  cycle  of  K2n  ■ [See  J.  Fink, 
J.  Comb.  Theory  B97  (2007),  1074-1076;  Elect.  Notes  Disc.  Math.  29  (2007),  345-351.] 
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56.  [Bell  System  Tech.  J.  37  (1958),  815-826.]  The  112  canonical  delta  sequences  yield 


Class  Example  t 
A 0102101302012023  2 
B 0102303132101232  2 
C 0102030130321013  2 


Class  Example  t 
D 0102013201020132  4 
E 0102032021202302  4 
F 0102013102010232  4 


Class  Example  t 
G 0102030201020302  8 
H 0102101301021013  8 
I 0102013121012132  1 


Here  B is  the  balanced  code  (Fig.  33(b)),  G is  standard  Gray  binary  (Fig.  30(b)),  and 
H is  the  complementary  code  (Fig.  33(a)).  Class  H is  also  equivalent  to  the  modular 
(4, 4)  Gray  code  under  the  correspondence  of  exercise  18.  A class  with  t automorphisms 
corresponds  to  32  x 24 ft  of  the  2688  different  delta  sequences  <50<5i  . . . <515. 

Similarly  (see  exercise  7.2.3-00),  the  5-bit  Gray  cycles  fall  into  237,675  different 
equivalence  classes. 

57.  With  Type  1 only,  480  vertices  are  isolated,  namely  those  of  classes  D,  F.  G in  the 
previous  answer.  With  Type  2 only,  the  graph  has  384  components,  288  of  which  are 
isolated  vertices  of  classes  F and  G.  There  are  64  components  of  size  9,  each  containing 
3 vertices  from  E and  6 from  A ; 16  components  of  size  30,  each  with  6 from  FI  and  24 
from  C;  and  16  components  of  size  84,  each  with  12  from  D,  24  from  B,  48  from  I.  With 
Type  3 (or  Type  4)  only,  the  entire  graph  is  connected.  [Similarly,  all  91,392  of  the  4-bit 
Gray  paths  are  connected  if  path  a/3  is  considered  adjacent  to  path  aR/3.  Vickers  and 
Silverman,  IEEE  Trans.  C-29  (1980),  329-331,  have  conjectured  that  Type  3 changes 
will  suffice  to  connect  the  graph  of  n-bit  Gray  cycles  for  all  n > 3.] 

58.  If  some  nonempty  substring  of  /3/3  involves  each  coordinate  an  even  number  of 
times,  that  substring  cannot  have  length  |/3|,  so  some  cyclic  shift  of  /?  has  a prefix  7 
with  the  same  evenness  property.  But  then  a doesn’t  define  a Gray  cycle,  because  we 
could  change  each  n of  7 back  to  0. 

59.  If  a is  nonlocal  in  exercise  58,  so  is  /3/3,  provided  that  q > 1 and  that  0 occurs 
more  than  q + 1 times  in  a.  Therefore,  starting  with  the  a of  (30)  but  with  0 and  1 
interchanged,  we  obtain  nonlocal  cycles  for  n > 5 in  which  coordinate  0 changes  exactly 
6 times.  [Mark  Ramras,  Discrete  Math.  85  (1990),  329-331.]  On  the  other  hand,  a 4- 
bit  Gray  cycle  cannot  be  nonlocal  because  it  always  has  a run  of  length  2;  if  8k  = 5k+2 , 
elements  {us,,  vk+i,  vk+2,  Vfc+3}  form  a 2-subcube. 

60.  Use  the  construction  of  exercise  58  with  9 = 1. 

61.  The  idea  is  to  interleave  an  m-bit  cycle  U = (uq,  u 1,  u2, . . . ) with  an  n-bit  cycle 
V — (v0,  vi,  V2, . . . ),  by  forming  concatenations 


W ( Ui0Vj0 , m1  Vjx , Ui2  Vj2 , . . . ),  ik  — ao  + • • • + ak-\ , jk  = Oo  + • • • + Ufc_i, 

where  a0aio2  ...  is  a periodic  string  of  control  bits  aaa ...;  we  advance  to  the  next 
element  of  U when  ak  = 0,  otherwise  to  the  next  element  of  V. 

If  a is  any  string  of  length  2m  < 2n,  containing  s bits  that  are  0 and  f = 2m  - s 
bits  that  are  1,  W will  be  an  (m  + n)-bit  Gray  cycle  if  s and  t are  odd.  For  we  have 
ik+i  = ik  (modulo  2m)  and  jk+i  = jk  (modulo  2n)  only  if  l is  a multiple  of  2m,  since 
ik  + jk  = k.  Suppose  l = 2mc;  then  jk+,  = jk  + tc,  so  c is  a multiple  of  2n . 

(a)  Let  a = 0111;  then  runs  of  length  8 occur  in  the  left  2 bits  and  runs  of  length 
> L3r(n)J  occur  in  the  right  n bits. 

(b)  Let  s be  the  largest  odd  number  < 2 mr(m)/(r(m)  +r(n)).  Also  let  t = 2m  -s 
and  ak  = [(fc  + l)t/2mJ  - |fct/2mJ,  so  that  ik  = \ks/2m]  and  jk  = [kt/2m\.  If 
a run  of  length  l occurs  in  the  left  m bits,  we  have  ik+1+1  > ik  + r(m ) + 1,  hence 


7.2.1. 1 


ANSWERS  TO  EXERCISES  689 


/ + 1 > 2 mr(m)/s  > r(m)  + r(n).  And  if  it  occurs  in  the  right  n bits  we  have  jk+i+i  > 
jk  + r(n)  + 1,  hence 


l + 1 > 2 mr(n)/t  > 2mr(n)/(2Tnr(n)/(r(m)  + r(n ))  + 2) 


= r{m)  + r(n) 


2 (r(m)  + r(n ))2 
2 mr(n)  + 2 (r(m)  + r(n )) 


> r(m)  + r(n)  — 1 


because  r(m)  < r(n). 

The  construction  often  works  also  in  less  restricted  cases.  See  the  paper  that 
introduced  the  study  of  Gray-code  runs:  L.  Goddyn,  G.  M.  Lawrence,  and  E.  Nemeth, 
Utilitas  Math.  34  (1988),  179-192. 

63.  Set  ak  <-  k mod  4 for  0 < k < 210,  except  that  ak  = 4 when  k mod  16  = 15  or 

k mod  64  = 42  or  k mod  256  = 133.  Also  set  (jo,ji, h,j3,  j4)  (0,2, 4, 6, 8).  Then 

for  k = 0,  1,  . . . , 1023,  set  5k  «—  jak  and  jak  <—  1 + 4ak  — jak-  (This  construction 
generalizes  the  method  of  exercise  61.) 

64.  (a)  Each  element  uk  appears  together  with  {vk,vk+2™,  ■ ■ ■ ,»s+2m(2«-i-1)}  and 
{ufe+i,Ufc+i+2m,...,ufc+i+2m(2"-i-i)}-  Thus  the  permutation  o0  . . ,02™-\  must  be  a 
2 x-cycle  containing  the  n-bit  vertices  of  even  parity,  times  an  arbitrary  permutation 
of  the  other  vertices.  This  condition  is  also  sufficient. 

(b)  Let  tj  be  the  permutation  that  takes  v eA  v ® 21 , and  let  7r j{u,w)  be  the 
permutation  ( uw)tj . If  u®w  = 21  + 2j  then  tt j(u,w)  takes  u ^ u®2i  and  w®2\ 
while  v i-A  v ® 2j  for  all  other  vertices  v,  so  it  takes  each  vertex  to  a neighbor. 

If  S is  any  set  C {0, . . . , n — 1},  let  a(S)  be  the  stream  of  all  permutations  Tj  for 
all  j G {0, . . . , n — 1}  \ S,  in  increasing  order  of  j,  repeated  twice;  for  example,  if  n = 5 
we  have  cr ( { 1 , 2 } ) = T0T3T4T0T3T4 . Then  the  Gray  stream 


E(i,  j,u)  = a({i,  j})7ri(u,u02,©2J)cr({Li})r:,(T({j}) 

consists  of  6n  - 8 permutations  whose  product  is  the  transposition  (u  u® 2i®2j). 
Moreover,  when  this  stream  is  applied  to  any  n-bit  vertex  v,  its  runs  all  have  length 
n — 2 or  more. 

We  may  assume  that  n > 5.  Let  So  . . .52™-i  be  the  delta  sequence  for  an  n-bit 
Gray  cycle  (vo,«i,. . . , U2«-i)  with  all  runs  of  length  3 or  more.  Then  the  product  of 
all  permutations  in 

2n~1-l 

£=  17  (^(^2k-l,S2k,V2k-l)  £(S2k,S2k+l,V2k)) 

k=  1 


is  (vi  V3)(v2  V4)  . . . (v2n  —3  V2«-i)(v2’»-2  Uo)  = (u2«-i  . . . Vi)(v2n~2  ■ ■ ■ Vo),  so  it  satisfies 
the  cycle  condition  of  (a). 

Moreover,  all  powers  (tr(0)E)‘  produce  runs  of  length  > n - 2 when  applied  to 
any  vertex  v.  By  repeating  individual  factors  a({i,j})  or  a({j})  in  E as  many  times 
as  we  wish,  we  can  adjust  the  length  of  <r(0)E,  obtaining  2n  + (2n_1  - l)(12n  - 16)  + 
2(n  — 2)a  + 2(n  — 1)6  for  any  integers  a,  b > 0;  thus  we  can  increase  its  length  to  exactly 
2m,  provided  that  2m  > 2n+  (2n  1—l)(12n  — 16)  + 2(n2  — 5n  + 6),  by  exercise  5.2.1-21. 

(c)  The  bound  r(n)  > n — 41gn  + 8 can  be  proved  for  n > 5 as  follows.  First 
we  observe  that  it  holds  for  5 < n < 33  by  the  methods  of  exercises  60-63.  Then  we 
observe  that  every  integer  IV  > 33  can  be  written  as  N = m + n or  IV  = m + n + 1,  for 
some  m > 20,  where 


n = m — [4  lg  mj  + 10. 
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If  m > 20,  2m  is  sufficiently  large  for  the  construction  in  part  (b)  to  be  valid;  hence 
r(N)  > r(m  + n)  > 2 min (r(m),n  — 2)  > 2(m  — [41gmJ  + 8) 

= ra  + n + 1 - [41g7V-l+eJ+8 
>!V-41g7V  + 8 

where  e = 41g(2m/IV)  < 1 + [N  = m + n].  [Electronic  Journal  of  Combinatorics  10 
(2003),  #R27,  1 -10.]  Recursive  use  of  (b)  gives,  in  fact,  r(1024)  > 1000. 

65.  A computer  search  reveals  that  eight  essentially  different  patterns  (and  their 
reverses)  are  possible.  One  of  them  has  the  delta  sequence  01020314203024041234 
214103234103,  and  it  is  close  to  two  of  the  others. 

66.  (Solution  by  Mark  Cooke.)  One  suitable  delta  sequence  is  012345607012132435 
65760710213534626701537412362567017314262065701342146560573102464537 
57102043537614073630464273703564027132750541210275641502403654250136 
02541615604312576032572043157624321760452041751635476703564757062543 
7242132624161523417514367143164314.  (Solutions  for  n > 8 are  still  unknown.) 

67.  Let  V2k+i  = V2k  and  V2k  = Ow/t,  where  (uo,ui, . . . , u2n-i_1)  is  any  (n—  l)-bit  Gray 
cycle.  [See  Robinson  and  Cohn,  IEEE  Trans.  C-30  (1981),  17-23.] 

68.  Yes.  The  simplest  way  is  probably  to  take  (n  — l)-trit  modular  Gray  ternary  code 
and  add  0 ...  0,  1 ...  1,  2 ...  2 to  each  string  (modulo  3).  For  example,  when  n = 3 the 
code  is  000,  111,  222,  001,  112,  220,  002,  110,  221,  012,  120,  201,  . . . , 020,  101,  212. 

69.  (a)  We  need  only  verify  the  change  in  h when  bits  bj-i  ...b0  are  simultaneously 
complemented,  for  j — 1,  2,  . . . ; and  these  changes  are  respectively  (1110)2,  (1101)2, 
(0111)2,  (1011)2,  (10011)2,  (100011)2,  •••  ■ To  prove  that  every  n-tuple  occurs,  note 
that  0 < h(k ) < 2n  when  0 < k < 2n  and  n > 3;  also  h[_1]  ((a„_i  . . . a0)2)  = 
(bn- 1 ■ ■ ■ M2,  where  b0  = a0  © cq  ® o2  © • • • , bx  = a0,  b2  = a2  ® a3  © a4  ® • • • , 
&3  = ao  © ai  ® a3  ® • ■ ■ , and  bj  = <ij  © aj+i  © ■ ■ • for  j > 4. 

(b)  Let  h(k)  = (. . . a2aia0)2  where  a3  = bj  © bj+ 1 © b0[j  < t]  © bt-i[t  - 1 <j<t], 

70.  As  in  (32)  and  (33),  we  can  remove  a factor  of  n!  by  assuming  that  the  strings  of 
weight  1 occur  in  order.  Then  there  are  14  solutions  for  n = 5 starting  with  00000,  and 
21  starting  with  00001.  When  n = 6 there  are  46,935  of  each  type  (related  by  reversal 
and  complementation).  When  n = 7 the  number  is  much,  much  larger,  yet  very  small 
by  comparison  with  the  total  number  of  7-bit  Gray  codes. 

71.  Suppose  that  an(J+1)  differs  from  anj  in  coordinate  tj , for  0 < j < n — 1.  Then 

tj  = j 7r„,  by  (44)  and  (38).  Now  Eq.  (34)  tells  us  that  t0  = n - 1;  and  if  0 < j < n - 1 
we  have  tj  = (( j - l)7rn_i)7r„_i  by  (40).  Thus  tj  = jan tt^_1  for  0 < j < n - 1,  and 

the  value  of  (n  — l)7rn  is  whatever  is  left.  (Notations  for  permutations  are  notoriously 

confusing,  so  it  is  always  wise  to  check  a few  small  cases  carefully.) 

72.  The  delta  sequence  is  0102132430201234012313041021323. 

73.  Let  Qnj  = P^j  and  denote  the  sequences  (41),  (42)  by  Sn  and  Tn.  Thus  Sn  = 
PnoQniPnv  ■ ■ ■ and  Tn  = QnoPniQn2  . . . , if  we  omit  the  commas;  and  we  have 

Sn+l  = 0 Pn0  0 Qnl  1 Qlo  1 Pnl  0 Pn2  0 Qn3  1 Ql2  1 Pn*3  0 Pni  . . . , 

Tn+ 1 = 0QnO  lPZo  0 Pnl  0Qn2  lQn!  1 p:2  0 Pn3  0 Qn4  1 Q*3 

where  7r  = 7rn,  revealing  a reasonably  simple  joint  recursion  between  the  delta  sequences 
An  and  En  of  S„  and  Tn.  Namely,  if  we  write 


An  — 4>\  d\  (j)2  0,2  ■ ■ ■ 4>n- 1 dn- 1 4>n, 


En  = Ipl  bi  1p2b2..  .tpn-l  1 %l>n, 
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where  each  tf>j  and  04  is  a string  of  length  — 1,  the  next  sequences  are 

An+l  = 01  0 1 02  n 0l7T  bin  027T  n 03  a3  04  Tl  037T  637T  047T  Tl  . . . 

Sn+i  =0i  n 0i7r  n 02  f>2  03  n 0 2?r  02ir  0 37r  n 04  fo4  05  n 047r  a4ir  057r  n . . . 

For  example,  we  have  A3  = 0102101  and  10  = 0212021,  if  we  underline  the  a’s 
and  6’s  to  distinguish  them  from  the  0’s  and  0’s;  and 

A4  = 010213  07r27rl7r27r07r  313  l7r  = 01021321012313  0, 

£4  = 03  07T  3 1 2 0 2 1 3 07r27rl7r07rl7T  = 03231202132102  0; 

here  a304  and  b304  are  empty.  Elements  have  been  underlined  for  the  next  step. 

Thus  we  can  compute  the  delta  sequences  in  memory  as  follows.  Here  p[j]  = jnn 
for  1 < j < n;  sk  = 5k,  tk  = ek,  and  uk  = [5*,  and  ek  are  underlined],  for  0 < k < 2n—  1. 

XI.  [Initialize.]  Set  n 4-  1,  p[0]  4-  0,  s0  4-  t0  4-  u0  4-  0. 

X2.  [Advance  n .]  Perform  Algorithm  Y below,  which  computes  the  arrays  s',  t', 
and  u for  the  next  value  of  n;  then  set  n 4-  n + 1. 

X3.  [Ready?]  If  n is  sufficiently  large,  the  desired  delta  sequence  An  is  in  array  s'; 
terminate.  Otherwise  keep  going. 

X4.  [Compute  tt„.]  Set  p'[0]  = n - 1,  and  p'[j]  = p[p[j  - 1]]  for  1 < j < n. 

X5.  [Prepare  to  advance.]  Set  p[j)  4-  p'[j]  for  0 < j < n;  set  sk  4-  s'k,  tk  4-  t'k, 

and  uk  4-  u'k  for  0 < k < 2n  - 1.  Return  to  X2.  | 

In  the  following  steps,  “Transmit  stuff]/,  j)  while  Uj  = 0”  is  an  abbreviation  for  “If 
Uj  = 0,  repeatedly  stuff]/,  j),  l 4-  / + 1,  j 4-  j + 1,  until  Uj  ^ 0.” 

Yl.  [Prepare  to  compute  An+1.]  Set  j 4-  k 4-  / 4-  0 and  u2"-i  4 1. 

Y2.  [Advance  j]  Transmit  s'z  4-  Sj  and  u\  4-  0 while  u , = 0.  Then  go  to  Y5  if 

Uj  < 0. 

Y3.  [Advance  j and  k.]  Set  s;  4—  Sj,  u\  4—  1,  / 4—  / + 1,  j 4—  j + 1.  Then  transmit 
s',  4-  sj  and  u\  4-  0 while  Uj  = 0.  Then  set  s',  4-  n,  u',  4-  0,  / 4-  / + 1.  Then 
transmit  s',  4-  p[tk]  and  u',  4-  0 while  uk  = 0.  Then  set  s',  4-  p[tk],  u,  4-  1, 

/ 4-  / + 1,  k 4-  k + 1.  And  once  again  transmit  s',  4-  p[/jt]  and  u ',  4-  0 while 
uk  = 0. 

Y4.  [Done  with  An+i?]  If  uk  < 0,  go  to  Y6.  Otherwise  set  s',  4—  n,  u',  4—  0, 
l l + 1,  j 4—  j + 1,  k 4—  k + 1,  and  return  to  Y2. 

Y5.  [Finish  An+4.]  Set  s',  4—  n,  u',  4—  1,  / 4—  / + 1.  Then  transmit  s'(  4—  p[i[fc]]  and 

u';  4—  0 while  uk  = 0. 

Y6.  [Prepare  to  compute  En+1.]  Set  j 4-  k 4-  / 4-  0.  Transmit  t',  4-  tk  while 
uk  = 0.  Then  set  4—  n,  / 4—  / + 1. 

Y7.  [Advance  j.]  Transmit  i'(  4—  p[s3]  while  Uj  = 0.  Then  terminate  if  Uj  < 0; 
otherwise  set  i'z  4-  n,  / 4-  / + 1,  j 4-  j + 1,  k 4-  fc  + 1. 

Y8.  [Advance  &.]  Transmit  t',  4-  tk  while  uk  = 0.  Then  go  to  Y10  if  uk  < 0. 

Y9.  [Advance  k and  j .}  Set  t',  4—  Z 4—  / + 1,  k 4—  k + 1.  Then  transmit  f'(  4—  ij, 
while  uk  = 0.  Then  set  /[  4—  n,  / 4—  / + 1.  Then  transmit  t',  4—  p[sj]  while 
ui  — 0.  Then  set  l'(  4—  p[sj\,  / 4-  / + 1,  j 4—  j + 1.  Return  to  Y7. 

\ 10.  [Finish  F/n+i-]  Set  t';  4—  n,  Z 4—  Z + l.  Then  transmit  f';  4—  pfs^]  while  Uj  = 0.  | 
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To  generate  the  monotonic  Savage-Winkler  code  for  fairly  large  n,  one  can  first  generate 
Aio  and  E\q,  say,  or  even  A20  and  E20.  Using  these  tables,  a suitable  recursive  pro- 
cedure will  then  be  able  to  reach  higher  values  of  n with  very  little  computational 
overhead  per  step,  on  the  average. 

74.  If  the  monotonic  path  is  Vo,  . . . , v2n-i  and  if  vk  has  weight  j,  we  have 

"9/)+  (O'  + "M)  mod  2)  <k<  2V'(\  U ) + (0  + u(vo))  mod  2)  - 2. 
<>oKj  7^oK^~2tJ 

Therefore  the  maximum  distance  between  vertices  of  respective  weights  j and  j + 1 
is  2(("_11)  + ("J1)  + (j+i))  - 1-  The  maximum  value,  approximately  3 • 2n+1/y/2n n, 
occurs  when  j is  approximately  n/2.  [This  is  only  about  three  times  the  smallest  value 
achievable  in  any  ordering  of  the  vertices,  which  is  Y%Zo  (b/2j)  by  exercise  7.10-00.] 

75.  The  trend-free  canonical  delta  sequences  all  turn  out  to  yield  Gray  cycles : 

0123012421032101210321040123012(1) 

0123012421032101301230141032103(1) 

0123012421032102032103242301230(2) 

0123012421032102123012343210321(2) 

0123012423012302012301242301230(2) 

0123410121030143210301410123410(3) 

(The  second  and  fourth  of  these  are  cyclically  equivalent.) 

76.  If  wo,  ... , v2u  — 1 is  trend-free,  so  is  the  (n  + l)-bit  cycle  0uo,  lu0,  l«i,  Otq,  0v2,  lv2, 
...,  l"2«-i , 0w2"-i.  Figure  34(g)  shows  a somewhat  more  interesting  construction, 
which  generalizes  the  first  solution  of  exercise  75  to  an  (n  + 2)-bit  cycle 

oor"i?,  oir'*,  nr',  iof",  lor,  nr'",  oir'"fl,  oorH 

where  T is  the  n-bit  sequence  g(  1),  ...,  g( 2n_1)  and  T'  = F©g(l),  T"  = T©t;(2n~1), 
F'"  = r © g(2"  1 + 1).  [An  n-bit  trend-free  design  that  is  almost  a Gray  code,  having 
just  four  steps  in  which  u(vk  ® ufc+1)  = 2,  was  found  for  all  n > 3 by  C.  S.  Cheng, 
Proc.  Berkeley  Conf.  Neyman  and  Kiefer  2 (Hayward,  Calif.:  Inst,  of  Math.  Statistics, 
1985),  619-633.] 

77.  Replace  the  array  (o„_i, . . . , o0)  by  an  array  of  sentinel  values  (»„_!,...,  s0),  with 
sj  <-  mj  - 1 in  step  HI.  Set  aj  <-  (a2  + 1)  mod  rrij  in  step  H4.  If  aj  = Sj  in  step  H5, 
set  Sj  <-  (sj  - 1)  mod  mj,  /j  <-  fj+1,  fJ+1  t-  j + 1. 

78.  For  (50),  notice  that  Bj+ 1 is  the  number  of  times  reflection  has  occurred  in 
coordinate  j,  because  we  bypass  coordinate  j on  steps  that  are  multiples  of  mj  ...  m0. 
Hence,  if  bj  < m,j  — 1,  an  increase  of  bj  by  1 causes  aj  to  increase  or  decrease  by  1 as 
appropriate.  Furthermore,  if  bi  = rrn  - 1 for  0 < i < j,  changing  all  these  b,  to  0 when 
incrementing  bj  will  increase  each  of  Bo,  . . . , Bj  by  1,  thereby  leaving  the  values  do, 

. . . , aj- 1 unchanged  in  (50). 

For  (51),  note  that  B3  = mjBj+1+bj  = mjBj+1  + aj  + (mj  - l)Bj+1  = a3  + BJ+1 
(modulo  2);  hence  Bj  = aj  + aj+i  + • • • , and  (51)  is  obviously  equivalent  to  (50). 

In  the  modular  Gray  code  for  general  radices  (m„_  1, . . . , mo),  let 

ffln-l,  • • • , a2 , <X1,  do 

mn- 1,  . . . , m2,  mi,  m0 . 


= 


7.2.1. 1 


ANSWERS  TO  EXERCISES  693 


when  k is  given  by  (46).  Then  aj  = ( bj  — Bj+i)  rnodm,,  because  coordinate  j has 
increased  modulo  m,  exactly  Bj  - Bj+1  times  if  we  start  at  (0,  ...,0).  The  inverse 
function,  which  determines  the  b’s  from  the  modular  Gray  a’s,  is  bj  = (aj  + aj+1  + 
aj+ 2 + • • • ) mod  rrij  in  the  special  case  that  each  rrij  is  a divisor  of  rrij+i  (for  example, 
if  all  rrij  are  equal).  But  the  inverse  has  no  simple  form  in  general;  it  can  be  computed 
by  using  the  recurrences  bj  = (aj  + %i)modmj,  Bj  = m-jBj+i  + bj  for  j = n - 1, 
. . . , 0,  starting  with  Bn  = 0. 

[Reflected  Gray  codes  for  radix  m > 2 were  introduced  by  Ivan  Flores  in  IRE 
Trans.  EC-5  (1956),  79-82;  he  derived  (50)  and  (51)  in  the  case  that  all  rrij  are 
equal.  Modular  Gray  codes  with  general  mixed  radices  were  implicitly  discussed  by 
Joseph  Rosenbaum  in  AMM  45  (1938),  694-696,  but  without  the  conversion  formulas; 
conversion  formulas  when  all  rrij  have  a common  value  m were  published  by  Martin 
Cohn,  Information  and  Control  6 (1963),  70-78.] 

79.  (a)  The  last  n-tuple  always  has  o„_i  = mn-i  - 1,  so  it  is  one  step  from  (0, . . . , 0) 
only  if  mn~  1 = 2.  And  this  condition  suffices  to  make  the  final  n-tuple  (1,0, ...  ,0). 
[Similarly,  the  final  subforest  output  by  Algorithm  K is  adjacent  to  the  initial  one  if 
and  only  if  the  leftmost  tree  is  an  isolated  vertex.] 

(b)  The  last  n-tuple  is  (mn-i  - 1, 0, . . . , 0)  if  and  only  if  m„_i  . . . mj+ 1 mod  mj  = 0 
for  0 < j < n — 1,  because  bj  = nij  — 1 and  Bj  = mn- 1 ...mj  — 1. 

80.  Run  through  p[l  . . . p“*  using  reflected  Gray  code  with  radices  mj  = + 1. 

81.  The  first  cycle  contains  the  edge  from  (x,  y)  to  (x,  (y  + 1)  modm)  if  and  only  if 
(x  + y)  mod  m ^ m — 1 if  and  only  if  the  second  cycle  contains  the  edge  from  ( x , y)  to 
((x  + 1)  mod  m,  y). 

82.  There  are  two  4-bit  Gray  cycles  (uo, . . . , u 15)  and  (vo, . . . , V15)  that  cover  all  edges 
of  the  4-cube.  (Indeed,  the  non-edges  of  classes  A,  B,  D,  H,  and  I in  exercise  56  form 
Gray  cycles,  in  the  same  classes  as  their  complements.)  Therefore  with  16-ary  modular 
Gray  code  we  can  form  the  four  desired  cycles  (u0u0,u0ui, ... , u0ui5,  nmis, . . . , ui5u0), 

(uoUo,UlUo,  . ■ ■ ,Ul5Uo,Ui5Ul,  . . . ,UoU  15),  (voVo,  ■ . . ,VisVo),  (V0V0,  ■ . . ,VoVi5). 

In  a similar  way  we  can  show  that  n/2  edge-disjoint  n-bit  Gray  cycles  exist  when 
n is  16,  32,  64,  etc.  [Abhandlungen  Math.  Sem.  Hamburg  20  (1956),  13-16.]  J.  Aubert 
and  B.  Schneider  [Discrete  Math.  38  (1982),  7-16]  have  proved  that  the  same  property 
holds  for  all  even  values  of  n > 4,  but  no  simple  construction  is  known. 

83.  Mark  Cooke  found  the  following  totally  unsymmetric  solution  in  December,  2002: 

(1)  2737465057320265612316546743610525106052042416314372145101421737 
2506246064173213107351607103156205713172463452102434643207054702 
4147356146737625047350745130620656415073123731427376432561240264 
3016735467532402524637475217640270736065105215106073575463253105; 

(2)  0616713417232175171671540460247164742473202531621673531632736052 
6710141503047313570615453627623241426465272021632075363710750740 
3157674761545652756510451024023107353424651230406545306213710537 
2620501752453406703437343531502602463045627674152752406021610434; 

(3)  3701063751507131236243765735103012042353747207410473621617247324 
6505132565057121565024570473247421427640231034362703262764130574 
0560620341745613151756314702721725205613212604053506260460173642 
6717641743513401245360241730636545061563027414535676432625745051; 
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(4)  6706546435672147236210405432054510737405170532145431636430504673 
4560621206416201320742373627204506473140171020514126107452343672 
1320452752353410515426370601363567307105420163151210535061731236 
4272537165617217542510760215462375452674257037346403647376271657. 


(Each  of  these  delta  sequences  should  start  from  the  same  vertex  of  the  cube.)  Is  there 
a symmetrical  way  to  do  the  job? 


84.  Calling  the  initial  position  (2,2),  the  8-step  solution  in  Fig.  A 1 shows  how  the 
sequence  progresses  down  to  (0,0).  In  the  first  move,  for  example,  the  front  half  of  the 
cord  passes  around  and  behind  the  right  comb,  then  through  the  large  right  loop.  The 
middle  line  should  be  read  from  right  to  left.  The  generalization  to  n pairs  of  loops 
would,  similarly,  take  3n  — 1 steps. 


[The  origin  of  this  delightful  puzzle  is  obscure.  The  Book  of  Ingenious  & Diabolical 
Puzzles  by  Jerry  Slocum  and  Jack  Botermans  (1994)  shows  a 2-loop  version  carved  from 
horn,  probably  made  in  China  about  1850  [page  101],  and  a modern  6-loop  version 
made  in  Malaysia  about  1988  [page  93].  Slocum  also  owns  a 4- loop  version  made  from 
bamboo  in  England  about  1884.  He  has  found  it  listed  in  Henry  Novra’s  Catalogue  of 
Conjuring  Tricks  and  Puzzles  (1858  or  1859)  and  W.  H.  Cremer’s  Games,  Amusements, 
Pastimes  and  Magic  (1867),  as  well  as  in  Hamley’s  catalog  of  1895,  under  the  name 
“Marvellous  Canoe  Puzzle.”  See  also  U.S.  Patents  2091191  (1937),  D172310  (1954), 
3758114  (1973),  D406866  (1999).  Dyckman  noted  its  connection  to  reflected  Gray 
ternary  in  a letter  to  Martin  Gardner,  dated  2 August  1972.] 

85.  By  (50),  element  [*’  bt, } of  T;T'  is  ctaa'a,  if  jQJ’  ])  = [“’  “,  ] in  the  reflected  Gray 
code  for  radices  We  can  now  show  that  element  [£’  J,’  b,,  ] of  both  (F  ( T')  l F" 
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and  T i (T'  1 r")  is  aaa>""  ifff([^i 
radices  See  exercise  4.1-10, 


/(»/])  = [“’  “/  ] in  the  reflected  Gray  code  for 

and  note  also  the  mixed-radix  law 


m i — 1 — xi , . . . , mn  — 1 — xn ' 

TTl\  , • • • 5 77in 

In  general,  the  reflected  Gray  code  for  radices  (mi , . . . , mn)  is  (0, . . . , mi  - 1)  ; • • • ; 
(0, . . . ,m„  - 1).  [ Information  Processing  Letters  22  (1986),  201-205.] 

86.  Let  Ymn  be  the  reflected  m- ary  Gray  code,  which  can  be  defined  by  Fmo  = e and 

(0,1,.,.,  171  1 ) 1 TmTl , Tl  > 0. 

This  path  runs  from  (0, 0, ... , 0)  to  (m—  1, 0, . . . , 0)  when  m is  even.  Consider  the  Gray 
path  nmn  defined  by  IImo  = 0 and 

- 171  ~ L)  > nmn;  mF(L  + l )n»  if  m is  odd; 

,m)inmn,  mr^„,  if  m is  even. 

This  path  traverses  all  of  the  (m  + l)n  — mn  nonnegative  integer  n-tuples  for  which 
max(di, . . . , an)  = m,  starting  with  (0, . . . , 0,  m)  and  ending  with  (m,  0, . . . , 0).  The 
desired  infinite  Gray  path  is  n0n  i in  n2n,  11^,,,  .... 

87.  This  is  impossible  when  n is  odd,  because  the  n-tuples  with  max(|ai|, . . . , |a„|)  = 1 
include  | (3n  + 1 ) with  odd  parity  and  | (3n  — 3)  with  even  parity.  When  n = 2 we 
can  use  a spiral  E0,  Ei,  E2,  . . . , where  Em  winds  counterclockwise  from  (m,  1 — m) 
to  (m,  — m)  when  m > 0.  For  even  values  of  n > 2,  if  Tm  is  a path  of  n-tuples  from 
(m,  1 — m,  m — 1,1  — m, . . . , m — 1,1  — m)  to  (m,  — m,  m,  — m, . . . , m,  — m),  we  can  use 
Em  l (To, . . . , Tm_i)  , (Eo, . . . , Em)s  l Tm  for  (n  + 2)-tuples  with  the  same  property, 
where  l is  the  dual  operation 

T l r'  = (a0Qo, . . • ,at_1a'0,at_1a1, . . . ,a0ai,a0a2l  • • • , at-i«2,  a*-i«3!  • • • )• 

[Infinite  n-dimensional  Gray  codes  without  the  magnitude  constraint  were  first  con- 
structed by  E.  Vazsonyi,  Acta  Litterarum  a c Scientiarum,  sectio  Scientiarum  Mathe- 
maticarum  9 (Szeged:  1938),  163-173.] 

88.  It  would  visit  all  the  subforests  again,  but  in  reverse  order,  ending  with  (0, . . . , 0) 
and  returning  to  the  state  it  had  after  the  initialization  step  Kl.  (This  reflection 
principle  is,  in  fact,  the  key  to  understanding  how  Algorithm  K works.) 

89.  (a)  Let  Mo  = t , M\  = •,  and  Mn+2  = ■ This  construction  works 

because  the  last  element  of  M„+1  is  the  first  element  of  M„+1,  namely  a dot  followed 
by  the  first  element  of  Mn- 

(b)  Given  a string  d\  . . .di  where  each  d}  is  • or  — , we  can  find  its  successor  by 
letting  k = l — [di  = •]  and  proceeding  as  follows:  If  k is  odd  and  dk  = •,  change  dkdk+i 
to  — ; if  k is  even  and  dk  = — , change  dk  to  • • ; otherwise  decrease  k by  1 and  repeat 
until  either  making  a change  or  reaching  k = 0.  The  successor  of  the  given  word  is 


n 


ra(n-j-l) 


(0,1,... 

(0,1,... 


Xu  . 

• ) xn 

.7711,  . 

• , rrin 

90.  A cycle  can  exist  only  when  the  number  of  code  words  is  even,  since  the  number 
of  dashes  changes  by  ±1  at  each  step.  Thus  we  must  have  n mod  3 = 2.  The  Gray 
paths  Mn  of  exercise  89  are  not  suitable;  they  begin  with  (•  — )Ln/3J  .nmod3  ancj  elKj 


with  (— .)Ln/3J  .[n  mod  3=1]  _[n  mod  3=2].  gut  M 
the  Morse  code  graph  when  n = 3k  + 2. 


3fc  + l •) 


M“k  — is  a Hamiltonian  cycle  in 
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91.  Equivalently,  the  n-tuples  01020304  . . . have  no  two  consecutive  Is.  Such  n-tuples 
correspond  to  Morse  code  sequences  of  length  n + 1,  if  we  append  0 and  then  represent 
• and  — respectively  by  0 and  10.  Under  this  correspondence  we  can  convert  the  path 
Mn+i  of  exercise  89  into  a procedure  like  Algorithm  K,  with  the  fringe  containing  the 
indices  where  each  dot  or  dash  begins  (except  for  a final  dot): 

Ul.  [Initialize.]  Set  a j 4-  [((j  - 1)  mod6)/3j  and  fj  4-  j for  1 < j < n.  Also  set 
fo  <-  0,  r0  4-  1,  h 4-  0,  rj  4—  j + ( j mod  3)  and  l]+i]  mod  3)  4-  j for  1 < j < n, 
except  if  j + (j  mod  3)  > n set  r,  4—  0 and  lo  4-  j.  (The  “fringe”  now  contains 
1.2,  4,  5,  7,8,....) 

U2.  [Visit.]  Visit  the  n-tuple  (01, . . . , a„). 

U3.  [Choose  p.}  Set  q 4-  lo,  p 4-  fq,  fq  4-  q. 

U4.  [Check  ap.]  Terminate  the  algorithm  if  p = 0.  Otherwise  set  ap  4-  1 — ap  and 
go  to  U6  if  ap  + p is  now  even. 

U5.  [Insert  p+1.]  If  p < n,  set  q 4-  rp,  lq  4-  p+1,  rp+1  4-  q,  rp  4-  p+ 1,  lp+1  4-  p. 
Go  to  U7. 

U6.  [Delete  p + 1.]  If  p < n,  set  q 4—  rp+i,  rp  4—  q,  lq  4-  p. 

U7.  [Make  p passive.]  Set  fp  4-  fip  and  flp  4-  lp.  Return  to  U2.  | 

This  algorithm  can  also  be  derived  as  a special  case  of  a considerably  more  general 
method  due  to  Gang  Li,  Frank  Ruskey,  and  D.  E.  Knuth,  which  extends  Algorithm  K 
by  allowing  the  user  to  specify  either  ap  > aq  or  ap  < aq  for  each  (parent,  child)  pair 
(p,  <?)•  [See  Knuth  and  Ruskey,  Lecture  Notes  in  Computer  Science  2635  (2004),  183- 
204.]  A generalization  in  another  direction,  which  produces  all  strings  of  length  n that 
do  not  contain  certain  substrings,  has  been  discovered  by  M.  B.  Squire,  Electronic  J. 
Combinatorics  3 (1996),  #R17,  1-29. 

Incidentally,  it  is  amusing  to  note  that  the  mapping  k g(k)/2  is  a one-to-one 
correspondence  between  all  binary  n-tuples  with  no  odd-length  runs  of  Is  and  all  binary 
n-tuples  with  no  two  consecutive  Is. 

92.  Yes,  because  the  digraph  of  all  (n-l)-tuples  (xi, . . . ,xn_i)  with  xi, . . .,x„-i  < m 
and  with  arcs  (xi, . . . , xn-i)  —t  (x2,...,xn)  whenever  max(xi, . . . , xn)  = m is  con- 
nected and  balanced;  see  Theorem  2.3.4.2G.  Indeed,  we  get  such  a sequence  from 
Algorithm  F if  we  note  that  the  final  kn  elements  of  the  prime  strings  of  length 
dividing  n,  when  subtracted  from  m — 1,  are  the  same  for  all  m>  k.  When  n = 4,  for 
example,  the  first  81  digits  of  the  sequence  $4  are  2 - aR  = 0 0001  01 0011  . . . , where 
a is  the  string  (62).  [There  also  are  infinite  m-ary  sequences  whose  first  mn  elements 
are  de  Bruijn  cycles  for  all  n,  given  any  fixed  m > 3.  See  L.  J.  Cummings  and  D. 
Wiedemann,  Cong.  Numerantium  53  (1986),  155-160.] 

93.  The  cycle  generated  by  /()  is  a cyclic  permutation  of  al,  where  a has  length  mn  — 1 
and  ends  with  ln_1.  The  cycle  generated  by  Algorithm  R is  a cyclic  permutation  of  7 = 
Co  . • . cmn+i_1,  where  ck  = (co  + 6o4 h&fc-i)  mod  m and  b0  . . . bmn+i_1  = f3  = am lm. 

If  x0  . . . x„  occurs  in  7,  say  Xj  = ck+j  for  0 < j < n,  then  y3  = bk+j  for  0 < j < n, 
where  yj  = (xJ+i  — Xj)  mod  m.  [This  is  the  connection  with  modular  m-ary  Gray  code; 
see  exercise  78.]  Now  if  y0  . ..yn-i  = ln  we  have  mn+1  - m - n < k < mn+1  - n; 
otherwise  there  is  an  index  k'  such  that  — n < k'  < mn  — n and  y0  . . ,yn- 1 occurs  in 
0 at  positions  k = (k'  + r(mn  - 1))  modm"+1  for  0 < r < m.  In  both  cases  the  m 
choices  of  k have  different  values  of  x0,  because  the  sum  of  all  elements  in  q is  m - 1 
(modulo  m)  when  n>  2.  [Algorithm  R is  valid  also  for  n = 1 if  m mod  4/2,  because 
m _L  ol  in  that  case.] 
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94.  0010203041121314223243344.  (The  underlined  digits  are  effectively  inserted 
into  the  interleaving  of  00112234  with  34.  Algorithm  D can  be  used  in  general  when 
n = 1 and  r = m — 2 > 0;  but  it  is  pointless  to  do  so,  in  view  of  (54).) 

95.  (a)  Let  C0C1C2  . . . have  period  r.  If  r is  odd  we  have  p = q = r,  so  r = pq  only  in 
the  trivial  case  when  p = q = 1 and  a0  = b0.  Otherwise  r/2  = lcm(p,  q)  = pq/gcd(p,  q) 
by  4.5.2-(io),  hence  gcd (p,q)  = 2.  In  the  latter  case  the  2n-tuples  c*c;+ 1 . . . cj+2n-i 
that  occur  are  ajbk  ■ ■ ■ flj+n-i&fc+n-i  for  0 < j < p,  0 < k < q,  j = k (modulo  2),  and 
b^d-j  • • ■ ^fc+n  — l&j+n  — 1 for  0 < j < p,  0 < k < q,  j ^ k (modulo  2). 

(b)  The  output  would  interleave  two  sequences  aodi  . . . and  60f>i  • • • whose  periods 
are  respectively  mn  + r and  mn  - r;  the  a’s  are  the  cycle  of  /()  with  xn  changed  to  xn+1 
and  the  6’s  are  the  cycle  of  /'()  with  xn  changed  to  a;"-1,  for  0 < x < r.  By  (58)  and 
part  (a),  the  period  length  is  m2n  — r2,  and  every  2n-tuple  occurs  with  the  exception 
of  ( xy)n  for  0 < x,  y < r. 

(c)  The  real  step  D6  alters  the  behavior  of  (b)  by  going  to  D3  when  t > n,  t'  = n, 
and  0 < x'  = x < r;  this  change  emits  an  extra  x at  the  time  when  x2n~ 1 has  just 
been  output  and  b is  about  to  be  emitted,  where  b is  the  digit  following  xn  in  the  cycle. 
D6  also  allows  control  to  pass  to  D7  and  then  D3  with  t!  = n in  the  case  that  t > n and 
x < x'  < r;  this  behavior  emits  an  extra  x' x at  the  time  when  (*®')n_1a;  has  just  been 
output  and  b will  be  next.  These  r2  extra  digits  provide  the  r2  missing  2n-tuples  of  (b). 

96.  (a)  For  example,  when  n = 5 the  top-level  coroutine  of  type  R invokes  a coroutine 
of  type  D for  n = 4,  which  invokes  two  of  type  S for  n = 2;  hence  R5  = D&  = 1 
and  S5  = 2.  The  recurrences  R2  = 0,  R2n+ 1 = 1 + R2n,  Rin  = 2 Rn,  D2  = 0, 
D2n+i  = D 2n  = 1 + 2Dn,  S 2 = 1,  S^n+i  = S2n  = 2 Sn  have  the  solution  Rn  = n — 2 Sn, 
Dn  = Sn  - 1,  Sn  = 2^g nJ_1.  Thus  Rn  + Dn  + S„  = n-  1. 

(b)  Each  top-level  output  usually  involves  [lg  nj  — 1 D-activations  and  v(n ) — 1 
R-activations,  plus  one  basic  activation  at  the  bottom  level.  But  there  are  exceptions: 
Algorithm  R might  invoke  its  /()  twice,  if  the  first  activation  completed  a sequence  In; 
and  sometimes  Algorithm  R doesn’t  need  to  invoke  /()  at  all.  Algorithm  D might 
invoke  its  /'()  twice,  if  the  first  activation  completed  a sequence  (x')n  for  x < r;  but 
sometimes  Algorithm  D doesn’t  need  to  invoke  either  /()  or  /'(). 

Algorithm  R completes  a sequence  xn+1  if  and  only  if  its  child  /()  has  just 
completed  a sequence  0n.  Algorithm  D completes  a sequence  x2n  for  x < r if  and 
only  if  it  has  just  jumped  from  D6  to  D3  without  invoking  any  child. 

From  these  observations  we  can  conclude  that  no  exceptions  arise  at  any  level 
when  the  coroutine  for  an  mn-cycle  produces  the  final  digit  of  a run  xn,  or  the  first 
digit  following  such  a run.  Hence  the  worst  case  occurs  when  the  top-level  coroutine 
activates  a subcoroutine  twice,  making  2[lgnJ  + 2u(n)  — 3 activations  altogether. 

97.  (a)  (0011),  (00011101),  (0000101001111011),  and  (00000110001011011111 
001110101001).  Thus  j2  = 2,  j3  = 3,  j4  = 9,  j5  = 15. 

(b)  We  obviously  have  f„+i{k)  = T,fn(k)  mod  2 for  0 < k < jn  + n.  The  next 
value,  /n+i (jn  + n),  depends  on  whether  step  R4  jumps  to  R2  after  computing  y = 
fn(jn  + n — 1).  If  it  does  (namely,  if  fn+i(jn  + n — 1)  ^ 0),  we  have  fn+i{k)  = 
1 + E/n(fc  + 1)  for  jn  +n  < k < 2 n+jn  +n;  otherwise  we  have  fn+i(k)  = 1 + E fn(k  — 1) 
for  those  values  of  k.  In  particular,  fn+i(k)  = 1 when  2n  < k + 5n  < 2n+n.  The  stated 
formula,  which  has  simpler  ranges  for  the  index  k,  holds  because  l+E/n(/c±l)  = E/n(fc) 
when  jn  < k < jn  + n or  2n  + jn  < k < 2 n+  jn  + n. 
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(c)  The  interleaved  cycle  has  cn(2k)  = f+  ( k ) and  c„(2fc  + 1)  = /“(fc),  where 

f+(k)={fn{k~l)'  if0  < k<Jr,  + M _ f /n(fc+l),  if  0 < fc  < jn; 

\/n(fc-2),  ifj„  + l<fc<2"  + 2;  \/„(Jfe  + 2),  if  jn  < fc  < 2"  — 2; 

fn(k)  = fn{k  mod  (2n+  2)),  /„  (fc)  = /„  (k  mod  (2n—  2)).  Therefore  the  subsequence 
l2n_1  begins  at  position  kn  = (2n_1  - 2)(2n+  2)  + 2 jn  + 2 in  the  cn  cycle;  this  will 
make  j2n  odd.  The  subsequence  (01)n-10  begins  at  position  ln  = (2n_1  + 1 )(jn  - 1)  if 
jn  mod  4 = 1,  at  ln  = (2n_1  + l)(2n  + jn  - 3)  if  mod  4 = 3.  Also  k2  = 6,  l2  = 2. 

(d)  Algorithm  D inserts  four  elements  into  the  cn  cycle;  hence 


when  jn  mod  4<3  ( ln<k„ ):  when  jn  mod  4 = 3 ( kn<ln ): 

f c„(k- 1),  if  0<k<ln  + 2\  f c„(k- 1),  ifO<fc<fcn  + l; 

/2n(&)=\  cn(k  — 3),  if  ln-\-2<k<kn  + Z\  = < cn(k  — 2),  if  fc„  + l<fc</n  + 3; 
l cn(fc  — 4),  if  fc„  + 3<fc<22n;  lc„(fc-4),  if  ln  + 3<k<22n. 

(e)  Consequently  j2n  = kn  + 1 + 2 [jn  mod  4 < 3].  Indeed,  the  elements  preceding 
l2n  consist  of  2n_2  - 1 complete  periods  of  /+()  interleaved  with  2"-2  complete 
periods  of  f~  (),  with  one  0 inserted  and  also  with  10  inserted  if  ln  < kn,  followed 
by  /n(l)/n(l)/n(2)/n(2) . . . fn(jn  — 1 )fn(jn  — 1).  The  sum  of  all  these  elements  is  odd, 
unless  ln  < kn;  therefore  S2n  = 1 - 2 [jn  mod  4 = 3]. 

Let  n = 2 *q,  where  q is  odd  and  n > 2.  The  recurrences  imply  that,  if  q — 1,  we 
have  jn  = 2n  1+  bt  where  bt  = 2*/ 3 — ( — 1)4/3.  And  if  q > 1 we  have  jn  = 2T1_1±  bt+ 2, 
where  the  + sign  is  chosen  if  and  only  if  [lggj  + [\Aq/2^sq^ \ =5]  is  even. 

98.  If  f(k)  = g(k)  when  k lies  in  a certain  range,  there’s  a constant  C such  that 
E/(fc)  = C + E g(k)  for  k in  that  range.  We  can  therefore  continue  almost  mindlessly 
to  derive  additional  recurrences:  If  n > 1 we  have 


T,f2n{k),  when  jn  mod  4 < 3 (l„  < kn):  when  jn  mod  4 = 3 {kn  < ln ): 

J-Ecnffe-l),  if  0 < fc  < ln+2;  f Ecn(fc—  1),  if  0 < k < fcn  + l; 

= < 1 + Ec„(fc  — 3),  if  + 2 < k < fcn  + 3;  = l l + Ecn(fc-2),  if  k„  +1  < k < /n  + 3; 
l Ecn(fc  — 4),  if  fcn  + 3<fc<22";  [ Ecn(fc-4),  if  ln  + 3 < k < 22n. 

^c„(k)  = E/+([fc/2])  + E/-(Lfc/2j). 

f E/n(fc  — 1),  if  0<fc<jn  + l; 

l l + E/„(fc-2),  if  jn  + l<fc<2n  + 2; 

L*/(2"  ± 2)J  + E f±(k  mod  (2n  ± 2)) 


S/+(*)E 


E f-(k)=^fn(k+1)' 


S/n(fc) 


if  0 <k<jn; 

\l  + E/„(fc  + 2),  if  in  <fc<2n  — 2; 

E/n(fc)  = E/n(fc  mod  2n). 


El/2n+l(&) 


EE /2„(fc),  if  0 < fc  < j2n  or  22n  +i2n  < k < 22n+1; 

1 + fc  + EE f2n(k  + <52n),  if  j2n  < k < 22n  + j2n- 


EE f2n(k),  when  jn  mod4<3  {l„<kn):  when  jn  mod 4=3  ( kn<ln ): 

J"  EEc„(fc-l),  if  0<fc<Zrj  + 2;  |'EEc„(fc-l),  if  0<fc<ifc„  + l; 

= < 1 + /c  + EEc„(fc  — 3),  in„  + 2<fc<fcn  + 3;  l + fc  + EEcn(fc-2),  if  fc„  + l<fc</„  + 3; 

l EEcn(fc— 4),  if  fcn  + 3<fc<22n;  { l+EEc„(fc-4),  if  ln+3</c<22’1. 

EE/2n(fc)  = [j„  mod  4 < 3]  |fc/22nJ  + EE/2n(fc  mod  22n). 

And  then,  aha,  there  is  closure: 


EEcn(2fc)  = T,fn(k), 


EEcn(2fc  + 1)  = Tif~(k). 
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If  n = 2fg  where  q is  odd,  the  running  time  to  evaluate  fn(k)  by  this  system  of 
recursive  formulas  is  0(t  + S(q)),  where  S(l)  = 1,  S(2k)  - l + 2S(fc),  and  S(2fc  + 1)  = 
1 + S(k).  Clearly  S(k)  < 2k,  so  the  evaluations  involve  at  most  0(n)  simple  operations 
on  n-bit  numbers.  In  fact,  the  method  is  often  significantly  faster:  If  we  average  S(k) 
over  all  k with  [lgfcj  = s we  get  (3S+1  - 2S+1)/2S,  which  is  less  than  3fclg(3/2)  < 3k0  b9. 

(Incidentally,  if  k = 2S+1  - 1 - (2s-ei  + 2s-®2  H b 2s_e‘)  where  0 < ex  < • • • < et, 

we  have  S(k)  ~ s + 1 -f-  et  + 2et~\  + 4et— 2 -f-  • • • 2*  1ei.) 

99.  A string  that  starts  at  position  k in  /„()  starts  at  position  k+  = k + 1 + [k  > j„]  in 
fn  ()  and  at  position  k~  = k - 1 - [fc  > jn]  in  /“(),  except  that  0n  and  ln  occur  twice 
in  fn()  but  not  at  all  in  fn(). 

To  find  7 = a0b0  ■ ■ .an~ibn-i  in  the  cycle  /2n(),  let  a = a0...a„- 1 and  p = 
bo  ■ ■ - bn- 1.  Suppose  a starts  at  position  j and  P at  position  k in  /„(),  and  assume 
that  neither  a nor  P is  0n  or  ln.  If  j+  = k+  (modulo  2),  let  1/2  be  a solution  to  the 
equation  j+  + (2n+2)x  = k~  + (2n  - 2 )y;  we  may  take  1/2  = k + (2"-  2)(2n~3(j  - k) 
mod  (2n~1  + 1))  if  j > k , otherwise  1/2  — j + (2n+  2)(2n_3(fc  - j)  mod  (2n_1  - 1)). 
Otherwise  let  (l  — l)/2  = k+  + (2n+  2)x  = j~  + (2n—  2)y.  Then  7 starts  at  position  l 
in  the  cycle  c„();  hence  it  starts  at  position  l + l + [l>kn]  + 2[l>ln]  in  the  cycle  /2n()- 
Similar  formulas  hold  when  a G {0",T1}  or  (3  € {0n,  ln}  (but  not  both).  Finally, 
02n,  l2n,  (01)n,  and  (10)n  start  respectively  in  positions  0,  j2n,  ln  + 1 + [kn  < ln],  and 

In  + 2 + \kn  < /„]. 

To  find  /3  = b0b\  . . . bn  in  /n+i()  when  n is  even,  suppose  that  the  n-bit  string 
(bo  ® 61)  • • • (b„- 1 ® bn)  starts  at  position  j in  /n().  Then  fh  starts  at  position  k = 
j — bn[j  >j„]  + 2n[j  = jn][Sn  = 1]  if  fn+i(k)  = b0,  otherwise  at  position  k + (2n—  Sn, 
5n,2n+8n)  according  as  (j<jn,j=jn,j>jn)- 

The  running  time  of  this  recursion  satisfies  T(n ) = 0(n)  + 2T([n/2j),  so  it  is 
0(n  log  n).  [Exercises  97-99  are  based  on  the  work  of  J.  Tuliani,  who  also  has  developed 
methods  for  certain  larger  values  of  m;  see  Discrete  Math.  226  (2001),  313-336.] 

100.  No  obvious  defects  are  apparent,  but  extensive  testing  should  be  done  before  any 
sequence  can  be  recommended.  By  contrast,  the  de  Bruijn  cycle  produced  implicitly 
by  Algorithm  F is  a terrible  source  of  supposedly  random  bits,  even  though  it  is  n- 
distributed  in  the  sense  of  Definition  3.5D,  because  0s  predominate  at  the  beginning. 
Indeed,  when  n is  prime,  bits  tn  + 1 of  that  sequence  are  zero  for  0 < t < (2n  — 2 )/n. 

101.  (a)  Let  P be  a proper  suffix  of  AA'  with  P < AA'.  Either  p is  a suffix  of  A',  whence 
A < A'  < P,  or  P = qA'  and  we  have  A < a < p. 

Now  A < P < AA'  implies  that  P — A7  for  some  7 < X' . But  7 is  a suffix  of  P with 
1 < ItI  = \P\  — | A|  < | A' | ; hence  7 is  a proper  suffix  of  A',  and  A'  < 7.  Contradiction. 

(b)  Any  string  of  length  1 is  prime.  Combine  adjacent  primes  by  (a),  in  any 
order,  until  no  further  combination  is  possible.  [See  the  more  general  results  of  M.  P. 
Schiitzenberger,  Proc.  Amer.  Math.  Soc.  16  (1965),  21-24.] 

(c)  If  t 0,  let  A be  the  smallest  suffix  of  Ax  ...  At.  Then  A is  prime  by  definition, 
and  it  has  the  form  P'y  where  p is  a nonempty  suffix  of  some  Xj . Therefore  At  < Xj  < 
P < P~1  = A < At,  so  we  must  have  A = At-  Remove  At  and  repeat  until  t = 0. 

(d)  True.  For  if  we  had  a = Xp  for  some  prime  A with  |A|  > |Ai|,  we  could  append 
the  factors  of  P to  obtain  another  factorization  of  a. 

(e)  3 • 1415926535897932384626433832795  ■ 02884197.  (An  efficient  algorithm 
appears  in  exercise  106.  Knowing  more  digits  of  7r  would  not  change  the  first  two 
factors.  The  infinite  decimal  expansion  of  any  number  that  is  “normal”  in  the  sense  of 
Borel  (see  Section  3.5)  factors  into  primes  of  finite  length.) 
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102.  We  must  have  1/(1  - mz)  = l/fl^iC1  ~ zn)Lm(n).  This  implies  (6o)  as  in 
exercise  4. 6. 2-4. 

103.  When  n = p is  prime,  (59)  tells  us  that  Lm(  1)  + pLm(p)  = mp , and  we  also 
have  Lm(  1)  = m.  [This  combinatorial  proof  provides  an  interesting  contrast  to  the 
traditional  algebraic  proof  of  Theorem  1.2.4F.] 

104.  The  4483  nonprimes  are  abaca,  agora,  ahead,  . . . ; the  1274  primes  are  ... , rusts, 
rusty,  rutty.  (Since  prime  isn’t  prime,  we  should  perhaps  call  prime  strings  lowly.) 

105.  (a)  Let  a'  be  a with  its  last  letter  increased,  and  suppose  a'  = /3y'  where  a = j3 7 
and  /?  / e,  7/e.  Let  9 be  the  prefix  of  a with  |0|  = l-yj . By  hypothesis  there  is  a string 
uj  such  that  aui  is  prime;  hence  9 < auj  < 70 j,  so  we  must  have  9 < 7.  Consequently 
9 < 7',  and  we  have  a'  < 7'. 

(b)  Let  a = Ai/3  = 01  ...an  where  Ai /3uj  is  prime.  The  condition  A i/3oo  < (3w 
implies  that  aj  < a,j+r  for  1 < j < n — r,  where  r = |Ai|.  But  we  cannot  have  <7,  < aJ+r; 
otherwise  a would  begin  with  a prime  longer  than  Ai,  contradicting  exercise  101(d). 

(c)  If  a is  the  n-extension  of  both  A and  A',  where  |A|  > |A'|,  we  must  have 
A = (A ')q9  where  9 is  a nonempty  prefix  of  A'.  But  then  9 < A'  < A < 9. 

106.  El.  [Initialize.]  Set  ai  «-•••■<-  a„  <-  m - 1,  an+1  « 1,  and  j <-  1. 

E2.  [Visit.]  Visit  (ai, . . . ,an)  with  index  j. 

E3.  [Subtract  one.]  Terminate  if  a3  = 0.  Otherwise  set  a3  -e-  a3  - 1,  and 
ak  <—  m — 1 for  j < k < n. 

E4.  [Prepare  to  factor.]  (According  to  exercise  105(b),  we  now  want  to  find  the 
first  prime  factor  Ai  of  m . . . a„.)  Set  j <-  1 and  k <—  2. 

E5.  [Find  the  new  j .]  (Now  a\  ...ak- 1 is  the  ( k — l)-extension  of  the  prime 
ai  . . .aj.)  If  ak-j  > ak,  return  to  E2.  Otherwise,  if  ak-j  < ak,  set  j t—  k. 
Then  increase  k by  1 and  repeat  this  step.  | 


The  efficient  factoring  algorithm  in  steps  E4  and  E5  is  due  to  J.  P.  Duval,  J.  Algorithms 
4 (1983),  363-381.  For  further  information,  see  Cattell,  Ruskey,  Sawada,  Serra,  and 
Miers,  J.  Algorithms  37  (2000),  267-282. 

107.  The  number  of  n-tuples  visited  is  Pm(n)  = J2j=i  Lm{j).  Since  Lm(n)  = \mn  + 
0{mn/2/n),  we  have  Pm(n)  = Q(m,n)  + 0(Q(v/m,n)),  where 


r\i  \ mn 

Q{m,  n)  = > — = R(m,  n); 

K Tl 

fc=l 


-n/ 2\ 


= ^E4Eff}rJ?Ti7+0(n"‘),  for  all  t. 

m - 1 ^ ^ U J m - 1 1 

]=0  1 v 


Thus  Pm(n ) ~ mn+1/((m  — l)n).  The  main  contributions  to  the  running  time  come 
from  the  loops  in  steps  F3  and  F5,  which  cost  n — j for  each  prime  of  length  j,  hence 
a total  of  nPm{n)  - £"=i  jLm{j)  = mn+1(l/((m  - l)2n)  + 0(l/(mn2))).  This  is  less 
than  the  time  needed  to  output  the  mn  individual  digits  of  the  de  Bruijn  cycle. 
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108.  (a)  If  a yf  9 ...  9,  we  have  \k+i  < /39^,  because  the  latter  is  prime. 

(b)  We  can  assume  that  /3  is  not  all  Os,  since  9J0n_3is  a substring  of  At-iAtA^  = 
89n0"l.  Let  k be  minimal  with  /3  < A*,;  then  A*,  < /3a,  so  f3  is  a prefix  of  \k ■ Since  f3 
is  a preprime,  it  is  the  |/3  [-extension  of  some  prime  3'  < f3.  The  preprime  visited  by 
Algorithm  F just  before  /31  is  (/31  — l)9n_^  : . by  exercise  106,  where  (3'  — 1 denotes  the 
decimal  number  that  is  one  less  than  /3' . Thus,  if  8'  is  not  At, _i,  the  hint  (which  also 
follows  from  exercise  106)  implies  that  A*,-i  ends  with  at  least  n — \fi'\  > n — |/3|  9s, 
and  a is  a suffix  of  Afc_i.  On  the  other  hand  if  B'  = \k-i,  a is  a suffix  of  Ait-2,  and  (3 
is  a prefix  of  A*,_  i A a,. 

(c)  If  a / 9... 9,  we  have  \k+i  < (/ 3a)d~1/39 because  the  latter  is  prime. 
Otherwise  Ajt  — i ends  with  at  least  (d  l)|/3a|  9s,  and  A^-i-i  < (/9a)d-19la“l,  so  ( a(3)d 
is  a substring  of  A*,_iAfcAfc+i. 

(d)  Within  the  primes  135899135914,  787899787979,  12999913131314,  09090911, 
089999  09  090911,  118999119  119122. 

(e)  Yes:  In  all  cases,  the  position  of  ai  . . . an  precedes  the  position  of  the  substring 
ai  . . . a„_i (an  + 1),  if  0 < an  < 9 (and  if  we  assume  that  strings  like  9J0n-J  occur  at 
the  beginning).  Furthermore  g^O"-3-1  occurs  only  after  93-10n-Ja  has  appeared  for 
1 < a < 9,  so  we  must  not  place  0 after  9J0n_3_1. 

109.  Suppose  we  want  to  locate  the  submatrix 

f (wn-l  ■ ■ ■ WlW0)2  (Xn-1  ■ ■ ■ XlX0)2 

V i,yn—i  • * • 2/12/0)2  (Zn-l  ...  Zl2o)2 

The  binary  case  n = 1 is  the  given  example,  and  if  n > 1 we  can  assume  by  induction 
that  we  only  need  to  determine  the  leading  bits  d2n-i,  «2n-2,  &2n-i>  and  b2n-2-  The 
case  n = 3 is  typical:  We  must  solve 


h = w2, 

f>4  = W2 , 

a5  ® 65  = w2 , 
a4  (B  b4  = w2 , 


64  = x2, 
K = x2, 
a4  © b4  — x2, 
© 65  = x2, 


U5  © 65  — y2, 
a4®b4  — y2 , 

h = 2/2, 

b4  = 2/2, 


a4  © 64  = z2,  if  ao  = 0,  bo  = 0; 

a 5 © b'5  — z2,  if  ao  = 0,  60  = 1; 

64  = z2,  if  a0  = 1,  b0  = 0; 

65  = z2,  if  do  — 1,  b0  = 1; 


here  b'5  — 65  © b4b2b2b\  takes  account  of  carrying  when  j becomes  j + 1. 

110.  Let  a0d1  . . . drn2_1  be  an  m-ary  de  Bruijn  cycle,  such  as  the  first  m2  elements  of 
(54).  If  m is  odd,  let  dij  = dj  when  i is  even,  dij  = a^+^i_1y2)  modm2  when  i is  odd. 
[The  first  of  many  people  to  discover  this  construction  seems  to  have  been  John  C. 
Cock,  who  also  constructed  de  Bruijn  toruses  of  other  shapes  and  sizes  in  Discrete 
Math.  70  (1988),  209-210.] 

If  m = m'm"  where  m'  J_  m",  we  use  the  Chinese  remainder  algorithm  to  define 

dij  = d'ij  (modulo  m ')  and  = d"j  (modulo  m") 

in  terms  of  matrices  that  solve  the  problem  for  m!  and  m" . Thus  the  previous  exercise 
leads  to  a solution  for  arbitrary  m. 

Another  interesting  solution  for  even  values  of  m was  found  by  Antal  Ivanyi  and 
Zoltan  Toth  [2nd  Conf.  Automata,  Languages,  and  Programming  Systems  (1988),  165- 
172;  see  also  Hurlbert  and  Isaak,  Contemp.  Math.  178  (1994),  153-160].  The  first  m2 
elements  dj  of  the  infinite  sequence 
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define  a de  Bruijn  cycle  with  the  property  that  the  distance  between  the  appearances 
of  ab  and  ba  is  always  even.  Then  we  can  let  di3  = a,  if  i + j is  even,  dtj  = a,  if  i + j 
is  odd.  For  example,  when  m = 4we  have 
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(exercise  109); 
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111.  (a)  Let  dj  — j and  0 < a3  < 3 for  1 < j < 9,  ag  / 0.  Form  sequences  Sj , tj  by  the 
rules  si  =0,t1=d1;  tj+1  = dJ+1  + 10 tj[aj  = 0]  for  1 < j < 9;  sJ+1  = Sj  + (0, 

for  cij  = (0, 1,2)  and  1 < j < 9.  Then  sio  is  a possible  result;  we  need  only  remember 
the  smallish  values  that  occur.  More  than  half  the  work  is  saved  by  disallowing  ak  = 2 
when  sk  = 0,  then  using  |s10|  instead  of  sxo.  Since  fewer  than  38  = 6561  possibilities 
need  to  be  tried,  brute  force  via  the  ternary  version  of  Algorithm  M works  well;  fewer 
than  24,000  mems  and  1600  multiplications  are  needed  to  deduce  that  all  integers  less 
than  211  are  representable,  but  211  is  not. 

Another  approach,  using  Gray  code  to  vary  the  signs  after  breaking  the  digits 
into  blocks  in  28  possible  ways,  reduces  the  number  of  multiplications  to  255,  but  at 
the  cost  of  about  500  additional  mems.  Therefore  Gray  code  is  not  advantageous  in 
this  application. 

(b)  Now  (with  / 3,000  mems  and  4900  multiplications)  we  can  reach  all  numbers 
less  than  241,  but  not  241.  There  are  46  ways  to  represent  100,  including  the  remarkable 
9 - 87  + 6 + 5 - 43  + 210. 

[H.  E.  Dudeney  introduced  his  “century”  problem  in  The  Weekly  Dispatch  (4  and 
18  June  1899).  See  also  The  Numerology  of  Dr.  Matrix  by  Martin  Gardner,  Chapter  6; 
Steven  Kahan,  J.  Recreational  Math.  23  (1991),  19-25;  and  exercise  7.2.1.6-122.] 

112.  The  method  of  exercise  111  now  needs  more  than  167  million  mems  and  10  million 
multiplications,  because  316  is  so  much  larger  than  38.  We  can  do  much  better  (10.4 
million  mems,  1100  mults)  by  first  tabulating  the  possibilities  obtainable  from  the  first 
k and  last  k digits,  for  1 < k < 9,  then  considering  all  blocks  of  digits  that  use  the  9. 
There  are  60,318  ways  to  represent  100,  and  the  first  unreachable  number  is  16,040. 

SECTION  7.2. 1.2 

1.  [J.  P.  N.  Phillips,  Comp.  J.  10  (1967),  311.]  Assuming  that  n > 3,  we  can  replace 
steps  L2-L4  by: 

L2  . [Easiest  case?]  Set  y 1—  ctn_i  and  z < — an.  If  y < z,  set  an  — i 4 — z,  an  < — y, 
and  return  to  LI. 
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L2.1'.  [Next  easiest  case?]  Set  x an- 2.  If  x > y,  go  on  to  step  L2.2'.  Otherwise 
set  ( an-2 , an-i,an)  <—  (z,  a:,  7/)  if  a;  < z,  ( y , z,  x)  if  x > z.  Return  to  LI. 

L2.2'.  [Find  j .]  Set  j «—  n — 3 and  y «—  c^-.  While  y > x,  set  j <—  j — 1,  x <—  y,  and 
y «—  aj.  Terminate  if  j = 0. 

L3'.  [Easy  increase?]  If  y < z,  set  aj  <—  z,  a-,+i  <—  y,  an  «—  x,  and  go  to  L4.1'. 

L3.l'.  [Increase  aj.]  Set  l n — 1;  if  y > a;,  repeatedly  decrease  l by  1 until  y < a;. 
Then  set  aj  <—  at  and  a;  «—  y. 

L4'.  [Begin  to  reverse.]  Set  an  «—  a^+i  and  aJ+ 1 <—  z. 

L4.l'.  [Reverse  07+2  • • .an-i]  Set  k «—  j + 2 and  l 4—  n — 1.  Then,  while  k < l, 
interchange  a*,  a;  and  set  k <—  k + 1,  Z 4—  l — 1.  Return  to  LI.  | 

The  program  might  run  still  faster  if  at  is  stored  in  memory  location  A [n  — t]  for 
0 < t < n,  or  if  reverse  colex  order  is  used  as  in  the  following  exercise. 

2.  Again  we  assume  that  a\  < <22  < • • • < a„  initially;  the  permutations  generated 
from  {1,2, 2, 3}  will,  however,  be  1223,  2123,  2213,  ...,  2321,  3221.  Let  an+ 1 be  an 
auxiliary  element,  larger  than  an. 

Ml.  [Visit.]  Visit  the  permutation  0102  . . . an. 

M2.  [Find  j.]  Set  j «—  2.  If  aj-i  > aj,  increase  j by  1 until  aj_  1 < aj.  Terminate 
if  j > n. 

M3.  [Decrease  aj.]  Set  l t—  1.  If  a;  > aj,  increase  l until  a;  < aj.  Then  swap 

CL  l Cl  j . 

M4.  [Reverse  ai  . . . aj- 1.]  Set  k 1—  1 and  l 1—  j — 1.  Then,  if  k < l,  swap  aj, 

set  k k + 1,  l «—  l — 1,  and  repeat  until  k > l.  Return  to  Ml.  | 


3.  Let  C\...Cn  = cai  . . . can  be  the  inversion  table,  as  in  exercise  5. 1.1-7.  Then 

rank(ai  . . . an ) is  the  mixed-radix  number  [®1’  C\1'  c”].  [See  H.  A.  Rothe,  Samm- 

lung  combinatorisch-analytischer  Abhandlungen  2 (1800),  263-264;  and  see  also  the 
pioneering  work  of  Sarngadeva  and  Narayana  cited  in  Section  7. 2. 1.7.]  For  example, 
314592687  has  rank  [ 9]  g]  7]  e]  5]  4]  3]  2]  ? ] = 2 • 8!  + 6!  + 5!  + 4 ■ 4!  + 1!  = 81577;  this  is 
the  factorial  number  system  featured  in  Eq.  4.1-(io). 

4.  Use  the  recurrence  rank(ai  . . . an)  = £ ]Cj=i  nAxi  < ai](ni  " „t)+rank(a2  • • • an). 
For  example,  rank(314159265)  is 


9 V 


2, 1,1, 1,2 


1,1)  +0+7  (1,1, 1,2, 1,1)  +0+5  (1,2, 1,1)  + 4 (1,1, 1,1)  +0+2  (1,1)  ~ 30991. 


5.  (a)  Step  L2  is  performed  n!  times.  The  probability  that  exactly  k comparisons  are 
made  is  —qk+ 1,  where  qt  is  the  probability  that  an-t+ 1 > • • ■ > an,  namely  [t  < n]/t\. 

Therefore  the  mean  is  ]+  k{qu  - qk+i)  = qi  + b qn  = [n!  ej/n!  - l«e-l«  1.718, 

and  the  variance  is 


Y.  k2(qk-qk+ i)-mean2  = qi-t-3q24 h(2n-l)g„-(q'i-| b qn)2  « e(3-e)  « 0.766. 

[For  higher  moments,  see  R.  Kemp,  Acta  Informatica  35  (1998),  17-89,  Theorem  4.] 
Incidentally,  the  average  number  of  interchange  operations  in  step  L4  is  therefore 
^2[k/2\  ( qk  — qk+ 1)  = <72  + <?4  + • • • ~ cosh  1 — 1 = (e  + e_1  — 2)/2  w 0.543,  a result  due 
to  R.  J.  Ord-Smith  [Comp.  J.  13  (1970),  152-155]. 

(b)  Step  L3  is  performed  only  n!  — 1 times,  but  we  will  assume  for  convenience 
that  it  occurs  once  more  (with  0 comparisons).  Then  the  probability  that  exactly  k 
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comparisons  are  made  is  Y^=k+ 1 1/o'  for  1 < k < n and  1/n!  for  k = 0.  Hence  the 
mean  is  \ E"=02  Vi-  ~ e/ 2 ~ 1.359;  exercise  1 reduces  this  number  by  |.  The  variance 


Ie2 

4C 


0.418. 


is  | E3n=o3  Vi!  + I E7=o2  Vi!  - mean2  « 

6.  (a)  Let  e„(«)  = Ek= o 2VV  then  the  number  of  different  prefixes  ai  . . . dj  is 
i-  [V]  eni  (2) . . . e„,  (2).  This  is  N = (nj times  the  probability  qn-j  that  at  least 
n-j  comparisons  are  made  in  step  L2.  Therefore  the  mean  is  jfW(eni  (2) . . . e„t  (2))  - 1, 
where  w(J2xkZk/k\)  = J2xk-  In  the  binary  case  the  mean  is  M/(")  — 1,  where  M = 

sue::?*'  0 = E?=0  r;:n  = ra  - 1 = 0(2 

(b)  If  {ai, . . . , dj}  = {ni  ■ x\, . . . , n't  ■ xt},  the  prefix  oi  . . . dj  contributes  altogether 
’52i<k<i<t(nk  ~ nk)[ni  >n'i]  to  the  total  number  of  comparisons  made  in  step  L3.  Thus 
the  mean  is  i Ei<fc<;<t  ™{fki(z)),  where 


fki{z)=(  e„m(z)\ - r)^-'je„l_1(2) 

^ l<m<f  / \r=0  ^ ' / 

m^k,  m^l 


= eni(z) . ..ent(z)(nk  - zrk{z))ri(z), 


1 / \ Crifc  — 1(^) 

where  rk(z ) = — . 

enk \z) 


In  the  two-valued  case  this  formula  reduces  to  jjw((ses(z)  - zes-i(z))en-s-1(z))  = 

MCZ)  - 1)  - UCtl)(s  - + 1)  = M-°  ~ 1 + (T))  = ^ 

7.  In  the  notation  of  the  previous  answer,  the  quantity  j^w{eni  (2) . . . en,  (2))  — 1 is 
ni  + ■ ’ • + nt  {nin2  + nin3  + ■ • ■ + nt-\nt)  + ni(ni  — 1)  + ■ ■ • + nt(nt- 1) 


n(n  — 1) 


+ • 


One  can  show  using  Eq.  1.2.9-(38)  that  the  limit  is  -1  + exp  Ejt>i  rk/k,  where  rk  = 
limi^oo(nJ  + • • • + nk)/ (ni  + • • • + nt)k . In  cases  (a)  and  (b)  we  have  rk  = [k  = 1], 
so  the  limit  is  e - 1 « 1.71828.  In  case  (c)  we  have  rk  = l/(2fc  - 1),  so  the  limit  is 
-1  + exp  J2k>i  l/(fc(2fc  - 1))  » 2.46275. 

8.  Assume  that  j is  initially  zero,  and  change  step  LI  to 

LI  . [Visit.]  Visit  the  variation  di  ...  dj.  If  j < n,  set  j t—  j + 1 and  repeat  this 
step.  | 

This  algorithm  is  due  to  L.  J.  Fischer  and  K.  C.  Krause,  Lehrbuch  der  Combinations- 
lehre  und  der  Arithmetik  (Dresden:  1812),  55-57. 

Incidentally,  the  total  number  of  variations  is  w(eni  (2)  . . . e„,  (2))  in  the  notation 
of  answer  6.  This  counting  problem  was  first  treated  by  James  Bernoulli  in  Ars 
Conjectandi  (1713),  Part  2,  Chapter  9. 

9.  Assume  that  r > 0 and  that  we  begin  with  a0  < ai  < a2  < ■ ■ ■ < an. 

Rl.  [Visit.]  Visit  the  variation  ax  . . . a.r.  (At  this  point  ar+ 1 < • • • < a„.) 

R2.  [Easy  case?]  If  ar  < a„,  interchange  ar  44  dj  where  j is  the  smallest  subscript 
such  that  j > r and  dj  > ar,  and  return  to  Rl. 

R3.  [Reverse.]  Set  (ar+i, . . . , an ) 4-  (an, . . . , ar+i)  as  in  step  L4. 

R4.  [Find  j.]  Set  j <—  r — 1.  If  aj  > dj+ 1,  decrease  j by  1 repeatedly  until 
dj  < dj+ 1.  Terminate  if  j = 0. 

R5.  [Increase  dj.]  Set  I 4-  n.  If  dj  > a;,  decrease  l by  1 repeatedly  until  dj  < a;. 
Then  interchange  dj  44  a;. 
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R6.  [Reverse  again.]  Set  (oj+i, . . . , an ) 4—  (an, . . . , aJ  + 1)  as  in  step  L4,  and  return 
to  Rl.  | 

The  number  of  outputs  is  r!  [zr]  eni  ( z ) . . . ent  (z);  this  is,  of  course,  n-  when  the  elements 
are  distinct. 


10.  a ia,2  ...  on  = 213  . . . n,  C1C2  . . . cn  = 010  ...  0,  01O2  . . . o„  = 1 ( — 1)1  ...  1,  if  n > 2. 

11.  Step  (PI,  ...,  P7)  is  performed  (1,  n!,  n!,  n!  + xn,  n\  - 1,  (xn  + 3)/2,  x„)  times, 
where  xn  = YlkZ  1 ^!,  because  P7  is  performed  (j  — 1)!  times  when  2 < j < n. 

12.  We  want  the  permutation  of  rank  999999.  The  answers  are  (a)  2783915460,  by 
exercise  3;  (b)  8750426319,  because  the  reflected  mixed-radix  number  corresponding 

+ „ r 0,  0,  1,  2,  3,  0,  2,  7,  0,  91  • TO,  0,  1,  3-2,  3,  5-0,  2,  7,  8-0,  9-91  . _ o . , , \ \ 

t0  L 1,  2,  3,  4,  5,  6,  7,  8,  9,  10  J 1S  U,  2,  3,  4,  5,  6,  7,  8,  9,  10  J by  7.2. 1.1  (50);  (c)  the 

product  (0  1 ...  9)9(0  1 . . . 8)°(0  1 . . . 7)7(0  1 ...  6)2  ...  (0  1 2)1,  namely  9703156248. 


13.  The  first  statement  is  true  for  all  n > 2.  But  when  2 crosses  1,  namely  when 

c2  changes  from  0 to  1,  we  have  c3  = 2,  c4  = 3,  c5  = ■ • • = c„  = 0,  and  the  next 

permutation  when  n > 5 is  432156 . . . n.  [See  Time  Travel  (1988),  page  74.] 

14.  True  at  the  beginning  of  steps  P4,  P5,  and  P6,  because  exactly  j — l—Cj+s  elements 
lie  to  the  left  of  Xj , namely  j - 1 - cj  from  {x\, . . . , Xj-i}  and  s from  {xj+ 1, . . . , xn}. 
(In  a sense,  this  formula  is  the  main  point  of  Algorithm  P.) 

15.  If  ■■■’  *]o]  corresponds  to  the  reflected  Gray  code  [tj1’  c"  ],  we  get  to  step  P6 

if  and  only  if  bn-k  = k — 1 for  j < k < n and  Bn-j+i  is  even,  by  7. 2. 1.1  (50).  But 
bn-k  = k — 1 for  j < k < n implies  that  Bn-k  is  odd  for  j < k < n.  Therefore 

s = lcj+ 1 =j]  + [°j+2  — j + 1]  = [oj+i  < 0]  + [oj+2  < 0]  in  step  P5.  [See  Math.  Comp. 

17  (1963),  282-285.] 


16.  Pi'.  [Initialize.]  Set  Cj  <—  j i and  Oj  < 1 for  1 < j < n;  also  set  z <—  an. 

P2'.  [Visit.]  Visit  a\  . . .an.  Then  go  to  P3.5'  if  di  = z. 

P3'.  [Hunt  down.]  For  j 4—  n — 1,  n — 2,  . . . , 1 (in  this  order),  set  a^+i  4—  aj, 
a,j  4—  2,  and  visit  ai  . . . an.  Then  set  j 4—  n — 1,  s 4—  1,  and  go  to  P4'. 

P3.5'.  [Hunt  up.]  For  j 4—  1,  2,  . . . , n — 1 (in  this  order),  set  aj  4—  aj+i,  aj+ 1 4—  z, 
and  visit  ai  . . . an.  Then  set  j 4—  n — 1,  s 4—  0. 


P 4'.  [Ready  to  change?]  Set  q 4-  Cj  + Oj.  If  q = 0,  go  to  P6';  if  q > j,  go  to  P7'. 

P5'.  [Change.]  Interchange  aCj+s  4->  aq+s.  Then  set  Cj  4-  q and  return  to  P2'. 

P6'.  [Increase  s.]  Terminate  if  j = 1;  otherwise  set  s 4—  s + 1. 

P7'.  [Switch  direction.]  Set  Oj  4 Oj,  j 4-  j - 1,  and  go  back  to  P4'.  | 

17.  Initially  a,  4—  a)  4—  j for  1 < j < n.  Step  P5  should  now  set  t 4—  j — Cj  + s, 
u<— j — q + s,v4-  au,  at  4—  v,  a'v  4—  t,  au  4—  j,  a'-  4—  u , Cj  4—  q.  (See  exercise  14.) 

But  with  the  inverse  required  and  available  we  can  actually  simplify  the  algorithm 
significantly,  avoiding  the  offset  variable  s and  letting  the  control  table  ci  . . . cn  count 
only  downwards,  as  noted  by  G.  Ehrlich  [JACM  20  (1973),  505-506]: 

Ql.  [Initialize.]  Set  a3  4-  o'-  4-  j,  Cj  4-  j - 1,  and  Oj  4 1 for  1 < j < n.  Also 

set  Co  = —1. 


Q2.  [Visit.]  Visit  the  permutation  a1  . . . an  and  its  inverse  a[  . . . a'n. 

Q3.  [Find  k.]  Set  k 4—  n.  Then,  while  ck  = 0,  set  ck  4-  k — 1,  ok  4 ok,  and 

k 4—  k — 1.  Terminate  if  k — 0. 


Q4.  [Change.]  Set  ck  4—  ck  — 1,  j 4—  a'k,  and  i = j + ok.  Then  set  t 4—  a{,  at  4—  k, 
Oj  4 — f,  o[  4 — j,  a'k  4—  i,  and  return  to  Q2.  | 


706  ANSWERS  TO  EXERCISES 


7.2.1.2 


18.  Set  an  <—  n,  and  use  (n  — l)!/2  iterations  of  Algorithm  P to  generate  all  permuta- 
tions of  {1, . . . ,n  — 1}  such  that  1 precedes  2.  [M.  K.  Roy,  CACM  16  (1973),  312-313; 
see  also  exercise  13.] 

19.  For  example,  we  can  use  the  idea  of  Algorithm  P,  with  the  n-tuples  ci  . . . cn 
changing  as  in  Algorithm  7.2.1.1H  with  respect  to  the  radices  (1,2,  ...,n).  That 
algorithm  maintains  the  directions  correctly,  although  it  numbers  subscripts  differently. 
The  offset  s needed  by  Algorithm  P can  be  computed  as  in  the  answer  to  exercise  15,  or 
the  inverse  permutation  can  be  maintained  as  in  exercise  17.  [See  G.  Ehrlich,  CACM 
16  (1973),  690-691.]  Other  algorithms,  like  that  of  Heap,  can  also  be  implemented 
looplessly. 

(Note:  In  most  applications  of  permutation  generation  we  are  interested  in  mini- 
mizing the  total  running  time,  not  the  maximum  time  between  successive  visits;  from 
this  standpoint  looplessness  is  usually  undesirable,  except  on  a parallel  computer.  Yet 
there’s  something  intellectually  satisfying  about  the  fact  that  a loopless  algorithm 
exists,  whether  practical  or  not.) 

20.  For  example,  when  n = 3 we  can  begin  123,  132,  312,  312,  132,  123,  213,  ..., 
213,  213,  ....  If  the  delta  sequence  for  n is  (<5i<$2  • • • ^"ni),  the  corresponding  sequence 
for  n + 1 is  (A„<5i  An<52  • • • An52"n!),  where  An  is  the  sequence  of  2n  + 1 operations 
n n— 1 ...  1 — 1 ...  n— 1 n;  here  5k  = j means  aj  +»  aj+\  and  5k  = — means 

CL\  i — — d\. 

(Signed  permutations  appear  in  another  guise  in  exercises  5.1.4-43  and  44.  The 
set  of  all  signed  permutations  is  called  the  octahedral  group.) 

21.  Clearly  M = 1,  hence  0 must  be  0 and  S must  be  6—1.  Then  N = E + 1,  R = 6—  2, 
and  D + E = b + Y.  This  leaves  exactly  max(0,  b — 7 — k)  choices  for  E when  Y = k > 2, 
hence  a total  of  2(^~  7 — k)  = ( b ~8)  solutions  when  6 > 8.  [Math.  Mag.  45  (1972), 
48-49.  Incidentally,  D.  Eppstein  has  proved  that  the  task  of  solving  alphametics  with 
a given  radix  is  NP-complete;  see  SIGACT  News  18,3  (1987),  38-40.] 

22.  (X);,  + (X)i,  = (XY)i,  is  solvable  only  when  6 = 2. 

23.  Almost  true,  because  the  number  of  solutions  will  be  even,  unless  [j  € F]  / [k  6 F]. 
(Consider  the  ternary  alphametic  X + (XX)3  + (YY)3  + (XZ)3  = (XYX)3.) 

24.  (a)  9283  + 7 + 473  + 1062  = 10825.  (b)  698392  + 3192  = 701584.  (c)  63952  + 
69275  = 133227.  (d)  653924  + 653924  = 1307848.  (e)  5718  + 3 + 98741  = 104462.  (f) 
127503  + 502351  + 3947539+46578  = 4623971.  (g)  67432  + 704  + 8046  + 97364  = 173546. 
(h)  59  + 577404251698  + 69342491650  + 49869442698  + 1504  + 40614  + 82591  + 344  + 
41  + 741425  = 5216367650  + 691400684974.  [All  solutions  are  unique.  References  for 
(b)-(g):  J.  Recreational  Math.  10  (1977),  115;  5 (1972),  296;  10  (1977),  41;  10  (1978), 
274;  12  (1979),  133-134;  9 (1977),  207.] 

(i)  In  this  case  there  are  yjjlO!  = 2903040  solutions,  because  every  permutation  of 
{0, 1, . . . , 9}  works  except  those  that  assign  H or  N to  0.  (A  well- written  general  additive 
alphametic  solver  will  be  careful  to  reduce  the  amount  of  output  in  such  cases.) 

25.  We  may  assume  that  < • • • < si0.  Let  i be  the  least  index  ^ F,  and  set 

Q.i  4 0 ; then  set  the  remaining  elements  aj  in  order  of  increasing  j.  A proof  like  that 

of  Theorem  6. IS  shows  that  this  procedure  maximizes  a ■ s.  A similar  procedure  yields 
the  minimum,  because  min(a  • s)  = — max(a  • ( — s)). 

26.  400739  + 63930  - 2379  - 1252630  + 53430  - 1390  + 738300. 

27.  Readers  can  probably  improve  upon  the  following  examples:  BLOOD  + SWEAT  + 
TEARS  = LATER:  EARTH  + WATER  + WRATH  = HELLO  + WORLD;  AWAIT  + ROBOT  + ERROR  = 
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SOBER  + WORDS;  CHILD  + THEME  + PEACE  + ETHIC  = IDEAL  + ALPHA  + METIC.  (This 
exercise  was  inspired  by  WHERE  + SEDGE  + GRASS  + GROWS  = MARSH  [A.  W.  Johnson, 
Jr.,  J.  Recr.  Math.  15  (1982),  51],  which  would  be  marvelously  pure  except  that  D 
and  0 have  the  same  signature.)  J.  A.  Brown,  J.  Szabo,  and  T.  J.  Trowbridge  suggest 

GREAT  + GREAT  = LARGE  and  GREAT  + GREAT  = SMALL. 

28.  (a)  11  = 3 + 3 + 2 + 2 + 1,  20  = 11  + 3 + 3 + 3,  20  = 11  + 3 + 3 + 2 + 1, 
20  = 11  + 3 + 3+1  + 1 + 1,  20  = 8 + 8 + 2+  1 + 1,  20  = 7 + 7 + 6,  20  = 7 + 7 + 2 + 2 + 2, 
20  = 7 + 7 + 2 + 1 + 1 + 1 + 1,  20  = 7 + 5 + 5 + 2 + 1,  20  = 7 + 5 + 2 + 2 + 2 + 1 + 1,  20  = 
7+5+2+2+1+1+1+1,  20  = 7+3+3+2+2+1+1  + 1,  20  = 7+3+3+ 1+ 1+ 1+ 1+ 1 + 1 + 1 , 
20  = 5 + 3 + 3 + 3 + 3 + 3.  [These  fourteen  solutions  were  first  computed  by  Roy  Childs 
in  1999.  The  next  doubly  partitionable  values  of  n are  30  (in  20  ways),  then  40  (in  94 
ways),  41  (in  67),  42  (in  57),  50  (in  190  ways,  including  50  = 2 + 2 + ■ ■ ■ + 2),  etc.] 

(b)  51  = 20  + 15  + 14  + 2,  51  = 15  + 14  + 10  + 9 + 3,  61  = 19  + 16  + 11  + 9 + 6, 
65  = 17+  16+  15  + 9 + 7+  1,  66  = 20  + 19  + 16  + 6 + 5,  69  = 18  + 17+  16  + 10  + 8, 
70  = 30  + 20  + 10  + 7 + 3,  70  = 20  + 16  + 12  + 9 + 7 + 6,  70  = 20  + 15  + 12  + 11  + 7 + 5, 
80  = 50  + 20  + 9+1,  90  = 50+12+11+9  + 5 + 2 + 1,  91  = 45  + 19+11  + 10  + 5 + 1.  [The 
two  51s  are  due  to  Steven  Kahan;  see  his  book  Have  Some  Sums  To  Solve  (Farmingdale, 
New  York:  Baywood,  1978),  36-37,  84,  112.  Amazing  examples  with  seventeen  distinct 
terms  in  Italian  and  fifty-eight  distinct  terms  in  Roman  numerals  have  been  found  by 
Giulio  Cesare,  J.  Recr.  Math.  30  (1999),  63.] 

Notes:  The  beautiful  example  THREE  = TW0  + 0NE+ZER0  [Richard  L.  Breisch,  Recre- 
ational Math.  Magazine  12  (December  1962),  24]  is  unfortunately  ruled  out  by  our  con- 
ventions. The  total  number  of  doubly  true  partitions  into  distinct  parts  is  probably  fi- 
nite, in  English,  although  nomenclature  for  arbitrarily  large  integers  is  not  standard.  Is 
there  an  example  bigger  than  NINETYNINEN0NILLIONNINETYNINESEXTILLI0NSIXTY0NE  = 
NINETYNINEN0NILLI0NNINETYNINESEXTILLI0NNINETEEN  + SIXTEEN +ELEVEN+NINE+ SIX 
(suggested  by  G.  Gonzalez-Morris)? 

29.  10  + 7+1  = 9 + 6 + 3,  11  + 10  = 8 + 7 + 6,  12  + 7 + 6 + 5 = 11+10  + 9, 

19  + 10  + 3 = 14  + 13  + 4 + 1 (31  examples  in  all). 

30.  (a)  5672  = 321489  , 8072  = 651249,  or  8542  = 729316.  (b)  9582  = 917764. 
(c)  96  x 72  = 4704.  (d)  51304/61904  = 7260/8760.  (e)  3285092  = 47613.  [Strand  78 
(1929),  91,  208;  J.  Recr.  Math  3 (1970),  43;  13  (1981),  212;  27  (1995),  137;  31  (2003), 
133.  The  solutions  to  (b),  (c),  (d),  and  (e)  are  unique.  With  a right-to-left  approach 
based  on  Algorithm  X,  the  answers  are  found  in  (14,  13,  11,  3423,  42)  kilomems, 
respectively.  Nob  also  noticed  that  NORTH/SOUTH  = WEST/EAST  has  the  unique  solution 
67104/27504  = 9320/3820.] 

31.  (a)  5/34  + 7/68  + 9/12(1).  One  can  verify  uniqueness  with  Algorithm  X using  the 
side  condition  A < D < G,  in  about  265  Kg.  [Quark  Visual  Science  Magazine,  No.  136 
(Tokyo:  Kodansha,  October  1993).]  Curiously,  a similar  puzzle  also  has  a unique  solu- 
tion: 1/(3x6)  + 5/(8x9)  + 7/(2x4)  = 1 [Scot  Morris,  Omni  17, 4 (January  1995),  97]. 

(b)  ABCDEFGHI  = 381654729,  via  Algorithm  X in  10  Kg. 

32.  There  are  eleven  ways,  of  which  the  most  surprising  is  3 + 69258/714.  [See  The 
Weekly  Dispatch  (9  and  23  June  1901);  Amusements  in  Mathematics  (1917),  158-159.] 

33.  (a)  1,  2,  3,  4,  15,  18,  118,  146.  (b)  6,  9,  16,  20,  27,  126,  127,  129,  136,  145.  [The 
Weekly  Dispatch  (11  and  30  November,  1902);  Amusements  in  Math.  (1917),  159.] 

In  this  case  one  suitable  strategy  is  to  find  all  variations  where  a*,  ...  a;-i/ a; ...  ag 
is  an  integer,  then  to  record  solutions  for  all  permutations  of  a\  . . . ak-i ■ There  are 
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exactly  164959  integers  with  a unique  solution,  the  largest  being  9876533.  There  are 
solutions  for  all  years  in  the  21st  century  except  2091.  The  most  solutions  (389)  occur 
when  n = 12221;  the  longest  stretch  of  representable  n’s  is  5109  <n<  7060.  Dudeney 
was  able  to  get  the  correct  answers  by  hand  for  small  n by  “casting  out  nines.” 

34.  (a)  x = 105,  7378  + 155  + 92467  = 7178  + 355  + 92467  = 1016  + 733  + 98251  = 
1014  + 255  + 98731  = 100000. 

(b)  x = 47,  3036  + 455  + 12893  = 16384  is  unique.  The  fastest  way  to  resolve  this 
problem  is  probably  to  start  with  a list  of  the  2529  primes  that  consist  of  five  distinct 
digits  (namely  10243,  10247,  . . . , 98731)  and  to  permute  the  five  remaining  digits. 

Incidentally,  the  unrestricted  alphametic  EVEN  + ODD  = PRIME  has  ten  solutions; 
both  ODD  and  PRIME  are  prime  in  just  one  of  them.  [See  M.  Arisawa,  J.  Recr.  Math  8 
(1975),  153.] 

35.  In  general,  if  sk  = |S/|  for  1 < k < n,  there  are  si  . . . sk-i  ways  to  choose  each  of 
the  nonidentity  elements  of  Sk.  Hence  the  answer  is  rifc=i(n*=i  sSfe_1),  which  in  this 
case  is  22  • 63  ■ 2415  = 436196692474023836123136. 

(But  if  the  vertices  are  renumbered,  the  sk  values  may  change.  For  example, 
if  vertices  (0,3,5)  of  (12)  are  interchanged  with  (e,d,c),  we  have  S14  = 1,  s13  = 6, 
si2  = 4,  sn  = 1,  and  45  • 2415  Sims  tables.) 

36.  Since  each  of  {0, 3,  5,  6, 9,  a,  c,  f } lies  on  three  lines,  but  every  other  element  lies 
on  only  two,  it  is  clear  that  we  may  let  5f  = {(),  cr,  cr2,  a3,  a,  acr,  acr2,  acr3},  where  cr  = 
(03fc)(17e8)(2bd4)(56a9)  is  a 90°  rotation  and  a = (05)(14)(27)(36)(8d)(9c)(af)(be) 
is  an  mside-out  twist.  Also  Se  = {(),/?,  7, /Sy},  where  = (l4)(28)(3c)(69)(7d)(be)  is  a 
transposition  and  7 = (12)(48)(5a)(69)(7b)(de)  is  another  twist;  Sd  = • • ■ = Si  = {()}. 
(There  are  47  — 1 alternative  answers.) 

37.  The  set  Sk  can  be  chosen  in  k\i:  ways  (see  exercise  35),  and  its  nonidentity  elements 
can  be  assigned  to  cr(k,  1),  ...,  a(k,k)  in  k\  further  ways.  So  the  answer  is  An  = 

HU  k-k+1  = 2 VriLi  2 For  example,  Ai0  « 1.148  x 10170.  We  have 

/ k\  1 fU  1 

2J  ( 2 J ln  k = 2 J x(x  ~ 1)lnxdx  + 0(n2^°Sn)  = ^n3  In  n + 0(n3) 

by  Euler’s  summation  formula;  thus  ln  An  = |n3  Inn  + 0(n3). 

38.  The  probability  that  <j>(k)  is  needed  in  step  G4  is  1/A:!  - l/(k  + 1)!,  for  1 < 

k < n-  the  probability  is  1/n!  that  we  don’t  get  to  step  G4  at  all.  Since  does 

[A;/2]  transpositions,  the  average  is  £%=?(  1/fe!  - l/(k  + l)!)[fc/2l  = ( rfc/21  - 

[(A;  - l)/2l )/fc!  - \(n  - l)/2l/n!  = Y,odd  1/fc!  + 0(l/(n  - 1)!). 

39.  (a)  0123,  1023,  2013,  0213,  1203,  2103,  3012,  0312,  1302,  3102,  0132,  1032,  2301, 
3201,  0231,  2031,  3021,  0321,  1230,  2130,  3120,  1320,  2310,  3210;  (b)  0123,  1023,  2013, 
0213,  1203,  2103,  3102,  1302,  0312,  3012,  1032,  0132,  0231,  2031,  3021,  0321,  2301 
3201,  3210,  2310,  1320,  3120,  2130,  1230. 

40.  By  induction  we  find  er(l,  1)  = (0  1),  a( 2,2)  = (0  1 2), 

k)  — / ^ h)(k—  1 k—2  ...  1),  if  k > 3 is  odd, 

1 (0  fc— 1 k—2  1 ...  A;— 3 k),  if  k > 4 is  even; 

also  uj{k)  = (0  k)  when  k is  even,  uj(k)  = (0  k-2  ...  1 k- 1 A:)  when  k > 3 is  odd. 

Thus  when  k > 3 is  odd,  a(k,  1)  = (A:  A:-l  0)  and  a(k,j)  takes  k t-A  j — 1 for  1 < j < A:; 
when  k > 4 is  even,  a(k,j)  = (0  k k-3  ...  1 k-2  k-l)j  for  1 < j < k. 
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Notes:  The  first  scheme  that  causes  Algorithm  G to  generate  all  permutations  by 
single  transpositions  was  devised  by  Mark  Wells  [Math.  Comp.  15  (1961),  192-195], 
but  it  was  considerably  more  complicated.  W.  Lipski,  Jr.,  studied  such  schemes  in 
general  and  found  a variety  of  additional  methods  [Computing  23  (1979),  357-365]. 

41.  We  may  assume  that  r < n.  Algorithm  G will  generate  r- variations  for  any  Sims 
table  if  we  simply  change  ‘fc  4—  1’  to  ‘fc  <—  n — r’  in  step  G3,  provided  that  we  redefine 
ui(k)  to  be  er(n  — r,n  — r)  . . . a(k,  k)  instead  of  using  (16). 

If  n — r is  odd,  the  method  of  (27)  is  still  valid,  although  the  formulas  in  answer  40 
need  to  be  revised  when  k < n — r + 2.  The  new  formulas  are  a(k,j)  = (k  j — 1 ...  10) 
and  io(k)  = (fc  ...  10)  when  k = n — r;  a(k,j)  = (k  ...  1 0)^  when  k = n — r + 1. 

If  n — r is  even,  we  can  use  (27)  with  even  and  odd  reversed,  if  r < 3.  But  when 
r > 4 a more  complex  scheme  is  needed,  because  a fixed  transposition  like  (fc  0)  can 
be  used  for  odd  k only  if  uj(k  — 1)  is  a fc-cycle,  which  means  that  u(k  — 1)  must  be  an 
even  permutation;  but  u>(k)  is  odd  for  fc  > n — r + 2. 

The  following  scheme  works  when  n - r is  even:  Let  r{k,j)uj(k  — 1)_  = (k  k—j) 
for  1 < j < k = n — r,  and  use  (27)  when  k > n-r.  Then,  when  fc  = n - r + 1,  we  have 
uj(k  - 1)  = (0  1 ...  k-l),  hence  a(k,j)  takes  k (2 j - 1)  mod  k for  1 < j < k , and 
a(k,k)  = ( k k-l  k- 3 ...  0 fc— 2 ...  1 ),ui(k)  = (fc  ...  10),  a(k  + l,j)  = (fc+1  ...  0)L 

42.  If  cr(fc,  j)  = (fc  j- 1)  we  have  r(fc,  1)  = (fc  0)  and  r(k,j)  = (fc  j-l)(fc  j- 2)  = 
(fc  j~  1 j— 2)  for  2 < j < k. 

43.  Of  course  w(l)  = <r(l,l)  = r(l,  1)  = (0  1).  The  following  construction  makes 
ui(k)  = (fc— 2 fc  — 1 fc)  for  all  fc  > 2:  Let  a(k,j)  = r(k,j)uj(k  — 1)_,  where  a(2, 1)  = (2  0), 
«(2,  2)  = (2  0 1),  a(3, 1)  = a(3,  3)  = (3  1),  a(3,  2)  = (3  1 0);  this  makes  <r(2, 2)  = (0  2), 
cr(3,3)  = (0  3 1).  Then  for  fc  > 4,  let  a(fc,  1)  = (fc  fc-4),  a(k,j)  = (fc  k-3-j  k-2-j) 
for  1 < j < fc  — 2,  and 


a(fc,  fc— 2) 
a(fc,  fc— 1) 
a(k,  fc) 

this  makes  cr(fc,  fc)  = 


fc  mod  3 = 0 
= (fc  fc— 2 0) 
= (fc  fc— 2 fc— 3) 
= (fc  fc— 2) 

(fc— 3 fc  fc— 2) 


fc  mod  3 = 1 
(fc  fc  — 3 0)  or 

(fc  fc— 3)  or 
(fc  fc— 3 fc— 2)  or 


fc  mod  3 = 2 
(fc  fc— 1 0), 
(fc  fc— 1 fc— 3), 
(fc  fc— 2); 


or 
or 
or 

as  required. 


44.  No,  because  r(fc,  j)  is  a (fc  + l)-cycle,  not  a transposition.  (See  (19)  and  (24).) 

45.  (a)  202280070,  since  uk  = max({0, 1, . . . , ak  - 1}  \ (ai, . . . ,afe_i}).  (Actually  un  is 
never  set  by  the  algorithm,  but  we  can  assume  that  it  is  zero.)  (b)  273914568. 

46.  True  (assuming  that  u„  = 0).  If  either  uk  > uk+ 1 or  ak  > ak+i  we  must  have 

CLk  ^ Uk  ^ 0*k- (-1  U/c-i-i. 


47.  Steps  (XI,  X2, . . . , X6)  are  performed  respectively  (1,  A,  B,  A—  1,  B — Nn,  A)  times, 

where  A = N0 f Nn-i  and  B = nN0  + (n  - l)Ni  H h lNn-i- 

48.  Steps  (X2, X3, X4, X5, X6)  are  performed  respectively  A„  + (l,n!, 0,0,1)  times, 
where  An  = ^2k—i  n~  = n-  SU=i  ~ n\  (e  — 1).  Assuming  that  they  cost  respec- 
tively (1, 1,3, 1,3)  rnems,  for  operations  involving  aj,  lj,  or  Uj,  the  total  cost  is  about 
9e  — 8 ~ 16.46  meins  per  permutation. 

Algorithm  L uses  approximately  (e,  2 + e/2, 2e  + 2e-1  - 4)  mems  per  permutation 
in  steps  (L2,  L3,  L4),  for  a total  of  3.5e  + 2e-1  — 2 « 8.25  (see  exercise  5). 

Algorithm  X could  be  tuned  up  for  this  case  by  streamlining  the  code  when  fc  is 
near  n.  But  so  can  Algorithm  L,  as  shown  in  exercise  1. 
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49.  Order  the  signatures  so  that  |s0|  > •••  > |s9|;  also  prepare  tables  w0...wg, 
X0...xg,  2/0...  2/9,  so  that  the  signatures  are  wXk  < •••  < Wyk.  For 

example,  when  SEND  + MORE  = MONEY  we  have  (so,...,s9)  = (—9000,1000,-900,91 
-90, 10, 1,  -1, 0,  0)  for  the  respective  letters  (M,  S,  0.  E,  N,  R,  D,  Y,  A,  B);  also  (w0,  ’. . . , Wg)  = 

(-9000,-900,-90,-1,0,0,1,10,91,1000),  and  x0...xg  = 0112233344,  y0...y9  = 
9988776554.  Yet  another  table  f0...f9  has  /,  = 1 if  the  digit  corresponding  to  w3 
cannot  be  zero;  in  this  case  /o  . . . /9  = 1000000001.  These  tables  make  it  easy  to 
compute  the  largest  and  smallest  values  of 


SkO'k  + b Sgag 


over  all  choices  ak...ag  of  the  remaining  digits,  using  the  method  of  exercise  25,  since 
the  links  l3  tell  us  those  digits  in  increasing  order. 

This  method  requires  a rather  expensive  computation  at  each  node  of  the  search 
tree,  but  it  often  succeeds  in  keeping  that  tree  small.  For  example,  it  solves  the  first 
eight  alphametics  of  exercise  24  with  costs  of  only  7,  13,  7,  9,  5,  343,  44,  and  89 
kilomems;  this  is  a substantial  improvement  in  cases  (a),  (b),  (e),  and  (h),  although 
case  (f)  comes  out  significantly  worse.  Another  bad  case  is  the  ‘CHILD’  example  of 
answer  27,  where  left-to-right  needs  2947  kilomems  compared  to  588  for  the  right-to- 
left  approach.  Left-to-right  does,  however,  fare  better  on  BLOOD  + SWEAT  + TEARS  (73 
versus  360)  and  HELLO  + WORLD  (340  versus  410). 

50.  If  a is  in  a permutation  group,  so  are  all  its  powers  a2,  a3,  . . . , including  a"1'1  = 
a , where  m is  the  order  of  a (the  least  common  multiple  of  its  cycle  lengths).  And 
(32)  is  equivalent  to  a-  = 03 og  ■ ■ ■ 

51.  False.  For  example,  a{k,i)~  and  cr(k,j)~  might  both  take  k 1-4  0. 

52.  r(k,j)  = (k—j  k—j+1)  is  an  adjacent  interchange,  and 


ui{k)  = (n-\  ...  0)  (n— 2 ...  0 ) . . . (k  ...  0)  = 0(n  - \)<j>{k  - 1 ) 

is  a fc-flip  followed  by  an  ra-flip.  The  permutation  corresponding  to  control  table 
c0  . . . cn — 1 in  Algorithm  H has  c3  elements  to  the  right  of  j that  are  less  than  j, 
for  0 < j < n;  so  it  is  the  same  as  the  permutation  corresponding  to  ci  . . . cn  in 
Algorithm  P,  except  that  subscripts  are  shifted  by  1. 

The  only  essential  difference  between  Algorithm  P and  this  version  of  Algorithm  H 
is  that  Algorithm  P uses  a reflected  Gray  code  to  run  through  all  possibilities  of  its 
control  table,  while  Algorithm  H runs  through  those  mixed-radix  numbers  in  ascending 
(lexicographic)  order. 

Indeed,  Gray  code  can  be  used  with  any  Sims  table,  by  modifying  either  Algo- 
rithm G or  Algorithm  H.  Then  all  transitions  are  by  r(k,j)  or  by  r(k,j)~ , and  the 
permutations  ui(k)  are  irrelevant. 


53.  The  text’s  proof  that  n\  - 1 transpositions  cannot  be  achieved  for  n = 4 also  shows 
that  we  can  reduce  the  problem  from  n to  n - 2 at  the  cost  of  a single  transposition 
(ra-1  n— 2),  which  was  called  ‘(3c)’  in  the  notation  of  that  proof. 

Thus  we  can  generate  all  permutations  by  making  the  following  transformation 
ln_steP  H4:  If  k = n - 1 or  k = n - 2,  transpose  a3  mod  „ <->  a(i_1)modn,  where 
j - c„_i-l.  If  k = n — 3or  k = n — 4,  transpose  a„_i  <->  on_2  and  also  uimod(n_2)  o 
aU- 1)  mod  (n- 2)j  where  j = cn_ 3 — 1.  And  in  general  if  k = n — 2t  — 1 or  k = n — 2t  — 2, 
transpose  an_2t+1  <4  a„_2i  for  1 < * < t and  also  ajmod(n_2t)  ^ a0_1}  mod  (n_2t)’ 
where  j = cn-2t-i  - 1.  [See  CACM  19  (1976),  68-72.] 
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The  corresponding  Sims  table  permutations  can  be  written  down  as  follows,  al- 
though they  don’t  appear  explicitly  in  the  algorithm  itself: 

( (0  1 . . . j— 1 k),  if  n — k is  odd; 
a (k,  j)  — < 

i (0  1 ...  k)3 , if  n — k is  even. 

The  value  of  aj  mod  (n_2 t)  will  be  n — 2t  — 1 after  the  interchange.  For  efficiency  we 
can  also  use  the  fact  that  k usually  equals  n — 1.  The  total  number  of  transpositions 

is  E,L=o2J  ( n - 2tY-  - K2J  - 1. 

54.  Yes;  the  transformation  can  be  any  fc-cycle  on  positions  {1, . . . , k}. 

55.  (a)  Since  p\{m)  = pi(mmodn!)  when  n > p\{m),  we  have  p\(n\  + m)  = p<(m) 
for  0 < m < n ■ n!  = (n  + 1)!  - n\.  Therefore  P„\+m  = crp,(n!+m)  . . . crp,(n\+i)Pn\  = 
& p\  (m)  • • • & p\  ( i )Pn\  = PmPni  for  0 < m < n • n!,  and  we  have  in  particular 

/^(n+l)!  ^n+l  /^(n+1)!  — 1 &n-\-l  finl  — lPn\  &n+l&n  fin\ 

Similarly  an!+m  = /3~,am/3n!an!  for  0 < m < n • n\. 

Since  f3n\  commutes  with  rn  and  rn+i  we  find  an\  = rnotn\- 1,  and 

^(n+l)!  ^"n+l  ^(n+1)!  — 1 fin\ ^(n+1)!  — 1 — n!  Pnl  ^n! 

^~n-\- l/^n!  ^-n!  — 1 (/^n!  ^n! ) 

= K<n~lrn+ir~  (/3n!ani)n+1 

rrn  Tnd_iTn  (/5niO:nj) 

(b)  In  this  case  crn+1<r“  = (n  n— 1 ...  1)  and  rn+1r“  = (n+1  n 0),  and  we  have 
P(n+\y.ai(n+i)'.  = (n+1  n ...  0)  by  induction.  Therefore  ajni+m  — f3~,3  am(n  ...  0)3 
for  0 < j < n and  0 < m < n\.  All  permutations  of  {0, . . . ,n}  are  achieved  because 
Pn\  am  fixes  n and  (n  . . . 0)-7  takes  n i-4  n — j. 

56.  If  we  set  er/t  = (k  — 1 fe— 2)(fc— 3 k— 4) ...  in  the  previous  exercise,  we  find  by  induc- 
tion that  /3ni a„i  is  the  (n  + l)-cycle  (0  n n— 1 n— 3 ...  (2  or  1)  (1  or  2)  ...  n— 4 n— 2). 

57.  Arguing  as  in  answer  5,  we  obtain  Y^kZ.2  odd]/fc!  — ([n/2j  — l)/n!  = sinh  1 — 1 — 
0(l/(n-l)l). 

58.  True.  By  the  formulas  of  exercise  55  we  have  anl_1  = (0  n)/3“,(n  ...  0),  and 
this  takes  04n-l  because  f3n  \ fixes  n.  (Consequently  Algorithm  E will  define  a 
Hamiltonian  cycle  on  the  graph  of  exercise  66  if  and  only  if  f3n\  = (n— 1 ...  21),  and 
this  holds  if  and  only  if  the  length  of  every  cycle  of  f3(n-\y.  is  a divisor  of  n.  The  latter 
is  true  for  n = 2,  3,  4,  6,  12,  20,  and  40,  but  for  no  other  n < 250,000.) 

59.  The  Cayley  graph  with  generators  (on, . . . , cu- ) in  the  text’s  definition  is  isomorphic 
to  the  Cayley  graph  with  generators  (erf , . . . , a'k  ) in  the  alternative  definition,  since 
7r  — ► c*j7r  in  the  former  if  and  only  if  7r_  — > aj  in  the  latter. 

60.  (a,  b)  There  are  88  delta  sequences,  which  reduce  to  four  classes:  P = (32131231)3 
(plain  changes,  represented  by  8 different  delta  sequences);  Q = (32121232)3  (a  doubly 
Gray  variant  of  plain  changes,  with  8 representatives);  R = (121232321232)2  (a  doubly 
Gray  code  with  24  representatives);  S = 2o:3aR,  a = 12321312121  (48  representatives). 
Classes  P and  Q are  cyclic  shifts  of  their  complements;  classes  P,  Q.  and  S are  shifts  of 
their  reversals;  class  R is  a shifted  reversal  of  its  complement.  [See  A.  L.  Leigh  Silver, 
Math.  Gazette  48  (1964),  1 16.] 
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61.  There  are  respectively  (26, 36,  20, 26,  28, 40, 40,  20,  26,  28,  28, 26)  such  paths  ending 
at  (1243, 1324, 1432, 2134,  2341, 2413, 3142,  3214, 3421, 4123, 4231, 4312). 

62.  There  are  only  two  paths  when  n = 3,  ending  respectively  at  132  and  213. 
But  when  n > 4 there  are  Gray  codes  leading  from  12 ...  n to  any  odd  permuta- 
tion cl\q,2  . . .On*  Exercise  61  establishes  this  when  n — 4,  and  we  can  prove  it  by 
induction  for  n > 4 as  follows. 

Let  A(j)  be  the  set  of  all  permutations  that  begin  with  j,  and  let  A(j,k)  be 
those  that  begin  with  jk.  If  (qo,  an, . . . , a„)  are  any  odd  permutations  such  that 
ai  G A(xj,Xj+ 1),  then  (12 )atj  is  an  even  permutation  in  A(xj+1,  xj).  Consequently,  if 
X\X2  • • • xn  is  a permutation  of  {1,2,...,  n},  there  is  at  least  one  Hamiltonian  path  of 
the  form 

(12)a0 ai  — (12)Ql a2 (12)an_1 an; 

the  subpath  from  (12)a3_j  to  aj  includes  all  elements  of  A(xj). 

This  construction  solves  the  problem  in  at  least  (n - 2)!"/2n_1  distinct  ways  when 
L 1,  because  we  can  take  < vo  — 2 1 . . . n and  olu  = q,\q,2  . . . ■ there  are  (n  — 2)!  ways 

to  choose  x2  . . .in-i,  and  ( n — 2)!/2  ways  to  choose  each  of  ai,  . . . , a„_i. 

Finally,  if  ai  = 1,  take  any  path  12  ...  n — • • • — a\a2  ...  an  that  runs  through 
all  of  A(l),  and  choose  any  step  a — a'  with  a E A(l,j)  and  a'  E A(l,j')  for  some 
j 7=  j' . Replace  that  step  by 

a (1  2)«i  • OL2 (1  2)a„_i an  — a , 

using  a construction  like  the  Hamiltonian  path  above  but  now  with  on  = a,  an  = 
(12)a  , xi  = 1,  x2  = j,  xn  = j' , and  xn+i  — 1.  (In  this  case  the  permutations  qi, 

■ ■ . , an  might  all  be  even.) 

63.  Monte  Carlo  estimates  using  the  techniques  of  Section  7.2.3  suggest  that  the  total 
number  of  equivalence  classes  will  be  roughly  1.2  X 1021;  most  of  those  classes  will 
contain  480  Gray  cycles. 

64.  Exactly  2,005,200  delta  sequences  have  the  doubly  Gray  property;  they  belong  to 
4206  equivalence  classes  under  cyclic  shift,  reversal,  and/or  complementation.  Nine 
classes,  such  as  the  code  2e*2afl  where 

a = 12343234321232121232321232121234343212123432123432121232321, 

are  shifts  of  their  reversal;  48  classes  are  composed  of  repeated  60-cycles.  One  of  the 
most  interesting  of  the  latter  type  is  aa  where 

a = /32/34/H34/34,  /3  = 32121232123. 

65.  Such  a path  exists  for  any  given  N < n\:  Let  the  Nth  permutation  bea  = a1...a„, 
and  let  j = a\.  Also  let  n*,  be  the  set  of  all  permutations  f3  = bi  ...  bn  for  which  b\  — k 
and  < ot.  By  induction  on  N there  is  a Gray  path  Pi  for  1 1 , . We  can  then  construct 
Gray  paths  Pa-  for  Ily  U IL  U • • ■ U 11a- i for  2 < k < j,  successively  combining  Pa_i 
with  a Gray  cycle  for  Ha_i.  (See  the  “absorption”  construction  of  answer  62.  In  fact, 
Pj  will  be  a Gray  cycle  when  A is  a multiple  of  6.) 

66.  Defining  the  delta  sequence  by  the  rule  7r(fc+1)  mod  = (1  i)k)nk,  we  find  exactly  36 
such  sequences,  all  of  which  are  cyclic  shifts  of  a pattern  like  (xyzyzyxzyzyz)2 . (The 
next  case,  n = 5,  probably  has  about  1018  solutions  that  are  inequivalent  with  respect 
to  cyclic  shifting,  reversal,  and  permutation  of  coordinates,  thus  about  6x  1021  different 
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delta  sequences.)  Incidentally,  Igor  Pak  has  shown  that  the  Cayley  graph  generated  by 
star  transpositions  is  an  (n  — 2)-dimensional  torus  in  general. 

67.  If  we  let  7r  be  equivalent  to  7t(12345),  we  get  a reduced  graph  on  24  vertices  that  has 
40768  Hamiltonian  cycles,  240  of  which  lead  to  delta  sequences  of  the  form  a5  in  which 
a uses  each  transposition  6 times  (for  example,  a = 354232534234532454352452).  The 
total  number  of  solutions  to  this  problem  is  probably  about  1016. 

68.  If  A isn’t  connected,  neither  is  G.  If  A is  connected,  we  can  assume  that  it  is  a free 
tree.  Moreover,  in  this  case  we  can  prove  a generalization  of  the  result  in  exercise  62: 
For  n > 4 there  is  a Hamiltonian  path  in  G from  the  identity  permutation  to  any  odd 
permutation.  For  we  can  assume  without  loss  of  generality  that  A contains  the  edge 
1 — 2 where  1 is  a leaf  of  the  tree,  and  a proof  like  that  of  exercise  62  applies. 

[This  elegant  construction  is  due  to  M.  Tchuente,  Ars  Combinatoria  14  (1982), 
115-122.  Extensive  generalizations  have  been  discussed  by  Ruskey  and  Savage  in  SIAM 
J.  Discrete  Math.  6 (1993),  152-166.  See  also  the  original  Russian  publication  in 
Kibernetika  11,3  (1975),  17-21;  English  translation,  Cybernetics  11  (1975),  362-366.] 

69.  Following  the  hint,  the  modified  algorithm  behaves  like  this  when  n = 5: 
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35412*— 31452 

51432 
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32451  *-35421 
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21435 

25431 
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21453— *25413 

23415 

13425 

15423— *12453 

52413 

53412 

13452 

21543 

51243 

53241 

23541 

23145 

25143 

15243 

13245 

13542 

53142 

52143 

12543 

34512 

34215 

14235 

14532 

54132 

34152 

34251 

54231 

24531 

24135 

34125 

34521 

32154— *35124 

15324— *12354 

52314 

32514*— 31524 

51324 

21354-*  25314 

35214— *31254 

45123*— 42153 

42351*- 

-45321 

41325 

41523-*  42513 

42315 

45312*— 41352 

41253*— 45213 

43215 

43512*— 41532 

41235 

45231— *43251 

43152— *45132 

42135 

42531*- 43521 

43125 

51234 

21534— *23514 

53214 

13254*- 15234 

25134*- 23154 

53124 

13524-*  12534 

52134 

4 

t 

4 

t 

4 

t 

4 

t 

4 

t 

4 

4 

Here  the  columns  represent  sets  of  permutations  that  are  cyclically  rotated  and/or 
reflected  in  all  2n  ways;  therefore  each  column  contains  exactly  one  “rosary  permuta- 
tion” (exercise  18).  We  can  use  Algorithm  P to  run  through  the  rosary  permutations 
systematically,  knowing  that  the  pair  xy  will  occur  before  yx  in  its  column,  at  which 
time  t'  instead  of  f>  will  move  us  to  the  right  or  to  the  left.  Step  Z2  omits  the 
interchange  ai  f>  02,  thereby  causing  the  permutations  (i\  . . . an_i  to  repeat  themselves 
going  backwards.  (We  implicitly  use  the  fact  that  t[fc]  = t[n\  — fc]  in  the  output  of 
Algorithm  T.) 

Now  if  we  replace  1 . . . n by  24  ...  31  and  change  Ai  . . . An  to  Ai  An  A2  An-i  . . . , 
we  get  the  unmodified  algorithm  whose  results  are  shown  in  Fig.  42(b). 

This  method  was  inspired  by  a (nonconstructive)  theorem  of  E.  S.  Rapaport, 
Scripta  Math.  24  (1959),  51-58.  It  illustrates  a more  general  fact  observed  by  Carla 
Savage  in  1989,  namely  that  the  Cayley  graph  for  any  group  generated  by  three 
involutions  p,  <7,  r has  a Hamiltonian  cycle  when  pr  = rp.  [See  I.  Pak  and  R.  Radoicic, 
Discrete  Math.  309  (2009),  5501-5508.] 

70.  No;  the  longest  cycle  in  that  digraph  has  length  358.  But  there  do  exist  pairs  of 
disjoint  180-cycles  from  which  a Hamiltonian  path  of  length  720  can  be  derived.  For 


714  ANSWERS  TO  EXERCISES 


7.2. 1.2 


example,  consider  the  cycles  acrftcr  and  7 aa  where 

a = Tcr5Ta5Tcr3Tcr2Tcr5Tcr3T<j2Tcr5TO'5T(72Ta3Ta1Tcr5T(T5Tcr5Tcr3T(T1TrT1Tcr3T<72TrT1Tcr1 ; 

£ = cr3Ta5T<j2Ta2T<j5TO2Tcr3To1Ta1T<j5Ta1T<j3T03Tcr3To3Ta2TCT1Tcr2T<j3T<jlTa1T<j3Tcr2Tcr4 ; 
7 = c7Tcr5rcr5rcr3rc71r<T1T(T3rcr2T(T5r<T2r(T3T(T5r<T1TiT5r(T3T(T2rc71Tcr2T(T3T(T1rcr1r(73Tcr2 

TcrJTcr3Tij5Ta3Tcr5TtT2T(T5Tcr2T<T3Tcr1Tcr1Tcr5T<j1T<73Tcr3TcrSTa5Tij1Tcr5Tcr2Tcr3Ta1Tij2 


If  we  start  with  134526  and  follow  aa(3r  we  reach  163452;  then  follow  yar  and  reach 
126345;  then  follow  ayr  and  reach  152634;  then  follow  /3<ja,  ending  at  415263. 

71*  Brendan  McKay  and  Frank  Ruskey  have  found  such  cycles  by  computer  when 
n = 7,  9,  and  11,  but  no  nice  structure  was  apparent. 

72.  Any  Hamiltonian  path  includes  (n  - 1)!  vertices  that  take  y x,  each  of  which  (if 
not  the  last)  is  followed  by  a vertex  that  takes  x x.  So  one  must  be  last;  otherwise 
(n  — 1)!  + 1 vertices  would  take  x 1-4  x. 

73.  (a)  Assume  first  that  (3  is  the  identity  permutation  ().  Then  every  cycle  of  ct  that 
contains  an  element  of  A lies  entirely  within  A.  Hence  the  cycles  of  a are  obtained  by 
omitting  all  cycles  of  cy.  that  contain  no  element  of  A.  All  remaining  cycles  have  odd 
length,  so  a is  an  even  permutation. 

If  P is  not  the  identity,  we  apply  this  argument  to  a'  = a/3~ , (3'  = (),  and  a'  = ap~ , 
concluding  that  a'  is  an  even  permutation;  thus  a and  f3  have  the  same  sign. 

Similarly,  a and  ot  have  the  same  sign,  because  (3oc  = ( ot/3  ) has  the  same  order 
as  aft- . 

(b)  Let  X be  the  vertices  of  the  Cayley  graph  in  Theorem  R,  and  let  a be  the 
permutation  of  X that  takes  a vertex  n into  crx;  this  permutation  has  g/a  cycles  of 
length  a.  Define  the  permutation  j3  similarly.  Then  d/3_  has  g/c  cycles  of  length  c. 
If  c is  odd,  any  Hamiltonian  cycle  in  the  graph  defines  a cycle  <r  that  contains  all  the 
vertices  and  satisfies  the  hypotheses  of  (a).  Therefore  a and  $ have  an  odd  number  of 
cycles,  because  the  sign  of  a permutation  on  n elements  with  r cycles  is  ( — l)n-r  (see 
exercise  5. 2. 2-2). 

[This  proof,  which  shows  that  X cannot  be  the  union  of  any  odd  number  of  cycles, 
was  presented  by  Rankin  in  Proc.  Cambridge  Phil.  Soc.  62  (1966),  15-16.] 

74.  The  representation  /3J yk  is  unique  if  we  require  0 < j < g/c  and  0 < k < c.  For 
if  we  had  (3j  = 7*  for  some  j with  0 < j < g/c,  the  group  would  have  at  most  jc 
elements.  It  follows  that  /39,c  = 7*  for  some  t. 

Let  it  be  a Hamiltonian  cycle,  as  in  the  previous  answer,  and  let  7 = a/3~ . If 
7TCT  = 7 rd  then  7T7<7  must  be  Tryd,  because  7T7/3  = 7rd.  And  if  n&  = 7 r/3  then  nya  cannot 
be  7T7  a,  because  that  would  imply  irfo  = nj2a,  nycv  = n^ca.  Thus  the  elements 
7r,  7T7,  777  , . . . , all  have  equivalent  behavior  with  respect  to  their  successors  in  it. 

„ . When  the  path  n — > na  — y ■ ■ ■ — > 7r<xJ  has  k(j)  steps  of  type  a,  we  have  tto]  = 
Trf33yk'3) . Thus  7nr9/c  = njt+k(-g^c\  and  we  have  k(j  + g/c)  = k(j)  for  j > 0.  The  path 
returns  to  7r  for  the  first  time  in  g steps  if  and  only  if  t + k(g/c)  is  relatively  prime  to  c. 

75.  Apply  the  previous  exercise  with  g = mn,  a = m,  b = n,  c = mn/d.  The  number  t 
satisfies  t = 0 (modulo  m),  t + d = 0 (modulo  n);  and  it  follows  that  k + t _L  c if  and 
only  if  (d  - k)m/d  _L  kn/d. 

Notes.  The  modular  Gray  code  of  exercise  7.2.1.1—78  is  a Hamiltonian  path  from 
(0, 0)  to  (m  - 1,  (-m)  mod  n),  so  it  is  a Hamiltonian  cycle  if  and  only  if  m is  a multiple 
of  n.  It  is  natural  to  conjecture  (falsely)  that  at  least  one  Hamiltonian  cycle  exists 
whenever  d > 1.  But  P.  Erdos  and  W.  T.  Trotter  have  observed  [J.  Graph  Theory  2 
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(1978),  137-142]  that  if  p and  2p+l  are  odd  prime  numbers,  no  suitable  k exists  when 
m = p(2p  + l)(3p+  1)  and  n=  (3p+  1)  n*=i  9[,i8primel[,#plI,5'2p+l1- 

See  J.  A.  Gallian,  Mathematical  Intelligencer  13,3  (Summer  1991),  40-43,  for 
interesting  facts  about  other  kinds  of  cycles  in  Cm  X Cn- 

76.  We  may  assume  that  the  tour  begins  in  the  lower  left  corner.  There  are  no  solutions 
when  m and  n are  both  divisible  by  3,  because  2/3  of  the  cells  are  unreachable  in  that 
case.  Otherwise,  letting  d = gcd(m,  n)  and  arguing  as  in  the  previous  exercise  but  with 
( x , y)a  = ((x  + 2)  mod  to,  (y  + 1)  mod  n)  and  ( x , y)/3  = ((x  + 1)  mod  m,  (y  + 2)  mod  n), 
we  find  the  answer 
d ^ 

^(fc)[gcd((2d  — k)m,  ( k+d)n ) = d or  ( mn  X 3 and  gcd((2 d—k)m,  (k+d)n)  = 3d)]. 


k= 0 

77.  01 

* Permutation  generator 

Va  la  Heap 

02 

N 

IS 

10 

The  value  of  n (3  or  more,  not  large) 

03 

t 

IS 

$255 

04 

j 

IS 

$0 

8 j 

05 

k 

IS 

$1 

8k 

06 

ak 

IS 

$2 

07 

aj 

IS 

$3 

08 

LOC 

Data_Segment 

09 

a 

GREG 

(3 

Base  address  for  oo  • • • an-i 

10 

AO 

IS 

a 

11 

A1 

IS 

a+8 

12 

A2 

IS 

a+16 

13 

LOC 

9+8*N 

Space  for  ao  ■ ■ ■ a„-i 

14 

c 

GREG 

9-8*3 

Location  of  8co 

15 

LOC 

9-8*3+8*N 

8C3  . . . 8cn-i,  initially  zero 

16 

OCTA 

-1 

8cn  = — 1,  a convenient  sentinel 

17 

u 

GREG 

0 

Contents  of  ao,  except  in  inner  loop 

18 

V 

GREG 

0 

Contents  of  ai,  except  in  inner  loop 

19 

w 

GREG 

0 

Contents  of  a 2,  except  in  inner  loop 

20 

21 

1H 

LOC 

STCO 

#100 

0,c,k 

B — A Cfc  i — 0. 

22 

INCL 

k ,8 

B — A fc  + 1. 

23 

OH 

LDO 

j ,c,k 

B j <-  Cfc . 

24 

CMP 

t,  j ,k 

B 

25 

BZ 

t , IB 

B Loop  if  Ck  = k. 

26 

BN 

j .Done 

A Terminate  if  c/t  < 0 (k  = n). 

27 

LDO 

ak ,a,k 

A — 1 Fetch  ak  ■ 

28 

ADD 

t,  j ,8 

A - 1 

29 

STO 

t ,c  ,k 

A — 1 ck  4-  j + 1. 

30 

AND 

t ,k,#8 

A - 1 

31 

CSZ 

j ,t,o 

A — 1 Set  j <—  0 if  k is  even. 

32 

LDO 

a j , a , j 

A — 1 Fetch  aj . 

33 

STO 

ak.a. j 

A — 1 Replace  it  by  a*,. 

34 

CSZ 

u, j , ak 

A — 1 Set  u <—  ak  if  j = 0. 

35 

SUB 

j.j.8 

A-  1 j *- j - 1. 

36 

CSZ 

v,  j ,ak 

A — 1 Set  v <—  ak  if  j = 0. 
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37 

SUB 

j.j.8 

A - 1 

j <r-  j ~ 1. 

38 

CSZ 

w, j ,ak 

A - 1 

Set  w 4—  ak  if  j = 0. 

39 

STO 

aj ,a,k 

A - l 

Replace  a*,  by  what  was  a 

40 

Inner 

PUSHJ 

0 .Visit 

A 

(See  (42)) 

55 

PUSHJ 

0, Visit 

A 

56 

SET 

t ,u 

A 

Swap  u w. 

57 

SET 

u.w 

A 

58 

SET 

w.t 

A 

59 

SET 

k,8*3 

A 

k <-  3. 

60 

JMP 

0B 

A 

61 

Main 

LDO 

u,  A0 

1 

62 

LDO 

v,  A1 

1 

63 

LDO 

w,A2 

1 

64 

JMP 

Inner 

1 

1 

78.  Lines  31-38  become  2r  — 1 instructions,  lines  61-63  become  r,  and  lines  56-58 
become  3+  (r  - 2 )[r  even]  instructions  (see  u(r  — 1)  in  answer  40).  The  total  running 
time  is  therefore  ((2r!  + 2)A+2B+r-5)/i+((2r!  + 2r+7+(r-2)[r  even]) A+7S-r-4)n, 
where  A = n\/r\  and  B = n!(l/r!  + • • • + 1/n!). 

79.  SLU  u, [#f]  ,t;  SLUt,a,4;  XORt.t.a;  AND  t ,t ,u;  SRUu,t,4;  OR  t,t,u; 
XOR  a,a,t;  here,  as  in  the  answer  to  exercise  1.3.T-34,  the  notation  ‘ [#f  ] ’ denotes  a 
register  that  contains  the  constant  value  #f . (See  the  similar  code  in  7.1.3-(6g).) 

80.  SLU  u,a,t ; MXOR  u,  [#8844221188442211]  ,u;  AND  u,u,  [#f  fOOOOOO]  ; SRUu,u,t; 
XOR  a,a,u.  This  cheats,  since  it  transforms  *12345678  to  *13245678  when  t = 4,  but 
(45)  still  works. 

Even  faster  and  trickier  would  be  a routine  analogous  to  (42):  Consider 


PUSHJ  0,  Visit;  MXORa.a.cl;  PUSHJ  0, Visit;  . . . MXOR  a,a,c5;  PUSHJO.Visit 

where  cl,  ...,  c5  are  constants  that  would  cause  *12345678  to  become  successively 
*12783456,  *12567834,  *12563478,  *12785634,  *12347856.  Other  instructions,  exe- 
cuted only  1/6  or  1/24  as  often,  can  take  care  of  shuffling  nybbles  within  and  between 
bytes.  Very  clever,  but  it  doesn’t  beat  (46)  in  view  of  the  PUSHJ /POP  overhead. 


k IS 

$0  ;kk 

IS  $1  ; c IS  $2  ;d  IS  $3 

SET 

k,  1 

k <—  1. 

3H  SRU 

d.a.60 

d leftmost  nybble. 

SLU 

a, a, 4 

a «—  16a  mod  1616. 

CMP 

c,d,k 

SLU 

kk , k , 2 

SLU 

d.d.kk 

OR 

t ,t  ,d 

t <-  t + 16kd. 

PBNZ 

c , IB 

Return  to  main  loop  if  d / 

INCL 

k.l 

k -t—  k + 1 . 

PBNZ 

a,3B 

Return  to  second  loop  if  k 

82.  /x  + (5n!  + HA  — (n  — l)!  + 6)t>  = ((5  + 10/n)u  + 0(n  2))n!,  plus  the  visiting  time, 
where  A = Y^k=i  *s  t^le  number  of  times  the  loop  at  3H  is  used. 

83.  With  suitable  initialization  and  a 13-octabyte  table,  only  about  a dozen  MMIX 
instructions  are  needed: 
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magic 

GREG  #8844221188442211 

OH 

(Visit  register  a) 

PBN  c, Sigma 

Tau 

MX0R  t, magic, a;  ANDNL  t,#ffff; 

JMP  IF 

Sigma 

SRU  t , a,20;  SLU  a, a, 4;  ANDNML 

a,#f00 

1H 

X0R  a,a,t;  SLU  c,c,l 

2H 

PBNZ  c,0B;  INCL  p,8 

3H 

LD0U  c,p,0;  PBNZ  c,0B 

84.  Assuming  that  the  processors  all  have  essentially  the  same  speed,  we  can  let  the 
kth  processor  generate  all  permutations  of  rank  r for  ( k — 1 )n\/p  < r < kn\/p,  using 
any  method  based  on  control  tables  ci  . ..cn.  The  starting  and  ending  control  tables 
are  easily  computed  by  converting  their  ranks  to  mixed-radix  notation  (exercise  12). 

85.  We  can  use  a technique  like  that  of  Algorithm  3.4.2P:  To  compute  k = r(a),  first 
set  a'a  . 4—  j for  1 < j < n (the  inverse  permutation).  Then  set  A-  •< — 0,  and  for  j = n, 
n - 1,  ...,  2 (in  this  order)  set  t 4-  a',  k 4-  kj  + t - 1,  at  4-  a 3,  a'a.  4 - t.  To 
compute  r1-li(fc),  start  with  ax  4-  1.  Then  for  j = 2,  . . . , n - 1,  n (in  this  order)  set 
t 4—  (k  mod  j)  + 1,  a,j  4-  at,  at  4—  j,  k 4—  \k/j\.  [See  S.  Pleszczynski,  Inf.  Proc.  Letters 
3 (1975),  180  183;  W.  Myrvold  and  F.  Ruskey,  Inf.  Proc.  Letters  79  (2001),  281-284.] 

Another  method  is  preferable  if  we  want  to  rank  and  unrank  only  the  n-  variations 
a\  ...  am  of  {1, ... , n}:  To  compute  k = r(a i . . . om),  start  with  bx  . . . bn  4—  bx  . . . b'n  4— 
1 . . .n;  then  for  j = 1,  .. .,  m (in  this  order)  set  t 4-  b'a. , bt  4-  bn+1-j,  and  b'if  4-  t; 
finally  set  k 4-  0 and  for  j = m,  . . . , 1 (in  this  order)  set  k 4—  k x (n  + 1 — j)  + b'a.  — 1. 
To  compute  rl_l1  (fc),  start  with  &i  . . . bn  4-  1 . . . n;  then  for  j = 1,  . . . , m (in  this  order) 
set  t 4-  (fc  mod  (n  + 1 - j))  + 1,  a3  4-  bt,  bt  4-  6„+i_j,  k 4-  [k/{n  + 1 - j) J.  (See 
exercise  3.4.2-15  for  cases  with  large  n and  small  m .) 

86.  If  a • < y and  y -<  z,  the  algorithm  will  never  move  y to  the  left  of  x,  nor  2 to  the 
left  of  y,  so  it  will  never  test  x versus  z. 

87.  They  appear  in  lexicographic  order;  Algorithm  P used  a reflected  Gray  order. 

88.  Generate  inverse  permutations  with  a'0  < a'x  < a'2,  a'3  < a'4  < a'5,  a'6  < a'r,  a'8  < a'g, 
a0  < a6  < Og  • 

89.  (a)  Let  dk  = maxjj  | 0 < j < k and  j is  nontrivial},  where  0 is  considered 
nontrivial.  This  table  is  easily  precomputed,  because  j is  trivial  if  and  only  if  it  must 
follow  {1, . . . ,j  — 1}.  Set  k 4—  dn  in  step  V2  and  k 4-  dk- i in  step  V5.  (Assume  dn  > 0.) 

(b)  Now  M = 537=1  tj[j  >s  nontrivial], 

(c)  There  are  at  least  two  topological  sorts  aj  . . . ak  of  the  set  {}, . . . , k},  and  either 
of  them  can  be  placed  after  any  topological  sort  ai  . . . Oj_i  of  {1, . . . ,j  — 1}. 

(d)  Algorithm  2.2.3T  repeatedly  outputs  minimal  elements  (elements  with  no 
predecessors),  removing  them  from  the  relation  graph.  We  use  it  in  reverse,  repeatedly 
removing  and  giving  the  highest  labels  to  maximal  elements  (elements  with  no  succes- 
sors). If  only  one  maximal  element  exists,  it  is  trivial.  If  k and  l are  both  maximal, 
they  both  are  output  before  any  element  x with  x -<  k or  x -<  l,  because  steps  T5 
and  T7  keep  maximal  elements  in  a queue  (not  a stack).  Thus  if  k is  nontrivial  and 
output  first,  element  l might  become  trivial,  but  the  next  nontrivial  element  j will  not 
be  output  before  /;  and  k is  unrelated  to  l. 

(e)  Let  the  nontrivial  t’s  be  si  < s2  < ■ ■ ■ < sr  = N.  Then  we  have  sj  > 2 s3_2, 
by  (c).  Consequently  M = s2  + - • - + S,.  < Sr(l  + 1 + i +. . . ) + Sr_1(i  + 1 + 1 + . . . ) < 4Sr. 
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(A  sharper  estimate  is  in  fact  true,  as  observed  by  M.  Peczarski:  Let  s0  = 1,  let 
the  nontrivial  indices  be  0 = kx  < k2  < ■ ■ ■ < kr,  and  let  fc'  = max{fc  | l < k < kj, 
k kj}  for  j > 1.  Then  k'J+1  > kj.  There  are  Sj  topological  sorts  of  {1, ... , kj+i}  that 
end  with  kj+1;  and  there  are  at  least  Sj- 1 that  end  with  fc'+1,  since  each  of  the  Sj_i 
topological  sorts  of  {1, . . . , kj  - 1}  can  be  extended.  Hence 

Sj+ 1 > Sj  + Sj- 1 for  1 < j < r. 

Now  let  y0  = 0,  yi  = F2  + ■ ■ ■ + Fr,  and  Vj  = yj-2  + y_,-i  - Fr+ 1 for  1 < j < r.  Then 

r — 1 

Fr+l  (Si  + ' • • + Sr)  + ^ \ Vj  (sr+l— j — Sr-j  — S, — l_j)  = (F2  + • ■ ■ + Fr+l)sr, 
i= i 

and  each  yj  = Fr+ 1 — 2 Fj  — (— l)j Fr+i-j  is  nonnegative.  Hence  «i  + • • • + sr  < 
((F2  + • • • + Fr  +i)/Fr+i)sr  « 2.6sr.  The  following  exercise  shows  that  this  bound  is 
best  possible.) 

90.  The  number  N of  such  permutations  is  Fn+i  by  exercise  5.2.1-25.  Therefore 

M = Fn+i  H |-F2  = Fn+ 3 — 2 « c/>2N.  Notice  incidentally  that  all  such  permutations 

satisfy  al...an  = a’i  . . . a’n . They  can  be  arranged  in  a Gray  path  (exercise  7.2.1.1-89). 

91.  Since  tj  = ( j - 1 )(j  - 3) ...  (2  or  1),  we  find  M = (l  + 2/y/wn  + 0(l/n))N. 

Note:  The  inversion  tables  ci  . . . c2n  for  permutations  satisfying  (49)  are  charac- 
terized by  the  conditions  ci  = 0,  0 < c2k  < c2k-i,  0 < c2k+ 1 < c2k-i  + 1. 

92.  The  total  number  of  pairs  (R,S),  where  R is  a partial  ordering  and  5 is  a linear 
ordering  that  includes  R,  is  equal  to  P„  times  the  expected  number  of  topological  sorts; 
it  is  also  Qn  times  n!.  So  the  answer  is  n!  Qn/Pn- 

We  will  discuss  the  computation  of  Pn  and  Qn  in  Section  7.2.3.  For  1 < n < 12 
the  expectation  turns  out  to  be  approximately 

(1,  1.33,  2.21,  4.38,  10.1,  26.7,  79.3,  262,  950,  3760,  16200,  74800). 

Asymptotic  values  as  n ->  00  have  been  deduced  by  Brightwell,  Promel,  and  Steger 
[J.  Combinatorial  Theory  A73  (1996),  193-206],  but  the  limiting  behavior  is  quite 
different  from  what  happens  when  n is  in  a practical  range.  The  values  of  Qn  were  first 
determined  for  n < 5 by  S.  P.  Avann  [^quationes  Math.  8 (1972),  95-102]. 

93.  The  basic  idea  is  to  introduce  dummy  elements  n + 1 and  n + 2 with  j -<  n + 1 
and  j A n + 2 for  1 < j < n,  and  to  find  all  topological  sorts  of  such  an  extended 
relation  via  adjacent  interchanges;  then  take  every  second  permutation,  suppressing 
the  dummy  elements.  An  algorithm  similar  to  Algorithm  V can  be  used,  but  with  a 
recursion  that  reduces  n to  n — 2 by  inserting  n — 1 and  n among  ai  . . . an_2  in  all 
possible  ways,  assuming  that  n - 1 7^  n,  occasionally  swapping  n + 1 with  n + 2.  [See 
G.  Pruesse  and  F.  Ruskey,  SICOMP  23  (1994),  373—386.  A loopless  implementation 
has  been  described  by  Canfield  and  Williamson,  Order  12  (1995),  57-75.] 

94.  The  case  n = 3 illustrates  the  general  idea  of  a pattern  that  begins  with  1 . . . (2 n) 
and  ends  with  1 (2n)2(2n-l) . . . n(n+l):  123456,  123546,  123645,  132645,  132546, 
132456,  142356,  142536,  142635,  152634,  152436,  152346,  162345,  162435,  162534. 

Perfect  matchings  can  also  be  regarded  as  involutions  of  (1, ... , 2n}  that  have  n 
cycles.  With  that  representation  this  pattern  involves  two  transpositions  per  step. 

Notice  that  the  C inversion  tables  of  the  permutations  just  listed  are  respectively 
000000,  000100,  000200,  010200,  010100,  010000,  020000,  020100,  020200,  030200, 
030100,  030000,  040000,  040100,  040200.  In  general,  Ci  = C3  = • • • = C2n-1  = 0 
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and  the  n-tuples  (C2,  C4, . . . , C2n)  run  through  a reflected  Gray  code  on  the  radices 
(2n  - 1,  2n  — 3, . . . , 1).  Thus  the  generation  process  can  easily  be  made  loopless  if 
desired.  [See  Timothy  Walsh,  J.  Combinatorial  Math,  and  Combinatorial  Computing 
36  (2001),  95-118,  Section  1.] 

Note:  Algorithms  to  generate  all  perfect  matchings  go  back  to  J.  F.  Pfaff  [Abhand- 
lungen  Akad.  Wissenschaften  (Berlin:  1814  1815),  124-125],  who  described  two  such 
procedures:  His  first  method  was  lexicographic,  which  also  corresponds  to  lexicographic 
order  of  the  C inversion  tables;  his  second  method  corresponds  to  colex  order  of  those 
tables.  Even  and  odd  permutations  alternate  in  both  cases. 

95.  Generate  inverse  permutations  with  a[  < a'n  > a'2  < a'n_1  > • • • , using  Algo- 
rithm V.  (See  exercise  5.1.4-23  for  the  number  of  solutions.) 

96.  For  example,  we  can  start  with  ax  . . ,an-1an  = 2...nl  and  M2  • • • bnbn+i  = 
12... nl,  and  use  Algorithm  P to  generate  the  (n  - 1)!  permutations  fe2  • • • bn  of 
(2, . . . ,n}.  Just  after  that  algorithm  swaps  bt  -H-  bi+1,  we  set  a/,i_1  <—  bi,  abi  4-  bi+ 1, 
abi+1  «—  bi+ 2,  and  visit  ai  ...  an. 

97.  Use  Algorithm  X,  with  tk(a\ , . . . , ak)  = lak  / k\ 

98.  Using  the  notation  of  exercise  47,  we  have  Nk  = ^ (J)(-l)J(n  - j)LJ  by  the 
method  of  inclusion  and  exclusion  (exercise  1.3.3-26).  If  k = O(logn)  then  Nn-k  = 
(rz! e—1/fc!) (1  + 0(logn)2/n);  hence  A/n\  « (e  - l)/e  and  B/n\  k 1.  The  number  of 
memory  references,  under  the  assumptions  of  answer  48,  is  therefore  « A + B + 3A  + 
B — Nn  + 3A  « n!  (9  — |)  ss  6.06n!,  about  16.5  per  derangement.  [See  S.  G.  Akl,  BIT 
20  (1980),  2-7,  for  a similar  method.] 

99.  Suppose  Ln  generates  D„UDn-i,  beginning  with  (12  ...  n),  then  (2  1 ...  n),  and 
ending  with  (1  . . . n— 1);  for  example,  L3  = (1  2 3),  (2  1 3),  (1  2).  Then  we  can  generate 
Dn+ 1 as  Knn,  . ..,  Kn2,  Kn  1,  where  Knk  = (12  ...  n)~k(n  n+l)Ln(l  2 ...  n)k ; for 
example,  D4  is 

(1  2 3 4),  (2  1 3 4),  (1  2)(3  4),  (3  1 2 4),  (1  3 2 4),  (3  1)(2  4),  (2  3 1 4),  (3  2 1 4),  (2  3)(1  4). 

Notice  that  Knk  begins  with  the  cycle  (fc+1  ...  n 1 ...  k n+1)  and  ends  with 
(k+ 1 ...  nl  ...  k—l)(k  n+1);  so  premultiplication  by  (k- 1 k ) takes  us  from  Knk 
to  Kn(k_ i).  Also,  premultiplication  by  (1  n)  will  return  from  the  last  element  of  Dn+ 1 
to  the  first.  Premultiplication  by  (1  2 n+1)  takes  us  from  the  last  element  of  Dn+ 1 to 
(2  13  ...  n),  from  which  we  can  return  to  (1  2 ...  n)  by  following  the  cycle  for  Dn 
backwards,  thereby  completing  the  list  Ln+ 1 as  desired. 

100.  Use  Algorithm  X,  with  ffc(ai, . . . , a*,)  = ‘p  > 0 or  l[q]  ^ k + 1’. 

Notes:  The  number  of  indecomposable  permutations  is  [2"](i-i/£r=o  fci**); 
see  L.  Comtet,  Comptes  Rendus  Acad.  Sci.  A275  (Paris,  1972),  569-572. 

A.  D.  King  [ Discrete  Math.  306  (2006),  508-516]  has  shown  that  indecomposable 
permutations  can  be  generated  efficiently  by  making  only  a single  transposition  at  each 
step.  In  fact,  adjacent  transpositions  may  well  suffice;  for  example,  when  n = 4 the 
indecomposable  permutations  are  3142,  3412,  3421,  3241,  2341,  2431,  4231,  4321,  4312, 
4132,  4123,  4213,  2413. 

101.  Here  is  a lexicographic  involution  generator  analogous  to  Algorithm  X. 

Yl.  [Initialize.]  Set  ak  4-  k and  lk-i  4~  k for  1 < k < n.  Then  set  ln  4-  0,  k 4-  1. 
Y2.  [Enter  level  k.]  If  k > n,  visit  ai  . . . an  and  go  to  Y3.  Otherwise  set  p 4—  lo, 
uk  4r-  p,  1 0 lp,  k <r-  k + 1,  and  repeat  this  step.  (We  have  decided  to  let 
ap  =P-) 
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Y3.  [Decrease  k.]  Set  k 4-  k — 1,  and  terminate  if  k = 0.  Otherwise  set  q <—  uk  and 
p 4—  aq.  If  p = q,  set  lo  4—  q,  q <—  0,  r 4—  lp , and  k 4—  k + 1 (preparing  to  make 

ap  > p)-  Otherwise  set  lUk_1  <—  q,  r 4—  lq  (preparing  to  make  ap  > q). 

Y4.  [Increase  ap .]  If  r = 0 go  to  Y5.  Otherwise  set  lq  <—  lr,  uk-i  4-  q,  uk  4-  r, 
ap  4—  r,  aq  4—  q,  ar  4—  p,  k 4—  k + l,  and  go  to  Y2. 

Y5.  [Restore  ap.]  Set  lo  <—  p,  ap  4-  p,  aq  t—  q,  k 4—  k — 1,  and  return  to  Y3.  | 

Let  fn+i  = tn  + Titn— i,  an+ 1 = 1 + dn  + ncLn— i,  to  = t\  — 1,  ao  = 0,  ax  = 1.  (See 
Eq.  5.1.4-(4o).)  Step  Y2  is  performed  tn  times  with  k > n and  an  times  with  k < n. 
Step  Y3  is  performed  an  times  with  p = q and  an  + tn  times  altogether.  Step  Y4  is 
performed  tn  — 1 times;  step  Y5,  an  times.  The  total  number  of  mems  for  all  tn  outputs 
is  therefore  approximately  lla„  + 12fn,  where  an  < 1.25331414tn.  (Optimizations  are 
clearly  possible  if  speed  is  essential.) 

102.  We  construct  a list  Ln  that  begins  with  ()  and  ends  with  (n— 1 n),  starting  with 
^3  = (),  (1  2),  (1  3),  (2  3).  If  n is  odd,  Ln+1  is  Ln,  K*x,  Kn2,  ...,  K*n,  where 
Knk  = (k  ...  n)~  Ln-\(k  . . . n)(k  n+1).  For  example, 

L4  = (),  (1  2),  (1  3),  (2  3),  (2  3) ( 1 4),  (1  4),  (2  4),  (1  3)(2  4),  (1  2)(3  4),  (3  4). 

If  n is  even,  Ln+1  is  Ln,  Kn(n_1},  K*{n_a),  ...,  KnU  (1  n-2)L«_1(l  n-2)(n  n+1). 
For  further  developments,  see  the  article  by  Walsh  cited  in  answer  94. 

103.  The  following  elegant  solution  by  Carla  Savage  needs  only  n - 2 different  op- 

erations pj,  for  1 < j < n,  where  pj  replaces  aj-Xajaj+x  by  aj+1aj^1aj  when  j is 
even,  ajctj+idj-i  when  j is  odd.  We  may  assume  that  n > 4;  let  A4  = (P3P2P2P3)3 ■ 
In  general  A„  will  begin  and  end  with  pn~i,  and  it  will  contain  2n  — 2 occurrences 
of  pn- 1 altogether.  To  get  An+ 1,  replace  the  fcth  of  An  by  pnA'npn,  where 

k = 1,  2,  4,  . . . , 2n  — 2 if  n is  even  and  k = 1,3,  . . . , 2n  — 3,  2n  — 2 if  n is 
odd,  and  where  A'n  is  +„  with  its  first  or  last  element  deleted.  Then,  if  we  begin 
with  a\...an  = 1 . . . n,  the  operations  pn_i  of  An  will  cause  position  an  to  run 
through  the  successive  values  n — > px  -A  n -+  p2  -+  •••  -4  pn-i  ->  n,  where 
Pi  ■ ■ ■ pn-i  = (n—1  — [neven]) ...  4213 ...  (n—1  — [nodd]);  the  final  permutation  will 
again  be  1 . . . n. 

104.  (a)  A well-balanced  permutation  has  Ylk=i  = n(n  + l)2/4!  an  integer. 

(b)  Replace  k by  ak  when  summing  over  k. 

(c)  A fairly  fast  way  to  count,  when  n is  not  too  large,  can  be  based  on  the 
streamlined  plain-change  algorithm  of  exercise  16,  because  the  quantity  kak  changes 
in  a simple  way  with  each  adjacent  interchange,  and  because  n - 1 of  every  n steps 
are  “hunts”  that  can  be  done  rapidly.  We  can  save  half  the  work  by  considering  only 
permutations  in  which  1 precedes  2.  The  values  for  1 < n < 15  are  0,  0,  0,  2,  6,  0,  184, 
936,  6688,  0,  420480,  4298664,  44405142,  0,  6732621476. 

105.  (a)  For  each  permutation  a4  . . .an,  insert  + between  a,j  and  aj+ 1 if  a3  > aj+1\ 
insert  either  = or  + between  them  if  a3  < Oj+i.  (A  permutation  with  k “ascents” 
therefore  yields  2k  weak  orders.  Weak  orders  are  sometimes  called  “preferential  arrange- 
ments”; exercise  5.3. 1-4  shows  that  there  are  approximately  n!/(2(ln2)’1+1)  of  them. 
A Gray  code  for  weak  orders,  in  which  each  step  changes  -<  44  = and/or  a3  44  a]+l, 
can  be  obtained  by  combining  Algorithm  P with  Gray  binary  code  at  the  ascents. 

(b)  Start  with  a\  . . . anan4-i  = 0 ...  00  and  ao  = —1.  Perform  Algorithm  L until 
it  stops  with  j = 0.  Find  k such  that  ax  > • • • > ak  = a^+i,  and  terminate  if 
k = n.  Otherwise  set  a;  ak+ 1 + 1 for  1 < l < k and  go  to  step  L4.  [See  M.  Mor 
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and  A.  S.  Fraenkel,  Discrete  Math.  48  (1984),  101—112.  Weak  ordering  sequences  are 
characterized  by  the  property  that,  if  k appears  and  k > 0,  then  k - 1 also  appears.] 

106.  All  weak  ordering  sequences  can  be  obtained  by  a sequence  of  elementary  oper- 
ations a,  a,j  or  a*  <—  aj.  (Perhaps  one  could  actually  restrict  the  transformations 
further,  allowing  only  a j <4  aJ+1  or  aj  <-  aj+ 1 for  1 < j < n.) 

107.  Every  step  increases  the  quantity  YH=x  2fc[a*  = *],  as  noted  by  H.  S.  Wilf,  so  the 
game  must  terminate.  At  least  three  approaches  to  the  solution  are  plausible:  one  bad, 
one  good,  and  one  better. 

The  bad  one  is  to  play  the  game  on  all  13!  shuffles  and  to  record  the  longest.  This 
method  does  produce  the  correct  answer;  but  13!  is  6,227,020,800,  and  the  average  game 
lasts  ss  8.728  steps. 

The  good  one  [A.  Pepperdine,  Math.  Gazette  73  (1989),  131-133]  is  to  play 
backwards,  starting  with  the  final  position  1*  ...  * where  * denotes  a card  that  is  face 
down;  we  will  turn  a card  up  only  when  its  value  becomes  relevant.  To  move  backward 
from  a given  position  ai  . . . an , consider  all  k > 1 such  that  either  ak  = k or  ak  = * and 
k has  not  yet  turned  up.  Thus  the  next-to-last  positions  are  21*  . . . *,  3*1*  . . . *,  . . . , 
n*  . . . *1.  Some  positions  (like  6**213  for  n = 6)  have  no  predecessors,  even  though  we 
haven’t  turned  all  the  cards  up.  It  is  easy  to  explore  the  tree  of  potential  backwards 
games  systematically,  and  one  can  in  fact  show  that  the  number  of  nodes  with  t *’s  is 
exactly  (n  — l)!/f!.  Hence  the  total  number  of  nodes  considered  is  exactly  |_(ra  — 1)!  ej. 
When  n = 13  this  is  1,302,061,345. 

The  better  one  is  to  play  forwards,  starting  with  initial  position  * ...  * and  turning 
over  the  top  card  when  it  is  face  down,  running  through  all  (n  — 1)!  permutations  of 
{2, . . . , n}  as  cards  are  turned.  If  the  bottom  n — m cards  are  known  to  be  equal 
to  (m+l)(m+2) ...  n,  in  that  order,  at  most  f(m)  further  moves  are  possible;  thus  we 
need  not  pursue  a line  of  play  any  further  if  it  cannot  last  long  enough  to  be  interesting. 
A permutation  generator  like  Algorithm  X allows  us  to  share  the  computation  for  all 
permutations  with  the  same  prefix  and  to  reject  unimportant  prefixes.  The  card  in  posi- 
tion j need  not  take  the  value  j when  it  is  turned.  When  n = 13  this  method  needs  to 
consider  only  respectively  (1,11,940,6960,44745,245083,1118216,4112676,11798207, 
26541611,44380227,37417359)  branches  at  levels  (1,2,...,  12)  and  to  make  a total  of 
only  482,663,902  forward  moves.  Although  it  repeats  some  lines  of  play,  the  early  cutoffs 
of  unprofitable  branches  make  it  run  more  than  11  times  faster  than  the  backward 
method  when  n = 13. 

The  unique  way  to  attain  length  80  is  to  start  with  29451112  10  18  13  36  7. 

108.  This  result  holds  for  any  game  in  which 

di  . . .an  -4  a*,ap(fci2)  . . . ap^ktk_1^aiak+i  . . . a„ 

when  ai  = k,  where  p(k , 2) . . .p(k,  k — 1)  is  an  arbitrary  permutation  of  (2, . . . , k — 1}. 
Suppose  ai  takes  on  exactly  m distinct  values  d(l)  < ■ • • < d(m)  during  a play  of 
the  game;  we  will  prove  that  at  most  Fm+i  permutations  occur,  including  the  initial 
shuffle.  This  assertion  is  obvious  when  m = 1. 

Let  d(j)  be  the  initial  value  of  ad(m),  where  j < m,  and  suppose  ad(m)  changes  on 
step  r.  If  d(j)  = 1,  the  number  of  permutations  is  r + 1 < Fm  + 1 < Fm+ 1.  Otherwise 
r < Fm- 1,  and  at  most  Fm  further  permutations  follow  step  r.  [SIAM  Review  19 
(1977),  739-741.] 

The  values  of  f(n)  for  1 < n < 16  are  (0,  1,  2,  4,  7,  10,  16,  22,  30,  38,  51,  65,  80, 
101,  113,  139),  and  they  are  attainable  in  respectively  (1,  1,  2,  2,  1,  5,  2,  1,  1,  1,  1,  1, 
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1,  4,  6,  1)  ways.  The  unique  longest-winded  permutation  for  n = 16  is 
9 12  6 7 2 14  8 1 11  13  5 4 15  16  10  3. 

109.  An  ingenious  construction  by  I.  H.  Sudborough  and  L.  Morales  [Theoretical 
Comp.  Sci.  411  (2010),  3965-3970]  proves  that  f(n)  > -L^n2  + 0(1). 

110.  For  0 < j < 9 construct  the  bit  vectors  Aj  = [a3  € Si] . . . [ cij  € Sm]  and  Bj  = 
[jeSi]...[je  Sm  ].  Then  the  number  of  j such  that  Aj  = v must  equal  the  number 
of  k such  that  Bk  = v,  for  all  bit  vectors  v.  And  if  so,  the  values  {aj  | Aj  = u}  should 
be  assigned  to  permutations  of  {k  \ Bk  = u}  in  all  possible  ways. 

For  example,  the  bit  vectors  in  the  given  problem  are 

(A0, . . . , Ag)  - (9, 6, 8, b, 5, 4,0, a, 2,0),  (B0,...,B9)  = (5, 0, 8,  6,  2,  a.  4,  b,  9,  0), 

in  hexadecimal  notation;  hence  a0  . . . ag  = 8327061549  or  8327069541. 

In  a larger  problem  we  would  keep  the  bit  vectors  in  a hash  table.  It  would  be 
better  to  give  the  answer  in  terms  of  equivalence  classes,  not  permutations;  indeed,  this 
problem  has  comparatively  little  to  do  with  permutations. 

111.  In  the  directed  graph  with  n\/ 2 vertices  oi...o„_2  and  n\  arcs  a1...an-2  -A 
a2  ...  a„_i  (one  for  each  permutation  a i . . . a„),  each  vertex  has  in-degree  2 and  out- 
degree  2.  Furthermore,  from  paths  like  ai...a„_2  -A  a2...a„_i  -4  a3...a„  -a 
a 4 • • • OjnCL2  as  .. . ana,2ai  — > • • • — y d2d\d 3 . . . an_2,  we  can  see  that  any  vertex  is 
reachable  from  any  other.  Therefore  an  Eulerian  trail  exists  by  Theorem  2.3.4.2G,  and 
such  a trail  clearly  is  equivalent  to  a universal  cycle  of  permutations.  The  lexicograph- 
ically smallest  example  when  n = 4 is  (123124132134214324314234). 

[G.  Hurlbert  and  G.  Isaak,  in  Discrete  Math.  149  (1996),  123-129,  have  suggested 
another  appealing  approach:  Let’s  say  that  a modular  universal  cycle  of  permutations 
is  a cycle  of  n!  digits  {0, . . . , n}  with  the  property  that  each  permutation  a\...an  of 
{1, ....  n}  arises  from  consecutive  digits  m ...  un  by  letting  aj  = (u3  - c)  mod  (n  + 1), 
where  c is  the  “missing”  digit  in  {ui, . . . , un}.  For  example,  the  modular  universal  cycle 
(012032)  is  essentially  unique  for  n = 3;  and  the  lexicographically  smallest  for  n = 4 is 
(012301420132014321430243).  If  vertices  0l...a„_2  and  a\  ...a'n_2  in  the  digraph 
of  the  previous  paragraph  are  considered  equivalent  when  - a'x  = • ■ • = an^2  ~ a'n_2 
(modulo  n),  we  get  a digraph  of  ( n — l)!/2  vertices  whose  Eulerian  trails  correspond 
to  the  modular  universal  cycles  of  permutations  for  {1, . . . , n — 1}.] 

112.  (a)  If  the  cycle  is  a\a2  . . . , use  a at  step  j if  the  subsequence  a^oJ+i  . . . oJ+n_i  is 
a permutation;  otherwise  use  p. 

(b)  This  statement  follows  immediately  from  exercise  72. 

(c)  Let  1?2  = u-2,  and  obtain  from  12n  by  substituting  o ha  o 2pn~1  and  p ha 

ct  p o.  For  example,  1?3  = (cr2p)2  and  J?4  = ((o~p2)2cr2po)2 . Generate  permutations 
by  starting  with  n ...  21  and  applying  the  successive  elements  of  Qn ; for  example,  the 
sequence  when  n = 4 is 

4321,  3214,  2143, 1423, 4213,  2134, 1342, 3412, 4132, 1324, 3241, 2431, 

4312, 3124, 1243, 2413, 4123, 1234, 2341, 3421, 4231,  2314, 3142, 1432, 

and  the  corresponding  universal  cycle  is  (432142134132431241234231).  Notice  that  n 
moves  cyclically  in  this  sequence  of  permutations;  and  the  permutations  that  begin 
with  n correspond  to  the  sequence  obtained  from  Qn-i  ■ 

[See  F.  Ruskey  and  A.  Williams,  ACM  Trans,  on  Algorithms  6 (2010),  45:1- 
45:12.  Similar  methods  are  said  to  be  known  to  bell-ringers.  Universal  cycles  can 
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also  be  constructed  explicitly  for  permutations  of  an  arbitrary  multiset,  with  a method 
analogous  to  7.2.1. M62);  see  A.  Williams,  Ph.D.  thesis  (Univ.  of  Victoria,  2009).] 
113.  By  exercise  2.3.4.2-22  it  suffices  to  count  the  oriented  trees  rooted  at  12  . . . (n— 2), 
in  the  digraph  of  the  preceding  answer;  and  those  trees  can  be  counted  by  exercise 
2.3.4.2-19.  For  n < 6 the  numbers  Un  turn  out  to  be  tantalizingly  simple:  U2  = 1, 
U3  = 3,  t/4  = 27  • 3,  U5  = 233  • 38  • 53,  Ue  = 2190  • 349  • 533.  (Here  we  consider  (121323) 
to  be  the  same  cycle  as  (213231),  but  different  from  (131232).) 

Mark  Cooke  has  discovered  the  following  instructive  way  to  compute  these  values 
efficiently:  Consider  the  n!  x n\  matrix  M = 21  - R — S,  where  = [tt'  = 7rp]  and 
— - - ' — I71'  =7rcr].  There  is  a matrix  H such  that  H RH  and  H~SH  each  have  block 
diagonal  form  consisting  of  kx  copies  of  kx  x kx  matrices  Rx  and  Sx,  for  each  partition  A 
of  n,  where  kx  is  n!  divided  by  the  product  of  the  hook  lengths  of  shape  A (Theorem 
5.1.4H),  and  where  Rx  and  Sx  are  matrix  representations  of  p and  a based  on  Young 
tableaux.  [A  proof  can  be  found  in  Bruce  Sagan,  The  Symmetric  Group  (Pacific  Grove, 
Calif.:  Wadsworth  & Brooks/Cole,  1991).]  For  example,  when  n = 3 we  have 
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when  rows  and  columns  are  indexed  by  the  respective  permutations  1,  a,  a2,  p,  pa, 
pa  ; here  k3  = fcm  = 1 and  fc2 1 = 2.  Therefore  the  eigenvalues  of  M are  the  union, 
over  A,  of  &A-fold  repeated  eigenvalues  of  the  kx  x kx  matrices  27  - Rx  - Sx.  In  the 
example,  the  eigenvalues  of  (0),  (2),  and  (_22  °)  twice  are  {0},  {2},  and  {2,3}  twice. 

The  eigenvalues  of  M are  directly  related  to  those  of  the  matrix  A in  exercise 
2.3.4.2-19.  Indeed,  each  eigenvector  of  A yields  an  eigenvector  of  M,  if  we  equate  the 
components  for  permutations  7r  and  -Kpa~ , because  rows  77  and  ixpo~  of  R + S are 
equal.  For  example, 


A = 


(2  -1-1 
-1  2 -1 
V-l  -1  2 


has  eigenvectors 


( 1 V 

I 0 1 for  eigenvalues  0,  3,  3, 


yielding  the  eigenvectors  (1, 1, 1, 1, 1, 1)T,  (1,-1,  0, 0,  -1, 1)T,  (1, 0,  -1,-1, 0, 1)T  of  M 
for  the  same  eigenvalues.  And  M has  n!/2  additional  eigenvectors,  with  all  components 
zero  except  those  indexed  by  7r  and  7r a~p  for  some  7r,  because  only  rows  np~  and  na~ 
of  R + S have  nonzero  entries  in  columns  n and  7nr“p;  such  vectors  yield  n!/2  additional 
eigenvalues,  all  equal  to  2. 

Therefore  Un,  which  is  2/n!  times  the  product  of  the  nonzero  eigenvalues  of  A,  is 
21  n'^2/n\  times  the  product  of  the  nonzero  eigenvalues  of  M. 

Unfortunately  the  small-prime-factor  phenomenon  does  not  continue;  U7  equals 

212173123511975112843357320792110935)  and  Ug  jg  divisible  by  59229013196333168. 
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SECTION  7. 2. 1.3 

1.  Given  a multiset,  form  the  sequence  ef...e2ei  from  right  to  left  by  listing  the 
distinct  elements  first,  then  those  that  appear  twice,  then  those  that  appear  thrice, 
etc.  Let  us  set  e~j  t—  s — j for  0 < j < s = n — f,  so  that  every  element  e3  for  1 < j < t 
is  equal  to  some  element  to  its  right  in  the  sequence  et  ■ ■ ■ eieo  . . . e_s.  If  the  first  such 
element  is  eCj-s,  we  obtain  a solution  to  (3).  Conversely,  every  solution  to  (3)  yields  a 
unique  multiset  {ei , . . . , et},  because  Cj  < s + j for  1 < j < t. 

[A  similar  correspondence  was  proposed  by  E.  Catalan:  If  0 < ei  < • • • < et  < s,  let 

{ci,...,ct}  = {ei, . . . ,et}  U {s  + j | 1 < j < t and  ej  = ej+i}. 


See  Memoires  de  la  Soc.  roy.  des  Sciences  de  Liege  (2)  12  (1885),  Melanges  Math.,  3.] 
2.  Start  at  the  bottom  left  corner;  then  go  up  for  each  0,  go  right  for  each  1.  The 

result  is 


. Conversely,  we  can  easily  “read  off”  the  representations  a,,  b,.  a, 
di,  pi,  or  qi  of  (2)-(n)  from  any  given  path  from  (0,0)  to  (s,t). 

3.  In  this  algorithm,  variable  r is  the  least  positive  index  such  that  qr  > 0. 

Nl.  [Initialize.]  Set  < jj  4—  0 for  1 < j < t,  and  q0  4—  s.  (We  assume  that  st  > 0.) 
N2.  [Visit.]  Visit  the  composition  qt  ■ ■ ■ qo-  Go  to  N4  if  qo  = 0. 

N3.  [Easy  case.]  Set  qo  4—  qo  — 1,  r 4—  1,  and  go  to  N5. 

N4.  [Tricky  case.]  Terminate  if  r = t.  Otherwise  set  go  4—  qr  — \,qr  4—  0,  r 4—  r + 1. 
N5.  [Increase  qr.\  Set  qr  4—  qr  + 1 and  return  to  N2.  | 

[See  CACM  11  (1968),  430;  12  (1969),  187.  The  task  of  generating  such  compositions 
in  decreasing  lexicographic  order  is  more  difficult.] 

4.  We  can  reverse  the  roles  of  0 and  1 in  (14),  so  that  09'  lO9'*1 1 . . . 10<'1 10?n  = 
l’'301ra-10...01ri01r°.  This  gives  0110°10210210410010o10o10o10o10110010110°  = 
lo01201o01101o01101o01°01o016012011 . Lexicographic  order  of  an-\...a\ao  corre- 
sponds to  lexicographic  order  of  rs  ■ ■ ■ r\ro- 

Incidentally,  there’s  also  a multiset  connection:  {dt, . . . . di } = {rs  ■ s, . . . ,ro  ■ 0}. 
For  example,  {10,10,8,6,2,2,2,2,2,2,1,1,0}  = {0  • 11, 2 • 10, 0 • 9, 1 • 8, 0 • 7, 1 ■ 6, 0 • 5, 
0-4, 0-3, 6-2, 2-1, 10}. 

5.  (a)  Set  Xj  = Cj  — [(}  — 1)/2J  in  each  t-combination  of  n+ [t/2j . (b)  Set  xj  =Cj+j+ 1 
in  each  f-combination  of  n — t — 2. 

(A  similar  approach  finds  all  solutions  ( xt , . . . , xi)  to  the  inequalities  Xj+ 1 > Xj  + Sj 
for  0 < j < t,  given  the  values  of  xt+i,  (St,  ■ ■ ■ ,<5o),  and  xq.) 

6.  Assume  that  t > 0.  We  get  to  T3  when  ci  > 0;  to  T5  when  C2  = ci  + 1 > 1;  to  T4 

for  2 < j < f + 1 when  Cj  = Ci+j  — 1 > j.  So  the  counts  are:  Tl,  1;  T2,  (");  T3,  1) ; 

T4,  (7"a)  + (r23)  + ■ ■ ■ + rr1)  = (n1);  T5,  (n);  T6,  (H)  + C:2)  - 1. 

7.  A procedure  slightly  simpler  than  Algorithm  T suffices:  Assume  that  s < n. 

51.  [Initialize.]  Set  bj  4—  j + n — s — 1 for  1 < j < s;  then  set  j 4—  1. 

52.  [Visit.]  Visit  the  combination  b,  . . .62hi.  Terminate  if  j > s. 


S3.  [Decrease  bj.}  Set  bj 


1.  If  bj  < j,  set  j 4—  j + 1 and  return  to  S2. 


S4.  [Reset  bj- 1 . . . 61.]  While  j > 1,  set  bj 


bj  — 1 and  j 4—  j — 1.  Go  to  S2.  | 
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(See  S.  Dvorak,  Comp.  J.  33  (1990),  188.  Notice  that  if  xk  = n - bk  for  1 < k < s, 
this  algorithm  runs  through  all  combinations  xs  ...  x2x\  of  {1,2,...,  tt,}  with  1 < xs  < 
■ ■ ■ < x2  < x\  < n,  in  increasing  lexicographic  order.) 

8.  Al.  [Initialize.]  Set  an  ...  ao  <—  03+1lt,  q t,  r <—  0.  (We  assume  that  0 < t < n.) 
A2.  [Visit.]  Visit  the  combination  a„_i  . . . aia0.  Go  to  A4  if  q = 0. 

A3.  [Replace  . . . 01"  by  ...  1019-1.]  Set  aq  ©-  1,  a,_ i 0,  q <-  q - 1;  then  if 
q = 0,  set  r <—  1.  Return  to  A2. 

A4.  [Shift  block  of  Is.]  Set  ar  <-  0 and  r <-  r + 1 . Then  if  ar  = 1,  set  aq  «-  1, 
q t—  q + 1,  and  repeat  step  A4. 

A5.  [Carry  to  left.]  Terminate  if  r = n;  otherwise  set  ar  1. 

A6.  [Odd?]  If  q > 0,  set  r «-  0.  Return  to  A2.  | 

In  step  A2,  q and  r point  respectively  to  the  rightmost  0 and  1 in  an-4  ...a0.  Steps 
Al,  . . . , A6  are  executed  with  frequency  1,  (”),  (”“*),  (")  - 1,  (n{1),  (n“1)  - 1. 

9.  (a)  The  first  ( ( ) strings  begin  with  0 and  have  2A(s_j)t  bit  changes;  the  other 
("lx1)  begin  with  1 and  have  2As(t_1).  And  !/(01f0s_1  © 10slt_1)  = 2min(s,t). 

(b)  Solution  1 (direct):  Let  Bst  = Ast  + min(s,t)  + 1.  Then 

Bst  = S(s-i)t  + Ss(t- 1)  + [s  = t]  when  st  > 0;  Bst  = 1 when  st  = 0. 

Consequently  Bst  = £“in0<3’t)  (s+‘_-2fc) . If  s < t this  is  < (s+{t':)  = (s+t+1)  = 

(T)^<2  (T).  ~ " 

Solution  2 (indirect):  The  algorithm  in  answer  8 makes  2 (x  + y)  bit  changes  when 
steps  (A3,  A4)  are  executed  ( x,y ) times.  Thus  Ast  < ("ll)  + (")  - 1 < 2(”). 

[The  comment  in  answer  7. 2. 1.1-3  therefore  applies  to  combinations  as  well.] 

10.  Each  scenario  corresponds  to  a (4, 4)-combination  bAb3b2bi  or  c4c3c2c i in  which 
A wins  games  {8  - b4 , 8 — b3 , 8 — b2 , 8 — b\ } and  N wins  games  {8  — c4 , 8 — c3 , 8 - c2 , 8 — ci } , 
because  we  can  assume  that  the  losing  team  wins  the  remaining  games  in  a series  of  8. 
(Equivalently,  we  can  generate  all  permutations  of  { A,  A,  A,  A,  N,  N,  N,  N}  and  omit 
the  trailing  run  of  As  or  Ns.)  The  American  League  wins  if  and  only  if  b\  ^ 0,  if  and 
only  if  ci  = 0.  The  formula  (c{)  + (c33)  + (c22)  © (cl1)  assigns  a unique  integer  between 
0 and  69  to  each  scenario. 

For  example,  ANANAA  4=>  a7...aia0  = 01010011  <^=>  b4b3b2b4  = 7532  <©=> 
C4C3C2C1  = 6410,  and  this  is  the  scenario  of  rank  (®)  + + Q)  © = 19  in  lexicographic 

order.  (The  term  (c?)  will  be  zero  if  and  only  if  it  corresponds  to  a trailing  N.) 

11.  AAAA  (9  times),  NNNN  (8),  and  AN  AAA  (7)  were  most  common.  Exactly  27 
of  the  70  failed  to  occur,  including  all  four  beginning  with  NNNA.  (We  disregard  the 
games  that  were  tied  because  of  darkness,  in  1907,  1912,  and  1922.  The  case  ANNAAA 
should  perhaps  be  excluded  too,  because  it  occurred  only  in  1920  as  part  of  ANNAAA  A 
in  a best-of-nine  series.  The  scenario  NNAAANN  occurred  for  the  first  time  in  2001.) 

12.  (a)  Let  Vj  be  the  subspace  {an-i...a0  € V \ ak  = 0 for  0 < k < j},  so  that 
{0  . . . 0}  = Vn  C Vn-!  C ■ • • C Vo  = V . Then  {ci, . . . , ct)  = {c  | Vc  / Vc+i},  and  a*,  is 
the  unique  element  an_i  ...  a0  of  V with  aCj  = [j  = k]  for  1 < j < t. 

Incidentally,  the  f x n matrix  corresponding  to  a canonical  basis  is  said  to  be  in 
reduced  row-echelon  form.  It  can  be  found  by  a standard  “triangulation”  algorithm 
(see  exercise  4.6.1-19  and  Algorithm  4.6.2N). 
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(b)  The  2-nomial  coefficient  (")2  = 2'  (n“1)2  + (ntZ{)2  of  exercise  1.2.6-58  has  the 
right  properties,  because  2 ‘ (n“1)2  binary  vector  spaces  have  ct  < n-  1 and  ("“*)  have 
ct  = n — 1.  [In  general  the  number  of  canonical  bases  with  r asterisks  is  the  number  of 
partitions  of  r into  at  most  t parts,  with  no  part  exceeding  n — t,  and  this  is  [zT]  (") 
by  Eq.  7.2.1.4-(5i).  See  D.  E.  Knuth,  J.  Combinatorial  Theory  A10  (1971),  178-180.] 

(c)  The  following  algorithm  assumes  that  n > t > 0 and  that  a((+ ^ = 0 for 

t < j < n. 

VI.  [Initialize.]  Set  akj  4-  [j  = k - 1]  for  1 < k < t and  0 < j < n.  Also  set  q 4-  t, 
r 4—  0. 


V2.  [Visit.]  (At  this  point  we  have  afc(fc_i)  = 1 for  1 < k < q,  a(,+1),  = 0,  and 
ui  r = 1-)  Visit  the  canonical  basis  . . . audio, . . . , i)  . . . atiCLto). 

Go  to  V4  if  q > 0. 

V3.  [Find  block  of  Is.]  Set  q 4-  1,  2,  . . . , until  a(9+1)(9+r)  = 0.  Terminate  if 
q + r = n. 

V4.  [Add  1 to  column  q + r.]  Set  k 4-  1.  While  ak(q+r)  = 1,  set  afc(g+r)  0 

and  k 4—  k + 1.  Then  if  k < q,  set  ak(q-\-r)  4-  1;  otherwise  set  aq(q+r~)  4-  1, 

aq(q+r-l ) 4-  0,  q 4—  q — 1. 

V5.  [Shift  block  right.]  If  q = 0,  set  r y-  r + 1.  Otherwise,  if  r > 0,  set  ak(k_1)  <-  1 

and  afc(r+/t_1)  <—  0 for  1 < k < q,  then  set  r 4—  0.  Go  to  V2.  | 

Step  V2  finds  q > 0 with  probability  1 - (2n_<  - l)/(2n  — 1)  « 1 — 2 * , so  we  could 
save  time  by  treating  this  case  separately. 

(d)  Since  999999  = 4(:)2  + 16(47)2  + 5©2  + 5Q2  + 8(^)2  + 0®2+4g)2  + l(;)2  + 
2 (°)2,  the  millionth  output  has  binary  columns  4,  16/2,  5,  5,  8/2,  0,  4/2,  1,  2/2,  namely 


ai  = 001100011, 

a2  = 00000010  0, 
q3  = 1 0 1 1 1 0 0 0 0, 
a4  = 01000000  0. 


[Reference:  E.  Calabi  and  H.  S.  Wilf,  J.  Combinatorial  Theory  A22  (1977),  107-109.] 

13.  Let  n = s + t.  There  are  ([•(rl1J/2'|)  ([(r-l)/2j)  configurations  beginning  with  0 
and  ([(rli)/2j)  (f(r-"i)/2])  beginning  with  1,  because  an  Ising  configuration  that  begins 
with  0 corresponds  to  a composition  of  s 0s  into  |"(r  + l)/2]  parts  and  a composition  of 
t Is  into  [(r  + 1)/2J  parts.  We  can  generate  all  such  pairs  of  compositions  and  weave 
them  into  configurations.  [See  E.  Ising,  Zeitschrift  fiir  Physik  31  (1925),  253-258; 
J.  M.  S.  Simoes  Pereira,  CACM  12  (1969),  562.] 

14.  Start  with  l[j]  «-  j - 1 and  r[j  - 1]  «-  j for  1 < j < n;  /[0]  4-  n,  r[n]  4-  0.  To  get 
the  next  combination,  assuming  that  t > 0,  set  p 4-  s if  Z[0]  > s,  otherwise  p <-  r[n]  - 1. 
Terminate  if  p < 0;  otherwise  set  q 4-  r\p],  l[q]  <r-  l\p\,  and  r[l\p ]]  4-  q.  Then  if  r[q]  > s 
and  p < s,  set  r[p]  4-  r[n ],  l[r[n] ] 4-  p,  r[s]  4-  r[q ],  Z[r[q]]  4-  s,  r[n\  4-  0,  l[ 0]  4-  n; 
otherwise  set  r\p]  4-  r[q],  Z[r[q]]  4-  p.  Finally  set  r[gj  4-  p and  l\p]  4-  q. 

[See  Korsh  and  Lipschutz,  J.  Algorithms  25  (1997),  321-335,  where  the  idea  is 
extended  to  a loopless  algorithm  for  multiset  permutations.  Caution:  This  exercise, 
like  exercise  7.2.1.1-16,  is  more  academic  than  practical,  because  the  routine  that  visits 
the  linked  list  might  need  a loop  that  nullifies  any  advantage  of  loopless  generation.] 
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15.  (The  stated  fact  is  true  because  lexicographic  order  of  ct...c\  corresponds  to 
lexicographic  order  of  an  _ i . . . ao,  which  is  reverse  lexicographic  order  of  the  comple- 
mentary sequence  1 ...  1 0 a„_ i . . . ao.)  By  Theorem  L,  the  combination  ct . . . ci  is 
visited  before  exactly  (b’'j  + • • • + (622)  + (?)  others  have  been  visited,  and  we  must  have 

C;)-+ (?) +()')++(:) 

This  general  identity  can  be  written 


"-1  f 


2=0 
when  each  x 


xo  H b x 


) + £%( 


2=0 


Xo  + • • • + X 


)-( 


XO  H b x. 


n — 1 ^ 


j is  0 or  1,  and  Xj  = 1 


Cn  ( 


n 

x0  + • • • + x, 


T Xji  ^ 


xo  H x 


Xj-  it  follows  also  from  the  equation 
n + 1 

xo  -I b x 


)=(  U+1  W 

nJ  \Xo  H h XnJ  \ 


Xo  “H  * • • “I-  Xfi— 


16.  Since  999999  = (14214 ) + (10°8)  = (T)  + (T)  + C“)  = (?)  + (“)  + (326)  + (V)  = 
(7)  + (7)  + (7)  + (7 ) + (i)’  the  answers  are  (a)  1414  1008;  (b)  182  153  111;  (c)  71 
56  36  14;  (d)  43  32  21  15  6;  (e)  1000000  999999  ...  2 0. 

17.  By  Theorem  L,  nt  is  the  largest  integer  such  that  N > ("*);  the  remaining  terms 
are  the  degree-(f  — 1)  representation  of  N — ("‘). 

A simple  sequential  method  for  t > 1 starts  with  x = 1,  c = t,  and  sets  ct-c+1, 
x 4-  xc/(c  - t ) zero  or  more  times  until  x > N\  then  we  complete  the  first  phase  by 
setting  x 4—  x(c  — t)/c,  c 4—  c — 1,  at  which  point  we  have  x = (J)  < N < (c|1).  Set 
nt  4-  c,  IV  4-  N — x;  terminate  with  m 4-  if  t = 2;  otherwise  set  x 4-  xt/c,  t 4-  t-  1, 
1;  while  x > N set  x 4—  x(c  — t)/c,  c 4—  c — 1;  repeat.  This  method  requires 
0(n ) arithmetic  operations  if  IV  < (”),  so  it  is  suitable  unless  t is  small  and  N is  large. 

When  t = 2,  exercise  1.2.4-41  tells  us  that  n2  = |_V2Ar  + 2 + |J.  In  general, 
nt  is  [xj  where  x is  the  largest  root  of  xt  = t\ N;  this  root  can  be  approximated 

by  reverting  the  series  y = (x*-)1^  = x - - 1)  + ±(t2  - l)x_1  H to  get  x = 

y + ^(t  — 1)  + -^(t~  — 1) /y  + 0(y  3).  Setting  y = (t\  N)1^  in  this  formula  gives  a good 
approximation,  after  which  we  can  check  that  (7J)  < N < (L;cJt+1)  or  make  a final 
adjustment.  [See  A.  S.  Fraenkel  and  M.  Mor,  Comp.  J.  26  (1983),  336-343.] 

18.  A complete  binary  tree  of  2"  — 1 nodes  is  obtained,  with  an  extra  node  at  the 
top,  like  the  “tree  of  losers”  in  replacement  selection  sorting  (Fig.  63  in  Section  5.4.1). 
Therefore  explicit  links  aren’t  necessary;  the  right  child  of  node  k is  node  2k  + 1,  and 
the  left  sibling  is  node  2k,  for  1 < k < 2n~1 . 

This  representation  of  a binomial  tree  has  the  curious  property  that  node  k = 
(0ala)2  corresponds  to  the  combination  whose  binary  string  is  0“laH. 

19.  It  is  11110100001001000100,  the  binary  representation  of  post(1000000),  where 
post(2fc+1  — 1)  = 2k,  and  post(n)  = 2k  + post(n- 2k  + 1)  if  2 k <n<  2fc+1-l,  for  k > 0. 
[Incidentally,  the  left-child/right-sibling  representation  of  is  the  sideways  heap.] 

20.  f(z)  = (1  + zw"-') . . . (1  + zw')/(  1 - z),  g(z ) = (1  + zwo)f(z),  h(z)  = zw°f{z). 

21.  The  rank  of  ct  . . . c2ci  is  (ct(M)  — 1 minus  the  rank  of  ct-\  . . . c2c\.  [Page  40  of 
Miller’s  thesis;  see  also  H.  Liineburg,  Abh.  Math.  Sem.  Hamburg  52  (1982),  208-227.] 

22.  Since  999999  = (1415)  - (4f ) = (*“)  - (“)  + (V)  = (?)  - (SJ)  + (322)  - (?)  = 
(7)  ~ (4°)  + (333)  _ C2)  + (3)>  the  answers  are  (a)  1414  405;  (b)  182  97  21;  (c)  71  56 
31  26;  (d)  43  39  32  12  3;  (e)  1000000  999999  999998  999996  ...  0. 
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23.  There  are  (”_^)  combinations  with  j > r,  for  r = 1,  2,  . . . , t.  (If  r — 1 we  have 
C2  = ci  + 1;  if  r = 2 we  have  ci  = 0,  c2  = 1;  if  r = 3 we  have  ci  = 0,  c2  = 1,  c4  = c3  + 1; 
etc.)  Thus  the  mean  is  ((?)  + M + • ■ ■+ (n~‘))/C?)  = (T)/(?)  = (n+l)/(n+ 1-t). 
The  average  running  time  per  step  is  approximately  proportional  to  this  quantity;  thus 
the  algorithm  is  quite  fast  when  t is  small,  but  slow  if  t is  near  n. 


24.  In  fact  jk  — 2 < jk+i  < jk  + 1 when  jk  = t (modulo  2)  and  jk  — 1 < jjt+i  < jk  + 2 
when  j*,  ^ t,  because  R5  is  performed  only  when  c*  = i — 1 for  1 < i < j. 

Thus  we  could  say,  “If  j > 4,  set  j <r-  j - 1 - [j  odd]  and  go  to  R5”  at  the  end  of  R2, 
if  t is  odd;  “If  j > 3,  set  j «—  j — 1 — [j  even]  and  go  to  R5”  if  t is  even.  The  algorithm 
will  then  be  loopless,  since  R4  and  R5  will  be  performed  at  most  twice  per  visit. 

25.  Assume  that  N > N'  and  N — N'  is  minimum;  furthermore  let  t and  ct  be 
minimum,  subject  to  those  assumptions.  Then  ct  > c't. 

If  there  is  an  element  x^CUC"  with  0 < x < ct,  map  each  t-combination  of 
C U C"  by  changing  j t-4  j - 1 for  j > x;  or,  if  there  is  an  element  x € C n C' , map  each 
t-combination  that  contains  x into  a (t  — 1 (-combination  by  omitting  x and  changing 
j '-t  x — j for  j < x.  In  either  case  the  mapping  preserves  alternating  lexicographic 
order;  hence  N — N'  must  exceed  the  number  of  combinations  between  the  images 
of  C and  C' . But  ct  is  minimum,  so  no  such  x can  exist.  Consequently  t = m and 
ct  = 2m  — 1. 

Now  if  <4,  < cm  — 1,  we  could  decrease  N — N'  by  increasing  c'm.  Therefore  c'm  = 
2m— 2,  and  the  problem  has  been  reduced  to  finding  the  maximum  of  rank(cm_x  . . . ci)- 
rank(c(n_1  . . . ci),  where  rank  is  calculated  as  in  (30). 

Let  f{s,t)  = max  (rank  (&s  . . . bx)  - rank(ct . . .ci))  over  all  {bs, . . . , 61,  ct, . . . , ci}  = 
{0, . . . , s + t — 1}.  Then  /(s,  f)  satisfies  the  curious  recurrence 


/(«,0)  = /(0,t)=0;  /(M)  = t; 

/(M)  = (s+‘_1)  + max(/(f  - l,s  - l),/(s  - 2 ,t))  if  st  > 0 and  s > I. 
When  s T t = 2u  T 2 the  solution  turns  out  to  be 


/(-,.) = (2;_+;) + £ (2“+(  ■ ' 2l) + e(27  ‘).  - = 

j=i  j= 0 J 

with  the  maximum  occurring  at  /(f  — 1,  s— 1)  when  s < t and  at  f(s— 
Therefore  the  minimum  N — N'  occurs  for 


min(s  — 2,  t — 1), 

2,  t)  when  s > t+ 2. 


C = {2m  — 1}  U {2m  — 2 — x|l<x<  2m  — 2,  x mod  4 < 1}, 

C'  = {2m  - 2}  U {2m  - 2-  a;|l<a;<2m-2,  x mod  4 > 2}; 

and  it  equals  (^j1)  - T,T=o  (2\+1)  = 1 + tSee  A-  J-  van  Zanten,  IEEE 

Trans.  IT-37  (1991),  1229-1233.] 

26.  (a)  Yes:  The  first  is  0"-b/2l  i«  mod  22L«/2J  and  the  last  is  2L*/2J  mod  2Qn-\t/i\ . 
transitions  are  substrings  of  the  forms  02°1  <->  12a0,  02“2  o 12“  1 , 10al  -H-  20“0, 
10“2  20°  1. 

(b)  No:  If  s = 0 there  is  a big  jump  from  02*0r_1  to  20r2t~1. 

27.  The  following  procedure  extracts  all  combinations  ci  . . . Ck  of  T„  that  have  weight 
< t:  Begin  with  k <—  0 and  Co  •<—  n.  Visit  c\  . . .Ck-  If  k is  even  and  Ck  = 0,  set 
k k - 1;  if  k is  even  and  ck  > 0,  set  ck  <-  ck  - 1 if  k = t,  otherwise  k <—  k + 1 
and  ck  0.  On  the  other  hand  if  k is  odd  and  ck  + 1 = ck- 1,  set  k «-  k - 1 and 
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Ck  <-  ck+1  (but  terminate  if  k = 0);  if  k is  odd  and  ck  + 1 < Cfc_i,  set  ck  <-  ck  + 1 if 
k = t,  otherwise  k <-  k + 1,  ck  <-  ck-U  ck <- ck  + 1.  Repeat. 

(This  loopless  algorithm  reduces  to  that  of  exercise  7.2.1.1-12(b)  when  t = n,  with 
slight  changes  of  notation.) 

28.  True.  Bit  strings  an_1  . . . a0  = a/3  and  a'n_i  . . . a'0  = a/31  correspond  to  index  lists 

(6„  . . . = 6x,  ct  ...  Ci  = <f>tp)  and  (b's  . . . b[  = 0\  , c't  . . ,c\  = <jnp')  such  that  everything 

between  a/3  and  a/3'  begins  with  a if  and  only  if  everything  between  0\  and  Ox!  begins 
with  0 and  everything  between  tfnp  and  t/n/j'  begins  with  </>.  For  example,  if  n = 10,  the 
prefix  a = 01101  corresponds  to  prefixes  0 = 96  and  <j>  = 875. 

(But  just  having  ct  . . . a in  genlex  order  is  a much  weaker  condition.  For  example, 
every  such  sequence  is  genlex  when  t = 1.) 

29.  (a)  - V+1  or  -*o,+1+±m  or  ±\  for  k,l,m>  0. 

(b)  No;  the  successor  is  always  smaller  in  balanced  ternary  notation. 

(c)  For  all  a and  all  k,l,m  > Owe  have  a0-fc+V+±m  ->  a-+fc0!+1-±m  and 

a+_*0;  + l+±m  Q0+fc  + l0Z_±m.  also  Q0_fc  + l0Z  a.+k0l  + 1 and  a+.k0l+ 1 ^ a0+k+l 0i 

(d)  Let  the  jth  sign  of  a,  be  (-1)°«,  and  let  it  be  in  position  bt].  Then  we  have 

= (_i)a(i+1)i+6(i+1)u_1)  for  o < i < k and  1 < j < a,  if  we  let  bi0  = 0. 

(e)  By  parts  (a),  (b),  and  (c),  a belongs  to  some  chain  a0-> >U),,  where  ak 

is  final  (has  no  successor)  and  ao  is  initial  (has  no  predecessor).  By  part  (d),  every 
such  chain  has  at  most  (s^<)  elements.  But  there  are  2s  final  strings,  by  (a),  and  there 
are  2s(s|f)  strings  with  s signs  and  t zeros;  so  k must  be  (s|‘)  - 1. 

Reference:  SICOMP  2 (1973),  128-133. 

30.  Assume  that  t > 0.  Initial  strings  are  the  negatives  of  final  strings.  Let  cr,  be  the 

initial  string  0t-TJ  for  0 < j < 2s  1 , where  the  fcth  character  of  Tj  for  1 < k < s is  the 

sign  of  (-1)“*  when  j is  the  binary  number  (os_i  . . . ai)2;  thus  c r0  = o‘-++ . . . +,  <n  = 
0 + • • +’  ■ • • i cr2a~i_1  = O'  . . . Let  pj  be  the  final  string  obtained  by  inserting 

-0  after  the  first  (possibly  empty)  run  of  minus  signs  in  Tj\  thus  po  = -0*++...+, 
P1  = 0 +...+,  . . . , p2.*~i- 1 = ...  -o'.  We  also  let  a2’-1  = and  p2a-i  — po- 
Then  we  can  prove  by  induction  that  the  chain  beginning  with  <jj  ends  with  pj  when  t 
is  even,  with  Pj—i  when  t is  odd,  for  1 < j < 2s  1 . Therefore  the  chain  beginning  with 
— pj  ends  with  —aj  or  — crj+i. 

Let  Aj(s,t)  be  the  sequence  of  (s,  ^-combinations  derived  by  mapping  the  chain 
that  starts  with  try,  and  let  Bj(s,t)  be  the  analogous  sequence  derived  from  —pj.  Then, 
for  1 < j < 2s-1,  the  reverse  sequence  Aj(s,t)R  is  Bj(s,t ) when  t is  even,  Bj_i(s,t) 
when  t is  odd.  The  corresponding  recurrences  when  st.  > 0 are 

Aj/s ,t)  = { 1Ai(8,t  ~ 1)’  °"4L(2a-1-i-t)/2j(s  - M)fl,  it  j + t is  even; 

\ I Aj(s,  t — 1),  QA\j/2\  (s  — 1,  t),  if  j + t is  odd; 

and  when  st  > 0 all  2s  1 of  these  sequences  are  distinct. 

Chase  s sequence  Cst  is  j4[2a/3J  (si  t),  and  Cst  is  Ay2,-i/3^  (s,  t)R . Incidentally,  the 
homogeneous  sequence  Kst  of  (31)  is  A2s-i_[teven](s,t)R. 

31.  (a)  2(  * )_1  solves  the  recurrence  f(s,t)  = 2 f(s-  1 ,t)f(s,t-  1)  when  f(s,  0)  = 
/(0,  t)  = 1.  (b)  Now  f(s,t)  = (s  + 1)!  f(s,  t — 1) ...  f(0,t  — 1)  has  the  solution 

(s  + l)!‘S!(2)(s-l)!(tt1)...2!(s+r2)  = I^(r+l)!(S+tt=^r)+h=sl. 
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32.  (a)  No  simple  formula  seems  to  exist,  but  the  listings  can  be  counted  for  small  s 
and  t by  systematically  computing  the  number  of  genlex  paths  that  run  through  all 
weight-t  strings  from  a given  starting  point  to  a given  ending  point  via  revolving-door 
moves.  The  totals  for  s + t < 6 are 

1 

1 1 

1 2 1 
14  4 1 

1 8 20  8 1 
1 16  160  160  16  1 
1 32  2264  17152  2264  32  1 

and  /(4,4)  = 95,304,112,865,280;  /(5,  5)  « 5.92  6 46  x 1048.  [This  class  of  combination 
generators  was  first  studied  by  G.  Ehrlich,  JACM  20  (1973),  500-513,  but  he  did  not 
attempt  to  enumerate  them.] 

(b)  By  extending  the  proof  of  Theorem  N,  one  can  show  that  all  such  listings  or 
their  reversals  must  run  from  l40s  to  oaitos-a  for  some  a,  1 < a < s.  Moreover,  the 
number  nsta  of  possibilities,  given  s,  t,  and  a with  st  > 0,  satisfies  n\ti  = 1 and 

f ^s(t— 1)1  ^(s— i)t(a— l)  j if  a > 1; 

T^sta  = ^ *r  i 

l ns(t~l)2n(s~l)tl  + ' ' ' + ns(t-l)sn(s-l)t(s-l)>  it  a = 1 < S. 


This  recurrence  has  the  remarkable  solution  nsta  = where 


m(s,  t,  a) 


rr3)  + (^5)+---  + (V).  iff  is  even; 

(S+t-3)  4- f (3)  + s — a — [a  < s]>  if  t is  odd. 


33.  Consider  first  the  case  t = 1:  The  number  of  near-perfect  paths  from  i to  j > i is 
f(j  — i — [i  > 0]  — [j  < n — 1]),  where  JT  /(j)zJ  = 1/(1  — z — z3).  (By  coincidence,  the 
same  sequence  f(j)  arises  in  Caron’s  polyphase  merge  on  6 tapes,  Table  5. 4. 2-2.)  The 
sum  over  0 < i < j < n is  3/(n)  + f(n  — 1)  + f(n— 2)  + 2 — n;  and  we  must  double  this, 
to  cover  cases  with  j > i. 

When  t > 1 we  can  construct  (")  x (")  matrices  that  tell  how  many  genlex  listings 
begin  and  end  with  particular  combinations.  The  entries  of  these  matrices  are  sums  of 
products  of  matrices  for  the  case  t — 1,  summed  over  all  paths  of  the  type  considered 
for  t = 1.  The  totals  for  s + t < 6 turn  out  to  be 


1 

1 1 
1 2 1 

16  2 1 

1 12  10  2 1 

1 20  44  10  2 1 

1 34  238  68  10  2 1 


1 

1 1 
12  1 
12  0 1 
1 2 2 0 1 
1 2 0 0 0 1 
1 2 6 0 0 0 1 


where  the  right-hand  triangle  shows  the  number  of  cycles,  g(s,  t).  Further  values  include 
/(4, 4)  = 17736;  /( 5,5)  = 9,900,888,879,984;  <?(4,4)  = 96;  g( 5,5)  = 30,961,456,320. 

There  are  exactly  10  such  schemes  when  s — 2 and  n > 4.  For  example,  when 
n = 7 they  run  from  43210  to  65431  or  65432,  or  from  54321  to  65420  or  65430  or 
65432,  or  the  reverse. 
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34.  The  minimum  can  be  computed  as  in  the  previous  answer,  but  using  min-plus 
matrix  multiplication  Cij  = minfc(a,*:  + bkj)  instead  of  ordinary  matrix  multiplication 
Cij  = aikbkj-  (When  s = t = 5,  the  genlex  path  in  Fig.  46(e)  with  only  49  imperfect 
transitions  is  essentially  unique.  There  is  a genlex  cycle  for  s = t = 5 that  has  only  55 
imperfections.) 

35.  From  the  recurrences  (35)  we  have  ast  = bs(t-i)  + [s  > l][t>0]  + 0(s_ i)t,  bst  = 
as(t-i)  + a(s-i)t;  consequently  ast  = bst  + [s  > l][todd]  and  ast  = as(t_!)  + a(s_1)t  + 
[s  > l][f  odd].  The  solution  is 


O'St 


t/2 

-u 


s + t-2-2k 
s-2 


^ — [s  > l][teven]; 


this  sum  is  approximately  s/(s  + 2t)  times  (s+4). 

36.  Consider  the  binary  tree  with  root  node  (s,  t ) and  with  recursively  defined  subtrees 
rooted  at  (s—  1,  t)  and  (s,  t — 1)  whenever  st  > 0;  the  node  ( s,t ) is  a leaf  if  st  = 0.  Then 
the  subtree  rooted  at  (s,t)  has  (s^<)  leaves,  corresponding  to  all  (s,  ^-combinations 
an_i...aiao.  Nodes  on  level  l correspond  to  prefixes  a„_i  . . . a„_;,  and  leaves  on 
level  l are  combinations  with  r = n — l. 

Any  genlex  algorithm  for  combinations  an- 1 • . . ajai)  corresponds  to  preorder  tra- 
versal of  such  a tree,  after  the  children  of  the  — 1 branch  nodes  have  been 

\ t / /s  + t\ 

ordered  in  any  desired  way;  that,  in  fact,  is  why  there  are  2v  * l-1  such  genlex  schemes 
(exercise  31(a)).  And  the  operation  j 4—  j + 1 is  performed  exactly  once  per  branch 
node,  namely  after  both  children  have  been  processed. 

Incidentally,  exercise  7.2.1.2-6(a)  implies  that  the  average  value  of  r is  s/(t  + 1)  + 
t/(s  + l),  which  can  be  S2(n);  thus  the  extra  time  needed  to  keep  track  of  r is  worthwhile. 

37.  (a)  In  the  lexicographic  case  we  needn’t  maintain  the  Wj  table,  since  a3  is  active 
for  j > r if  and  only  if  a3  = 0.  After  setting  aj  «—  1 and  aj_i  4-  0 there  are  two  cases 
to  consider  if  j > 1:  If  r — j,  set  r 4—  j — 1;  otherwise  set  o3-2  . . . ao  4—  0rlJ-1-r  and 
r 4—  j — 1 — r (or  r 4—  j if  r was  j — 1). 

(b)  Now  the  transitions  to  be  handled  when  j > 1 are  to  change  aj  . . . ao  as  follows: 

or  ->  nor-2,  oior  -4  ior+1,  oio°r  ->  noa+1r-\  ior  ->  oior_1,  nor  -» oior_1i, 

10“  lr  —4  0°1'  , these  six  cases  are  easily  distinguished.  The  value  of  r should  change 

appropriately. 

(c)  Again  the  case  j = 1 is  trivial.  Otherwise  01“0r  — > 101o_10r;  0“lr  ->  10“lr_1; 
101“0r  —4  01a+10r;  10“lr  —4  0“lr+1;  and  there  is  also  an  ambiguous  case,  which  can 
occur  only  if  on_i  . . .aj+i  contains  at  least  one  0:  Let  k > j be  minimal  with  a*,  = 0. 
Then  10r  — > 010r_1  if  k is  odd,  10r  — > 0rl  if  k is  even. 

38.  The  same  algorithm  works,  except  that  (i)  step  Cl  sets  a„_i  . . ,a0  4—  01t0s_1  if 
n is  odd  or  s = 1,  an_i  . . . ao  4—  00r0s-2  if  n is  even  and  s > 1,  with  an  appropriate 
value  of  r;  (ii)  step  C3  interchanges  the  roles  of  even  and  odd;  (iii)  step  C5  goes  to  C4 
also  if  j = 1 . 


39.  In  general,  start  with  r 
st  = 0: 

r 4—  r + [ Wj  = 0]  ( ^ 


)■ 


0,  j 4—  s + t — 1,  and  repeat  the  following  steps  until 
s 4-  s - [aj  =0],  t 4—  t — [aj  = 1],  j <- j - 1. 


Then  r is  the  rank  of  an-i  ■ ■ .a\ao.  So  the  rank  of  11001001000011111101101010  is 

(S)  + (ii)  + (V)  + ( s)  + (?)  + Cs1)  + (s3)  + (?)  + Ca1)  + (a0)  + (a)  + (a)  + (a)  + (?)  + (l)  = 
2390131. 
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40.  We  start  with  N «—  999999,  v <—  0,  and  repeat  the  following  steps  until  st  = 0:  If 
v = 0,  set  t <-  t — 1 and  as+t  <—  1 if  N < (s+*_1),  otherwise  set  IV  <-  IV  - (s+‘_1), 
v <—  (s  + t)  mod  2,  s t—  s — 1,  as+t  -<—0.  If  v = 1,  set  v <—  (s  + t)  mod  2,  s <—  s — 1, 
and  as+t  <—  0 if  N < (s+‘_1),  otherwise  set  N <—  IV  — (s+(_1),  t <—  t — 1,  as+t  <—  1. 
Finally  if  s = 0,  set  at_i  . . . a0  <-  l‘;  if  t = 0,  set  as-i  ■ ■ ■ a0  <-  0s.  The  answer  is 
a25...a0  = 11101001111110101001000001. 

41.  Let  c(0),  ...,  c(2"  — 1)  = Cn  where  C2n  = 0C2n-i , 102n-i;  C2n+i  = 0C2n, 
1 C2n\  C2n  = lC2n-i,  0C2n-i;  C2n+i  = 1 C2n,  0C2n;  Co  = Co  = e.  Then  aj  ® bj  = 
bj+i&i(bj+2  | (bj+3k(bj+4  !•••)))  if  3 is  even,  bJ+1  \ (bj+2k(bJ+3  \ (6>+4&-  • • )))  if j is  odd. 
Curiously  we  also  have  the  inverse  relation  c(( . . . 04030201610)2)  = ( . . . bib3b2b\bo)2- 


42.  Equation  (40)  shows  that  the  left  context  an- 1 . . . Oi+i  does  not  affect  the  behavior 
of  the  algorithm  on  a;_i...ao  if  o;  =0  and  l > r.  Therefore  we  can  analyze 
Algorithm  C by  counting  combinations  that  end  with  certain  bit  patterns,  and  it 
follows  that  the  number  of  times  each  operation  is  performed  can  be  represented  as 
[ws z1}  p(w,  z)/(l  — w2)2(l  — z2)2(  1 — w — z)  for  an  appropriate  polynomial  p(w,z). 

For  example,  the  algorithm  goes  from  C5  to  C4  once  for  each  combination  that  ends 
with  oi2a+1012i>+1  or  has  the  form  la+1012i,+1,  for  integers  a,b  > 0;  the  corresponding 
generating  functions  are  w2z2/(  1 — z2)2(l  — w — z)  and  w(z2  + z3)/(l  — z2)2. 

Here  are  the  polynomials  p(w,  z)  for  key  operations.  Let  W = 1 — w2,  Z = 1 — z2 . 


C3 

C3 

C3 

C3 


C4: 

C5: 

C6: 

C7 : 

C4  (j  = 1): 
C4(r  t-  j- 1): 

~ j)- 

C4: 

-j~  2): 


C4(r 
C5  -t 
C5(r 


wzW(l  + wz)(l  —w  — z2); 
wzW(w  + z)(l  — wz  — z2); 
w2z2W(w  + z ); 

«I2zlf(l  + ffiz)i 

wzW2Z(l-w-z2); 

w3zWZ(l-w-z2); 


C5 (r  <-  1): 
C5(r  <-  j- 1): 
C6  (j  = 1): 


wz2W2(l  + z- 


2wz  — z—z3 
,2\. 


C6(r 
C6(r 
C7 
C7  (r 
C7  (r 


w2zW2Z(l—wz  — z2); 

w2z3W2(l  — wz  — z2); 

w2zW2Z- 

w2z3W2\ 

w3z2WZ- 

w2zW2; 

w4zWZ: 


~ i- f): 

^i): 

C6: 

“ j)- 

■-  j- 2):  w3z2W2 


w4zWZ(l  — wz  — z2); 


The  asymptotic  value  is  (s|*)(p(l  — x,x)/(2x  - x2)2(l  - x2)2  + 0(n-1)),  for  fixed 
0 < x < 1,  if  t = in  + 0(1)  as  n —¥  00.  Thus  we  find,  for  example,  that  the  four-way 
branching  in  step  C3  takes  place  with  relative  frequencies  x + x2  — x3  : 1 : x : 1 + x — x2. 

Incidentally,  the  number  of  cases  with  j odd  exceeds  the  number  of  cases  with 
j even  by 

's  + t-2k-2l' 


?(' 


k,l>  1 


s — 2k 


^ [2k  + 21  < s + t]  + [sodd][t  odd], 


in  any  genlex  scheme  that  uses  (39).  This  quantity  has  the  interesting  generating 
function  wz/(l  + w)(l  + z)(l  — w — z). 

43.  The  identity  is  true  for  all  nonnegative  integers  x,  except  when  x = 1.  (Inciden- 
tally, s(x)  = f(x)  © 1 and  p(x)  = f(x  © 1),  where  f(x)  = (x  — 1)  + ((x  & 1)  <g  1).) 

44.  In  fact,  Ct(n)  - 1 = Ct(n  - l)fl,  and  Ct(n)  - 1 = Ct(n  - l)fl.  (Hence  Ct{n)  - 2 = 
Ct(n  - 2),  etc.) 

45.  In  the  following  algorithm,  r is  the  least  subscript  with  cr  > r. 

CC1.  [Initialize.]  Set  Cj  <-  n — t — 1 + j and  Zj  «—  0 for  1 < j < t + 1.  Also  set 
r t—  1.  (We  assume  that  0 < t < n.) 

CC2.  [Visit.]  Visit  the  combination  ct  . . . c2Ci . Then  set  j «—  r. 

CC3.  [Branch.]  Go  to  CC5  if  Zj  / 0. 
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CC4.  [Try  to  decrease  Cj.}  Set  x 4-  Cj  + ( cj  mod  2)  - 2.  If  x > j,  set  Cj  4-  x, 
r <-  1;  Otherwise  if  cj  = j,  set  Cj  4-  j - 1,  Zj  4-  cj+1  - {{cj+1  + 1)  mod  2), 
r 4-  j\  otherwise  if  cj  < j,  set  Cj  4-  j,  zj  4-  cj+1  - {(cj+1  + 1)  mod  2), 
r 4-  max(l,  j — 1);  otherwise  set  Cj  4-  x,  r 4-  j.  Return  to  CC2. 

CC5.  [Try  to  increase  Cj.]  Set  x 4—  Cj  + 2.  If  x < Zj,  set  Cj  4—  x\  otherwise  if 
x = Zj  and  Zj+\  ^ 0,  set  Cj  4-  x — (cj+i  mod  2);  otherwise  set  Zj  4—  0, 
j <-  j + 1,  and  go  to  CC3  (but  terminate  if  j > t).  If  a > 0,  set  r 4-  1; 
otherwise  set  r 4-  j — 1.  Return  to  CC2.  | 

46.  Equation  (40)  implies  that  Uk  — (bj  + k + 1)  mod  2 when  j is  minimal  with  bj  > k. 
Then  (37)  and  (38)  yield  the  following  algorithm,  where  we  assume  for  convenience 
that  3 < s < n. 

CB1.  [Initialize.]  Set  bj  4-  j - 1 for  1 < j < s;  also  set  z 4-  s + 1,  bz  4-  1.  (When 
subsequent  steps  examine  the  value  of  z,  it  is  the  smallest  index  such  that 
bz  ^ z - 1.) 

CB2.  [Visit.]  Visit  the  dual  combination  bs  . . . 6261 . 

CB3.  [Branch.]  If  62  is  odd:  Go  to  CB4  if  i>2  7^  61  + 1,  otherwise  to  CBS  if  bi  > 0, 
otherwise  to  CB6  if  bz  is  odd.  Go  to  CB9  if  b2  is  even  and  61  > 0.  Otherwise 
go  to  CB8  if  bz+i  = 6.  + 1,  otherwise  to  CB7. 

CB4.  [Increase  hi.]  Set  hi  <—  61  + 1 and  return  to  CB2. 

CB5.  [Slide  hi  and  h2 -]  If  ?>3  is  odd,  set  hi  4—  hi  + 1 and  b2  4-  b2  + 1;  otherwise 
set  hi  <—  hi  — 1,  I>2  4—  b2  — 1,  z 4—  3.  Go  to  CB2. 

CB6.  [Slide  left.]  If  z is  odd,  set  z 4—  z — 2,  h.+i  4—  z + 1,  bz  4—  z;  otherwise  set 

z 4—  z — 1,  bz  4—  z.  Go  to  CB2. 

CB7.  [Slide  bz.]  If  h2+i  is  odd,  set  bz  4—  bz  + 1 and  terminate  if  bz  > n;  otherwise 
set  bz  4r-  bz  — 1,  then  if  bz  < z set  z 4-  z + 1.  Go  to  CB2. 

CB8.  [Slide  bz  and  h-+i.]  If  bz+2  is  odd,  set  bz  4-  b*+i,  hz+i  4-  bz  + 1,  and 
terminate  if  bz+ 1 > n.  Otherwise  set  bz+i  4-  bz,  bz  4-  bz  - 1,  then  if  bz  < z 
set  z 4—  z + 2.  Go  to  CB2. 

CB9.  [Decrease  hi.]  Set  hi  4—  hi  — 1,  z 4—  2,  and  return  to  CB2.  | 

Notice  that  this  algorithm  is  loopless.  Chase  gave  a similar  procedure  for  the  sequence 
C[t  in  Cong.  Num.  69  (1989),  233-237.  It  is  truly  amazing  that  this  algorithm  defines 

precisely  the  complements  of  the  indices  ct  . . . Ci  produced  by  the  algorithm  in  the 

previous  exercise. 

47.  We  can,  for  example,  use  Algorithm  C and  its  reverse  (exercise  38),  with  Wj 
replaced  by  a d-bit  number  whose  bits  represent  activity  at  different  levels  of  the 
recursion.  Separate  pointers  ro,  r 1 , . . . , r,i-\  are  needed  to  keep  track  of  the  r- values 
on  each  level.  (Many  other  solutions  are  possible.) 

48.  There  are  permutations  7ri,  . . . , 7 tm  such  that  the  A:th  element  of  Aj  is  TTkQj^Pk-i- 
And  7 vkaj  runs  through  all  permutations  of  {sx  • 1, . . . , sd-d)  as  j varies  from  0 to  N-  1. 

Historical  note:  The  first  publication  of  a homogeneous  revolving-door  scheme 
for  (s, ^-combinations  was  by  Eva  Torok,  Matematikai  Lapok  19  (1968),  143-146, 
who  was  motivated  by  the  generation  of  multiset  permutations.  Many  authors  have 
subsequently  relied  on  the  homogeneity  condition  for  similar  constructions,  but  this 
exercise  shows  that  homogeneity  is  not  necessary. 
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49.  We  have  lim Z^q(zkm+r  — 1 )/(zlm+r  — 1)  = 1 when  0 < r < m,  and  the  limit 
is  lim2_»9(fcm,zfcm_1)/(Imzim_1)  = k/l  when  r = 0.  So  we  can  pair  up  factors  of  the 
numerator  Yln-k<a<n(za  ~ 1)  with  factors  of  the  denominator  rio<6<fc(zi>  — 1)  when 
a = b (modulo  m). 

Notes:  This  formula  was  discovered  by  G.  Olive,  AMM  72  (1965),  619.  In  the 
special  case  m = 2,  q = — 1,  the  second  factor  vanishes  only  when  n is  even  and  k is  odd. 

The  formula  (2)q  — („"*),  holds  for  all  n > 0,  but  is  not  always  equal  to 

i\(n-k)/m\) • The  reason  is  that  the  second  factor  is  zero  unless  n mod  m > k mod  m, 
and  in  that  case  we  do  have  [ k/m\  + [(n  — k)/m\  = [n/m\. 

50.  The  stated  coefficient  is  zero  when  ni  mod  m + • • • + nt  mod  m > m.  Otherwise  it 
equals 

/ L(ni  H InJ/raJW  (m  + hnt)modm  \ 

V [ni  /m\ , . . . , [nt  /m\  J \n  i mod  m, . . . ,nt  mod  m )q  ’ 
by  Eq.  1 .2.6  -(43);  here  each  upper  index  is  the  sum  of  the  lower  indices. 

51.  All  paths  clearly  run  between  000111  and  111000,  since  those  vertices  have  de- 
gree 1.  Fourteen  total  paths  reduce  to  four  under  the  stated  equivalences.  The  path 
in  (50),  which  is  equivalent  to  itself  under  reflection-and-reversal,  can  be  described 
by  the  delta  sequence  A = 3452132523414354123;  the  other  three  classes  are  B = 
3452541453414512543,  C = 3452541453252154123,  D = 3452134145341432543.  D.  H. 
Lehmer  found  path  C [AMM  72  (1965),  Part  II,  36-46];  I)  is  essentially  the  path 
constructed  by  Eades,  Hickey,  and  Read. 

(Incidentally,  perfect  schemes  aren’t  really  rare,  although  they  seem  to  be  difficult 
to  construct  systematically.  The  case  ( s,t ) = (3,5)  has  4,050,046  of  them.) 

52.  We  may  assume  that  each  Sj  is  nonzero  and  that  d > 1.  Then  the  difference 
between  permutations  with  an  even  and  odd  number  of  inversions  is  ([io/2J "'+L*d/2j)  — 
2,  by  exercise  50,  unless  at  least  two  of  the  multiplicities  Sj  are  odd. 

Conversely,  if  at  least  two  multiplicities  are  odd,  a general  construction  by  G.  Sta- 
chowiak  [SIAM  J.  Discrete  Math.  5 (1992),  199-206]  shows  that  a perfect  scheme 
exists.  Indeed,  his  construction  applies  to  a variety  of  topological  sorting  problems;  in 
the  special  case  of  multisets  it  gives  a Hamiltonian  cycle  in  all  cases  with  d > 1 and 
so«i  odd,  except  when  d = 2,  so  = si  = 1,  and  S2  is  even. 

53.  See  AMM  72  (1965),  Part  II,  36-46. 

54.  Assuming  that  st  ^ 0,  a Hamiltonian  path  exists  if  and  only  if  s and  t are  not 

both  even;  a Hamiltonian  cycle  exists  if  and  only  if,  in  addition,  (s  / 2 and  t / 2)  or 

n = 5.  [T.  C.  Enns,  Discrete  Math.  122  (1993),  153-165.] 

55.  (a)  [Solution  by  Aaron  Williams.]  The  sequence  0s  lf,  Wat  has  the  right  properties  if 

Wst  = 0H/'(s_1)t,  lW£(t_i),  10s  1*  1 , for  st  > 0;  Wot  = Ws0  = 0. 

And  there  is  an  amazingly  efficient,  loopless  implementation:  Assume  that  t > 0. 

Wl.  [Initialize.]  Set  n «—  s -(- 1,  o,j  t—  1 for  0 < j < t,  and  aj  <—  0 for  t < j < n. 

Then  set  j t—  k «—  t — 1.  (This  is  tricky,  but  it  works.) 

W2.  [Visit.]  Visit  the  (s,  t)-combination  a„_  1 ..  .aia0. 

W3.  [Zero  out  aj.]  Set  a,j  «—  0 and  j 4—  j + 1. 

W4.  [Easy  case?]  If  aj  = 1,  set  o/t  4—  1,  k 4—  k + 1,  and  return  to  W2. 

W5.  [Wrap  around.]  Terminate  if  j — n.  Otherwise  set  a:i  <—  1.  Then  if  k > 0, 
set  ak  1,  ao  <—  0,  j 4-  1,  and  k 4—  0.  Return  to  W2.  | 
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After  the  second  visit,  j is  the  smallest  index  with  ajdj  — i — 10,  and  k is  smallest 
with  ak  = 0.  The  easy  case  occurs  exactly  (s+‘_1)  - 1 times;  the  condition  k = 0 
occurs  in  step  W5  exactly  (s+it  “)  + <5ti  times.  Curiously,  if  N has  the  combinatorial 
representation  (57),  the  combination  of  rank  N in  Algorithm  L has  rank  N — t+  ( J7  ) + 
v - 1 in  Algorithm  W.  [Lecture  Notes  in  Comp.  Sci.  3595  (2005),  570-576;  see  also 
A.  Williams,  SODA  20  (2009),  987-996,  for  a significant  generalization  by  which  the 
permutations  of  an  arbitrary  multiset  can  be  generated  looplessly  by  prefix  rotations.] 


(b)  SET  bits,  (l«t)-l  (This  program  assumes  that  s > 0 and  t > 0. 


1H  PUSHJ  $0, Visit 

ADDU  $0 .bits , 1 ; AND  $0,$0,bits 
SUBU  $1 , $0 , 1 ; X0R  $1,$0,$1 
ADDU  $0 , $1 , 1 ; AND  $l,$l,bits 
AND  $0,$0,bits ; 0DIF  $0,$0,1 
SUBU  $1 , $1 , $0 ; ADDU  bits,bits,$l 
SRU  $0,bits , s+t ; PBZ  $0,1B 


Visit  bits  = (as+i_x  . . . aia0)2- 
Set  $0  <—  bits  & (bits  + 1). 

Set  $1  <-  $0©  ($0  - 1). 

Set  $0  <-  $1  + 1,  $1  <-  $1  & bits. 

Set  $0  <—  ($0  & bits)  — 1. 

Set  bits  <—  bits  + $1  — $0. 
Repeat  unless  as+t  = 1-  I 


56.  [Discrete  Math.  48  (1984),  163-171.]  This  problem  is  equivalent  to  the  “middle 
levels  conjecture,  which  states  that  there  is  a Gray  path  through  all  binary  strings 
of  length  2 1 — 1 and  weights  {t  — l,f}.  In  fact,  such  strings  can  almost  certainly  be 
generated  by  a delta  sequence  of  the  special  form  aoan  . . . 0:21-2  where  the  elements  of 
a*,  are  those  of  oo  shifted  by  k,  modulo  2f  — 1.  For  example,  when  t = 3 we  can  start 
with  a5a403a2a1ao  = 000111  and  repeatedly  swap  a0  <4  as,  where  S runs  through  the 
cycle  (4134  5245  1351  2412  3523).  The  middle  levels  conjecture  is  known  to  be  true  for 
t < 15  [see  1.  Shields  and  C.  D.  Savage,  Cong.  Num.  140  (1999),  161-178], 

57.  Yes;  there  is  a near-perfect  genlex  solution  for  all  m,  n,  and  t when  n > m > t. 
One  such  scheme,  in  bitstring  notation,  is  1 A(m_<)(t_1)0n_m,  01A(m_()(t_1)0n“m'1 


0" 


Ast  of  (35). 


l^(m-i)(«-i),  0n  m+1l  . . . , 0"  tlA0(t_i),  using  the  sequences 


58.  Solve  the  previous  problem  with  m and  n reduced  by  t - 1,  then  add  j - 1 to 
each  Cj.  (Case  (a),  which  is  particularly  simple,  was  probably  known  to  Czerny.) 

59.  The  generating  function  Gmnt(z)  = J29mutkzk  for  the  number  gmntk  of  chords 

reachablehiksteps  from  0n-‘l(  satisfies  Gmmt(z)  = (7),  and  Gm(n+1)t{z)  = Gmnt{z)  + 
z " m(7-i)z’  because  the  latter  term  accounts  for  cases  with  ct  = n and  ci  > 

n - in.  A perfect  scheme  is  possible  only  if  |Gmrlt(-l)|  < 1.  But  if  n > m > t > 2, 
this  condition  holds  only  when  m = t + 1 or  (n  — t)t  is  odd,  by  (49).  So  there  is  no 
perfect  solution  when  t = 4 and  m > 5.  (Many  chords  have  only  two  neighbors  when 
n = t + 2,  so  one  can  easily  rule  out  that  case.  All  cases  with  n > m > 5 and  t = 3 
apparently  do  have  perfect  paths  when  n is  even.) 

60.  The  following  solution  uses  lexicographic  order,  taking  care  to  ensure  that  the  aver- 
age amount  of  computation  per  visit  is  bounded.  We  may  assume  that  stms  . . . m0  / 0 
and  t < ms  © • * • + m\  + mo. 

Ql.  [Initialize.]  Set  qj  <-  0 for  s > j > 1,  and  x t. 

Q2.  [Distribute.]  Set  j <-  0.  Then  while  x > mj,  set  qj  <—  mj,  x <-  x — mj,  and 
j «—  j + 1.  Finally  set  qj  «—  x. 

Q3.  [Visit.]  Visit  the  bounded  composition  qs  + ■ ■ ■ + qi  + qo- 

Q4.  [Pick  up  the  rightmost  units.]  If  j = 0,  set  x «-  q0  — 1,  j <-  1.  Otherwise  if 
qo  = 0,  set  x <—  q3  — 1,  q}  <—  0,  and  j <—  j + 1.  Otherwise  go  to  Q7. 
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Q5.  [Full?]  Terminate  if  j > s.  Otherwise  if  qj  = nij,  set  x <—  x + rrij,  qj  <—  0, 
j <—  j + 1,  and  repeat  this  step. 

Q6.  [Increase  qj.]  Set  qj  <—  qj  + 1.  Then  if  x = 0,  set  go  <—  0 and  return  to  Q3. 

(In  that  case  g,_i  = • • • = q0  = 0.)  Otherwise  go  to  Q2. 

Q7.  [Increase  and  decrease.]  (Now  qi  = rrij  for  j > i > 0.)  While  g,  = set 
j j + 1 and  repeat  until  qj  < rrij  (but  terminate  if  j > s).  Then  set 
q j <—  + 1 , j j — 1,  qj  «—  qj  — 1.  If  go  = 0,  set  j <—  1.  Return  to  Q3.  | 

For  example,  if  ms  = ■ ■ ■ = mo  = 9,  the  successors  of  the  composition  3+9+9+7+0+0 
are  4 + 0 + 0 + 6 + 9 + 9,  4 + 0 + 0 + 7 + 8 + 9,  4 + 0 + 0 + 7 + 9 + 8,  4 + 0 + 0 + 8 + 7 + 9,  . . . . 

61.  Let  Fs(t)  = 0 if  £ < 0 or  t > ms  + b mo;  otherwise  let  Fo{t)  = t,  and 

Fs{t)  = 0+Fs-i(t),  l+Fs-i(t  — l)fl,  2+Fs_1(t  — 2),  ...,  ms+Fs-i(t  - ms)RTns 

when  s > 0.  This  sequence  can  be  shown  to  have  the  required  properties;  it  is,  in 
fact,  equivalent  to  the  compositions  defined  by  the  homogeneous  sequence  Kst  of  (31) 
under  the  correspondence  of  exercise  4,  when  restricted  to  the  subsequence  defined  by 
the  bounds  ms,  ...,  mo.  [See  T.  Walsh,  J.  Combinatorial  Math,  and  Combinatorial 
Computing  33  (2000),  323-345,  who  has  implemented  it  looplessly.] 

62.  (a)  A 2 x n contingency  table  with  row  sums  r and  ci  + ■ • • + cn  — r is  equivalent 

to  solving  r = ai  + b an  with  0 < ai  < ci,  . . . , 0 < a„  < cn. 

(b)  We  can  compute  it  sequentially  by  setting  aij  <—  min(ri  — atl  — • • • — a^j-u, 
°j  — a\j  — •••  — a(,_!)j)  for  j = 1,  . . . , n,  for  i = 1,  . . . , m.  Alternatively,  if  rq  < c 1,  set 
an  +-  r 1,  ai2  +-•••+-  ai„  4-  0,  and  do  the  remaining  rows  with  ci  decreased  by  iq;  if 
rq  > ci,  set  an  t—  Ci,  021  «—•••<—  ami  0,  and  do  the  remaining  columns  with  rq 
decreased  by  ci . The  second  approach  shows  that  at  most  m + n — 1 of  the  entries  are 


nonzero.  We  can  also  write  down  the  explicit  formula 

aij  = max(0,min(ri,Cj,ri  H b r<  - ci  Cj_ i,ci  H b Cj  - n rq- 1)). 

(c)  The  same  matrix  is  obtained  as  in  (b). 

(d)  Reverse  left  and  right  in  (b)  and  (c);  in  both  cases  the  answer  is 

aij  = max(0,min(ri,cq,7q  4 bn  - ci+ 1 c„, c,  H b cn  - n ^-i))- 

(e)  Here  we  choose,  say,  row-wise  order:  Generate  the  first  row  just  as  for  bounded 
compositions  of  rq,  with  bounds  (ci,...,c„);  and  for  each  row  (an, . . . , aln),  gen- 


erate the  remaining  rows  recursively  in  the  same  way,  but  with  the  column  sums 
(ci  — an,...,c„  — ain).  Most  of  the  action  takes  place  on  the  bottom  two  rows, 
but  when  a change  is  made  to  an  earlier  row  the  later  rows  must  be  re-initialized. 

63.  If  aij  and  aki  are  positive,  we  obtain  another  contingency  table  by  setting  aij  e- 
atj  — 1,  an  <—  an  + 1,  akj  <—  akj  + 1,  au  -f-  au  — 1-  We  want  to  show  that  the  graph  G 
whose  vertices  are  the  contingency  tables  for  (rq, . . . , rm;  ci , . . . , cn),  adjacent  if  they 
can  be  obtained  from  each  other  by  such  a transformation,  has  a Hamiltonian  path. 

When  m = n = 2,  G is  a simple  path.  When  m = 2 and  n = 3,  G has  a two- 
dimensional  structure  from  which  we  can  see  that  every  vertex  is  the  starting  point  of  at 
least  two  Hamiltonian  paths,  having  distinct  endpoints.  When  m = 2 and  n > 4 we  can 
show,  inductively,  that  G actually  has  Hamiltonian  paths  from  any  vertex  to  any  other. 

When  m > 3 and  n > 3,  we  can  reduce  the  problem  from  m to  m — 1 as  in  answer 
62(e),  if  we  are  careful  not  to  “paint  ourselves  into  a corner.”  Namely,  we  must  avoid 
reaching  a state  where  the  nonzero  entries  of  the  bottom  two  rows  have  the  form  ( J “ ° ) 
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for  some  a,  b,  c > 0 and  a change  to  row  m — 2 forces  this  to  become  (°  “ *).  The 
previous  round  of  changes  to  rows  m — 1 and  m can  avoid  such  a trap  unless  c — 1 and 
it  begins  with  °)  or  ( J J).  But  that  situation  can  be  avoided  too. 

(A  genlex  method  based  on  exercise  61  would  be  considerably  simpler,  and  it 
almost  always  would  make  only  four  changes  per  step.  But  it  would  occasionally  need 
to  update  2min(m,  n)  entries  at  a time.) 

64.  When  x\  . . . xs  is  a binary  string  and  A is  a list  of  subcubes,  let  A ® x\  . . . xs 
denote  replacing  the  digits  (ai, . . . , as)  in  each  subcube  of  A by  (ai  0 xlt . . . , as  © xs), 
from  left  to  right.  For  example,  0*1**10  01010  = 1*1**00.  Then  the  following  mutual 
recursions  define  a Gray  cycle,  because  Ast  gives  a Gray  path  from  0s**  to  10s-1**  and 
Bst  gives  a Gray  path  from  0s**  to  *01s_1*‘_1,  when  st  > 0: 

Ast  = 0B(s-i)t,  *As(t-i)  0 001s  2,  lB(l_1)t; 

Bst  = 0A(s_1)t,  lB(s-i)(  0 010  , *AS((_!)  0 Is. 

The  strings  001s-2  and  010s-2  are  simply  0s  when  s < 2;  Aso  is  Gray  binary  code; 
Aot  = Bot  = **.  (Incidentally,  the  somewhat  simpler  construction 


Gst  — *Gs(t_1),  a(G(s_i)t,  <Jt_i a<  = ( mod  2, 
defines  a pleasant  Gray  path  from  **0S  to  at_i**0s_1.) 

65.  If  a path  P is  considered  equivalent  to  PR  and  to  P 0 x\  . . . xs,  the  total  number 
can  be  computed  systematically  as  in  exercise  33,  with  the  following  results  for  s+t  < 6: 

paths  cycles 


1 

1 1 
1 2 1 

13  3 1 

1 5 10  4 1 

1 6 36  35  5 1 

1 9 310  4630  218  6 1 


1 

1 1 
1 1 1 
1111 
12  111 
12  3 111 
1 3 46  4 1 1 1 


In  general  there  are  t + 1 paths  when  s = 1 and  (^/:^+2)  — (s  moc*  2)  when  t = 1.  The 
cycles  for  s < 2 are  unique.  When  s = t = 5 there  are  approximately  6.869  x 10170 
paths  and  2.495  x 107°  cycles. 

66.  Let  G(n,  0)  = e;  G(n,  t)  = 0 when  n < t;  and  for  1 < t < n,  let  G(n,  t ) be 

g(0)G(n  — 1,  t),  g(l)G(n-l,t)R,  ....  ff(2*  - l)G(n  - 1,  t)R,  g(2*  - l)G(n  - 1,  t - 1), 

where  g(k)  is  a f-bit  column  containing  the  Gray  binary  number  g(k)  with  its  least 
significant  bit  at  the  top.  In  this  general  formula  we  implicitly  add  a row  of  zeros 
below  the  bases  of  G(n  — 1,  t — 1). 

This  remarkable  rule  gives  ordinary  Gray  binary  code  when  t = 1,  omitting  0 ...  00. 
A cyclic  Gray  code  is  impossible  because  (")  is  odd. 

67 . A Gray  path  for  compositions  corresponding  to  Algorithm  C implies  that  there  is 
a path  in  which  all  transitions  are  0*1*  O l'0fc  with  min(fc,  l ) < 2.  Perhaps  there  is,  in 
fact,  a cycle  with  min (k,l)  = 1 in  each  transition. 

68.  (a)  {0};  (b)  0. 

69.  The  least  N with  KtN  < N is  (2tt_1)  + H f (J)  + 1 = |((2/)  + + 

• • • + (o)  + l),  because  (t^1)  < (”)  if  and  only  if  n > 2t  — 1. 
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70.  Using  the  facts  that  t > 3 implies 

Kt((2v3)+iv')-((2tr3)+iv')  = K,((2i-2)+^)-((2tr2)+iv')  = (“r^^r+Kt-xJv'-iv' 

when  N'  < (2*t  3) , we  conclude  that  the  maximum  is  (2t  ~2)  + (2tJ"14)  j~  + ' ' ' + (2)  f > 

and  it  occurs  at  2i_1  values  of  N when  t > 1. 

71.  Let  Ct  be  the  t-cliques.  The  first  (1414)  + (j00}1)  f-combinations  visited  by  Al- 
gorithm L define  a graph  on  1415  vertices  with  1000000  edges.  If  \Ct\  were  larger, 
If/  2Ct|  would  exceed  1000000.  Thus  the  single  graph  defined  by  P(  1000000)2  has  the 
maximum  number  of  t-cliques  for  all  t > 2. 

72.  M = (™s)  + • • ■ + (™u)  for  ma  > ■ ■ • > mu  > u > 1,  where  {ma, . . . ,mu}  = 
{s+f-1, . . . , n„}\{nt, . . . , n„+i}.  (Compare  with  exercise  15,  which  gives  (s+‘)  — 1 — N.) 

If  a = an_i  . . . a0  is  the  bit  string  corresponding  to  the  combination  nt . . . m,  then 
w is  1 plus  the  number  of  trailing  is  in  a,  and  u is  the  length  of  the  rightmost  run 
of  0s.  For  example,  when  a = 1010001111  we  have  s = 4,  t = 6,  M = (s)  + (7)  u = 3 

N=  (e)  + (D’v  = 5-  4 

73.  A and  B are  cross-intersecting  -£=)>  a £ U \ (5  for  all  a G A and  /3  e B <=> 

Andn-s~fB~  = 0,  where  B~  = {U  \ p \ /3  e B}  is  a set  of  (n  - ^-combinations.  Since 
QNnt  = PN(n-t),  we  have  \dn~s~1B-\  > \dn~s-tPN(n^t)\,  and  d71-3-* PN(n_t)  = PN,S 
where  N = Ks+i  . . . Kn-t  N.  Thus  if  A and  B are  cross-intersecting  we  have  M + N'  < 
\A\  + < ("),  and  QMns  D Pn,s  = 0. 

Conversely,  if  QMns  C PN/S  / 0 we  have  (”)  < M + N'  < \A\  + \dn~s~tB~\.  so  A 
and  B cannot  be  cross-intersecting. 

74.  \qQ Nnt\  = Kn-tN  (see  exercise  94).  Also,  arguing  as  in  (58)  and  (59),  we  find 
qPns  = (n—  1)P]V5  U • • ■ U IOP1V5  U {543210, . . . ,987654}  in  that  particular  case;  and 
\epNt\  = (n  + 1 — nt)N  + (n/4f11)  in  general. 

75.  The  identity  ( £ ) = (£)  + (£_J)  + ■■■  + ("0l),  Eq.  1.2.6-(io),  gives  another 
representation  if  nv  > v.  But  (60)  is  unaffected,  since  we  have  ({J+1)  = (fc " ) 4-  ("-1)  + 

■••  + (B_o*+1)- 

76.  Represent  N + 1 by  adding  (”_J)  to  (57);  then  use  the  previous  exercise  to  deduce 
that  Kt(N  + 1)  - Kt N = (J|~4)  = v - 1. 

77.  [D.  E.  Daykin,  JVanta  Math.  8,2  (1975),  78-83.]  We  work  with  extended  repre- 
sentations M = (”}*)  + • • ■ + ("(“)  and  N = ("*)  + ■ ■ • + (””)  as  in  exercise  75,  calling 
them  improper  if  the  final  index  u or  v is  zero.  Call  N flexible  if  it  has  both  proper 
and  improper  representations,  that  is,  if  nv  > v > 0. 

(a)  Given  an  integer  S,  find  M + N such  that  M + N = S and  ntM  + ntN  is 
minimum,  with  M as  large  as  possible.  If  N = 0,  we’re  done.  Otherwise  the  max-min 
operation  preserves  both  M + N and  kjM  + Kt  N , so  we  can  assume  that  v > u > 1 in 
the  proper  representations  of  M and  N.  If  N is  inflexible,  nt(M  + 1)  + Kt(N  - 1)  = 
(k(  M + u - 1)  + ( KtN  -v)  < KtM  + KtN,  by  exercise  76;  therefore  N must  be  flexible. 
But  then  we  can  apply  the  max-min  operation  to  M and  the  improper  representation 
of  N,  increasing  M:  Contradiction. 

This  proof  shows  that  equality  holds  if  and  only  if  MN  = 0,  a fact  that  was  noted 
in  1927  by  F.  S.  Macaulay. 

(b)  Now  we  try  to  minimize  max(/t tM,N)  + nt-iN  when  M + N = S,  this  time 
representing  N as  ("Cj1)  + •■•  + ("”)•  The  max-min  operation  can  still  be  used  if 
nt- 1 < mt ; leaving  mt  unchanged,  it  preserves  M + N and  KtM  + Kt-\N  as  well  as  the 
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relation  KtM  > N.  We  arrive  at  a contradiction  as  in  (a)  if  TV  ^ 0,  so  we  can  assume 
that  Tit- 1 > rrit- 

If  nt-i  > mt  we  have  TV  > KtM  and  also  A tN  > M;  hence  M + TV  < A tN  + TV  = 
(r*'7+1)  + • • • + (n;++11),  and  we  have  nt(M  + N)  < *t(A tN  + TV)  = TV  + Kt-iN. 

Finally  if  nt-i  = mt  = a,  let  M = (“)+M'  and  TV  = (t“x)+TV'.  Then  Kt(M+N)  = 
(“^)  + itt-i {M'  + TV'),  KtM  = (t“j)  + Kt-iM',  and  Kt-iN  = (t“2)  + «t_2TV';  the 
result  follows  by  induction  on  t. 

78.  [J.  Eckhoff  and  G.  Wegner,  Periodica  Math.  Hung.  6 (1975),  137-142;  A.  J.  W. 
Hilton,  Periodica  Math.  Hung.  10  (1979),  25-30.]  Let  M = |Ai|  and  TV  = |A0|;  we  can 
assume  that  t > 0 and  TV  > 0.  Then  |cL4|  = |cL4i  U Ao|  + |tMo|  > max(|#Ai|,  |A0|)  + 
|cL4o|  > max(KtM,  TV)  + Kt-iN  > Kt(M  + N)  = |P|^|t|,  by  induction  onm  + n + 1. 

Conversely,  let  Ai  = P\tt  + 1 and  Ao  = Pnh- i)  + 1;  this  notation  means,  for 
example,  that  {210,320}  + 1 = {321,431}.  Then  Kt(M  + N)  < \dA\  = | dAr  U A0|  + 
|(9A0)0|  = max(«tM,  TV)  + Kt-i N,  because  dA\  = P(KtM)(t- 1)  + 1.  [Schiitzenberger 
observed  in  1959  that  Kt(M  + N)  < k tM  + Kt-i N if  and  only  if  KtM  > TV.] 

For  the  first  inequality,  let  A and  B be  disjoint  sets  of  t-combinations  with  \A\  = M, 
\dA\  = KtM , |B|  = TV,  \dB\  = Kt TV.  Then  Kt(M  + N)  = Kt\A\JB\  < |9(AUB)|  = 
| dA  U dB | = |5A|  + \8B\  = ntM  + KtN. 

79.  In  fact,  Ht(M  + At_i M)  = M,  and  /rtTV  + \t-\HtN  = TV  + (n2  — n\)[v  = 1]  when 
TV  is  given  by  (57). 

80.  If  TV  > 0 and  t > 1,  represent  TV  as  in  (57)  and  let  TV  = No  + Ni,  where 


Let  No  = ( yt ) and  N\  = (t*j).  Then,  by  induction  on  t and  [xj,  we  have  (*)  = 
TVo  + KtNo  > (?)  + (t!x)  = Ni  = C)  - (?)  > (?)  - (X7x)  = (::i);  and 

KtN  = N1  + Kt-rNr  > + (t'_2)  = (}+()  > (,*,). 

[Lovasz  actually  proved  a stronger  result;  see  exercise  1.2.6-66.  We  have,  similarly, 
Mt  TV  > (*“j);  see  Bjorner,  Frankl,  and  Stanley,  Combinatorica  7 (1987),  27-28.] 

81.  For  example,  if  the  largest  element  of  Pjvs  is  66433,  we  have 

Pn 5 = {00000, . . . , 55555}(J{60000, . . . , 65555}U{66000, . . . , 66333}U{66400, . . . , 66433} 

so  TV  = (15°)  + (?)  + (?)  + (?).  Its  lower  shadow  is 

dPN 5 = {0000, . . . , 5555}  U {6000, . . . , 6555}  U {6600, . . . , 6633}  U {6640, . . . , 6643}, 

of  size  0 + G)  + G)  + (J). 

If  the  smallest  element  of  Qn9s  is  66433,  we  have 

QN95  = {99999, . . . , 70000}  U {66666, . . . , 66500}  U {66444, . . . , 66440}  U {66433} 

so  TV  = ((193)  + (182)  + (171))  + (Q  + Q)  + (?)  + (?).  Its  upper  shadow  is 

qQn95  = {999999, ....  700000}  U {666666, . . . , 665000} 

U {664444, . . . , 664400}  U {664333, . . . , 664330}, 

of  size  ((194)  + (1g3)  + (172))  + (0 + (5))  + (4)  + (3)  = N + K9N-  The  size,  t,  of  each 
combination  is  essentially  irrelevant,  as  long  as  TV  < ('}f) ; for  example,  the  smallest 
element  of  Qn98  is  99966433  in  the  case  we  have  considered. 
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82.  (a)  The  derivative  would  have  to  be  J2k>0  rJc(x),  but  that  series  diverges. 

[Informally,  the  graph  of  r(x)  shows  “pits”  of  relative  magnitude  2~k  at  all  odd 
multiples  of  2~k.  Takagi’s  original  publication,  in  Proc.  Physico-Math.  Soc.  Japan  (2) 
1 (1903),  176-177,  has  been  translated  into  English  in  his  Collected  Papers  (Iwanami 
Shoten,  1973).] 


(b)  Since  rk(l-t)  = (-1)  T2**1  when  k > 0,  we  have  f*  xrk(t)dt  = f\k(l-u)du  = 
- fx  rk(u)  du  = fx  rk(u)  du.  The  second  equation  follows  from  the  fact  that  rk(\t)  = 
rk— i(t).  Part  (d)  shows  that  these  two  equations  suffice  to  define  t(x)  when  x is 


rational. 

(c)  Since  r( 2~ax)  = a2~ax  + 2~ar(x)  for  0 < x < 1,  we  have  r(e)  = ae  + O(e) 
when  2”“_1  < e < 2~a.  Therefore  r(e)  = elg  I + O(e)  for  0 < e < 1. 

(d)  Suppose  0 < p/q  < 1.  If  p/q  < 1/2  we  have  r(p/q ) — p/q  -f  r(2p/g)/2; 
otherwise  r(p/<j)  = (q  — p)/q  + r(2(q  — p)/q)/2.  Therefore  we  can  assume  that  q is 
odd.  When  q is  odd,  let  p = p/2  when  p is  even,  p = (q  - p)/2  when  p is  odd.  Then 
T(p/<l)  = 2 r(p'/q)  - 2 p’/q  for  0 < p < q;  this  system  of  q — 1 equations  has  a unique 
solution.  For  example,  the  values  for  q = 3,  4,  5,  6,  7 are  2/3,  2/3;  1/2,  1/2,  1/2;  8/15, 
2/3,  2/3,  8/15;  1/2,  2/3,  1/2,  2/3,  1/2;  22/49,  30/49,  32/49,  32/49,  30/49,  22/49. 

/ . \ mi  i i • .1  11  ii  i i i i 


(e)  The  solutions  < 1 are  x 

(f)  The  value  | is  achieved  for  x 


1 i_  l 

4 16  ’ 4 

= 1 ± 1 ± 
2 ^ 8 ^ 


_1 1_  1 _ 

16  64  ’ 4 

- ± — ± • • • 

1 -1-  128  -c- 


2 i i_  i 

16  64  256  ’ ' ’ ' ’ 6 ' 

an  uncountable  set. 


83.  Given  any  integers  q > p > 0,  consider  paths  starting  from  0 in  the  digraph 


0 <-  1 


t 

1 


t 

2 


2 


t 

3 


3 


t 

4 


4 f-  5 


$ t 

-t  5 4 6 4 


Compute  an  associated  value  v,  starting  with  v < p;  horizontal  moves  change  v <—  2v, 

vertical  moves  from  node  a change  v <-  2 (qa  - v).  The  path  stops  if  we  reach  a node 
twice  with  the  same  value  v.  Transitions  are  not  allowed  to  upper  node  a if  v < —q  or 
v > qa  at  that  node;  they  are  not  allowed  to  lower  node  a with  v < 0 or  v > q(a  + 1). 
These  restrictions  force  most  steps  of  the  path.  (Node  a in  the  upper  row  means,  “Solve 
t(x)  = ax  - v/q"\  in  the  lower  row  it  means,  “Solve  r(x)  = v/q  - ax.")  Empirical 
tests  suggest  that  all  such  paths  are  finite.  The  equation  t ( x ) = p j q then  has  solutions 
x = xo  defined  by  the  sequence  xo,  X\,  X2,  ■ ■ ■ where  xk  = ^xk+\  on  a horizontal  step 
and  xk  = 1 - ixfc+i  on  a vertical  step;  eventually  xk  = x3  for  some  j < k.  If  ] > 0 and 
if  q is  not  a power  of  2,  these  are  all  the  solutions  to  t(x)  - p/q  when  x > 1/2. 

For  example,  this  procedure  establishes  that  t(x)  = 1/5  and  x > 1/2  only  when 
x is  83581/87040;  the  only  path  yields  x0  = 1 - \xx,  Xl  = \x2,  . . . , ii8  = |i19,  and 
*19  = *ii-  There  are,  similarly,  just  two  values  x > 1/2  with  t(x)  = 3/5,  having 
denominator  246(256  — l)/3. 

Moreover,  it  appears  that  all  cycles  in  the  digraph  that  pass  through  node  0 define 
values  of  p and  q such  that  r(x)  = p/q  has  uncountably  many  solutions.  Such  values 
are,  for  example,  2/3,  8/15,  8/21,  corresponding  to  the  cycles  (01),  (0121),  (012321). 
The  value  32/63  corresponds  to  (012121)  and  also  to  (012101234545454321),  as  well  as 
to  two  other  paths  that  do  not  return  to  0. 

84.  [Frankl,  Matsumoto,  Ruzsa,  and  Tokushige,  J.  Combinatorial  Theory  A69  (1995), 
125-148.]  If  a < b we  have 

C i-a  b)/T  = tS{t  ~ V~/(2t  - !)-  = 2“6(1  + /(a.h)*-1  + 0(b4/t2)), 
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where  f(a,  6)  — a(l  + 6)  — a2  — 6(l  + b)/4  = /(a+l,6)  — 6 + 2a.  Therefore  if  N has 
the  combinatorial  representation  (57),  and  if  we  set  rij  =21-1-6.,,  we  have 


^(kiN-n) 


bt  bt-i  - 2 6(^2  — 4 O(logf)3 

2*>t  + 26t-i  + 2bt~2  + " ' \ 


the  terms  being  negligible  when  bj  exceeds  2 lg  t.  And  one  can  show  that 


T{iz2  ej)  = x>i-2J)2-ei- 

3=0  j— 0 

85.  N-\t-iN  has  the  same  asymptotic  formas  ntN-N,  by  (63),  since  r(  x)  = r(l-x). 
So  does  2y.tN-N , up  to  0(T(log t)3/t2),  because  (2t~_1~b)  = 2(2t~_2"6)  (l  + 0(logt)/f) 
when  6 < 2 lg  t. 

86.  x G X°~  <=>■  x £ X°  -t==i>  x £ X or  x 0 X + ei  or  • • • or  x £ X + en  ■$=>  x G X~ 
or  x 6 X~  - ei  or  • • • or  x £ X~  - e„  x G X~+. 

87.  All  three  are  true,  using  the  fact  that  X C 7°  if  and  only  if  X+  C Y:  (a)  X C 7° 
^ O 7°~  = T~+  *=*  Y~  C X~°.  (b)  X+  C X+  =»  X C X+°;  hence 
X°  C X°+°.  Also  1°  C 1°  =>  X°+  C A';  hence  A°+°  C A0,  (c)  aM  < N 

C S°n  M < /31V. 

88.  If  i/x  < i/j/  then  i/(x  — e*)  < v(y  — ej),  so  we  can  assume  that  j/x  = i/y  and  that 
x > y in  lexicographic  order.  We  must  have  yj  >0;  otherwise  u(y  - ej)  would  exceed 
v(x  - efc).  If  Xi  = yi  for  1 < i < j,  clearly  k > j and  x - ek  <y-ej.  Otherwise  Xi  > yt 
for  some  i < j;  again  we  have  x - ek  -<  y - ej,  unless  x - ek  = y — ej. 

89.  From  the  table 


j = 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

ej  + ei  = 

ei 

eo 

e4 

es 

e.2 

e3 

eg 

eg 

ee 

e7 

eu 

eio 

ej  + e2  = 

62 

e4 

eo 

ee 

ei 

eg 

e3 

eio 

es 

en 

e? 

eg 

ej  + e3  = 

e3 

es 

ee 

e7 

eg 

eg 

eio 

eo 

en 

ei 

e2 

e4 

we  find  (a0,al,...,al2)  = (0,4,6,7,8,9,10,11,11,12,12,12,12);  (/30, 01, . . . , /312)  = 
(0,0,0,0,1,1,2,3,4,5,6,8,12). 

90.  Let  Y = A+  and  Z = CkX,  and  let  Na  = |Ajt(a)|  for  0 < a < m*,.  Then 

mfc~ 1 mfc  — 1 

1*1  = ^ |n(a)|=  ]T  |(Xfc(a-l)  + efc)U(Xfc(a)  + JEfc(0))| 

a=0  a=0 

mk-l 

> ^ max(JV0_i,aJV0), 

a=0 

where  a — 1 stands  for  (a  — 1)  mod  mk  and  the  a function  comes  from  the  (n  — 1)- 
dimensional  torus,  because  |X*(a)  + Ek(0)\  > aNa  by  induction.  Also 

mk -1  mk-l 

lz+l  = \Zt^)\=  ]T  |(Zfc(o-l)  + eik)U(Z*(a)  + £;fc(0))| 

a=0  a=0 

mfc-l 

= ^2  max(Na-i,aNa), 

a=0 

because  both  Zk(a  — 1)  + ek  and  Zk(a ) + Ek( 0)  are  standard  in  n — 1 dimensions. 
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91.  Let  there  be  Na  points  in  row  a of  a totally  compressed  array,  where  row  0 is 
at  the  bottom;  thus  l = N-1  > N0  > ■■  ■ > Nm- i > Nm  = 0.  We  show  first, 
that  there  is  an  optimum  X for  which  the  “bad”  condition  Na  = Na+\  never  occurs 
except  when  Na  = 0 or  Na  = l.  For  if  a is  the  smallest  bad  subscript,  suppose 
Na-l  > Na  = Na+1  = ■■■  = Na+k  > Na+k+1-  Then  we  can  always  decrease  Na. 
by  1 and  add  1 to  some  Nb  for  b < a without  increasing  ( A" + 1 , except  in  cases  where 
k = 1 and  Na+2  = Na+i  — 1 and  Nb  = Na  + a — b < l for  0 < b < a.  Exploring  such 
cases  further,  if  Nc+ i < Nc  = Nc-i  for  some  c > a + 1,  we  can  set  Nc  •<—  Nc  — 1 and 
Na  <—  Na  + 1,  thereby  either  decreasing  a or  increasing  N0.  Otherwise  we  can  find 
a subscript  d such  that  Nc  = Na+i  + a+  l-  c>0  for  a < c < d,  and  either  Nd  = 0 or 
Nd  < Nd- 1 - 1.  Then  it  is  OK  to  decrease  Nc  by  1 for  a < c < d and  subsequently  to 
increase  Nb  by  1 for  0 < b < d — a — 1.  (It  is  important  to  note  that  if  Nd  = 0 we  have 
No  > d — 1;  hence  d = m implies  l = m.) 

Repeating  such  transformations  until  Na  > Na+i  whenever  Na  / l and  Na+ i / 0, 
we  reach  situation  (86),  and  the  proof  can  be  completed  as  in  the  text. 

92.  Let  x + k denote  the  lexicographically  smallest  element  of  T(mi, . . . , m„_ i)  that 
exceeds  x and  has  weight  ux  + k,  if  any  such  element  exists.  For  example,  if  mi  = m2  = 
m3  = 4 and  x = 211,  we  have  x+1  = 212,  x + 2 = 213,  x + 3 = 223,  x + 4 = 233,  x + 5 = 
333,  and  x + 6 does  not  exist;  in  general,  x + k + 1 is  obtained  from  x + k by  increasing 
the  rightmost  component  that  can  be  increased.  If  x + k = (m i - 1, ... , m„_i  — 1),  let 
us  set  x + k + 1 = x + k.  Then  if  S(k)  is  the  set  of  all  elements  of  T(m\, . . . ,mn_i) 
that  are  ■<  x + k,  we  have  S(k  + 1)  = S(k)+ . Furthermore,  the  elements  of  5 that  end 
in  a are  those  whose  first  n — 1 components  are  in  S(m-l-a). 

The  result  of  this  exercise  can  be  stated  more  intuitively:  As  we  generate  n- 
dimensional  standard  sets  Si,  S2,  . . . , the  (n  — l)-dimensional  standard  sets  on  each 
layer  become  spreads  of  each  other  just  after  each  point  is  added  to  layer  m — 1. 
Similarly,  they  become  cores  of  each  other  just  before  each  point  is  added  to  layer  0. 

93.  (a)  Suppose  the  parameters  are  2 < mi  < m'2  < ■ ■ ■ < m'n  when  sorted  properly, 
and  let  k be  minimal  with  mk  ^ m'k.  Then  take  N = l + rank(0, . . . , 0,  m'k  — 1, 0, . . . , 0). 
(We  must  assume  that  min(mi, . . . , mn)  > 2,  since  parameters  equal  to  1 can  be  placed 
anywhere.) 

(b)  Only  in  the  proof  for  n = 2,  buried  inside  the  answer  to  exercise  91.  That 
proof  is  incorporated  by  induction  when  n is  larger. 

94.  Complementation  reverses  lexicographic  order  and  changes  q to  d. 

95.  For  Theorem  K,  let  d = n - 1 and  s0  = • • • = sd  — 1.  For  Theorem  M,  let  d = s 
and  s0  = • • • = sd  = t + 1. 

96.  In  such  a representation,  N is  the  number  of  t-multicombinations  of  (s0  • 0,  si  ■ 1, 
S2  • 2, . . . } that  precede  ntnt- 1 . . . n 1 in  lexicographic  order,  because  the  generalized 
coefficient  (S[n^)  counts  the  multicombinations  whose  leftmost  component  is  < n. 

If  we  truncate  the  representation  by  stopping  at  the  rightmost  nonzero  term 
(S(n„)),  we  0jjtajn  a njce  generalization  of  (60): 


[See  G.  F.  Clements,  J.  Combinatorial  Theory  A37  (1984),  91-97.  The  inequalities 
so  > .si  > • ■ ■ > sd  are  needed  for  the  validity  of  Corollary  C,  but  not  for  the  calculation 
of  |<LPjvt|.  Some  terms  ( sl-™k *)  for  t > k > v may  be  zero.  For  example,  when  N = 1, 
t = 4,  so  = 3,  and  si  = 2,  we  have  N = (S41))  + (S(31))  = 0 + 1 .] 
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97.  (a)  The  tetrahedron  has  four  vertices,  six  edges,  four  faces:  (No,. . . , IV4)  = 
(1,4,  6, 4,1).  The  octahedron,  similarly,  has  (N0,...,N6)  = (1, 6,  8, 8, 0, 0, 0),  and 
the  icosahedron  has  (No, . . . , Nn)  = (1, 12,  30,  20, 0, . . . , 0).  The  hexahedron,  aka  the 
3-cube,  has  eight  vertices,  12  edges,  and  six  square  faces;  perturbation  breaks  each 
square  face  into  two  triangles  and  introduces  new  edges,  so  we  have  (No,  ■ ■ ■ ,Ng)  = 
(1,  8, 18, 12,  0, ... , 0).  Finally,  the  perturbed  pentagonal  faces  of  the  dodecahedron  lead 
to  (AT0, , N20)  = (1,  20, 54,  36, 0, . . . , 0). 

(b)  {210,310}  U {10,  20,  21, 30, 31}  U {0,1, 2, 3}  U {e}. 

(c)  0 < Nt  < (")  for  0 < t < n and  Nt- 1 > ntNt  for  1 < t < n.  The  second 

condition  is  equivalent  to  \t-iNt-i  > Nt  for  1 < t < n,  if  we  define  A01  = 00.  These 

conditions  are  necessary  for  Theorem  K,  and  sufficient  if  A = |J  PNtt- 

(d)  The  complements  of  the  elements  not  in  a simplicial  complex,  namely  the  sets 
{ {0, . . . , n — 1}  \ a | a £ C},  form  a simplicial  complex.  (We  can  also  verify  that 
the  necessary  and  sufficient  condition  holds:  Nt~i  > ntNt  4=4>  \t-\Nt-\  > Nt  4=4- 
Kn-t+iNn~t+i  < N„-t,  because  Kn-tNn-t+ 1 = (")  — M-iNt-i  by  exercise  94.) 

(e)  00000  o 14641;  10000  14640;  11000  <->  14630;  12000  <->  14620;  13000 

14610;  14000  o 14600;  12100  14520;  13100  <->  14510;  14100  -H-  14500;  13200  ++ 

14410;  14200  <->  14400;  13300  *->  14310;  and  the  self-dual  cases  14300,  13310. 

98.  The  following  procedure  by  S.  Linusson  [Combinatorica  19  (1999),  255-266],  who 
considered  also  the  more  general  problem  for  multisets,  is  considerably  faster  than  a 
more  obvious  approach.  Let  L(n , h , l)  count  feasible  vectors  with  Nt  = (")  for  0 < t < l, 
Nt+i  < (("i),  and  Nt  = 0 for  t,  > h.  Then  L(n,h,l)  = 0 unless  —1  < l < h < n; 
also  L(n,h,h)  = L(n,h,—  1)  = 1,  and  L(n,n,l)  = L(n,n  — 1,/)  for  l < n.  When 
n > h > l > 0 we  can  compute  L(n,h,l)  = J2j=i  L(n  ~ 1,  h,j)L(n  — 1 ,j  — 1,/  - 1),  a 
recurrence  that  follows  from  Theorem  K.  (Each  size  vector  corresponds  to  the  complex 
U PNtt,  with  L(n  — 1,  h,  j)  representing  combinations  that  do  not  contain  the  maximum 
element  n — 1 and  L(n  — 1 ,j  — 1,1  — 1)  representing  those  that  do.)  Finally  the  grand 
total  is  L(n ) = ]T)"=1  L(n,n,l). 

We  have  L(0),  L(l),  L( 2),  ...  = 2,  3,  5,  10,  26,  96,  553,  5461,  100709,  3718354, 
289725509,  . . . ; L(IOO)  « 3.22  99  x 101842. 

99.  The  maximal  elements  of  a simplicial  complex  form  a clutter;  conversely,  the 
combinations  contained  in  elements  of  a clutter  form  a simplicial  complex.  Thus  the 
two  concepts  are  essentially  equivalent. 

(a)  If  (M0,  Mi, . . . , Mn)  is  the  size  vector  of  a clutter,  then  (N0,  Ni, . . . , Nn)  is 
the  size  vector  of  a simplicial  complex  if  Nn  = Mn  and  Nt  = Mt  + Kt+iNt+i  for 
0 < t < n.  Conversely,  every  such  (No, . . . , Nn)  yields  an  (Mo, . . . ,Mn)  if  we  use 
the  lexicographically  first  Nt  t-combinations.  [G.  F.  Clements  extended  this  result  to 
general  multisets  in  Discrete  Math.  4 (1973),  123-128.] 

(b)  In  the  order  of  answer  97(e)  they  are  00000,  00001,  10000,  00040,  01000,  00030, 
02000,  00120,  03000,  00310,  04000,  00600,  00100,  00020,  01100,  00210,  02100,  00500, 
00200,  00110,  01200,  00400,  00300,  01010,  01300,  00010.  Notice  that  (M0,...,Mn)  is 
feasible  if  and  only  if  (Mn, . . . , Mo)  is  feasible,  so  we  have  a different  sort  of  duality  in 
this  interpretation. 

100.  Represent  A as  a subset  of  T(m\, . . . , m„)  as  in  the  proof  of  Corollary  C.  Then  the 
maximum  value  of  vA  is  obtained  when  A consists  of  the  N lexicographically  smallest 
points  xi  ...  xn. 

The  proof  starts  by  reducing  to  the  case  that  A is  compressed,  in  the  sense  that 
its  t-multicombinations  are  P\Anrt\t  for  each  t.  Then  if  y is  the  largest  element  € A 
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and  if  x is  the  smallest  element  £ A,  we  prove  that  x < y implies  vx  > uy,  hence 
v{A  \ y U x)  > uA.  For  if  vx  — uy  — k we  could  find  an  element  of  dky  that  is  greater 
than  x,  contradicting  the  assumption  that  A is  compressed. 

101.  (a)  In  general,  F(p)  = N0pn  + Nipn~1(l-p)  + - ■ - + Nn(l-p)n  when  f(x  i,.. . ,xn) 
is  satisfied  by  exactly  Nt  binary  strings  xi  ..  ,xn  of  weight  t.  Thus  we  find  G(p)  = 
p4  + 3p3(l  -p)+p2(  1 - p)2;  H(p)  = p4  + p3(  1 - p)  + p\  1 - p)\ 

(b)  A monotone  formula  / is  equivalent  to  a simplicial  complex  C under  the  corre- 
spondence f(x i,  = 1 <=>  {j  - 1 | Xj  = 0}  € C.  Therefore  the  functions  f(p)  of 

monotone  Boolean  functions  are  those  that  satisfy  the  condition  of  exercise  97(c),  and 
we  obtain  a suitable  function  by  choosing  the  lexicographically  last  Nn-t  t-combinations 
(which  are  complements  of  the  first  N„  s-combinations) : {3210},  {321,320,310},  {32} 
gives  f(w,x,y,z)  = wxyz  V xyz  V wyz  V wxz  V yz  = wxz  V yz. 

M.  P.  Schiitzenberger  observed  that  we  can  find  the  parameters  Nt  easily  from 
f(p)  by  noting  that  /(l/(l  + «))  = (No  + N\u  + • • • + N„un)/(  1 + u)n . One  can  show 
that  H (p)  is  not  equivalent  to  a monotone  formula  in  any  number  of  variables,  because 
(1  + u + u2)/(l  + u)4  = (No  + Niu  + • • • + Nnun)/(  1 + u)n  implies  that  N\  = n — 3, 
N2  = (nj3)  + 1,  and  k2N2  = n — 2. 

But  the  task  of  deciding  this  question  is  not  so  simple  in  general.  For  example, 
the  function  (1  + 5u  + 5u2  + 5u3)/(l  + u)5  does  not  match  any  monotone  formula  in 
five  variables,  because  k35  = 7;  but  it  equals  (1  + 6u  + 10u2  + 10u3  + 5u4)/(l  + u)6, 
which  works  fine  with  six. 

102.  (a)  Choose  Nt  linearly  independent  polynomials  of  degree  t in  7;  order  their  terms 
lexicographically,  and  take  linear  combinations  so  that  the  lexicographically  smallest 
terms  are  distinct  monomials.  Let  I'  consist  of  all  multiples  of  those  monomials. 

(b)  Each  monomial  of  degree  t in  I'  is  essentially  a t-multicombination;  for 
example,  ■>'  \ x2 ./'{  corresponds  to  55552111.  If  A / , is  the  set  of  independent  monomials 
for  degree  t,  the  ideal  property  is  equivalent  to  saying  that  Mt+i  2 gMt. 

In  the  given  example,  M3  = {x0x?};  M4  = qM3  U{x0x1xl};  Ms  = eM4U{x1xJ}, 
since  x\(xqx\  — 2x3x2)  — x1(xoX1x%)  = — 2x^x\\  and  Mt+i  = gMt  thereafter. 

(c)  By  Theorem  M we  can  assume  that  Mt  = Qmsi ■ Let  Nt  = ("*')  + ■ ■ ■ + 
(T)  + (T)>  where  s + t > nts  > ■ ■ ■ > nt2  > nn  > 0;  then  nts  = s + t if  and  only  if 
nt(s-i)  = s — 2,  . . . , nt  1 = 0.  Furthermore  we  have 

Nt+ 1 > Nt  + KsNt  = (nts  + + . . . + (nt2  + [”f2  > 2]  j + + [j*«  > !]  y 

Therefore  the  sequence  (n(s-f-oo[nts  < s], . . . , nt2-t-oo[nt2  < 2],  ntl -t-oo[nn  < 1]) 
is  lexicographically  nondecreasing  as  t increases,  where  we  insert  ‘-00’  in  components 
that  have  ntj  = j — 1.  Such  a sequence  cannot  increase  infinitely  many  times  without 
exceeding  the  maximum  value  (s,  -00, . . . , -00),  by  exercise  1.2. 1 15(d) . 

103.  Let  7-Vst  be  the  first  N elements  of  a sequence  determined  as  follows:  For  each 
binary  string  x = xs+t-i  ...x0,  in  lexicographic  order,  write  down  (Y)  subcubes  by 
changing  t of  the  Is  to  *s  in  all  possible  ways,  in  lexicographic  order  (considering  1 < *). 
For  example,  if  x = 0101101  and  t — 2,  we  generate  the  subcubes  0101*0*,  010*10*, 
010**01,  0*0110*,  0*01*01,  0*0*101. 

[See  B.  Lindstrom,  Arkiv  for  Mat.  8 (1971),  245-257;  a generalization  analogous 
to  Corollary  C appears  in  K.  Engel,  Sperner  Theory  (Cambridge  Univ.  Press,  1997), 
Theorem  8.1.1.] 
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104.  The  first  N strings  in  cross  order  have  the  desired  property.  [T.  N.  Danh  and 
D.  E.  Daykin,  J.  London  Math.  Soc.  (2)  55  (1997),  417-426.] 

Notes:  Beginning  with  the  observation  that  the  “1-shadow”  of  the  N lexico- 
graphically first  strings  of  weight  t (namely  the  strings  obtained  by  deleting  1 bits 
only)  consists  of  the  first  ptN  strings  of  weight  t,  R.  Ahlswede  and  N.  Cai  extended 
the  Danh-Daykin  theorem  to  allow  insertion,  deletion,  and/or  transposition  of  bits 
[Combinatorica  17  (1997),  11-29;  Applied  Math.  Letters  11,5  (1998),  121-126],  Uwe 
Leek  has  proved  that  no  total  ordering  of  ternary  strings  has  the  analogous  minimum- 
shadow  property  [Preprint  98/6  (Univ.  Rostock,  1998),  6 pages], 

105.  Every  number  must  occur  the  same  number  of  times  in  the  cycle.  Equivalently, 
i ) must  be  a multiple  of  t.  This  necessary  condition  appears  to  be  sufficient  as 

well,  provided  that  n is  not  too  small  with  respect  to  t:  but  such  a result  may  well  be 
true  yet  impossible  to  prove.  [See  Chung,  Graham,  and  Diaconis,  Discrete  Math.  110 
(1992),  55-57.] 

The  next  few  exercises  consider  the  cases  t — 2 and  t = 3,  for  which  elegant 
results  are  known.  Similar  but  more  complicated  results  have  been  derived  for  t = 4 
and  t = 5,  and  the  case  t — 6 has  been  partially  resolved.  The  case  (n,t)  = (12,6)  is 
currently  the  smallest  for  which  the  existence  of  a universal  cycle  is  unknown. 

106.  Let  the  differences  mod  (2m+l)  be  1,  2,  . . . , m,  1,  2,  . . . , m,  . . . , repeated  2m+l 
times;  for  example,  the  cycle  for  m = 3 is  (013602561450346235124).  This  works 

because  1 -| b m = (m2hl)  is  relatively  prime  to  2m  + 1.  [J.  Ecole  Polytechnique  4, 

Cahier  10  (1810),  16-48.] 

107.  The  seven  doubles  HWm  can  be  inserted  in  37  ways  into  any 

universal  cycle  of  2-combinations  for  {0, 1, 2, 3, 4,  5, 6}.  The  number  of  such  universal 
cycles  is  the  number  of  Eulerian  trails  of  the  complete  graph  K7,  which  can  be  shown 
to  be  129,976,320  if  we  regard  (a0ai  . . .a20)  as  equivalent  to  (eq  . . .a2oa0)  but  not  to 
the  reverse-order  cycle  ( a20  ■ ■ ■ aia0).  So  the  answer  is  284,258,211,840. 

[This  problem  was  first  solved  in  1859  by  M.  Reiss,  whose  method  was  so  com- 
plicated that  people  doubted  the  result;  see  Nouvelles  Annales  de  Mathematiques  8 
(1849),  74;  11  (1852),  115;  Annali  di  Matematica  Pura  ed  Applicata  (2)  5 (1871- 
1873),  63-120.  A considerably  simpler  solution,  confirming  Reiss’s  claim,  was  found  by 
P.  Jolivald  and  G.  Tarry,  who  also  enumerated  the  Eulerian  trails  of  K<y.  see  Comptes 
Rendus  Association  Frangaise  pour  LAvancement  des  Sciences  15,  part  2 (1886),  49- 
53;  E.  Lucas,  Recreations  Mathematiques  4 (1894),  123-151.  Brendan  D.  McKay  and 
Robert  W.  Robinson  found  an  approach  that  is  better  still,  enabling  them  to  continue 
the  enumeration  through  K2 1 by  using  the  fact  that  the  number  of  trails  is 

\m  ■*■)•  1*0  Z1  • • • z2m  J det (djk)  H (Zj  + Zk), 

l<j<fc<2m 

where  a]k  = -l/(z]  + z2k)  when  j / fc;  a3j  = -1/(2 z])  + Eo<k<2m  V(^  + *2)i  see 
Combinatorics,  Probability,  and  Computing  7 (1998),  437-449.] 

C.  Flye  Sainte-Marie,  in  L’ Intermediate  des  Mathematiciens  1 (1894),  164-165, 
noted  that  the  Eulerian  trails  of  K7  include  2 x 720  that  have  7-fold  symmetry  under 
permutation  of  {0,1,..., 6}  (namely  Poinsot’s  cycle  and  its  reverse),  plus  32  x 1680 
with  3-fold  symmetry,  plus  25778  x 5040  cycles  that  are  asymmetric. 

108.  No  solution  is  possible  for  n < 7,  except  in  the  trivial  case  n = 4.  When  n = 
7 there  are  12,255,208  x 7!  universal  cycles,  not  considering  (aoai  ...a34)  to  be  the 
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same  as  (ai  . . . a34a0),  including  cases  with  5- fold  symmetry  like  the  example  cycle  in 
exercise  105. 

When  n > 8 we  can  proceed  systematically  as  suggested  by  B.  Jackson  in  Discrete 
Math.  117  (1993),  141-150;  see  also  G.  Hurlbert,  SIAM  J.  Disc.  Math.  7 (1994), 
598-604:  Put  each  3-combination  into  the  “standard  cyclic  order”  cic2c3  where  c2  = 
(ci  + S)  mod  n,  c3  = (c2  + S')  mod  n,  0 < S,  S'  < n/2,  and  either  <5  = S'  or  max(<5,  S')  < 
n - S - S'  / (n  - l)/2  or  (1  < 5 < n/4  and  S'  = (n  - l)/2)  or  (8  = (n  — l)/2  and 
1 < <5'  < n/4).  For  example,  when  n = 8 the  allowable  values  of  (S,S')  are  (1,1), 
(1,2),  (1,3),  (2,1),  (2,2),  (3,1),  (3,3);  when  n = 11  they  are  (1,1),  (1,2),  (1,3), 

(1,4),  (2,1),  (2,2),  (2,3),  (2,5),  (3,1),  (3,2),  (3,3),  (4,1),  (4,4),  (5,2),  (5,5).  Then 

construct  the  digraph  with  vertices  (c,<5)  for  0 < c < n and  1 < 5 < n/2,  and  with  arcs 
(ci,  J)  ->  (c2,S')  for  every  combination  cic2c3  in  standard  cyclic  order.  This  digraph  is 
connected  and  balanced,  so  it  has  an  Eulerian  trail  by  Theorem  2. 3. 4. 2D.  (The  peculiar 
rules  about  (n  - l)/2  make  the  digraph  connected  when  n is  odd.  The  Eulerian  trail 
can  be  chosen  to  have  n-fold  symmetry  when  n = 8,  but  not  when  n = 12.) 

109.  When  n = 1 the  cycle  (000)  is  trivial;  when  n = 2 there  is  no  cycle;  and  there 
are  essentially  only  two  when  n = 4,  namely 

(00011122233302021313)  and(0001 1 1 20203332221313). 

When  n > 5,  let  the  multicombination  did2d3  be  in  standard  cyclic  order  if  d2  = 
(di  + 5 - 1)  mod  n,  d3  = (d2  + S'  - 1)  mod  n,  and  (<5,  J')  is  allowable  for  n + 3 in 

the  previous  answer.  Construct  the  digraph  with  vertices  (d,  S)  for  0 < d < n and 

1 < S < (n  + 3)/2,  and  with  arcs  (di,<5)  — > (d2,<5')  for  every  multicombination  did2d3 
in  standard  cyclic  order;  then  find  an  Eulerian  trail. 

Perhaps  a universal  cycle  of  f-multicombinations  exists  for  {0, 1, . . . , n - 1}  if  and 
only  if  a universal  cycle  of  f-combinations  exists  for  {0, 1, . . . , n + t — 1}. 

110.  A nice  way  to  check  for  runs  is  to  compute  the  numbers 

b(s)  = ]T{2p(c)  Ices} 

where  (p(A), . . . ,p(K))  = (1, . . . , 13);  then  set  l <-  6(5)&-6(S)  and  check  that  b(S)+l  = 
l « s,  and  also  that  (( l « s)  \ (l  » 1))  & a = 0,  where  a = 2p(ci)  | | 2P<C5).  The 

values  of  6(5)  and  ^{u(c)  | c 6 5}  are  easily  maintained  as  S runs  through  all  31 
nonempty  subsets  in  Gray-code  order.  The  answers  are  (1009008,  99792,  2813796, 
505008,  2855676,  697508,  1800268,  751324,  1137236,  361224,  388740,  51680,  317340, 
19656,  90100,  9168,  58248,  11196,  2708,  0,  8068,  2496,  444,  356,  3680,  0,  0,  0,  76,  4) 
for  x = (0, ... , 29);  thus  the  mean  score  is  « 4.769  and  the  variance  is  « 9.768. 

Hands  without  points  are  sometimes  facetiously  called  nineteen, 
as  that  number  cannot  be  made  by  the  cards. 

— G.  H.  DAVIDSON,  Dee's  Hand-Book  of  Cribbage  (1839) 

Note:  A four-card  flush  is  not  allowed  in  the  “crib.”  Then  the  distribution  is  a bit 
easier  to  compute,  and  it  turns  out  to  be  (1022208,  99792,  2839800,  508908,  2868960, 
703496,  1787176,  755320,  1118336,  358368,  378240,  43880,  310956,  16548,  88132,  9072, 
57288,  11196,  2264,  0,  7828,  2472,  444,  356,  3680,  0,  0,  0,  76,  4);  the  mean  and  variance 
decrease  to  approximately  4.735  and  9.667. 

111.  dn  2rB  is  the  set  of  all  r-subsets  of  J5;  these  subsets  must  not  be  in  A.  If  |A|  = 
|B|  = (n^r)  for  some  real  a:  > ra-1,  we  would  have  (")  > \A\-\-\dn~2r B\  > J + (x)  > 

("Ir)  + (n/X)  = ("),  by  exercise  80.  [See  Quart.  J.  Math.  Oxford  12  (1961),  313-320.] 
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2.  In  general,  given  any  integers  x\  > > xrn,  we  obtain  all  integer  m-tuples 

a i ..  .am  such  that  ai  > • • • > am , ai  4 1-  am  = %i  4 hxm , and  am  . . . a i > xm  ■ ■ ■ Xi 

by  initializing  ai  . . . am  4—  x\  . . . xm  and  am+ 1 4—  a;m  — 2.  In  particular,  if  c is  any 
integer  constant,  we  obtain  all  integer  m-tuples  such  that  a\  > • • • > am  > c and 
ai  + • • • + am  = n by  initializing  ai  4—  n — me  + c,  aj  4—  c for  1 < j < m,  and 
am+  i 4—  c — 2,  assuming  that  n > cm. 

3.  Oj  = [ (n  + m - j)/m\  = [(n  + 1 — j)/m\,  for  1 < j < m;  see  CMath  §3.4. 

4.  Assume  that  1 < r < n.  We  must  have  am  > a\  — 1;  therefore  aj  = |_(n+m— j)/m\ 
for  1 < j < m,  where  m is  the  largest  integer  with  [n/mj  > r,  namely  m = [n/r J. 

5.  [See  Eugene  M.  Klimko,  BIT  13  (1973),  38-49.] 

Cl.  [Initialize.]  Set  c0  «-  1,  Ci  «-  n,  c2...c„  4-  0...0,  l0  4-  1,  Ii  4-  0.  (We 
assume  that  n > 0.) 

C2.  [Visit.]  Visit  the  partition  represented  by  part  counts  c\...cn  and  links 
loll  ...  In  • 

C3.  [Branch.]  Set  j 4—  lo  and  k 4—  lj.  If  Cj  = 1,  go  to  C6;  otherwise,  if  j > 1,  go 
to  C5. 

C4.  [Change  1 + 1 to  2.]  Set  a 4-  ci  - 2,  c2  4-  c2  + 1,  and  /[Cl>o]  4-2.  If  k / 2, 
also  set  /2  4—  fc.  Return  to  C2. 

C5.  [Change  j ■ Cj  to  (j+1)  + 1 + • • • + 1.]  Set  ci  4-  j ( Cj  — 1)  - 1 and  go  to  C7. 

C6.  [Change  k ■ c*,  + j to  (k+ 1)  + 1 + • • • + 1.]  Terminate  if  k = 0.  Otherwise  set 
Cj  4—  0;  then  set  ci  4—  k(ck  — 1)  + j — 1,  j 4—  k,  and  k 4—  Ik- 

C7.  [Adjust  links.]  If  ci  > 0,  set  lo  4—  1,  l\  4—  j + 1;  otherwise  set  l0  4—  j + 1. 
Then  set  Cj  4—  0 and  Cj+\  4—  Cj+i  + 1.  If  k ^ j + 1,  set  2j+i  4—  k.  Return 
to  C2.  | 

Notice  that  this  algorithm  is  loopless-,  but  it  isn’t  really  faster  than  Algorithm  P.  Steps 
C4,  C5,  and  C6  are  performed  respectively  pin  — 2),  2 p{n)  — p(n  + 1)  — p(n  — 2),  and 
p(n  + 1)  — p(n)  times;  thus  step  C4  is  most  important  when  n is  large.  (See  exercise 
45  and  the  detailed  analysis  by  Fenner  and  Loizou  in  Acta  Inf.  16  (1981),  237-252.) 

6.  Assume  that  each  partition  is  followed  by  0.  Set  j 4—  0,  k 4—  ai,  and  bk+i  4—  0. 
Then,  while  k > 0,  set  j 4—  j + 1 and,  while  k > aJ+ 1,  set  bk  4—  j and  k 4—  k — 1. 
(We  have  used  (n)  in  the  dual  form  aj  — aJ+1  = dj , where  d\  . . ,dn  is  the  part-count 
representation  of  &i&2  • • • • This  algorithm  essentially  walks  along  the  rim  of  the  Ferrers 
diagram;  so  its  running  time  is  roughly  proportional  to  ai  + 6i , the  number  of  parts  in 
the  output  plus  the  number  of  parts  in  the  input.) 

7.  We  have  bi  . . . bn  = nan  (n—  l)0"-1  ~a"  . . . i°i-a2on-“i?  by  the  dual  of  (n). 

8.  Transposing  the  Ferrers  diagram  corresponds  to  reflecting  and  complementing  the 

bit  string  (15).  So  we  simply  interchange  and  reverse  the  p’s  and  q' s,  getting  the 
partition  (aia2  ...  )T  = {qt  H 1 qi)pi  (qt  ~\ \- q2)P2  . . . {qt)pt  ■ 
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9.  By  induction:  If  a,k  = l — 1 and  b ; = A;  — 1 , increasing  a*,  and  bi  preserves  equality. 

10.  (a)  The  left  child  of  each  node  is  obtained  by  appending  ‘I’.  The  right  child  is 
obtained  by  increasing  the  rightmost  digit;  this  child  exists  if  and  only  if  the  parent  node 
ends  with  unequal  digits.  All  partitions  of  n appear  on  level  n in  lexicographic  order. 

(b)  The  left  child  is  obtained  by  changing  ‘IT  to  ‘2’;  it  exists  if  and  only  if  the 
parent  node  contains  at  least  two  Is.  The  right  child  is  obtained  by  deleting  a 1 and 
increasing  the  smallest  part  that  exceeds  1;  it  exists  if  and  only  if  there  is  at  least  one  1 
and  the  smallest  larger  part  appears  exactly  once.  All  partitions  of  n into  m parts  ap- 
pear on  level  n-min  lexicographic  order;  preorder  of  the  entire  tree  gives  lexicographic 
order  of  the  whole.  [T.  I.  Fenner  and  G.  Loizou,  Comp.  J.  23  (1980),  332-337.] 

11.  [z100]  1/((1  - z)(  1 - z2)(  1 - z5)(  1 - z10)(l  - z20)(  1 - z50)(l  - z100))  = 45  63;  and 
[z100]  (l  + z + z2)(l  + z2+z4)...(l+z100  + z200)  = 7.  [See  G.  Polya,  AMM  63  (1956), 
689-697.]  In  the  infinite  product  El/tx)  nre{io'=,2  io'',5  io^}(1  + ^ + ~2r),  the  coefficient 
of  z10"  is  2n+1  — 1,  and  the  coefficient  of  z10"-1  is  2". 

12.  To  prove  that  (1  + z)(l  + z2)(l  + z3) . . . = 1/((1  — z)(l  — z3)(l  — z5) . . . ),  write 
the  left-hand  side  as 

(l-z2)(l-z4)(l-z6) 

(1-z)  (1  — z2)  (i  — z3) 

and  cancel  common  factors  from  numerator  and  denominator.  Alternatively,  replace  z 
by  z1,  z3,  z5,  ...  in  the  identity  (1  + z)(l  + z2)(l  + z4)(l  + z8) . . . = 1/(1  — z)  and 
multiply  the  results  together.  [Novi  Comment.  Acad.  Sci.  Pet.  3 (1750),  125-169,  §47.] 

13.  Map  the  partition  ci -l  + c2-2  + C3-3-| into  r-i  - 1+  [ci/2] -2  + r3 -3+  Lc2/2j-4+r5-5  + 

[c3/2j-6-| , whererm  = (cm  mod2)  + 2(c2m  mod  2)  + 4(c4m  mod  2)+8(c8m  mod2)  + 

• • • ; 433222211  i-a  64421111  ha  8332211.  [Johns  Hopkins  Univ.  Circular  2 (1882),  72.] 

14.  Sylvester’s  correspondence  is  best  understood  as  a diagram  in  which  the  dots  of 
the  odd  parts  are  centered  and  the  partition  is  divided  into  disjoint  hooks.  For  example, 
the  partition  17+15  + 15  + 9 + 9-1-9-1-9-1-5-1-5-1-3-1-3,  having  five  different  odd  parts, 
corresponds  via  the  diagram 


to  the  all-distinct  partition  19+18+16  + 13+12  + 9 + 5 + 4 + 3 with  four  gaps. 

In  general,  when  the  “Durfee  rectangle”  (shown  in  the  diagram)  has  t rows,  suppose 
there  are  a\,  . . . , at  extra  dots  at  the  right  and  bt , . . . , hi,  . . . , bt  extra  dots  below, 
where  ai  > • • • > at  > 0 and  f>i  > • • • > bt  > 0.  Then  the  distinct  parts  obtained  are 
2t  — 1 + ai  + &i,  2t  — 2 + ai  + fe2,  • • • , 2 + at~ i + bt,  1 + at  + bt,  and  (if  it’s  nonzero) 

0 + at-  Conversely,  any  partition  with  2 1 distinct  nonnegative  parts  can  uniquely  be 
written  in  this  form. 

The  relevant  odd-parts  partitions  when  n = 10  are  9 + 1,  7 + 3,  7 + 1 + 1 + 1,  5 + 5, 
5 + 3 + 1 + 1,  5 + 1 + 1 + 1 + 1 + 1,  3 + 3 + 3+1,  3 + 3+ 1 + 1 + 1 + 1,  3+1  + ---  + 1, 

1 + ■ • • + 1,  corresponding  respectively  to  the  distinct-parts  partitions  6 + 4,  5 + 4+1, 
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7 + 3,  4 + 3 + 2 + 1,  6 + 3 + 1,  8 + 2,  5 + 3 + 2,  7 + 2+1,  9 + 1,  10.  [See  Sylvester’s 
remarkable  paper  in  Amer.  J.  Math.  5 (1882),  251-330;  6 (1883),  334-336.] 

15.  Every  self-conjugate  partition  of  trace  k corresponds  to  a partition  of  n into  k 
distinct  odd  parts  (“hooks”).  Therefore  we  can  write  the  generating  function  either  as 
the  product  (l  + z)(l  + z3)(l  + z5) ...  or  as  the  sum  1 + z1 / (l-z2)  + z4/ ((l-z2)(l-z4))  + 
-2;9/ ( ( 1 — -s2 ) ( 1 — 24)(1— z6))  + • • • • [Johns  Hopkins  Univ.  Circular  3 (1883),  42-43.] 

16.  The  Durfee  square  contains  k 2 dots,  and  the  remaining  dots  correspond  to  two 
independent  partitions  with  largest  part  < k.  Thus,  if  we  use  w to  count  parts  and 
z to  count  dots,  we  find 


n 


i 

1 — wzm 


E 


~^L  (1  — z)(  1 — z2) ...  (1  — zk)(l  — wz)(  1 — wz2) ...  (1  — wzk ) ’ 


[This  impressive-looking  formula  turns  out  to  be  just  the  special  case  x = y = 0 of  the 
even  more  impressive  identity  of  exercise  19.] 

17.  (a)  ((1  + uvz)(  1 + uvz2)(  1 + uvz3)  . . . )/((l  - uz)(  1 - «z2)(  1 - «z3) . . . ). 

(b)  A joint  partition  can  be  represented  by  a generalized  Ferrers 

diagram  in  which  all  of  the  parts  are  merged  together,  with  a,  above  •••••••• 

bj  if  a,  > bj , and  with  a mark  on  the  rightmost  dot  of  each  bj.  ••••••« 

For  example,  the  joint  partition  (8,8,5;  9, 7, 5,2)  has  the  diagram  * * * * * 
illustrated  here,  with  marked  dots  shown  as  ‘ V.  Marks  appear  only  • + 
in  corners;  thus  the  transposed  diagram  corresponds  to  another  joint  partition,  which 
in  this  case  is  (7, 6,  6,4,  3;  7,6,4, 1).  [See  J.  T.  Joichi  and  D.  Stanton,  Pacific  J.  Math. 
127  (1987),  103-120;  S.  Corteel  and  J.  Lovejoy,  Trans.  Amer.  Math.  Soc.  356  (2004), 
1623-1635;  Igor  Pak,  The  Ramanujan  Journal  12  (2006),  5-75.] 

Every  joint  partition  with  t > 0 parts  corresponds  in  this  way  to  a “conjugate” 
in  which  the  largest  part  is  t.  And  the  generating  function  for  such  joint  partitions  is 
((1  + vz) . . . (1  + vzt~1))/ ((1  - z) . . . (1  - z*))  times  (vz*  + z*),  where  vz * corresponds 
to  the  case  that  61  = t,  and  z*  corresponds  to  the  case  that  s = 0 or  61  < t. 

(c)  Thus  we  obtain  a form  of  the  general  z-nomial  theorem  in  answer  1.2.6-58: 


(1  + uvz)  (1  + uvz2)  (1  + uvz3)  _ (1  + v)  (1  + vz)  (l  + vzt_1)  t t 

(1  - uz)  ( 1 UZ2 ) (1  — UZ3)  “5  (1  - 2)  (1  - 22)  ' ' (1-2*)  U “ ' 

18.  The  equations  obviously  determine  the  a’s  and  b's  when  the  c’s  and  d’s  are  given, 
so  we  want  to  show  that  the  c’s  and  d’s  are  uniquely  determined  from  the  a’s  and  b's. 
The  following  algorithm  determines  the  c’s  and  d’s  from  right  to  left: 

Al.  [Initialize.]  Set  * +-  r,  j <—  s,  k +-  0,  and  ao  ho  «-  00. 

A2.  [Branch.]  Stop  if  i + j = 0.  Otherwise  go  to  A4  if  a*  > bj  — k. 

A3.  [Absorb  a, .]  Set  Ci+j  <—  a*,  di+j  <—  0,  i*—  i — 1,  k <—  fc  + 1,  and  return  to  A2. 

A4.  [Absorb  bj.]  Set  Ci+j  4—  bj  — k,  di+j  1,  j <—  j — 1,  k t—  k + 1,  and  return 
to  A2.  | 

There’s  also  a left-to-right  method: 

Bl.  [Initialize.]  Set  i +-  1,  j 4-  1,  k +-  r + s,  and  ar+ 1 +-  bs+1  < 00. 

B2.  [Branch.]  Stop  if  k = 0.  Otherwise  set  k <—  k — 1,  then  go  to  B4  if  a,  < bj  — k. 
B3.  [Absorb  a;.]  Set  ct+j-i  t—  a^,  di+j-i  •< — 0,  i •*—  i + 1,  and  return  to  B2. 

B4.  [Absorb  6j.]  Set  Cj+j_i  «—  bj  — k,  di+j-i  1,  j j + 1,  and  return  to  B2.  | 
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In  both  cases  the  branching  is  forced  and  the  resulting  sequence  satisfies  ci  > • • • > cr+s. 
Notice  that  cr+s  = m'm(ar,bs)  and  ci  — max(ai, f>i  — r— s+1). 

We  have  thereby  proved  the  identity  of  exercise  17(c)  in  a different  way.  Extensions 
of  this  idea  lead  to  a combinatorial  proof  of  Ramanujan’s  “remarkable  formula  with 
many  parameters,” 

V'  re"  TT  1 - bzk+n  = TT  zk+1)(\-awzk){\-zk+1) 

H 1 — azk+n  H (1— a~1bw~1  zk)(l— a_1zfc+1)(l  — azk)(l  — wzk) 

n=  — oo  k= 0 k=0  v ' v /v  7 v 7 

[References:  G.  H.  Hardy,  Ramanujan  (1940),  Eq.  (12.12.2);  D.  Zeilberger,  Europ.  J. 
Combinatorics  8 (1987),  461-463;  A.  J.  Yee,  J.  Comb.  Theory  A105  (2004),  63-77.] 

19.  [Crelle  34  (1847),  285-328.]  By  exercise  17(c),  the  hinted  sum  over  k is 

i (z  — bz)  . . . (z  - bzl ) (1  — uz)  . . . (1  — uzl)  \ -pj-  1 — auzm 
(1  — z)  . . . (1  — zl)  (1  — auz) . . . (1  — auz1) ) ^ 1 — uzm  ’ 

and  the  sum  over  l is  similar  but  with  u -H-  v,  a O b,  k l.  Furthermore  the  sum  over 
both  k and  l reduces  to 

ft  (1  - uvzm+1)(l  - auzm) 

(1  - uzm)(l  - vzm) 

when  b = auz.  Now  let  u = wxy,  v — 1 /(yz),  a = 1/x,  and  b = wyz\  equate  this 
infinite  product  to  the  sum  over  l. 

20.  To  get  p(n)  we  need  to  add  or  subtract  approximately  ^/8n/3  of  the  previous 
entries,  and  most  of  those  entries  are  Q(y/n)  bits  long.  Therefore  p(n)  is  computed  in 
0(n)  steps  and  the  total  time  is  0(n2). 

(A  straightforward  use  of  (17)  would  take  0(n5^2)  steps.) 

21.  Since  = (1  + z)(  1 + z2) . . . is  equal  to  (1  — z2)(l  — z4) . . . P(z ) = 

(1  — z2  — z4  + z10  + z14  — z24  — ■ ■ ■ )P(z ),  we  have 

q(n ) = p(n)  — p(n  - 2)  - p(n  — 4)  + p(n  - 10)  + p(n  — 14)  - p(n  - 24)  - • • • . 

[There  is  also  a “pure  recurrence”  in  the  r/’s  alone,  analogous  to  the  recurrence  for  cr(n) 
in  the  next  exercise.] 

22.  From  (21)  we  have  &(n)zn  = J2m,n>  1mzrnn  = z£lnP(z)  = (z+2z2-5z5- 

7z7  + • • • )/(l  — z — z2  + z5  + z7  + • • • ).  [Bibliotheque  Impartiale  3 (1751),  10-31.] 

23.  (Solution  by  Marc  van  Leeuwen.)  Divide  (19)  by  1 — v,  to  get 

OO  OO 

n(l-uV-1)(l-uV)(l-«V+1)  = ^(-l)nu("21) 

k—1  n= 0 

00  2 n 

= xx-ir-^E*’*; 

n=0  k= 0 

now  set  u = z and  v = 1. 

[See  §57  of  Sylvester’s  paper  cited  in  answer  14.  Jacobi’s  proof  is  in  §66  of  his 
monograph  Fundamenta  Nova  Theoriae  Functionum  EUipticarum  (1829).] 
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24.  (a)  By  (18)  and  exercise  23,  [zn]  A(z)  = ^(-l)J+'c(2fc+  1)  [3j2  + j + k2  +k  = 2n], 
summed  over  all  integers  j and  all  nonnegative  integers  k.  When  n mod  5 = 4,  the 
contributions  all  have  j mod  5 = 4 and  k mod  5 = 2;  but  then  (2k  + 1)  mod  5 = 0. 

(b)  B(z)p  = B(zp)  (modulo  p)  when  p is  prime,  by  Eq.  4.6.2-(5). 

(c)  Take  B(z)  = P(z),  since  A(z)  = P(z)~4.  [Proc.  Cambridge  Philos.  Soc.  19 
(1919),  207-210.  A similar  proof  shows  that  p(n)  is  a multiple  of  7 when  n mod  7 = 5. 
Ramanujan  went  on  to  obtain  the  beautiful  formulas  p(5n  + 4)/5  = [ zn ] P(z)6/ P(z5)5; 
p(7n  + 5)/7  = [zn]  (P(z)4/P(z7)3  + 7 zP(z)8/ P(z7)7).  Atkin  and  Swinnerton-Dyer,  in 
Proc.  London  Math.  Soc.  (3)  4 (1953),  84-106,  showed  that  the  partitions  of  5n  + 4 
and  7n  + 5 can  be  divided  into  equal-size  classes  according  to  the  respective  values 
of  (largest  part  — number  of  parts)  mod  5 or  mod  7,  as  conjectured  by  F.  Dyson. 
A slightly  more  complicated  combinatorial  statistic  proves  also  that  p(n)  mod  11  = 0 
when  n mod  11  = 6;  see  F.  G.  Garvan,  Trans.  Amer.  Math.  Soc.  305  (1988),  47-77.] 

25.  [The  hint  can  be  proved  by  differentiating  both  sides  of  the  stated  identity.  It  is 
the  special  case  y = 1 — x of  a beautiful  formula  discovered  by  N.  H.  Abel  in  1826: 

u2(x)  + Li2(v)  = + Lb (^)  - Md-Z-y))  “ ln(1~s) ln(1“y)' 


See  Abel’s  (Euvres  Completes  2 (Christiania:  Grpndahl,  1881),  189-193.] 

(a)  Let  f(x)  = ln(l/(l  - e~xt)).  Then  /*  f(x)  dx  = - Li2 (e"^ )/*  and  /<">(*)  = 
(-t)netx  ^2k  (n k1)  ektx/(etx  — l)n,  so  Euler’s  summation  formula  gives  Li2(e_t)/f + 
l ln(l/(l  - e"*))  + 0(1)  = (C(2)  + t ln(l  - e “*)  - Lia(l  - e"4))/!  - ± Inf  + 0(1)  = 
C(2 )/t  + \ lnt  + 0(1),  as  t -»  0. 


to 


(b)  We  have  Em,„>i  e mnt/n  = 
^ti-;~«z  + mz)t-zr(z)dz. 


Em,„>  1 fi-^ (mnt)  zr(2)  dz/n,  which  sums 
The  pole  at  2 = 1 gives  the  double 


pole  at  z = 0 gives  -((O)lnt  + C'(0)  = 5 Inf  - |ln27r;  the  pole  at  2 = -1  gives 
— C(  — l)C(0)f  = B2LM  = —t/ 24.  Zeros  of  £(z  + l)C(-s:)  cancel  the  other  poles  of  T(2), 
so  the  result  is  In  P(e~* ) = ((2)/t  + \ ln(f/27r)  - f/24  + 0(tM)  for  arbitrarily  large  M. 


26.  Let  F(n)  — EfcLi  e k /n.  We  can  use  (25)  either  with  f(x)  = e x2^n[x  > Oj  + I^o, 

or  with  /( x)  = e~x2/n  for  all  * because  2 F(n)  + 1 = Y.kL-oc  e~k2/n-  Let’s  choose  the 
latter  alternative;  then  the  right-hand  side  of  (25),  for  9 = 0,  is  the  rapidly  convergent 


lim 

M —too 


2Trmiy  — y^/n 


dy 


e~u2/n 


du 


if  we  substitute  u = y + nmni;  and  the  integral  is  s/wn.  [This  result  is  formula  (15)  on 
page  420  of  Poisson’s  original  paper.] 

27.  First,  f_ao  e a(y+t)  +2cly  dy  = e~c  /a~2bcl  e~au  du,  by  the  substitution  u = 
V + b — ci/a.  And  J_ooe  au  du  = /0°°  e 1 dt/sfat  = T(\)/\/a  = by  the 

substitution  t = au 2 and  exercises  1.2.5-20,  1.2.6-43. 

Now  (30)  follows  from  (29)  because  we  have,  for  all  integers  to, 

3(3to  + 1)  +g(-3m)  = l)me_67r2(m+5)2/‘;  g(3m  + 2)  + g(-3m  - 1)  = 0. 

[See  M.  I.  Knopp,  Modular  Functions  in  Analytic  Number  Theory  (1970),  Chapter  3.] 
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28.  (a,  b,  c,  d)  See  Trans.  Amer.  Math.  Soc.  43  (1938),  271-295.  In  fact,  Lehmer  found 
explicit  formulas  for  Ape(n),  in  terms  of  the  Jacobi  symbol  of  exercise  4.5.4-23: 


A2e(n) 

= ( 

iy(~l)r'2sin4™, 

’ \ m ) 2e+3  ’ 

if 

(3m)2  = 1 - 

24 n (modulo  2e+3); 

A3e  (n) 

= ( 

ir+f)  ^ 3^  sin4™, 

’ V 3 / 3e+!  ’ 

if 

(8m)2  = 1 - 

24n  (modulo  3e+1); 

3 \ e/2  47rm 

^ P cos  , 

V 71®  / 71® 

if 

III 

CS 

? 

- 24n  (modulo  pe),  p > 5, 

Ape.(n) 

= < 

\P  / P 

and  24n 

1 mod  1; 

(^)pC/2[e  = 1]’ 

if 

24n  mod  p = 

: 1 and  p > 5. 

(e)  If  A:  — 2a36pj1  . . ,pet{  for  3 < pi  <■■■<  pt  and  ei  . . . et  / 0,  the  probability 
that  Ak(n)  # 0 is  2_t(l  + (-l)[ei>1|/Pi)  •••(!  + (-l)(et>1|/Pt)- 

29.  2122  . . .2m/((l  - 2l)(l  - 2122)  ...  (1  - 2i22  . . . Zm)) . 

30.  (a)  |"+1|  and  (b)  |m+%  by  (39). 

31.  First  solution  [Marshall  Hall,  Jr.,  Combinatorial  Theory  (1967),  §4.1]:  From  the 
recurrence  (39),  we  can  show  directly  that,  for  0 < r < kl,  there  is  a polynomial 
fk,r(n)  = nk~1/(k\(k- 1)!)  + 0(nk~2)  such  that  |"|  = mod  k\  (n). 

Second  solution:  Since  (1  — 2) . . . (1  — zm)  = ± (1  — e2ntp^q z)^m/q\ , where 

the  product  is  over  all  reduced  fractions  p/q  with  0 < p < q,  the  coefficient  of  zn 
in  (41)  can  be  expressed  as  a sum  of  roots  of  unity  times  polynomials  in  n,  namely  as 
e27r*pn'/?/p,ij(rl)  where  fP,q(n)  is  a polynomial  of  degree  less  than  \_m/q\.  Thus 
there  exist  constants  such  that  |”|  = ain  + a2  + (-l)na3;  |"|  = bin2  + b2n  + 63  + 
(— l)n&4  + cuni>5  + u nbe,  where  w = e2lr'^3;  etc.  The  constants  are  determined  by  the 
values  for  small  n,  and  the  flrst  two  cases  are 


n 

2 


— Tl  — — ( — 1 ) - 1~  — 0J  -I-  — CjJ 

12  72  8V  ' 9 9 


It  follows  that  1 3 1 is  the  nearest  integer  to  n2/12.  Similarly,  |"|  is  the  nearest  integer 
to  (n3  + 3n2  — 9n  [nodd])/144. 

[Exact  formulas  for  |!]|,  1 3 1 , and  |™|,  without  the  simpliflcation  of  floor  functions, 
were  first  found  by  G.  F.  Malfatti,  Memorie  di  Mat.  e Fis.  Societ a Italiana  3 (1786), 
571-663.  W.  J.  A.  Colman,  in  Fibonacci  Quarterly  21  (1983),  272-284,  showed  that 
| s|  is  the  nearest  integer  to  (n4  + 10n3  + 10n2  - 75n  -45n(-l)")/2880,  and  gave  similar 
formulas  for  |g|  and  |”|.] 

32.  Since  < p(n),  with  equality  if  and  only  if  m > n,  we  have  |^|  < p(n  - m) 

with  equality  if  and  only  if  2m  > n. 

33.  A partition  into  m parts  corresponds  to  at  most  ml  compositions;  hence  (^Z\)  < 
ml  |”|.  Consequently  p(n)  > (n  — l)!/((n  - m)lml  (m  - 1)!),  and  when  m = [i/n\ 
Stirling’s  approximation  proves  that  In p(n)  > 2 y/n  — Inn  — | — In 27r. 

34.  ai  > a2  > • • • > am  > 0 if  and  only  if  ui  — m+1  > a2  — m + 2 > • ■ • > am  > 1.  And 
partitions  into  m distinct  parts  correspond  to  ml  compositions.  Thus,  by  the  previous 
answer,  we  have 


< 

n 

< -1 

( n + m(m  — l)/2  \ 

Vm  — 1 / 

m 

- mV 

1 m — 1 / 
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[See  H.  Gupta,  Proc.  Indian  Acad.  Sci.  A16  (1942),  101—102.  A detailed  asymptotic 
formula  for  |^|  when  n = 0(m3)  appears  in  exercise  3.3.2-30.] 

35.  (a)  x = ^ In  T « -0.194. 

(b)  x = ~ In  ~ — i In  In  2 « 0.092;  in  general  we  have  x — ^(ln  T - lnln  j^j)- 

(c)  f^OBxdF(x)  = f0°° (Cu)~2 (In uje-W^du  = -A  /0°°(lnC  + lnv)e^dv  = 
(7  — In  C)/C  k,  0.256. 

(d)  Similarly,  f™oox2e~Cx  exp(-e~Cx/C)dx  = (72+C(2)-27hiC  + (hiC')2)/C2  « 
1.0656.  So  the  variance  is  C(2)/C2  = 1,  exactly(!). 

[The  probability  distribution  js  C0mm0nly  called  the  Fisher-Tippett 

distribution;  see  Proc.  Cambridge  Phil.  Soc.  24  (1928),  180-190.] 

36.  The  sum  over  jr  - (m  + r - 1)  > • • • > j2  - (m  + 1)  > jx  - m > 1 gives 


t — rm  — r(r  — l)/2 
r 
2 


p(n  - t) 


a 


a 


1 — a 1 — a2  1 — 
n~1'2  n-1/2 

a-1  — 1 a-2  — 1 


-a 


cr 


p(n) 

a(l  + 0(; 


-l/2+2e 


))  + £ 


,-i/2 


— exp(— Cra:  + 0(rn  ^2+2£))  + £, 


where  E is  an  error  term  that  accounts  for  the  cases  t > n1/2+c.  The  leading  factor 
n“1/2/(a“J  -l)  is  ±(l+0(jn~1/2)).  And  it  is  easy  to  verify  that  E = O(nlogne“cne), 
even  if  we  use  the  crude  upper  bound  |‘-'-m-r(r-i)/2  1 < fr , y,ecause 

53  fre~t/N  = °(  I™  tre~t/Ndt)  = 0(Nr+1xre~x/(l  - r/x)), 

t>xN  J 


where  N = 0(v fn),  x = 0(n£),  r = O(logn). 

37.  Such  a partition  is  counted  once  in  E0,  q times  in  Ei,  (’)  times  in  E2,  ...;  so 

it  is  counted  exactly  ( — l)2  (?)  = ( — l)r(g~1)  times  in  the  partial  sum  that  ends 

with  (—1)  Er.  This  count  is  at  most  8q 0 when  r is  odd,  at  least  8q 0 when  r is  even. 
[A  similar  argument  shows  that  the  generalized  principle  of  exercise  1.3.3-26  also  has 
this  bracketing  property.  Reference:  C.  Bonferroni,  Pubblicazioni  del  Reale  Istituto 
Superiore  di  Scienze  Economiche  e Commerciali  di  Firenze  8 (1936),  3-62.] 

38.  2i+m“1  ( 1+™_~2)Z  = 2,+m“1(  1 - zl) . . . (1  - zl+m~2)/((  1 - z) ...  (1  - zm_1)). 

39.  [xm]  (1  + 2a;) (1  + z2a:) . . . (1  + zl~xx)  = /2m(m+1)/2  , by  exercise  1.2.6-58;  this 

is  (z  — zl)(z2  — zl) . . . (zm  —z1)/ ((1  — 2)(1  — 22)...  (1  — 2m)).  The  answer  also  follows  from 
Theorem  C:  Replacing  01  . . . am  by  (a  1 — m ) . . . (am  — 1)  gives  an  equivalent  partition 
of  n — m(m  + l)/2  into  at  most  m parts,  not  exceeding  l — 1 — m. 

40.  If  a = ai  . . . am  is  a partition  with  at  most  m parts,  let  f(a)  = 00  if  an  < l, 

otherwise  /(a)  = min{j  | ai  > l + aj+i}.  Let  gk  be  the  generating  function  for 
partitions  with  f(a)  > k.  Partitions  with  /(a)  = k < 00  are  characterized  by  the 
inequalities 

ai  > a.2  >■■•>  a,k  > ax  — l > ak+i  > • • • > am+ 1 = 0. 

Thus  aia2...am  = (6fc+l+l)(6i  + l) . . . (6fc_1+l)6fc+1  . . . where  f(bx...bm)  > k; 
and  the  converse  is  also  true.  It  follows  that  gk  = gk-i  — zl+kgk- 1. 

[See  American  J.  Math.  5 (1882),  254-257.] 
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41.  See  G.  Almkvist  and  G.  E.  Andrews,  J.  Number  Theory  38  (1991),  135-144. 

42.  A.  Vershik  [Functional  Anal.  Applic.  30  (1996),  90-105,  Theorem  4.7]  has  stated 

-c$ 


the  formula 


1 — e 


— ck/\/n  , 

:e  ' v + 


1 


-CQfc/v/n  , 


1, 


1 _ g-c(0+v>)  1 _ e~c(6+v) 

where  the  constant  c must  be  chosen  as  a function  of  9 and  p so  that  the  area  of  the 
shape  is  n.  This  constant  c is  negative  if  dp  < 2,  positive  if  dp  > 2;  the  shape  reduces 
to  a straight  line 

k flfc  ^ 1 

dsfn  p\fn 

when  dp  = 2.  If  p = oo  we  have  c = \/Li2 (t)  where  t satisfies  9 = (In  ^zy)/VF bW- 

43.  p(n  — k(k  — 1 )/2) . (Change  aia2  . . . a*,  to  (ai  — k + l)(a2  — k + 2) . . . a*,  to  get  an 
equivalent  partition  of  n — k(k  — l)/2.) 

44.  Assume  that  n > 0.  The  number  with  smallest  parts  unequal  (or  with  only  one 
part)  is  p(n  + 1)  — p(n),  the  number  of  partitions  of  n + 1 that  don’t  end  in  1,  because 
we  get  the  former  from  the  latter  by  changing  the  smallest  part.  Therefore  the  answer 
is  2 p(n)  — p(n  + 1).  [See  R.  J.  Boscovich,  Giornale  de’  Letterati  (Rome,  1748),  15.  The 
number  of  partitions  whose  smallest  three  parts  are  equal  is  3p(n)  — p(n+l)  — 2p(n  + 2)  + 
p(n  + 3);  similar  formulas  can  be  derived  for  other  constraints  on  the  smallest  parts.] 

45.  By  Eq.  (37)  we  have  p(n  — j)/p(n)  = 1 — Cjn~ l^2  + ( C2j 2 + 2j)/(2n)  — (8 C3j3  + 
60  Cj2  + Cj  + 12C~xj ) / (48n3/2 ) + 0(j4n~2). 

46.  If  n > 1,  T2(n)  = p(n  — 1)  — p(n  — 2)  < p(n)  — p(n  — 1)  = T2'(n),  because 
p(n)  —p(n  — 1)  is  the  number  of  partitions  of  n that  don’t  end  in  1;  every  such  partition 
of  n — 1 yields  one  for  n if  we  increase  the  largest  part.  But  the  difference  is  rather 
small:  (T2'(n)  - T2(n))/p(n)  = C2/n  + 0(n~3/2). 

47.  The  identity  in  the  hint  follows  by  differentiating  (21);  see  exercise  22.  The 
probability  of  obtaining  the  part-counts  C\  . . ,c„  when  ci  + 2c2  + ■ • • + nc„  = n is 

Pr(ci  ■ ■ .Cn)  = V'  J!  kVlyn  Pr(Cl  ' • ■Cfc-l(Cfc-j)Cfc  + i . . .c„) 

k= 1 j= 1 


n Cfc 

J ^ np(n 
k= 1 j= 1 


np(n ) 
k 


1 

p(n)  ’ 


by  induction  on  n.  [ Combinatorial  Algorithms  (Academic  Press,  1975),  Chapter  10.] 

48.  The  probability  that  j has  a particular  fixed  value  in  step  N5  is  b/(ir2j2)  + 
0(n-1/2),  and  the  average  value  of  jk  is  order  y7 n . The  average  time  spent  in  step  N4 
is  0(n),  so  the  average  running  time  is  of  order  n3^2.  (A  more  precise  analysis  would 
be  desirable.) 

49.  (a)  We  have  F(z)  = ^(z)>  where  Fk(z)  is  the  generating  function  for  all 

partitions  whose  smallest  part  is  > k,  namely  1/ ( ( 1 — zk)(  1 — zk+1 ) . . . ) — 1. 

(b)  Let  fk(n)  = [zn]  Fk(z)/p(n).  Then  /i(n)  = 1;  /2(n)  = 1 - p(n-l)/p(n)  = 
Cn~1/2  + 0(n_1);  /a(n)  = (p{n)  - p(n  - 1)  - p(n  - 2)  + p(n  - 3 ))/p(n)  = 2C2n~ 1 + 
0(n~ 3^2);  and  fr{n)  — 6 C3n~3^2  + 0(n-2).  (See  exercise  45.)  It  turns  out  that 
fk+i{n)  = k\  Ckn~k/2  + 0(n~(k+1^2)-,  in  particular,  f$( n ) = 0(n~ 2).  Hence  /s(n)  + 

1-  fn{n)  = 0(n_1),  because  fk+i{n)  < fk{n). 

Adding  everything  up  yields  [ zn ] F(z)  = p(n)(l  + C /\/n  + (^(rW1)). 
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50.  (a)  cm(m  + k)  — cm_i(m  — 1 + k)  + cm(k)  = m — 1 — k + c(k)  + 1 by  induction 
when  0 < k < m. 

(b)  Because  \m+k\  = p(k)  for  0 < k < m. 

(c)  When  n = 2m,  Algorithm  H essentially  generates  the  partitions  of  m,  and 
we  know  that  j — 1 is  the  second-smallest  part  in  the  conjugate  of  the  partition  just 
generated  — except  when  j — 1 = m,  just  after  the  partition  1 ...  1 whose  conjugate  has 
only  one  part. 

(d)  If  all  parts  of  a exceed  k,  let  akq+1j  correspond  to  a (k+ 1). 

(e)  Continuing  the  previous  exercise  and  its  answer,  the  generating  function  Gk{z) 
for  all  partitions  whose  second-smallest  part  is  > k is  Fk+i{z)/{\  — z ) by  (d).  Conse- 
quently C(z)  = ( F(z ) - Fi(z))/(1  - z)  + z/(l  - z)2. 

(f)  We  can  show  as  in  the  previous  exercise  that  [zn\  Gk(n)/p(n)  = 0(n~k /2)  for 
k < 5;  hence  c(m)/p(m)  = 1 + 0(m~1/2).  The  ratios  (c(m)  + l)/p{m),  which  are 
readily  computed  for  small  m,  reach  a maximum  of  2.6  at  m = 7 and  decrease  steadily 
thereafter.  So  a rigorous  attention  to  asymptotic  error  bounds  will  complete  the  proof. 

Note:  B.  Fristedt  [Trans.  Amer.  Math.  Soc.  337  (1993),  703-735]  has  proved, 
among  other  things,  that  the  number  of  fc’s  in  a random  partition  of  n is  greater  than 
Cxy/n  with  asymptotic  probability  e~x . 

52.  In  lexicographic  order,  |64j^13|  partitions  of  64  have  ai  < 13;  |50+10|  of  them  have 
a i = 14  and  a2  < 10;  etc.  Therefore,  by  the  hint,  the  partition  14  11  9 6 4 3 2 l15  is 
preceded  by  exactly  p(64)  — 1000000  partitions  in  lexicographic  order,  making  it  the 
millionth  in  reverse  lexicographic  order. 

53.  As  in  the  previous  answer,  |®°|  partitions  of  100  have  ax  = 32  and  a2  < 12,  etc.; 
so  the  lexicographically  millionth  partition  in  which  ai  = 32  is  32  13  12  8 7 6 5 5 l12. 
Algorithm  H produces  its  conjugate,  namely  20  8888654333321 19 . 

54.  (a)  Obviously  true.  This  question  was  just  a warmup. 

(b)  True,  but  not  so  obvious.  If  aT  = a\a'2  ...  we  have 

0.1  + ■ ■ • + ok  + a1  + • • • + a,\  < n + kl  for  1 < k < a\ 

by  considering  the  Ferrers  diagram,  with  equality  when  k = a\ . Thus  if  a > f3  and 
«!  + ■••  + a)  > b'i  + • ■ • + b'i  for  some  /,  with  l minimum,  we  have  n + kl  = b±  + ■ ■ ■ + bk  + 
bi  + • • ■ + b'i  < a,i  + • • • + ak  + a'i  + ■ • • + a\  < n + kl  when  k = b\,  a contradiction. 

(c)  The  recurrence  ck  = min(ai  -| b ak,  bi  -\ \-bk)  - (ci  H \-ck-i)  clearly 

defines  a greatest  lower  bound,  if  cic2  . . . is  a partition.  And  it  is;  for  if  ci  -| 1-  ck  = 

ui-l b ak  we  have  0<min(afc+1, 6fc+i)  <min(afc+1, bk+i  + &i -1 \-bk-ai ak)  = 

ck+ 1 £ ok+ 1 < ak  = ck  + (ci  + ■ • • + Cfc_i)  — (ai  + • • • + ak- 1)  < ck. 

(d)  a V (3  = ( aT  A /3T)T.  (Double  conjugation  is  needed  because  a max-oriented 
recurrence  analogous  to  the  one  in  part  (c)  can  fail.) 

(e)  a A P has  ma x(l,m)  parts  and  a V /?  has  min (l,m)  parts.  (Consider  the  first 
components  of  their  conjugates.) 

(f)  True  for  a A /?,  by  the  derivation  in  part  (c).  False  for  a V /3;  for  example, 
6321  V 543  = 633  in  Fig.  52. 

Reference:  T.  Brylawski,  Discrete  Mathematics  6 (1973),  201-219. 

55.  (a)  If  a f-  /3  and  ay  7^/3,  where  7 = C!C2  . . . , we  have  ai  + ■ • ■ + ak  = 

ci  + ■■■  + ck  = bi  + •■•  + bk  for  all  k except  k = l and  k = l + 1;  thus  a covers  (3. 

Therefore  f3T  covers  aT. 

Conversely,  if  a X f3  and  a / j3  we  can  find  7^/3  such  that  a b 7 or  7T  y aT , 

as  follows:  Find  the  smallest  k with  ak  > bk,  the  smallest  l with  ak  > oj+i,  and  the 
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smallest  m with  ak  - 1 > am+ 1-  (Note  that  bk  > 0.)  If  am  > am+i  + 1,  define  7 = 
cic2  ...  by  Cfc  = ak  - [k  = m]  + [k  = m+ 1],  Otherwise  let  ck  = ak-[k  = l]  + [k  — m+1], 

(b)  Consider  a and  ft  to  be  strings  of  n Os  and  n Is,  as  in  (15).  Then  a t>-  ft  if 
and  only  if  a — > ft,  and  ft'1  [>-  aT  if  and  only  if  a =>  ft,  where  denotes  replacing  a 
substring  of  the  form  Oil'' 10  by  101 "Ol  and  '=>’  denotes  replacing  a substring  of  the 
form  OlOTO  by  lOO’Ol,  for  some  q > 0. 

(c)  A partition  covers  at  most  [ai  > a2]  + • ■ • + [am-i  > am]  + [am  > 2]  others. 
The  partition  a = (n2+ni  — l)(n2  — 2)(ra2  — 3) ...  21  maximizes  this  quantity  in  the  case 
am  = 1;  cases  with  am  > 2 give  no  improvement.  (The  conjugate  partition,  namely 
(n-2  — l)(n2  — 2) . . . 21"1+1,  is  just  as  good.  Therefore  both  a and  aT  are  also  covered  by 
the  maximum  number  of  others.) 

(d)  Equivalently,  consecutive  parts  of  //  differ  by  at  most  1,  and  the  smallest  part 
is  1;  the  rim  representation  has  no  consecutive  Is. 

(e)  Use  rim  representations  and  replace  t-  by  the  relation  If  a -a  a,  and 
a -A  a i we  can  easily  show  the  existence  of  a string  ft  such  that  ax  -A  ft  and  ai  ->  ft; 
for  example, 

„ 10T!011TT0  , 

Z \ 

ounoirio  lomoiroi. 

\ 

* onmoiroi  x 

Let  ft  = ft2  >-  ■ ■ ■ >-  ftm  where  ftm  is  minimal.  Then,  by  induction  on  ma x(k,k'),  we 
have  k = m and  ak  = ftm\  also  k’  = m and  a'k,  = ftm. 

(f)  Set  ft  •«-  aT;  then  repeatedly  set  ft  <-  ft'  until  ft  is  minimal,  using  any 
convenient  partition  ft'  such  that  ft  ^ ft' . The  desired  partition  is  ftT . 

Proof:  Let  fi(a)  be  the  common  value  ak  = a'k,  in  part  (e);  we  must  prove  that 
ay  ft  implies  n(a)  ft  fi(ft).  There  is  a sequence  a = a0,  . . . , ak  = ft  where  aj  — ► aj+i 
or  aj  =>  aj+i  for  0 < j < k.  If  ao  — > ai  we  have  /r(a)  = /r(ai);  thus  it  suffices  to  prove 
that  a =$•  ft  and  a — > a'  implies  a'  ft  fi(ft).  But  we  have,  for  example, 


oionoirio 


\ 


10090111r10 

oiomoroi  oio,_1ioonroi 


\ 


10091011r01 


because  we  may  assume  that  q > 0;  and  the  other  cases  are  similar. 

(g)  The  parts  of  A„  are  ak  = n2  + [k  < m]  — k for  1 < k < n2;  the  parts  of  are 
bk  = n2  — k + [ri2  — k < m]  for  1 < k < n2.  The  algorithm  of  (f)  reaches  from  n1 
after  (n23+1)  — ("22ni)  steps,  because  each  step  increases  kbk  = ^ (afe2+1)  by  1. 

(h)  The  path  n,  (n-l)l,  (n- 2)2,  (n-2)ll,  (n-3)21,  . . . , 321n_5,  SI”-3,  221"“4, 
21n_2,  1",  of  length  2n  — 4 when  n > 3,  is  shortest. 

It  can  be  shown  that  the  longest  path  has  m = 2(”2)  +n,  (rt2-l)  steps.  One  such 
path  has  the  form  a0,  . . . , ak,  . . . , at,  am  where  a0  = n1;  ak  = An;  a,  = Aj; 
Qj  for  0 < j < l;  and  aj+1  >-  aj  for  k < j < m. 

Reference:  C.  Greene  and  D.  J.  Kleitman,  Europ.  J.  Combinatorics  7 (1986),  1-10. 

56.  Suppose  A = u\  ...um  and  //  = vi...vm.  The  following  (unoptimized)  algo- 
rithm applies  the  theory  of  exercise  54  to  generate  the  partitions  in  colex  order, 
maintaining  a — 01/12  . . . ft  fi  as  well  as  oT  — 61 62  ...  6/  ft  A^  . To  find  the 
successor  of  a,  we  first  find  the  largest  j such  that  h3  can  be  increased.  Then  we  have 
ft  — bi  ■ ■ . bj- 1 (fej  + 1)  1 . . . 1 ft  AT,  hence  the  desired  successor  is  ftT  A p.  The  algorithm 
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maintains  auxiliary  tables  ry  = bj  4 bbp  Sj  = v H h Vj,  and  tj  = Wj  + Wj+ H , 

where  AT  = W1W2  .... 

Ml.  [Initialize.]  Set  q <—  0,  k <—  u\.  For  j = 1,  . . . , m,  while  Uj+ 1 < k set 

tk  <7  q + j and  k k — 1.  Then  set  q <—  0 again,  and  for  j = 1,  . . . , m 

set  <Xj  Vj,  Sj  q + Oj . Then  set  g <—  0 yet  again,  and  k +-  l t—  oi. 

For  j = 1,  . . . , m,  while  am  < k set  bk  j,  rk  <-  q <r-  q+j,  and  k <-  k - 1. 

Finally,  set  fi  <—  0,  ho  <—  0,  6_i  < 1. 

M2.  [Visit.]  Visit  the  partition  oi  . . . am  and/or  its  conjugate  b\  . . . bi. 

M3.  [Find  j.]  Let  j be  the  largest  integer  < l such  that  rj+ 1 > tj+1  and  bj  / bj-i. 
Terminate  the  algorithm  if  j = 0. 

M4.  [Increase  bj.]  Set  x <-  rJ  + 1 - 1,  k <-  bj,  bj  <-  k + 1,  and  a*,+i  <-  j.  (The 
previous  value  of  ak+i  was  j - 1.  Now  we’re  going  to  update  oi  . . . ak  using 
essentially  the  method  of  exercise  54(c)  to  distribute  x dots  into  columns 
3 + 1>  3 + 2,  • • . .) 

M5.  [Majorize.]  Set  z 4—  0 and  then  do  the  following  for  i = 1,  . . . , k:  Set 
^ ^ j ■ y * min(x,Si),  a,  i y z,  z i y,  if  i — 1,  set  l { — p < — a i and 

q ■«-  0;  if  i > 1,  while  p > a,  set  bv  i - 1,  rp  -s-  q <-  q + i - 1,  p <-  p - 1. 

Finally,  while  p>  j set  bp  <-  k,  rp  <-  q <-  q + k,  p <-  p- 1.  Return  to  M2.  | 

57.  If  A = \i  there  obviously  is  only  one  such  matrix,  essentially  the  Ferrers  diagram 
of  A.  And  the  condition  A < p,T  is  necessary,  for  if  p.T  = b1fe2  • • . we  have  6i  + • • • +bk  = 

min(ci,  k)  + min(c2,  k)  H , and  this  quantity  must  not  be  less  than  the  number  of  Is 

in  the  first  k rows.  Finally,  if  there  is  a matrix  for  A and  p,  and  if  A covers  a,  we  can 
readily  construct  a matrix  for  a and  p by  moving  a 1 from  any  specified  row  to  another 
that  has  fewer  Is. 

Notes:  This  result  is  often  called  the  Gale-Ryser  theorem,  because  of  well-known 
papers  by  D.  Gale  [Pacific  J.  Math.  7 (1957),  1073-1082]  and  H.  J.  Ryser  [Canadian 
J.  Math.  9 (1957),  371-377].  But  the  number  of  0-1  matrices  with  row  sums  A and 
column  sums  p is  the  coefficient  of  the  monomial  symmetric  function  ...  in 

the  product  of  elementary  symmetric  functions  eri  er2  ... , where 

er  = [z1]  (1  + xxz)(\  + x2z)(l  + x3z) 


In  this  context  the  result  has  been  known  at  least  since  the  1930s;  see  D.  E.  Littlewood’s 
formula  for  rim.n^ol1  + XmVn)  in  Proc.  London  Math.  Soc.  (2)  40  (1936),  49-70. 
[Cayley  had  shown  much  earlier,  in  Philosophical  Trans.  147  (1857),  489-499,  that  the 
lexicographic  condition  A < pT  is  necessary.]  See  also  the  algorithm  in  exercise  7-108. 

58.  [R.  F.  Muirhead,  Proc.  Edinburgh  Math.  Soc.  21  (1903),  144-157.]  The  condition 
a > P is  necessary,  because  we  can  set  x\  = • • • = xk  — x and  xk+\  = • • • = xn  = 1 
and  let  x — t oo.  It  is  sufficient  because  we  need  only  prove  it  when  a covers  /3.  Then 
if,  say,  parts  (01,02)  become  (oi  — l,a2  + 1),  the  left-hand  side  is  the  right-hand  side 
plus  the  nonnegative  quantity 


1 

2?n! 


Ea2  a2  Om/Tni-«2-l 

■i-pi  J-P2  • • • xpm  \xpi 


ai  — a2  — 1 
U'P2 


)(%Pl 


Xp2  )* 


[Historical  notes:  Muirhead’s  paper  is  the  earliest  known  appearance  of  the  concept 
now  known  as  majorization;  shortly  afterward,  an  equivalent  definition  was  given 
by  M.  O.  Lorenz,  Quarterly  Publ.  Amer.  Stat.  Assoc.  9 (1905),  209-219,  who  was 
interested  in  measuring  nonuniform  distribution  of  wealth.  Yet  another  equivalent 
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concept  was  formulated  by  I.  Schur  in  Sitzungsberichte  Berliner  Math.  Gesellschaft 
22  (1923),  9-20.  “Majorization”  was  named  by  Hardy,  Littlewood,  and  Polya,  who 
established  its  most  basic  properties  in  Messenger  of  Math.  58  (1929),  145-152;  see 
exercise  2.3.4.5-17.  An  excellent  book,  Inequalities  by  A.  W.  Marshall  and  I.  Olkin 
(Academic  Press,  1979),  is  entirely  devoted  to  the  subject.] 

59.  The  unique  paths  for  n = 0,  1,  2,  3,  4,  and  6 must  have  the  stated  symmetry. 
There  is  one  such  path  for  n = 5,  namely  11111,  2111,  221,  311,  32,  41,  5.  And  there 
are  four  for  n = 7: 

1111111,  211111,  22111,  2221,  322,  3211,  31111,  4111,  511,  421,  331,  43,  52,  61,  7; 
1111111,  211111,  22111,  2221,  322,  421,  511,  4111,  31111,  3211,  331,  43,  52,  61,  7; 
1111111,  211111,  31111,  221 11,  2221,  322,  3211,  4111,  421,  331,  43,  52,  511,  61,  7; 
1111111,  211111,  31111,  22111,  2221,  322,  421,  4111,  3211,  331,  43,  52,  511, 61,  7. 

There  are  no  others,  because  at  least  two  self-conjugate  partitions  exist  for  all  n > 8 
(see  exercise  15). 

60.  For  L(6, 6),  use  (59);  otherwise  use  L'(4,6)  and  L'( 3,5)  everywhere. 

In  M{ 4, 18),  insert  444222,  4442211  between  443322  and  4432221. 

In  M(5, 11),  insert  52211,  5222  between  62111  and  6221. 

In  M( 5,20),  insert  5542211,  554222  between  5552111  and  555221. 

In  M( 6, 13),  insert  72211,  7222  between  62221  and  6322. 

In  L(4, 14),  insert  44222,  442211  between  43322  and  432221. 

In  L(5,15),  insert  542211,  54222  between  552111  and  55221. 

In  L(7, 12),  insert  62211,  6222  between  72111  and  7221. 

62.  The  statement  holds  for  n = 7,  8,  and  9,  except  in  two  cases:  n = 8,  m = 3, 
a = 3221;  n = 9,  m = 4,  a = 432. 

64.  If  n = 2 kq  where  q is  odd,  let  tvn  denote  the  partition  ( 2k)q , namely  q parts  equal 
to  2k . The  recursive  rule 

B{n)  = B(n  — l)fll,  2 x B(n/2) 

for  n > 0,  where  2xB(n/2)  denotes  doubling  all  parts  of  B(n/2)  (or  the  empty  sequence 
if  n is  odd),  defines  a pleasant  Gray  path  that  begins  with  u)„_il  and  ends  with  t vn.  if 
we  let  J5(0)  be  the  unique  partition  of  0.  Thus, 

B{  1)  = 1;  B( 2)  = 11,2;  5(3)  = 21,  111;  5(4)  = 1111, 211, 22, 4. 

Among  the  remarkable  properties  satisfied  by  this  sequence  is  the  fact  that 

B{n)  = (2  x 5(0))ln,  (2  x B(l))ln~2,  (2  x 5(2))ln-4,  ...,  (2  x 5(n/2))l°, 

when  n is  even;  for  example, 

5(8)  = 11111111,2111111,221111,41111,4211,22211,2222,422,44,8. 

The  following  algorithm  generates  B(n)  looplessly  when  n > 2: 

Kl.  [Initialize.]  Set  Co  <—  po  4—  0,  pi  4—1.  If  n is  even,  set  Ci  4—  n,  t 4—  1;  other- 
wise let  n — 1 = 2 kq  where  q is  odd  and  set  c\  4—  1,  c2  4—  </,  p2  4—  2k , t 4—  2. 

K2.  [Even  visit.]  Visit  the  partition  pct‘  . . .p^1 . (Now  ct  + 1-  ci  is  even.) 

K3.  [Change  the  largest  part.]  If  ct  = 1,  split  the  largest  part:  If  pt  ^ 2p(_i,  set 
ct  «-  2,  pt  <—  pt/ 2,  otherwise  set  ct_i  ct- 1 + 2,  t <-  t — 1.  But  if  ct  > 1, 
merge  two  of  the  largest  parts:  If  ct  = 2,  set  ct  <—  1,  pt  «—  2 pt,  otherwise  set 
Ct  Ct  — 2,  ct+i  1,  pt+i  t—  2 pt,  t t + 1. 
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K4.  [Odd  visit.]  Visit  the  partition  pct‘  . . .pj1.  (Now  ct  + h c\  is  odd.) 

K5.  [Change  the  next-largest  part.]  Now  we  wish  to  apply  the  following  transfor- 
mation: “Remove  ct  — [t  is  even]  of  the  largest  parts  temporarily,  then  apply 
step  K3,  then  restore  the  removed  parts.”  More  precisely,  there  are  nine 
cases:  (la)  If  ct  is  odd  and  t = 1,  terminate,  (lbl)  If  ct  is  odd,  c(_x  = 1,  and 
Pt-i  = 2pt_2,  set  ct—2  4—  cf_2  + 2,  ct- 1 4—  ct,  pt-i  4—  pt,  t 4—  t — 1.  (Ib2)  If 
ct  is  odd,  ct- 1 = 1,  and  pt-i  / 2 pt-2,  set  ct- i 4-  2,  pt- 1 4-  pt-i/2.  (lcl)  If 
ct  is  odd,  ct_i  = 2,  and  pt  = 2pt_i,  set  ct-\  4-  ct  + 1,  pt- i 4-  pt,  t 4-  t - 1. 
(Ic2)  If  c(  is  odd,  c(_i  = 2,  and  pt  / 2pt_i,  set  ct_i  4-  1,  pt-i  4-  2pt_i. 
(ldl)  If  c(  is  odd,  ct_i  > 2,  and  pt  = 2pf_i,  set  c(_i  4-  C(_i  - 2,  ct  4-  ct  + 1. 

(Id2)  If  ct  is  odd,  ct-\  > 2,  and  pt  yf  2pt_i,  set  ct+i  4-  ct,  pt+i  4-  pt,  c(  4-  1, 

Pt  4—  2pt_i,  Ct  — i 4—  ct_i  — 2,  t 4—  t + 1.  (2a)  If  ct  is  even  and  pt  = 2pt_i, 

set  c;  4—  ct  — 1,  ct_i  4—  Cf_i  + 2.  (2b)  If  ct  is  even  and  pt  ^ 2pt~\,  set 

Ct+i  4—  ct  — 1,  pt+i  4—  pt,  ct  4—2 , pt  4—  pt/2,  t 4—  t + 1.  Return  to  K2.  | 

[The  transformations  in  K3  and  K5  undo  themselves  when  performed  twice  in  a row. 
This  construction  is  due  to  T.  Colthurst  and  M.  Kleber,  “A  Gray  path  on  binary 
partitions,’  http://arxiv.org/abs/0907.3873.  Euler  considered  the  number  of  such 
partitions  in  §50  of  his  paper  in  1750.] 

65.  If  pj1  . . ,pxr  is  the  prime  factorization  of  m,  the  number  of  such  factorizations  is 
p(d) . . . p(er),  and  we  can  let  n = max(ei, . . . ,er).  Indeed,  for  each  r-tuple  (aq, . . . ,xr) 
with  0 < Xk  < p{ek ) we  can  let  rrij  = p“lj’  . . ,pxrj , where  aki  . . .akn  is  the  ( xk  + l)st 
partition  of  ek . Thus  we  can  use  a reflected  Gray  code  for  r-tuples  together  with  a 
Gray  code  for  partitions. 

66.  Let  ai  . . . am  be  an  m-tuple  that  satisfies  the  specified  inequalities.  We  can  sort  it 
into  nonincreasing  order  axi  > • • • > aXm , where  the  permutation  x\  . . .a :m  is  uniquely 
determined  if  we  require  the  sorting  to  be  stable-,  see  Eq.  5-(2). 

If  j -<  k,  we  have  Oj  > a^,  hence  j appears  to  the  left  of  k in  the  permutation 
xi  . . . xm.  Therefore  x\  . . . xm  is  one  of  the  permutations  output  by  Algorithm  7.2.1.2V. 
Moreover,  j will  be  left  of  k also  when  a-j  = ak  and  j < k,  by  stability.  Hence  aXi  is 
strictly  greater  than  aXi+1  when  xi  > xt+i  is  a “descent.” 

To  generate  all  the  relevant  partitions  of  n,  take  each  topological  permutation 
X\  . . . Xm  and  generate  the  partitions  y\  ■ ■ . ym  of  n — t where  t is  the  index  of  x\  . . . xm 
(see  Section  5.1.1).  For  1 < j < m set  aXj  4—  where  tj  is  the  number  of  descents 

to  the  right  of  Xj  in  x\  . . . xm- 

For  example,  if  aq  . . . xm  = 314592687  we  want  to  generate  all  cases  with  <13  > 
ai  > a4  > as  > a9  > a2  > ae  > a8  > 07.  In  this  case  t = 1 + 5 + 8 = 14;  so  we  set 
“1  4—  2/2  + 2,  02  4—  j/6  + 1,  a3  4—  yi  +3,  a4  4—  y3  + 2,  a5  4-  2/4  + 2,  06  4—  y-j  + 1,  a2  4—  yg, 
a 8 4—  ys  + 1,  and  09  4—  y3  + 2.  The  generalized  generating  function  z\ 1 • • • z99  *n  the 
sense  of  exercise  29  is 

2 322  2 

Z1  Z2Z3  z4  z~>  Zg  Zg 

(1  - 2:3 )(1  - Z3Zl)(l  - 232:12:4) (1  - 23212425)  ...  (1  - 23 2l24 25 29 22 26 28 27)  ' 

When  -<  is  any  given  partial  ordering,  the  ordinary  generating  function  for  the 
number  of  all  such  partitions  of  n is  therefore  2lnd“/((l  — 2)(1  — 2 2) ...  (1  — 2m)), 
where  the  sum  is  over  all  outputs  a of  Algorithm  7.2.1.2V. 

[See  R.  P.  Stanley,  Memoirs  Amer.  Math.  Soc.  119  (1972),  for  significant  extensions 
and  applications  of  these  ideas.  See  also  L.  Carlitz,  Studies  in  Foundations  and 
Combinatorics  (New  York:  Academic  Press,  1978),  101-129,  for  information  about 
up-down  partitions.] 
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67.  If  n + 1 = qi  . . . qr,  where  the  factors  qi,  . . . , qr  are  all  > 2,  we  get  a perfect 
partition  {(<?i  — 1)  • 1,  (<72  — 1)  • <?i , (93  — 1)  • 9192,  • • . , (gr  — 1)  • 91  • • ■ 9r-i}  that  corresponds 
in  an  obvious  way  to  mixed  radix  notation.  (The  order  of  the  factors  <7,  is  significant.) 

Conversely,  all  perfect  partitions  arise  in  this  way.  Suppose  the  multiset  M = 
{ki  ■ pi, . . . , km  ■ pm}  is  a perfect  partition,  where  pi  < • • • < pm-  then  we  must  have 
Pj  = (&1  + 1)  • • • (kj- l + l)  for  1 < j < rn,  because  pj  is  the  smallest  sum  of  a submultiset 
of  M that  is  not  a submultiset  of  (fci  • pi, . . . , kj-\  • 

The  perfect  partitions  of  n with  fewest  elements  occur  if  and  only  if  the  <7,  are  all 
prime,  because  pq  - 1 > (p— 1)  + (9-I)  whenever  p > 1 and  q > 1.  Thus,  for  example, 
the  minimal  perfect  partitions  of  11  correspond  to  the  ordered  factorizations  2-2-3, 
2-3-2,  and  3-2-2.  Reference:  Quarterly  Journal  of  Mathematics  21  (1886),  367-373. 

68.  (a)  If  cii  + 1 < aj  — 1 for  some  i and  j we  can  change  {ai,a,j}  to  {a;  + l, aj-  1}, 
thereby  increasing  the  product  by  a,  — a;  — 1 > 0.  Thus  the  optimum  occurs  only  in 
the  optimally  balanced  partition  of  exercise  3.  [L.  Oettinger  and  J.  Derbes,  Nouv.  Ann. 
Math.  18  (1859),  442;  19  (1860),  117-118.] 

(b)  Assume  that  n > 1.  Then  no  part  is  1;  and  if  aj  > 4 we  can  change  it 
to  2 + (aj- 2)  without  decreasing  the  product.  Thus  we  can  assume  that  all  parts 
are  2 or  3.  We  get  an  improvement  by  changing  2 + 2 + 2 to  3 + 3,  hence  there  are 
at  most  two  2s.  The  optimum  therefore  is  3n/3  when  n mod  3 is  0;  4 • 3(n-4)/3  = 
3(n~4)/3  . 2 • 2 = (4/34/3)3n/3  when  n mod  3 is  1;  3 (n~2)/3  • 2 = (2/32/3)3n/3  when 
n mod  3 is  2.  [O.  Meifiner,  Mathematisch-naturwissenschaftliche  Blatter  4 (1907),  85.] 

69.  All  n > 2 have  the  solution  (n,  2, 1, . . . , 1).  We  can  “sieve  out”  the  other  cases  < N 
by  starting  with  S2  • • • sjv  1 ...  1 and  then  setting  sak-b  <—  0 whenever  ak  — b < N, 
where  a = xi . . . xt  — 1,  b = x\  + • • • + xt  — t — 1,  k > x\  > • • • > xt,  and  a > 1,  because 
k + x\  + • • • + Xt  + (ak  — b — t — 1)  = kx  1 . . . xt-  The  sequence  (x\ , . . . , xt)  needs  to  be 

considered  only  when  (xi  . . ,xt-  l)xi  — (xi  H \-xt)  < N - 1;  we  can  also  continue  to 

decrease  N so  that  sN  = 1.  In  this  way  only  (32766, 1486539,254887, 1511,937,478,4) 
sequences  (x\, . . . ,xt)  need  to  be  tried  when  N is  initially  230,  and  the  only  survivors 
turn  out  to  be  2,  3,  4,  6,  24,  114,  174,  and  444.  [See  E.  Trost,  Elemente  der  Math.  11 
(1956),  135;  M.  Misiurewicz,  Elemente  der  Math.  21  (1966),  90.] 

Notes:  No  new  survivors  are  likely  as  N -*  00,  but  a new  idea  will  be  needed  to  rule 
them  out.  The  simplest  sequences  (xi,...,xt)  = (3)  and  (2,2)  already  exclude  all  n > 5 
with  n mod  6/0;  this  fact  can  be  used  to  speed  up  the  computation  by  a factor  of  6. 
The  sequences  (6)  and  (3,  2)  exclude  40%  of  the  remainder  (namely  all  n of  the  forms 
5 A:  — 4 and  5 k - 2);  the  sequences  (8),  (4,  2),  and  (2,  2,  2)  exclude  3/7  of  the  remainder; 
the  sequences  with  t = 1 imply  that  n — 1 must  be  prime;  the  sequences  in  which 
xi  . . . xt  = 2r  exclude  about  p(r)  residues  of  n mod  (2r  — 1);  sequences  in  which  x\  . . ,xt 
is  the  product  of  r distinct  primes  will  exclude  about  -cor  residues  of  n mod  (xi  . . . xt  — 1). 

70.  Each  step  takes  one  partition  of  n into  another,  so  we  must  eventually  reach  a 
repeating  cycle.  Many  partitions  simply  perform  a cyclic  shift  on  each  northeast-to- 
southwest  diagonal  of  the  Ferrers  diagram,  changing  it 
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in  other  words,  they  apply  the  permutation  p = (1)(2 3)(4 5 6)(78 9 10) ...  to  the  cells. 

Exceptions  occur  only  when  p introduces  an  empty  cell  above  a dot;  for  example,  xw 
might  be  empty  when  xXi  isn’t.  But  we  can  get  the  correct  new  diagram  by  moving 
the  top  row  down,  sorting  it  into  its  proper  place  after  applying  p in  such  cases.  Such  a 
move  always  reduces  the  number  of  occupied  diagonals,  so  it  cannot  be  part  of  a cycle. 

Thus  every  cycle  consists  entirely  of  permutations  by  p. 

If  any  element  of  a diagonal  is  empty  in  a cyclic  partition,  all  elements  of  the  next 
diagonal  must  be  empty.  For  if,  say,  x5  is  empty,  repeated  application  of  p will  make 
x5  adjacent  to  each  of  the  cells  X7,  x8,  xg,  xi0  of  the  next  diagonal.  Therefore  if  n = 

(n22 ) + ("1)  with  n2  > m > 0 the  cyclic  states  are  precisely  those  with  n2  — 1 completely 
filled  diagonals  and  m dots  in  the  next.  [This  result  is  due  to  J.  Brandt,  Proc.  Amer. 

Math.  Soc.  85  (1982),  483-486.  The  problem  reportedly  stems  from  Russia  via  Bulgaria 
and  Sweden.  See  also  Martin  Gardner,  The  Last  Recreations  (1997),  Chapter  2.] 

71.  When  n = 1 + ■ ■ ■ + m > 1,  the  starting  partition  (m—  l)(m—  l)(m— 2) . . . 2 1 1 

has  distance  m(m  — 1)  from  the  cyclic  state,  and  this  is  maximum.  [K.  Igusa,  Math. 

Magazine  58  (1985),  259-271;  G.  Etienne,  J.  Combin.  Theory  A58  (1991),  181-197.]  ,d 

In  the  general  case,  Griggs  and  Ho  [Advances  in  Appl.  Math.  21  (1998),  205-227]  have 

conjectured  that  the  maximum  distance  to  a cycle  is  max(2n  + 2-m(n2  + 1),  n + n2  + 1,  ir 

m(n2  + l))  — 2n2  for  all  n > 1;  their  conjecture  has  been  verified  for  n < 100.  Moreover, 

the  worst-case  starting  partition  appears  to  be  unique  when  n2  = 2m  + {-1,0, 2}. 

72.  Equivalently,  m < m-  1 [B.  Hopkins  and  J.  A.  Sellers,  Integers  7,  2 (2007),  #A19[.  g 

73.  (a)  [R.  Stanley,  1972  (unpublished).]  Swap  the  jth  occurrence  of  k in  the  partition  I 

n = j ■ k + a with  the  fcth  occurrence  of  j in  k ■ j + a,  for  every  partition  a of  n - jk. 

For  example,  when  n = 6 the  swaps  are 

6,  51,  42,  411,  33,  321,  3111,  222,  2211,  21111,  111111. 

a bl  fg  clg  hi  jkl  dlkh  n2i  m21n  elmjf  ledcba 

(b)  p(n-k)  +p(n-2k)  + p(n-3k)  H . [A.  H.  M.  Hoare,  AMM  93  (1986),  475-476.] 

SECTION  7. 2. 1.5 

1.  Whenever  m is  set  equal  to  r in  step  H6,  change  it  back  to  r - 1. 

2.  LI.  [Initialize.]  Set  lj  4—  j — 1 and  aj  4—  0 for  1 < j < n.  Also  set  hi  4—  n,  t 4—  1? 

and  set  l0  to  any  convenient  nonzero  value. 

L2.  [Visit.]  Visit  the  f-block  partition  represented  by  R . . . ln  and  hi  . . . ht.  (The 
restricted  growth  string  corresponding  to  this  partition  is  m . . .an.) 

L3.  [Find  j .]  Set  j 4-  n;  then,  while  lj  = 0,  set  j 4-  j - 1 and  t 4-  t - 1.  re 

L4.  [Move  j to  the  next  block.]  Terminate  if  j = 0.  Otherwise  set  k 4-  aj  + 1, 
hk  4—  lj,  aj  4 k.  If  k = t,  set  t 4—  t + 1 and  lj  4—  0;  otherwise  set  lj  4—  hk+i- 
Finally  set  hk+i  4—  j. 

L5.  [Move  j + 1,  . . . , n to  block  1.]  While  j < n,  set  j 4—  j + 1,  lj  4—  hi,  aj  4—  0, 
and  hi  4—  j.  Return  to  L2.  | 

3.  Let  r(k,  n ) be  the  number  of  strings  ax...an  that  satisfy  the  condition  0 < aj  < 
l + max(fc— l,m, . . ,,aj-i)  for  1 < j < n;  thus  r(k,  0)  = 1,  r(0,n)  = zvn,  and  r(k,n)  = 
kT(k,n  — l)  + T(k  + l,n—l).  [S.  G.  Williamson  has  called  r(k,  n ) a “tail  coefficient”;  see 
SICOMP  5 (1976),  602-617.]  The  number  of  strings  that  are  generated  by  Algorithm  H 
before  a given  restricted  growth  string  m . . . an  is  Y,j=iajT(bj,n  - j),  where  bj  = 
l + max(a1, . . . ,o2_  1).  Working  backwards  with  the  help  of  a precomputed  table  of  the 
tail  coefficients,  we  find  that  this  formula  yields  999999  when  01  . . . oi2  = 010220345041. 
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4.  The  most  common  representatives  of  each  type,  subscripted  by  the  number  of 
corresponding  occurrences  in  the  GraphBase,  are  zzzzzo,  ooooh0,  xxxix0,  xxxiio, 
ooops0,  Hullo,  llala0,  eelero,  iittio,  xxiiio,  ccxxv0,  eeriei,  llamai,  xxvii0, 
oozed5,  uhuuuo,  mammai,  puppy2g,  anana0,  hehee0,  vividis,  rarer3,  etexti,  amass2, 
againi37,  ahhaa0,  essesi,  teeth25,  yaaay0,  ahhhh2,  pssst2,  seems7,  added6,  lxxii0, 
booksis4,  swiss3,  senseio,  ended3,  checki6o,  levels,  tepee4,  slyly5,  neveri54, 
sells6,  motto2i,  whooo2,  trees384,  going307,  whichi5i,  therei74,  threeioo,  their3834. 
(See  S.  Golomb,  Math.  Mag.  53  (1980),  219-221.  Words  with  only  two  distinct  letters 
are,  of  course,  rare.  The  18  representatives  listed  here  with  subscript  0 can  be  found 
in  larger  dictionaries  or  in  English-language  pages  of  the  Internet.) 

5.  (a)  112  = p(0225).  The  sequence  is  r(0),  r(l),  r(4),  r(9),  r(16),  . . . , where  r(n)  is 
obtained  by  expressing  n in  decimal  notation  (with  one  or  more  leading  zeros),  applying 
the  p function  of  exercise  4,  then  deleting  the  leading  zeros.  Notice  that  n/9  < r(n)  < n. 

(b)  1012  = r(452).  The  sequence  is  the  same  as  (a),  but  sorted  into  order  and  with 
duplicates  removed.  (Who  knew  that  882  = 7744,  2122  = 44944,  and  2642  = 69696?) 

6.  Use  the  topological  sorting  approach  of  Algorithm  7.2.1.2V,  with  an  appropriate 
partial  ordering:  Include  Cj  chains  of  length  j,  with  their  least  elements  ordered.  For 
example,  if  n = 20,  c2  = 3,  and  c3  = C4  = 2,  we  use  that  algorithm  to  find  all 
permutations  ai...a2o  of  { 1, . . . , 20}  such  that  1 -<  2,  3 -<  4,  5 -<  6,  1 -<  3 -<  5, 
7 X 8 9,  10  -<  11  -<  12,  7 -<  10,  13  ^ 14  X 15  ^ 16,  17  ^ 18  ^ 19  X 20, 
13  -<  17,  forming  the  restricted  growth  strings  p(f{ax) . . . /(a2 0)),  where  p is  defined 
in  exercise  4 and  (/(l), . . . , /(20))  = (1, 1,  2, 2, 3, 3, 4, 4,  4, 5,  5,  5, 6, 6, 6, 6,  7,  7,  7,  7).  The 
total  number  of  outputs  is,  of  course,  given  by  (48). 

7.  Exactly  vjn.  They  are  the  permutations  we  get  by  reversing  the  left-right  order  of 
the  blocks  in  (2)  and  dropping  the  ‘ | ’ symbols:  1234,  4123,  3124,  3412,  . . . , 4321.  [See 
A.  Claesson,  European  J.  Combinatorics  22  (2001),  961-971.  S.  Kit.aev,  in  Discrete 
Math.  298  (2005),  212-229,  has  discovered  a far-reaching  generalization:  Let  7r  be  a 
permutation  of  {0, . . . , r},  let  gn  be  the  number  of  permutations  ai  . . . an  of  {1, . . . , n} 
such  that  afc_o,r  > ak-\ w > • • • > dk-r-rr  > aj  implies  j > fc,  and  let  fn  be  the  number 
of  permutations  ai  . . . an  for  which  the  pattern  ak-o-K  > afc-i*-  > • • • > ajt_r * is  avoided 
altogether  for  r < k < n.  Then  E„>0  9nZn/n\  = exp(X[n>1  /n_izn/n!).] 

8.  For  each  partition  of  {l,...,n}  into  m blocks,  arrange  the  blocks  in  decreasing 
order  of  their  smallest  elements,  and  permute  the  non-smallest  block  elements  in  all 
possible  ways.  If  n = 9 and  m = 3,  for  example,  the  partition  126|38|4579  would  yield 
457938126  and  eleven  other  cases  obtained  by  permuting  {5,7,9}  and  {2,6}  among 
themselves.  (Essentially  the  same  method  generates  all  permutations  that  have  exactly 
k cycles;  see  the  “unusual  correspondence”  of  Section  1.3.3.) 

9.  Among  the  permutations  of  the  multiset  {k0  ■ 0,  fci  ■ 1, . . . , kn-i  ■ (n—  1)},  exactly 

(ko  + k\  + • • • + kn—i  \ ko  kx  kn-i 

ko , kx , . . . , k„- 1 J ( ko  k 1 -(-•••-(-  kn~  1 ) (&i  -F  ■ • • T kn~  4 ) kn—\ 

have  restricted  growth,  since  kj/(kj  + ■ • • + kn-i)  is  the  probability  that  j precedes 
{j  + 1,  ...,  n-  1}. 

The  average  number  of  0s,  if  n > 0,  is  1 + (n  — l)rzrn-\/wn  = 0(logn),  because 
the  total  number  of  0s  among  all  wn  cases  is  k {^zD^n-i t = + (n  — l)tun_i. 

10.  Given  a partition  of  {1, ... , n},  construct  an  oriented  tree  on  {0, 1, ... , n } by  letting 
j - 1 be  the  parent  of  all  members  of  a block  whose  least  member  is  j.  Then  relabel 
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the  leaves,  preserving  order,  and  erase  the  other  labels.  For  example,  the  15  partitions 
in  (2)  correspond  respectively  to 


I 


To  reverse  the  process,  take  a semilabeled  tree  and  assign  new  numbers  to  its  nodes 
by  considering  the  nodes  first  encountered  on  the  path  from  the  root  to  the  smallest 
leaf,  then  on  the  path  from  the  root  to  the  second-smallest  leaf,  etc.  The  number  of 
leaves  is  n + 1 minus  the  number  of  blocks.  [This  construction  is  closely  related  to 
exercise  2.3.4.4-18  and  to  many  enumerations  in  that  section.  See  P.  L.  Erdos  and 
L.  A.  Szekely,  Advances  in  Applied  Math.  10  (1989),  488-496.] 

11.  We  get  pure  alphametics  from  900  of  the  64855  set  partitions  into  at  most  10 
blocks  for  which  p(a\  . . . 013)  = p(a5  . . . a$ai  . . . 0,40,9  . . . 013),  and  from  563,527  of  the 
13,788,536  for  which  p(a  1 . . . am)  < p(a5  . . . asai  . . . 0409  . . . am).  The  first  examples 
are  aaaa  + aaaa  = baaac,  aaaa  + aaaa  = bbbbc,  and  aaaa  + aaab  = baaac;  the 
last  are  abed  + efgd  = deeab  (goat  + newt  = tango)  and  abed  + efgd  = deeaf 
(clad  + nerd  = dance).  [The  idea  of  hooking  a partition  generator  to  an  alphametic 
solver  is  due  to  Alan  Sutcliffe.] 

12.  (a)  Form  p[(aia'i)  . . . (ana'n)),  where  p is  defined  in  exercise  4,  since  we  have  x = y 
(modulo  77  V 77')  if  and  only  if  x = y (modulo  77)  and  x = y (modulo  77'). 

(b)  Represent  77  by  links  as  in  exercise  2;  represent  77'  as  in  Algorithm  2.3.3E; 
and  use  that  algorithm  to  make  j = lj  whenever  lj  / 0.  (For  efficiency,  we  can  assume 
that  77  has  at  least  as  many  blocks  as  77'.) 

(c)  When  one  block  of  77  has  been  split  into  two  parts;  that  is,  when  two  blocks 
of  77'  have  been  merged  together. 

(d)  (‘);  (e)  (2Sl_1  — 1)  + • • • + (2St_1  — 1). 

(f)  True:  Let  77V77'  have  blocks  73i 1 732 1 • • • \Bt,  where  77  = J3i 7?2 17?3 1 • • • \Bt.  Then 
77'  is  essentially  a partition  of  {Bi, . . . , Bt}  with  B\  ^ B2,  and  77  A 77'  is  obtained  by 
merging  the  block  of  IT  that  contains  B\  with  the  block  that  contains  Bi-  [A  finite 
lattice  that  satisfies  this  condition  is  called  lower  semimodulav,  see  G.  Birkhoff,  Lattice 
Theory  (1940),  §1.8.  The  majorization  lattice  of  exercise  7.2.1.4-54  does  not  have  this 
property  when,  for  example,  a = 4111  and  a1  = 331.] 

(g)  False:  For  example,  let  IT  = 0011,  77'  = 0101. 

(h)  The  blocks  of  77  and  77'  are  unions  of  the  blocks  of  77  V 77',  so  we  can  assume 
that  77  V 77'  = {1, . . . , t).  As  in  part  (b),  merge  j with  lj  to  get  77  in  r steps,  when  77 
has  t — r blocks.  These  merges  applied  to  77'  will  each  reduce  the  number  of  blocks  by 
0 or  1.  Hence  6(77')  — 6(77  A 77')  < r = b(II  V 77')  — 6(77). 

[In  Algebra  Universalis  10  (1980),  74-95,  P.  Pudlak  and  J.  Tuma  proved  that  every 
finite  lattice  is  a sublattice  of  the  partition  lattice  of  {1, . . . , n},  for  suitably  large  n.] 

13.  [See  Advances  in  Math.  26  (1977),  290-305.]  If  the  j largest  elements  of  a f-block 
partition  appear  in  singleton  blocks,  but  the  next  element  n — j does  not,  let  us  say 
that  the  partition  has  order  t — j.  Define  the  “Stirling  string”  Ent  to  be  the  sequence 
of  orders  of  the  f-block  partitions  77i,  772,  ...;  for  example,  E43  = 122333.  Then 
Ett  = 0,  and  we  get  i7(n+1)t  from  Ent  by  replacing  each  digit  d in  the  latter  by  the 
string  dd(d+l)d+1  . . . f*  of  length  (‘j1)  — (2);  f°r  example, 


Dia  = 1223332233322333333333333. 
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The  basic  idea  is  to  consider  the  lexicographic  generation  process  of  Algorithm  H. 
Suppose  77  = ai . . . an  is  a f-block  partition  of  order  j ; then  it  is  the  lexicographically 
smallest  t-block  partition  whose  restricted  growth  string  begins  with  ai  . . . a„-t+j.  The 
partitions  covered  by  77  are,  in  lexicographic  order,  77i2,  77i3,  7723,  7714,  7724,  7734, 
•••>  77(t-i)t,  where  77rs  means  “coalesce  blocks  r and  s of  77”  (that  is,  “change  all 
occurrences  of  s — 1 to  r — 1 and  then  apply  p to  get  a restricted  growth  string”).  If 
77'  is  any  of  the  last  (*)  — ( J2 ) of  these,  from  771(:)+1)  onwards,  then  77  is  the  smallest 
t-block  partition  following  77'.  For  example,  if  77  = 001012034,  then  n = 9,  t = 5, 
j = 3,  and  the  relevant  partitions  77'  are  p(001012004),  p(001012014),  p(001012024), 
p(001012030),  p(001012031),  p(001012032),  p(001012033). 

Therefore  fnt(N)  = fnt(N  — 1)  + (*)  — ( }2 ),  where  j is  the  TVth  digit  of  Ent. 

14.  El.  [Initialize.]  Set  aj  <—  0 and  foj  <—  <7j  1 for  1 < j < n. 

E2.  [Visit.]  Visit  the  restricted  growth  string  a\  . . ,an. 

E3.  [Find  j .]  Set  j t—  n;  then,  while  aj  = dj,  set  dj  «-  1 - dj  and  j <-  j — 1. 

E4.  [Done?]  Terminate  if  j = 1.  Otherwise  go  to  E6  if  dj  = 0. 

E5.  [Move  down.]  If  aj  = 0,  set  a3  <-  bj , rn  «-  a,  + 1 , and  go  to  E7.  Otherwise  if 
cij  = bj,  set  aj  «—  bj  — 1,  m «—  bj,  and  go  to  E7.  Otherwise  set  aj  <—  aj  — 1 
and  return  to  E2. 

E6.  [Move  up.]  If  aj  = bj  — 1,  set  a3  < — h} . rn  < — a3  + 1,  and  go  to  E7.  Otherwise 
if  aj  = bj,  set  aj  <—  0,  m «-  bj , and  go  to  E7.  Otherwise  set  aj  «—  a3  + 1 and 
return  to  E2. 

E7.  [Fix  bj+ 1 . . ,bn.]  Set  bk  <—  m for  k = j + 1,  . . . , n.  Return  to  E2.  | 

[This  algorithm  can  be  extensively  optimized  because,  as  in  Algorithm  H,  j is  almost 
always  equal  to  n.] 

15.  It  corresponds  to  the  first  n digits  of  the  infinite  binary  string  01011011011  . . . , 
because  zan-i  is  even  if  and  only  if  n mod  3 = 0 (see  exercise  23). 

16.  00012,  01012,  01112,  00112,  00102,  01102,  01002,  01202,  01212,  01222,  01022, 
01122,  00122,  00121,  01121,  01021,  01221,  01211,  01201,  01200,  01210,  01220.  01020, 
01120,  00120. 

17.  The  following  solution  uses  two  mutually  recursive  procedures,  f(p,i/,cr)  and 
6(p,i/,  cr),  for  “forward”  and  “backward”  generation  of  when  a = 0 and  of  A'^ 
when  a = 1.  To  start  the  process,  assuming  that  1 < m < n,  first  set  a3  <-  0 for 
1 5:  j ^ n ~ m and  an-m+j  j — 1 for  1 < j < m,  then  call  f(m,  n,  0). 

Procedure  f(p,i/,cr):  If  p = 2,  visit  oi...o„;  otherwise  call  /(p  — 1, i j — 1, 
(p+cr)  mod  2).  Then,  if  v = p + 1,  do  the  following:  Change  aM  from  0 to  p - 1, 
and  visit  ax  ...an;  repeatedly  set  av  <-  av  - 1 and  visit  ax  until  a„  = 0.  But  if 

u > p + 1,  change  a„_4  (if  p+<r  is  odd)  or  aM  (if  p+cr  is  even)  from  0 to  p — 1;  then 
call  6(p,i/— 1,0)  if  av  + a is  odd,  /(p,iz— 1,0)  if  av  + a is  even;  and  while  au  > 0,  set 
a„  4-  a„  — 1 and  call  b(p,v— 1,0)  or  /(p,  v— 1,0)  again  in  the  same  way  until  av  = 0. 

Procedure  6(p,  u,a):  If  u = p + 1,  first  do  the  following:  Repeatedly  visit  ax  . . ,an 
and  set  av  <—  a„  + 1,  until  a v — p — 1;  then  visit  ax  ...  a„  and  change  a ^ from  p — 1 
to  0.  But  if  v > p + 1,  call  /(p,  v—  1, 0)  if  av  + cr  is  odd,  6(p,  v—  1, 0)  if  av  + a is  even; 
then  while  av  < p — 1,  set  a„  <—  a„  + 1 and  call  /(p,  i/— 1,0)  or  &(p,  v— 1,0)  again  in 
the  same  way  until  av  = p - 1;  finally  change  a„- 1 (if  p+cr  is  odd)  or  (if  p+cr  is 
even)  from  p - 1 to  0.  And  finally,  in  both  cases,  if  p = 2 visit  ax  . . . an,  otherwise  call 
6(p  — l,i/  — 1,  (p+cr)  mod  2). 
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Most  of  the  running  time  is  actually  spent  handing  the  case  /i  = 2;  faster  routines 
based  on  Gray  binary  code  (and  deviating  from  Ruskey’s  actual  sequences)  could  be 
substituted  for  this  case.  A streamlined  procedure  could  also  be  used  when  p = v — 1. 

18.  The  sequence  must  begin  (or  end)  with  01 . . . (n—  1).  By  exercise  32,  no  such  Gray 
code  can  exist  when  0 / 5n  ^ (l)0+1+'"+(n_1),  namely  when  n mod  12  is  4,  6,  7,  or  9. 

The  cases  n = 1,  2,  3,  are  easily  solved;  and  1,927,683,326  solutions  exist  when 
n = 5.  Thus  there  probably  are  zillions  of  solutions  for  all  n > 8 except  for  the  cases 
already  excluded.  Indeed,  we  can  probably  find  such  a Gray  path  through  all  mnk  of  the 
strings  considered  in  answer  28(e)  below,  except  when  n = 2k  + (2, 4, 5,  7)  (modulo  12). 

Note:  The  generalized  Stirling  number  {^}_j  in  exercise  30  exceeds  1 for  2 < 
m < n,  so  there  can  be  no  such  Gray  code  for  the  partitions  of  {l,...,n}  into  m 
blocks. 


19.  (a)  Change  (6)  to  the  pattern  0,  2,  . . . , m,  . . . , 3,  1 or  its  reverse,  as  in  endo-order 

(7.2.1.3-(45)). 

(b)  We  can  generalize  (8)  and  (9)  to  obtain  sequences  Amna  and  A'mna  that 
begin  with  0"~m01 . . . (m—  1)  and  end  with  01 . . . (m— l)a  and  0n-m-101 . . . (m— l)o, 
respectively,  where  0 < a < m — 2 and  a is  any  string  ai  . . . an_m  with  0 < a,j  < m — 2. 
When  2 < m < n the  new  rules  are 


‘m(n+l)(aa) 


a : 


m(n+l)a 


^(m  — l)n(b/3)  1 i ^■mna‘^‘2  5 • • • 5 ArnnaXrn , 

— AfnnQ/X  1 , ATnnaX2 , . . . , AinnaXm , 

^(m  — l)nbxl  ’ 5 mn0 j ■ * ■ 5 Arnn^Xrn , 

■^(m  — l)n(b/3)'^'l>  T , • • • 1 ArnnpXm , 


if  m is  even; 
if  m is  odd; 

if  m is  even; 
if  m is  odd; 


here  b = m — 3,  /3  = bn  m,  and  (aq, . . . , xm)  is  a path  from  aq  = m — 1 to  xm  = a. 

20.  012323212122;  in  general  (ai  . . . an)T  = p(an  . . . ai),  in  the  notation  of  exercise  4. 

21.  The  numbers  (s0,  Si,  s2, . . . ) = (1, 1, 2, 3,  7, 12, 31,  59, 164, 339, 999, . . . ) satisfy  the 

recurrences  s2n+i  = Xj,  (^)s2n-2/fc,  s2n+2  = X/t  (aSK2*  + l)s2n-2fc,  because  of  the  way 
the  middle  elements  relate  to  the  others.  Therefore  s2n  = n!  [ zn]  exp((e2z  — l)/2+ez  — l) 
and  .S2ti+i  = n!  [ zn ] exp((e2z  — l)/2  + ez  + z — l).  By  considering  set  partitions  on  the 
first  half  we  also  have  s2n  = J2k{k}Xk  and  s2n+i  = Xjt  where  xn  = 

2xn-i  + (n  — l)a:n_2  = n!  [zn]  exp(2z  + z2/ 2).  [T.  S.  Motzkin  considered  the  sequence 
(s2n)  in  Proc.  Symp.  Pure  Math.  19  (1971),  173.] 

22-  (a)Xr=o^Pr(^  = fc)  = e-1Xr=ofcnA!  = ^nby(i6).  (b)J2^=0knPi(X  = k)  = 

Xr=o^nXr=o(D(-irVj!,  and  we  can  extend  the  inner  sum  to  j = oo  because 

X*  (D(~1)*fcn  = 0 when  3 > n-  Thus  we  get  XkLo(fc7fc!)  X£X (-1)1/1'  = [See 

J.  O.  Irwin,  J.  Royal  St  at.  Soc.  A118  (1955),  389-404;  J.  Pitman,  AMM  104  (1997), 
201-209.] 

23.  (a)  The  formula  holds  whenever  f(x)  = xn , by  (14),  so  it  holds  in  general.  (Thus 
we  also  have  XtLo  /(&)/&!  = ef(tv),  by  (16).) 

(b)  Suppose  we  have  proved  the  relation  for  k,  and  let  h(x)  = (x  — 1 )- f(x),  g(x)  = 
f(x+  1).  Then  f(zv  + k + 1)  = g(zv  + k)  = zv-g(zv)  = h(w+  1)  = wh(w)  = zcr^Al f (tv) . 
[See  J.  Touchard,  Ann.  Soc.  Sci.  Bruxelles  53  (1933),  21-31.  This  symbolic  “umbral 
calculus,”  invented  by  John  Blissard  in  Quart.  J.  Pure  and  Applied  Math.  4 (1861), 
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279-305,  is  quite  useful;  but  it  must  be  handled  carefully  because  }(w)  = g(zu ) does 
not  imply  that  f(w)h(w)  = g(zu)h(m).] 

(c)  The  hint  is  a special  case  of  exercise  4.6.2-16(c).  Setting  f(x)  = xn  and  k = p 
in  (b)  then  yields  wn  = zup+n  — Wi+n- 

(d)  Modulo  p,  the  polynomial  xN  - I is  divisible  by  g(x)  = xp  - x - 1.  because 

xp1  = x + k and  xN  = xp  = x?  = xp  — x = 1 (modulo  g(x)  and  p).  Thus  if  h(x)  = 
(xN  - 1 )xn/g{x)  we  have  h(w)  = h(w  + p)  = = (wp  - w)h{w)\  and  0 = 

g(w)h(w)  = zuN+n  — von  (modulo  p). 

24.  The  hint  follows  by  induction  on  e,  because  x ^ = niCot*  ~ fcpe_1)^— -.  We  can 
also  prove  by  induction  on  n that  xn  = rn(x)  (modulo  gi{x)  and  p)  implies 

xp  n=rn{x)p  (modulo  ge(x),  pge-i(x),  . . . , pe~1g1(x),  and  pe). 

Hence  xp  N = 1 + h0(x)ge(x)  +phi  (x)ge-i  {x)  -\ 1 - pe~1he-i(x)g1(yx)  + pehe{x)  for 

certain  polynomials  hk(x ) with  integer  coefficients.  Modulo  pe  we  have  h0(w)wn  = 
ho(w  + pe)(w  + pe)n  = w^-ho('co)wn  = (pe(rt7)  + l)ho(w)wn\  hence 

WP  N+n  _ _|_  il0(w')ge(w}wn  + phi(w)ge-i(w)wn  + • • ■ = ro". 

[A  similar  derivation  applies  when  p = 2,  but  we  let  gj+ i(x)  = gj(x)2  + 2 [j  = 2],  and 
we  obtain  wn  = zon+3-2B  (modulo  2e).  These  results  are  due  to  Marshall  Hall;  see  Bull. 
Amer.  Math.  Soc.  40  (1934),  387;  Amer.  J.  Math.  70  (1948),  387-388.  For  further 
information  see  W.  F.  Lunnon,  P.  A.  B.  Pleasants,  and  N.  M.  Stephens,  Acta  Arith. 
35  (1979),  1-16.] 

25.  The  first  inequality  follows  by  applying  a much  more  general  principle  to  the  tree 

of  restricted  growth  strings:  In  any  tree  for  which  deg(p)  > d eg  (parent  (p))  for  all  non- 
root nodes  p,  we  have  Wk/wk-i  < Wk+i/wk  when  wk  is  the  total  number  of  nodes  on 
level  k.  For  if  the  m = wk-i  nodes  on  level  k — 1 have  respectively  ai,  . . . , om  children, 
they  have  at  least  a?  H haj,  grandchildren;  hence  w^-iWk+i  > m(a\  H f«J,)  > 

(ai  + P am)2  = w2k. 

For  the  second  inequality,  note  that  wn+i  - vxn  - Y,k=o((l)  ~ (fc-i))C7"-fci  thus 


”^n+l 


U"V)^r  ^(nV) 


L \ ttn—k— 1 


^ n — 1 


n 

— 1 


because,  for  example,  wn-3/zun  = {wn-3/wn-2){vJn-2/^n-\){^n-ilT^n)  is  less  than 
Or  equal  to  (^ri  — 4/rr^n  — 3)(^n  — 3/r^n  — 2)(r*7n  — 2/'CDn  — 1)  = ^n  — 4/r^n  — 1- 

26.  There  are  (™Zl)  rightward  paths  from  © to  (ft );  we  can  represent  them  by  Os  and 
Is,  where  0 means  “go  right,”  1 means  “go  up,”  and  the  positions  of  the  Is  tell  us  which 
n-t  of  the  elements  are  in  the  block  with  1.  The  next  step,  if  t > 1,  is  to  another  vertex 
at  the  far  left;  so  we  continue  with  a path  that  defines  a partition  on  the  remaining  t — 1 
elements.  For  example,  the  partition  14|2|3  corresponds  to  the  path  0010  under  these 
conventions,  where  the  respective  bits  mean  that  1 ^ 2,  1 ^ 3,  1 E 4,  2 ^ 3.  [Many 
other  interpretations  are  possible.  The  convention  suggested  here  shows  that  zunk 
enumerates  partitions  with  1 ^ 2,  . . . , 1 ^ k,  a combinatorial  property  discovered  by 
H.  W.  Becker;  see  AMM  51  (1944),  47,  and  Mathematics  Magazine  22  (1948),  23-26.] 
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27.  (a)  In  general,  Ao  = Ai  = A2n-i  = A2 n = 0.  The  following  list  shows  also  the 
restricted  growth  strings  that  correspond  to  each  loop  via  the  algorithm  of  part  (b): 


0,0, 1,0, 0,0, 0,0,0  0012 

0,0, 1,0, 0,0, 1,0,0  0011 

0,0, 1,0, 1,0, 0,0,0  0001 

0,0, 1,0, 1,0, 1,0,0  0000 

0,0, 1,0, 1,1, 1,0,0  0010 


0,0, 1,1, 1,0, 0,0,0  0102 

0,0, 1,1, 1,0, 1,0,0  0100 

0,0, 1,1, 1,1, 1,0,0  0120 

0,0,1,1,11,1,1,0,0  0101 

0,0,1, 1,2,1, 1,0,0  0110 


0,0, 0,0, 0,0, 0,0,0  0123 
0,0, 0,0, 0,0, 1,0,0  0122 
0,0, 0,0,1, 0,0, 0,0  0112 
0,0, 0,0,1, 0,1, 0,0  0111 
0,0, 0,0, 1,1, 1,0,0  0121 

(b)  The  name  “tableau”  suggests  a connection  to  Section  5.1.4,  and  indeed  the 
theory  developed  there  leads  to  an  interesting  one-to-one  correspondence.  We  can 
represent  set  partitions  on  a triangular  chessboard  by  putting 
a rook  in  column  lj  of  row  n + 1 — j whenever  lj  ^ 0 in  the 
linked  list  representation  of  exercise  2 (see  the  answer  to  exercise 
5.1.3-19).  For  example,  the  rook  representation  of  135|27|489|6 
is  shown  here.  Equivalently,  the  nonzero  links  can  be  specified  in 
a two-line  array,  such  as  (3753®);  see  5.1.4-(n). 

Consider  the  path  of  length  2 n that  begins  at  the  lower  left 
corner  of  this  triangular  diagram  and  follows  the  right  boundary 
edges,  ending  at  the  upper  right  corner:  The  points  of  this  path 
are  Zk  = {\_k/2\,  \k/ 2])  for  0 < k < 2 n.  Moreover,  the  rectangle  above  and  to  the  left 
of  Zk  contains  precisely  the  rooks  that  contribute  coordinate  pairs  ) to  the  two-line 
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array  when  i < [k/ 2J 
when  9 < k < 12 


and 


J > \k/ 21;  in  our  example,  there  are  just  two  such  rooks 
namely  (^g).  Theorem  5.1.4A  tells  us  that  such  two-line  arrays 
are  equivalent  to  tableaux  (Pk,Qk),  where  the  elements  of  Pk  come  from  the  lower  line 
and  the  elements  of  Qk  come  from  the  upper  line,  and  where  both  Pk  and  Qk  have  the 
same  shape.  It  is  advantageous  to  use  decreasing  order  in  the  P tableaux  but  increasing 
order  in  the  Q tableaux,  so  that  in  our  example  they  are  respectively 


k Pk  Qk 

2 [3]  a 

3 n □ 


4 

7 

1 

3_ 

X 

5 0 d] 


6 [7\5\  dT3 


k Pk  Qk 
7 0E  @1 


8 

8 

I] 

2 

x 

9 

~2 

± 

10 

8 

]_ 

~2 

4_ 

11 

"8 

7 

~2 

4 

k Pk  Qk 


12 

8 

2 

7 

4 

13  a s 

14  ® 0 


15  • 

16  [9]  m 


while  Pk  and  Qk  are  empty  for  k = 0,  1,  17,  and  18. 

In  this  way  every  set  partition  leads  to  a vacillating  tableau  loop  Ao,  Ai,  . . . , A2n, 
if  we  let  Xk  be  the  integer  partition  that  specifies  the  common  shape  of  i\  and  Qk- 
(The  loop  is  0,  0,  1,  1,  11,  1,  2,  2,  21,  11,  11,  11,  11,  1,  1,  0,  1,  0,  0 in  our  example.) 
Moreover,  t2k-i  = 0 if  and  only  if  row  n + 1 — k contains  no  rook,  if  and  only  if  k is 
smallest  in  its  block. 

Conversely,  the  elements  of  Pk  and  Qk  can  be  uniquely  reconstructed  from  the 
sequence  of  shapes  Xk.  Namely,  Qk  = Qk- 1 if  tk  = 0.  Otherwise,  if  k is  even,  Qk  is  Qk- 1 
with  the  number  k/2  placed  in  a new  cell  at  the  right  of  row  tk',  if  k is  odd,  Qk  is  obtained 
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from  Qk-i  by  using  Algorithm  5.1.4D  to  delete  the  rightmost  entry  of  row  tk . A similar 
procedure  defines  Pk  from  the  values  of  Pk+i  and  tk+i,  so  we  can  work  back  from  Pm 
to  P0.  Thus  the  sequence  of  shapes  \k  is  enough  to  tell  us  where  to  place  the  rooks. 

Vacillating  tableau  loops  were  introduced  in  a paper  by  W.  Y.  C.  Chen,  E.  Y.  P. 
Deng,  R.  R.  X.  Du,  R.  P.  Stanley,  and  C.  H.  Yan  [Transactions  of  the  Amer.  Math. 
Soc.  359  (2007),  1555-1575],  who  showed  that  the  construction  has  significant  (and 
surprising)  consequences.  For  example,  if  the  set  partition  If  corresponds  to  the 
vacillating  tableau  loop  A0,  Ai,  . . . , A2n,  let’s  say  that  its  dual  IJD  is  the  set  partition 
that  corresponds  to  the  sequence  of  transposed  shapes  Ao,  A)r,  . ..,  A^„.  Then,  by 
exercise  5. 1.4-7,  If  contains  a “fc-crossing  at  IP  namely  a sequence  of  indices  with 
ii  < < ik  < l < j i < •••  < jk  and  i\  = j i,  ...,  ik  = jk  (modulo  77), 

if  and  only  if  / / ? contains  a “fc-nesting  at  /,”  which  is  a sequence  of  indices  with 
i'i  < ■ ■ ■ < i'k  < l < j'k  < • • • < j[  and  = j[,  . . . , i'k  = j'k  (modulo  17°).  Notice  also 
that  an  involution  is  essentially  a set  partition  in  which  all  blocks  have  size  1 or  2;  the 
dual  of  an  involution  is  an  involution  having  the  same  singleton  sets.  In  particular,  the 
dual  of  a perfect  matching  (when  there  are  no  singleton  sets)  is  a perfect  matching. 

Furthermore,  an  analogous  construction  applies  to  rook  placements  in  any  Ferrers 
diagram,  not  only  in  the  stairstep  shapes  that  correspond  to  set  partitions.  Given  a 
Ferrers  diagram  that  has  at  most  m parts,  all  of  size  < n,  we  simply  consider  the  path 
z0  = (0,0),  2i  , . . . , Zm-\-n  — ( Tl,  m)  that  hugs  the  right  edge  of  the  diagram,  and  stipulate 
that  A k = A k l + effc  when  zk  = zk-i  + (1,  0),  \k  = \k-i  — etk  when  zk  = Zk-i  + (0. 1). 
The  proof  we  gave  for  stairstep  shapes  shows  also  that  every  placement  of  rooks  in  the 
Ferrers  diagram,  with  at  most  one  rook  in  each  row  and  at  most  one  in  each  column, 
corresponds  to  a unique  tableau  loop  of  this  kind. 

[And  much  more  is  true,  besides!  See  A.  Berele,  J.  Combinatorial  Theory  A43 
(1986),  320-328;  S.  Fomin,  J.  Combinatorial  Theory  A72  (1995),  277-292;  M.  van 
Leeuwen,  Electronic  J.  Combinatorics  3,2  (1996),  paper  #R15.] 

28.  (a)  Define  a one-to-one  correspondence  between  rook  placements,  by  interchanging 
the  positions  of  rooks  in  rows  j and  j + 1 if  and  only  if  there’s  a rook  in  the  “panhandle” 
of  the  longer  row: 


o 

o 

o 

0 

o 

o] O 1 • 1 

o 

o 

• 

o 

o 

• 
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o 

o 
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o 

1 i 1 1 ^ 

o 

• 

o 

o 

o 

o 

• 

o 

o 

o 

• 

1 1 i 

(b)  This  relation  is  obvious  from  the  definition,  by  transposing  all  the  rooks. 

(c)  Suppose  Or  > a2  > • • • and  ak  > ak+i  . Then  we  have 

R(ai,  02, ... ) = xR(a\  — 1, . . . ,ak~ i — 1,  ak+ 1,  • • . ) + yR(ai, . . . , ak-i,  ak—  1,  Ufc+i, . . . ) 

because  the  first  term  counts  cases  where  a rook  is  in  row  k and  column  a*,.  Also 
77(0)  = 1 because  of  the  empty  placement.  From  these  recurrences  we  find 

77(1)  = x + y,  77(2)  = 77(1, 1)  = x + xy  + y2;  77(3)  = 77(1, 1,1)  = ® + xy  + xy 2 + y3\ 
77(2, 1)  = x2  + 2 xy  + xy2  + y3\ 

R(  3>  1)  = 77(2, 2)  = 77(2, 1, 1)  = x2  + x2y  + xy  + 2 xy2  + xy3  + j/4; 

77(3, 1, 1)  = 77(3,  2)  = 77(2,  2, 1)  = x2  + 2 x2y  + x2 y2  + 2 xy2  + 2 xy3  + xy 4 + y5; 

77(3,  2, 1)  = ®3  + 3 x2y  + 3 x2y2  + x2y3  + 3 xy3  + 2 xy4  + xy5  + y6 . 
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(d)  For  example,  the  formula  n?73 {x,y)  — xzue3{x,y)  + yw7^(x,y)  is  equivalent 
to  R(5, 4,  4, 3,  2, 1)  = xR(4, 3,  3,  2, 1)  + yR(5, 4, 3, 3,  2, 1),  a special  case  of  (c);  and 
mnn(x,y)  = R(n  - 2, ...  ,0)  is  obviously  equal  to  zu(n_1)1  (x,  y)  = R(n  - 2, . . . , 1). 

(e)  In  fact  y •ajnic(x,y)  is  the  stated  sum  over  all  restricted  growth  strings 
ai  ...  an  for  which  a2  > 0,  . . . , ak  > 0. 

29.  (a)  If  the  rooks  are  respectively  in  columns  (cj, . . . , c„),  the  number  of  free  cells 
is  the  number  of  inversions  of  the  permutation  (ra+l-ci) . . . (n  + l-c„).  [Rotate  the 
right-hand  example  of  Fig.  56  by  180°  and  compare  the  result  to  the  illustration  that 
follows  Eq.  5.1. 1— (5) -] 

(b)  Each  r x r configuration  can  be  placed  in,  say,  rows  i\  < • • • < ir  and  columns 
j i<”-<  Jr,  yielding  ( m — r)(n  — r ) free  cells  in  the  unchosen  rows  and  columns;  there 
are  (i2  — ii  + l)  + 2(73  — i2  — 1)  + • • ■ + (r— l)(ir—  ir-\  — 1)  + r(m—ir)  in  the  unchosen  rows 
and  chosen  columns,  and  a similar  number  in  the  chosen  rows  and  unchosen  columns. 
Furthermore 


E 


y(i2~  il  + 1)+2(*3—  *2-1)4 h(r-l)(ir-iT._1 


l<il  <"-<ir<m 


— 1 )+r(m  — ir) 


may  be  regarded  as  the  sum  of  2/ai+'»2+"  +am_r  over  aU  part;tjons  r > ai  > a2  > • • • > 
Qm  — r ^ 0,  so  it  is  (™)w  by  Theorem  C.  The  polynomial  r\y  generates  free  cells  for  the 
chosen  rows  and  columns,  by  (a).  Therefore  the  answer  is  y(m~r)(n~r)  (™)  _ 

y(m-rHn-r)m\yn\y/((m  - r)\v(n  - r)\yr\y). 

(c)  The  left-hand  side  is  the  generating  function  Rm(t  + ai, . . . , t + am)  for  the 
Ferrers  diagram  with  t additional  columns  of  height  m.  For  there  are  t -(-  am  ways  to 

put  a rook  in  row  m,  yielding  1+j/H lj/‘+a’"-1  = (1  - J/t+“m)/(l  - y)  free  cells  with 

respect  to  those  choices;  then  there  are  t + a rn  . 1 — 1 available  cells  in  row  m — 1,  etc. 

The  right-hand  side,  likewise,  equals  Rm(t  + ai, . . . , t + am).  For  if  m-k  rooks 
are  placed  into  columns  > t,  we  must  put  k rooks  into  columns  < t of  the  k unused 
rows;  and  we  have  seen  that  t\y/(t  — k)\y  is  the  generating  function  for  free  cells  when 
k rooks  are  placed  on  a k x t board. 

Notes:  The  formula  proved  here  can  be  regarded  as  a polynomial  identity  in  the 
variables  y and  j/;  therefore  it  is  valid  for  arbitrary  t,  although  our  proof  assumed  that 
f is  a nonnegative  integer.  This  result  was  discovered  in  the  case  y = I by  J.  Goldman, 
J.  Joichi,  and  D.  White,  Proc.  Amei.  Math.  Soc.  52  (1975),  485-492.  The  general  case 
was  established  by  A.  M.  Garsia  and  J.  B.  Rernmel,  J.  Combinatorial  Theory  A41 
(1986),  246-275,  who  used  a similar  argument  to  prove  the  additional  formula 


E^n 


y 


,dj  -\-m—j+t 


3 = 1 


1 - y 


n 


(d)  This  statement,  which  follows  immediately  from  (c),  also  implies  that  we  have 
R{au  • ■ • > am)  = R(a,i , ... , a'm)  if  and  only  if  equality  holds  for  all  x and  for  any  nonzero 
value  of  y.  The  Peirce  polynomial  wnk(x,  y)  of  exercise  28(d)  is  the  rook  polynomial  for 
(fcli)  different  Ferrers  diagrams;  for  example,  zu63(x,y)  enumerates  rook  placements 
for  the  shapes  43321,  44221,  44311,  4432,  53221,  53311,  5332,  54211,  5422,  and  5431. 

30.  (a)  We  have  mn(x,  y)  = £m  a?"””1  Amn,  where  Amn  = R„_m(n-  1, . . . , 1)  satisfies 
a simple  law:  If  we  don’t  place  a rook  in  row  1 of  the  shape  (n  — 1, . . . , 1),  that  row 
has  m — 1 free  cells  because  of  the  n — m rooks  in  other  rows.  But  if  we  do  put  a rook 


id 


r 


? 

I 


re 
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there,  we  leave  0 or  1 or  • • • or  m - 1 of  its  cells  free.  Hence  Amn  = yrn  ~1  + 

(1  + V H b yrn-1)Am(n- i),  and  it  follows  by  induction  that  Amn  = 2/T7X(Trl— 1)/2 ^ . 

(b)  The  formula  U7n+1(x,y)  = (l)xn~kykwk(x,y)  yields 

Am(n+ 1)  = y ( /,  ) V ^4(m-l)fc' 

k 

(c)  From  (a)  and  (b)  we  have 


(1  - z)(l  - (1  + q)z)  . . . (1  - (1  + q 4 + q™-1)*) 


E(l) 


[The  second  formula  is  proved  by  induction  on  n,  because  both  sides  satisfy  the 
differential  equation  G'n+1(z)  = (!  + <?  + •••  + qn)ezGn(qz ) ; exercise  1.2.6-58  proves 
equality  when  z = 0.] 

Historical  note:  Leonard  Carlitz  introduced  q-Stirling  numbers  in  Transactions  of 
the  Amer.  Math.  Soc.  33  (1933),  127-129.  Then  in  Duke  Math.  J.  15  (1948),  987-1000, 
he  derived  (among  other  things)  an  appropriate  generalization  of  Eq.  1.2.6  (45): 


(!  + <?  + •••  + qm~l)n 


m\q 

( m - k)\q  ' 


31.  exp(e”'+2  + w - 1);  therefore  wnk  = (to  + 1 )n~kzuk~1  = zon+1~k(w  - l)fc— 1 in  the 
umbral  notation  of  exercise  23.  [L.  Moser  and  M.  Wyman,  Trans.  Royal  Soc.  Canada 
(3)  43  (1954),  Section  3,  31-37.]  In  fact,  the  numbers  wnk{x,  1)  of  exercise  28(d)  are 
generated  by  exp^e*^12  - l)/x  + xw). 

32.  We  have  Sn  = zun(l,  —1),  and  a simple  pattern  is  easily  perceived  in  the  generalized 
Peirce  triangle  of  exercise  28(d)  when  x = 1 and  y = -1:  We  have  |roni:(l,-l)|  < 1 
and  a7„(fc+1)(l,  — 1)  = wnk(l,—l)  + ( — l)n  (modulo  3)  for  1 < k < n.  [In  JACM  20 
(1973),  512-513,  Gideon  Ehrlich  gave  a combinatorial  proof  of  an  equivalent  result.] 

33.  Representing  set  partitions  by  rook  placements  as  in  answer  27  leads  to  the  answer 
zUnk>  by  setting  x = y — l in  exercise  28(d).  [The  case  k = n was  discovered  by 
H.  Prodinger,  Fibonacci  Quarterly  19  (1981),  463-465.] 

34.  (a)  Guittone’s  Sonetti  included  149  of  scheme  01010101232323,  64  of  scheme 
01010101234234,  two  of  scheme  01010101234342,  seven  with  schemes  used  only  once 
(like  01100110234432),  and  29  poems  that  we  would  no  longer  consider  to  be  sonnets 
because  they  do  not  have  14  lines. 

(b)  Petrarch’s  Canzoniere  included  115  sonnets  of  scheme  01100110234234,  109  of 
scheme  01100110232323,  66  of  scheme  01100110234324,  7 of  scheme  01100110232232, 
and  20  others  of  schemes  like  01010101232323  used  at  most  four  times  each. 

(c)  In  Spenser’s  Amoretti,  88  of  89  sonnets  used  the  scheme  01011212232344;  the 
exception  (number  8)  was  “Shakespearean.” 

(d)  Shakespeare’s  154  sonnets  all  used  coalescences  of  the  rather  easy  scheme 
01012323454566,  except  that  two  of  them  (99  and  126)  didn’t  have  14  lines. 

(e)  Browning’s  44  Sonnets  From  the  Portuguese  obeyed  the  Petrarchan  scheme 
01100110232323. 
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Sometimes  the  lines  would  rhyme  (by  chance?)  even  when  they  didn’t  need  to;  for 
example.  Browning’s  final  sonnet  actually  had  the  scheme  01100110121212. 

Incidentally,  the  lengthy  cantos  in  Dante’s  Divine  Comedy  used  an  interlocking 
scheme  of  rhymes  in  which  1 = 3 and  3n  - 1 = 3n  + 1 = 3n  + 3 for  n = 1,  2, 

35.  Every  incomplete  n-line  rhyme  scheme  17  corresponds  to  a singleton- free  partition 
of  {1, . . . ,n+l}  in  which  (n+1)  is  grouped  with  all  of  17’s  singletons.  [H.  W.  Becker 
gave  an  algebraic  proof  in  AMM  48  (1941),  702.  Notice  that  w'n  = £*  (^(-l)"-*^, 
by  the  principle  of  inclusion  and  exclusion,  and  wn  = YJk{n^)/co'k\  we  can  in  fact  write 
w = ro  — 1 in  the  umbral  notation  of  exercise  23.  J.  O.  Shallit  has  suggested  extending 
Peirce’s  triangle  by  setting  tun(n+1)  = w'n\  see  exercises  38(e)  and  33.  In  fact,  wnk 
is  the  number  of  partitions  of  {1, . . . , n}  with  the  property  that  1,  . . . , k - 1 are  not 
singletons;  see  H.  W.  Becker,  Bull.  Amer.  Math.  Soc.  58  (1952),  63.] 

36.  exp(e2  — 1 — z).  (In  general,  if  is  the  number  of  partitions  of  {l,...,n} 
into  subsets  of  allowable  sizes  ,si  < s2  < • • • , the  exponential  generating  function 

E„  $nZn/n\  is  exp(zsySl!  + z82 / s2\  + •••),  because  (zSl/Sl!  + z82 / s2!  H )k  is  the 

exponential  generating  function  for  partitions  into  exactly  k parts.) 

37.  There  are  T.ki’kWk^h-k  possibilities  of  length  n,  hence  784,071,966  when  n = 14. 
(But  Pushkin’s  scheme  is  hard  to  beat.) 

38.  (a)  Imagine  starting  with  icia:2  . . . xn  = 01 . . . (n— 1),  then  successively  removing 
some  element  bj  and  placing  it  at  the  left,  for  j — 1,  2,  . . . , n.  Then  xk  will  be  the 
kth  most  recently  moved  element,  for  1 < k < ]{&i, . . . , 6n}|;  see  exercise  5.2.3-36. 
Consequently  the  array  x\  . . . xn  will  return  to  its  original  state  if  and  only  if  bn  ■ ■ ■ &i 
is  a restricted  growth  string.  [Robbins  and  Bolker,  JEquat.  Math.  22  (1981),  281-282.] 

In  other  words,  let  a\  . . ,an  be  a restricted  growth  string.  Set  b-j  <—  j and 
bj+ 1 an-j  for  0 < j < n.  Then  for  1 < j < n,  define  kj  by  the  rule  that  bj  is  the  kjth 
distinct  element  of  the  sequence  bj-i,  bj-2,  • • ■ • For  example,  the  string  ai  . . .a i6  = 
0123032303456745  corresponds  in  this  way  to  the  cr-cycle  6688448628232384. 

(b)  Such  paths  correspond  to  restricted  growth  strings  with  max(aj,.  ,.,a„)  < m, 
so  the  answer  is  {[[}  + {"}  + • • • + {^}. 

(c)  We  may  assume  that  i = 1,  because  the  sequence  fc2  . . . knki  is  a a- cycle 
whenever  k\k2  . . . kn  is.  Thus  the  answer  is  the  number  of  restricted  growth  strings 
with  an  = j — 1,  namely  {"+*}  + {"J1}  + {”-*}  + • • • . 

(d)  If  the  answer  is  fn  we  must  have  ^2k  (^ )fk  = wn,  since  cri  is  the  identity 
permutation.  Therefore  fn  = vj'n , the  number  of  set  partitions  without  singletons 
(exercise  35). 

(e)  Again  w'n,  by  (a)  and  (d).  [Consequently  w'p  modp  = 1 when  p is  prime.] 

39.  Set  u = tp+1  to  obtain  f°°  e-t‘u(?“p)/(p+1)  du  = -Trflil) 

P+l  P+1  kp+l/‘ 

40.  We  have  g(z)  = cz  — n In  z,  so  the  saddle  point  occurs  at  n/c.  The  rectangular  path 

now  has  corners  at  ±n/c  ± mi/c;  and  exp g(n/c  + it)  = (encn/nn)  exp(— f2c2/(2n)  + 
it3c3/ (3n2)  -I ).  The  final  result  is  en {c/n)n~1/ \/ 2nn  times  1 + n/12  + 0{n~2). 

(Of  course  we  could  have  obtained  this  result  more  quickly  by  letting  w — cz  in 
the  integral.  But  the  answer  given  here  applies  the  saddle  point  method  mechanically, 
without  attempting  to  be  clever.) 

41.  Again  the  net  result  is  just  to  multiply  (21)  by  cn_1;  but  in  this  case  the  left  edge 
of  the  rectangular  path  is  significant  instead  of  the  right  edge.  (Incidentally,  when 
c = — 1 we  cannot  derive  an  analog  of  (22)  using  Hankel’s  contour  when  x is  real  and 
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positive,  because  the  integral  on  that  path  diverges.  But  with  the  usual  definition  of  zx , 
a suitable  path  of  integration  does  yield  the  formula  -(cos  nx)/T(x)  when  n = x > 0.) 

42.  We  have  § ez  dz/z n = 0 when  n is  even.  Otherwise  both  left  and  right  edges  of 
the  rectangle  with  corners  ± \Jn/2  ± in  contribute  approximately 


e 


n/2 


27r(n/2)n/2 


(-it)3  23'2  (it)4 

3 n1/2  + n 


dt, 


when  n is  large.  We  can  restrict  |f|  < n'  to  show  that  this  integral  is  70  + (h  - § h)/n 
with  relative  error  0(n9e~3/2),  where  Ik  = e~2tV  dt.  As  before,  the  relative  error 
is  actually  0(n~2);  we  deduce  the  answer 


((n-l)/2)!  v/2^(n/2)"/2(1+12n+°(n2))’  n °dd' 

(The  analog  of  (22)  is  (sin  ^)2/T((a;  - l)/2)  when  n = x > 0.) 

43.  Let  f(z)  = ee  / zn . When  z = -n  + it  we  have  \f(z)\  < en~n\  when  z = t + 2nin  + 
J7t/2  we  have  |/(z)|  = |z|  n < (27rn)_n.  So  the  integral  is  negligible  except  on  a path 
z £ "f  ft;  and  on  that  path  |/|  decreases  as  |t|  increases  from  0 to  7 r.  Already  when 
t = n‘  x^2  we  have  \f(z)\/f(£)  = 0(exp(— n2e/(log n)2)).  And  when  |t|  > jr  we  have 
\f(z)\/f(0  < 1/1 1 + ™/k |n  = exp(-f  ln(l  + 7r2/£2)). 

44.  Set  u = na2t 2 in  (25)  to  obtain  5R  f0°°  e~u  exp(n~1/2c3(-u)3/2  + n~1c4(-u)2  + 

n~3/2c5(-u)s/2  - 1 ) du/Vna2u  where  ck  = (2/(£  + l))fc/2(£fe_1  + (~l)k(k  - l)!)/fc!  = 

aujaij2 . This  expression  leads  to 


b, 


1\1±L5  c^_  C42  C53 

2)  Lj  k^.  w. 

fcl+2fc2+3fc3+-..=2( 

fcl  4-fc2  + ^3H = 111 

k 1 ,k2,ks,...  >0 


a sum  over  partitions  of  21.  For  example,  61  = |c4  - i|c§. 

45.  To  get  wn/n\  we  replace  g(z)  by  ez  - (n  + l)lnz  in  the  derivation  of  (26). 
This  change  multiplies  the  integrand  in  the  previous  answer  by  1/(1  + it/£),  which 
is  1/(1  - n~1/2a(—u)1/2)  where  a = -s/2/(£  + 1).  Thus  we  get 


1 
2 

fc  + fcl-l-2fc2+3fc3-(-”*  = 2Z 

k\  +^2+^3-! = m 

k,ki  ,^2,^3,...  >0 

a sum  of  p(2l)  + p(2l  - 1)  + • • • + p( 0)  terms;  b(  = fc4  - If  c§  + f ac3  - \a2 . [The 
coefficient  b(  was  obtained  in  a different  way  by  L.  Moser  and  M.  Wyman,  Trans. 
Royal  Soc.  Canada  (3)  49,  Section  3 (1955),  49-54,  who  were  the  first  to  deduce  an 
asymptotic  series  for  zun . Their  approximation  is  slightly  less  accurate  than  the  result 
of  (26)  with  n changed  to  n + 1,  because  it  doesn’t  pass  exactly  through  the  saddle 
point.  Formula  (26)  is  due  to  I.  J.  Good,  Iranian  J.  Science  and  Tech.  4 (1975),  77-83.] 

46.  Equations  (13)  and  (31)  show  that  wnk  = (1  - £/n)kzun(  1 + O (n~x))  for  fixed  k 
as  n -A  00.  And  this  approximation  also  holds  when  k — n,  but  with  relative  error 
0((logn)2/n). 


b'i  = 


E 


V— 

) W.  k2\ 


kl  „k2  _fc  3 


k2\  k3\ 
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47.  Steps  (H1,H2,  ...,H6)  are  performed  respectively  (1,  vjn,  w„  -wn-x,  mn-i,  zun-i, 

tJ7n-i  ~ 1)  times.  The  loop  in  H4  sets  j «—  j — 1 a total  of  zzrn_2  + a7n_3  + • • • + vj\ 
times;  the  loop  in  H6  sets  bj  «—  m a total  of  (ro„_2  — 1)  + • • • + (tui  — 1)  times.  The 
ratio  wn-\/wn  is  approximately  (In n)/n,  and  (wn-2  + f w\)/wn  « (lnn)2/n2. 

48.  We  can  easily  verify  the  interchange  of  summation  and  integration  in 


evjx 

r(*  + i) 


l 

27ri 


e 


kx 


k\  2-+1 


dz 


OO 


= E 


j_  i 
A’!  27TI 


dz  = YJ 

k-0 


1 A* 

A!  T(a:  + 1) 


49.  If  £ = Inn  — In  Inn  + a;,  we  have  j3  = 1 — e 
inversion  formula  (exercise  4.7-8), 


— ax.  Therefore  by  Lagrange’s 


k= 1 j= 0 


where  /(f)  = t/(l  — e f).  So  the  result  follows  from  the  handy  identity 


1 — e" 


\ m 00 

0 -E 


m 

m — n 


( rn  — 1 )(m  — 2) . . . (m  — n) 


(This  identity  should  be  interpreted  carefully  when  n > m;  the  coefficient  of  zn  is  a 
polynomial  in  m of  degree  n,  as  explained  in  CMath  equation  (7.59).) 

The  formula  in  this  exercise  is  due  to  L.  Comtet,  Comptes  Rendus  Acad.  Sci. 
(A)  270  (Paris,  1970),  1085  1088,  who  identified  the  coefficients  previously  computed 
by  N.  G.  de  Bruijn,  Asymptotic  Methods  in  Analysis  (1958),  25-28.  Convergence  for 
n > e was  shown  by  Jeffrey,  Corless,  Hare,  and  Knut.h,  Comptes  Rendus  Acad.  Sci.  (I) 
320  (1995),  1449-1452,  who  also  derived  a formula  that  converges  somewhat  faster. 

(The  equation  = n has  complex  roots  as  well.  We  can  obtain  them  all  by 

using  Inn  + 2nim  in  place  of  In n in  the  formula  of  this  exercise;  the  sum  converges 
rapidly  when  m / 0.  See  Corless,  Gonnet,  Hare,  Jeffrey,  and  Knuth,  Advances  in 
Computational  Math.  5 (1996),  347-350.) 

50.  Let  £ = £(n).  Then  £'(n)  = £/((£  + l)n),  and  the  Taylor  series 

£(n  + fc)  = £ + fc£'(n)  + y£"(n)  H 

can  be  shown  to  converge  for  \k\  < n + 1/e. 

Indeed,  much  more  is  true,  because  the  function  £(n)  = —T(—n)  is  obtained  from 
the  tree  function  T(z)  by  analytic  continuation  to  the  negative  real  axis.  (The  tree 
function  has  a quadratic  singularity  at  z = e”1;  after  going  around  this  singularity 
we  encounter  a logarithmic  singularity  at  2 = 0,  as  part  of  an  interesting  multi-level 
Riemann  surface  on  which  the  quadratic  singularity  appears  only  at  level  0.)  The 
derivatives  of  the  tree  function  satisfy  zkT('k\z)  = R(z)k pk(R(z)),  where  R(z)  = 
T(z)/(  1 - T(z))  and  pk(x)  is  the  polynomial  of  degree  k - 1 defined  by  pk+1(x ) = 
(1  + x)2p'k(x)  + k( 2 + x)pk(x).  For  example, 

pi(x)  = l,  p2(x)  = 2 + x,  p3(x)  = 9+  10a; + 3i2,  p4(x)  = 64  + 113a;  + 70*2  + 15*3. 
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(The  coefficients  of  pk(x),  incidentally,  enumerate  certain  phylogenetic  trees  called 
Greg  trees:  \x3}pk(x)  is  the  number  of  oriented  trees  with  j unlabeled  nodes  and 
k labeled  nodes,  where  leaves  must  be  labeled  and  unlabeled  nodes  must  have  at 
least  two  children.  See  J.  Felsenstein,  Systematic  Zoology  27  (1978),  27-33;  L.  R. 
Foulds  and  R.  W.  Robinson,  Lecture  Notes  in  Math.  829  (1980),  110-126;  C.  Flight, 
Manuscripts  34  (1990),  122-128.)  If  qk(x ) = pk(—x),  we  can  prove  by  induction  that 
( — 1)  Ik  (*)  > 0 for  0 < a:  < 1.  Therefore  qk(x ) decreases  monotonically  from  kk~l 
to  (k  - 1)!  as  x goes  from  0 to  1,  for  all  k,m  > 1.  It  follows  that 


£(n  + fc)=£+--(-y^M+(^ 

n V n J 2!  V n 


kx\3  q3{x) 


J 3! 


£ + 1’ 


where  the  partial  sums  alternately  overshoot  and  undershoot  the  correct  value  if  k > 0. 
51.  There  are  two  saddle  points,  o = n-\-bf 4 — 1/2  and  o'  = — 1 — o.  Integration  on  a 
rectangular  path  with  corners  at  <r±im  and  o' ±im  shows  that  only  o is  relevant  as  n — h 
oo  (although  o'  contributes  a relative  error  of  roughly  which  can  be  significant 

when  n is  small).  Arguing  almost  as  in  (25),  but  with  g(z)  = z + 22/2  - (n  + 1)  In  2, 
we  find  that  tn  is  well  approximated  by 


n 

2ix 


.!  fn‘ 

* J-n « 


ag(cr)-a2t2 +a3it3 ^ ha,  (-it)1  +0(n(‘+1)e~ ~ 1)/2 ) 


dt, 


Ok  = 


o + 1 [fc  = 2] 

kok~ 1 2 


The  integral  expands  as  in  exercise  44  to 

n,e(n+a)/2 


(1  + by  + 62  + • • • + bm  + 0(1 


y. 


2crn  + 1 yJ'Kd'l  ' 

This  time  ck  = (0+  1 )o1~k(l  + l/(2o))~^2/k  for  k > 3,  hence  (2a  + 1 )3kokbk  is  a 
polynomial  in  a of  degree  2 k : for  example, 


6l  = T 


15  2 
4°4  - 16C3  = 


8a2  + 7a  - 1 


12a(2a  + l)3  ’ 

In  particular,  Stirling’s  approximation  and  the  61  term  yield 


tn  = -Ln"/2e-"/2+v4I 

V2 


rn  — 1/4  ^ 


1 ^ -1/2 

1+24n 


119  _!  7933  _ 

n 7 

1152  414720 


3/2 


+ 0(n~2)) 


after  we  plug  in  the  formula  for  a a result  substantially  more  accurate  than  equation 
5.1.4— (53),  and  obtained  with  considerably  less  labor. 

52.  Let  G(z)  = Pr(A  = k)zk,  so  that  the  jth  cumulant  k3  is  j\[t3]  lnG(e‘).  In 
case  (a)  we  have  G(z ) = ee(z-e(;  hence 


In  G{et)  = e 


ie* 




e<(e«(e‘_1) 


-l)  = e^(e'-l/|, 


Hi  = e 


k= 1 


Case  (b)  is  sort  of  3,  du<il  situation;  Hero  k — j — tOj  \j  7-  0]  bGC3usG 


G(z) 


J’k  j 


(e> 


-y 


[If  = 1 in  case  (a)  we  have  Kj  = ew  [j  / 0].  But  if  £e5  = n in  that  case,  the 
mean  is  «i  = n and  the  variance  a2  is  (£  + l)n.  Thus,  the  formula  in  exercise  45  states 
that  the  mean  value  n occurs  with  approximate  probability  l/y/2 no  and  relative  error 
0(l/n).  This  observation  leads  to  another  way  to  prove  that  formula.] 
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53.  We  can  write  lnG(e‘)  = pt  + er2f2/ 2 + K3t3/3!  + • ■ • as  in  Eq.  1.2.10-(23),  and 
there  is  a positive  constant  5 such  that  3 1 1,1/ j\  < <j2t2/6  when  |t|  < 5.  Hence,  if 

0 < e < 1/2,  we  can  prove  that 


[z»n+r]G{z)n  = ±_  J' 


* G(elt)n  dt 


1/2 


-irt 


+ 0(n3£-1/2))  dt  + 0{e~cn2‘) 


as  n — >■  oo,  for  some  constant  c > 0:  The  integrand  for  ne_1/2  < |f|  < <5  is  bounded  in 
absolute  value  by  exp(— cr2n2e/3);  and  when  8 < |f|  < n its  magnitude  is  at  most  an, 
where  a = max|G(e,()|  is  less  than  1 because  the  individual  terms  pkeklt  don’t  all  lie 
on  a straight  line  by  our  assumption.  Thus 


[^n+r]  G(z)n  = exp  (-irt  -^  + 0(n3e-1/2))  dt  + 0(e~ 

h />(-£(* + £)'-  ^ + <*■-■'■>)  * + <*«--’■> 


-r2/(2<r  2n) 

+ 0(n3e_1 ) 


a\[2i m 


By  taking  account  of  «3,  K4,  ...  in  a similar  way  we  can  refine  the  estimate  to  0(n~m) 
for  arbitrarily  large  m;  thus  the  result  is  valid  also  for  e = 0.  [In  fact,  such  refinements 
lead  to  the  “Edgeworth  expansion,”  according  to  which  [zfm+r]  G(z)n  is  asymptotic  to 


e-”2/!2"2")  y,  ( — l)s(2Z  + m)—  r2i+m-2s 

a^2nn  ^ ff4l+2m-2s2s  s\  nl+m-s 

k 1 +2/e2+3fc3H = m 

fcl  +^2  + ^3“! = i 

fcl ,^2  J&3 , . . . > 0 
0<s<i  + m/2 


1 

fci ! k2 ! ... 


(tror 


the  absolute  error  is  0(n-p'2),  where  the  constant  hidden  in  the  O depends  only  on  p 
and  G but  not  on  r or  n,  if  we  restrict  the  sum  to  cases  with  m < p — 1.  For  example, 
when  p = 3 we  get 


[z»n+r]G{z)n  = 


s-r2/(2<r2n) 

<j\f2irn 


1 - 


«3 

2cr4 


(^)  + S(^))+0(^)’ 


and  there  are  seven  more  terms  when  p = 4.  [See  P.  L.  Chebyshev,  Zapiski  Imp.  Akad. 
Nauk  55  (1887),  No.  6,  1-16;  Acta  Math.  14  (1890),  305-315;  F.  Y.  Edgeworth, 
Trans.  Cambridge  Phil.  Soc.  20  (1905),  36-65,  113-141;  H.  Cramer,  Skandinavisk 
Aktuarietidsskrift  11  (1928),  13-74,  141-180.] 

54.  Formula  (40)  is  equivalent  to  a = scoths  + s,  /3  = scoths  — s. 

55.  Let  c = ae~a . The  Newtonian  iteration  f3 0 = c,  fik+i  = (1  — fik)ce0k/{\  — ce~^k) 
rises  rapidly  to  the  correct  value,  unless  a is  extremely  close  to  1.  For  example,  /I7 
differs  from  In  2 by  less  than  10-75  when  a = In  4. 


56.  (a)  By  induction  on  n,  g 


(n+l) 


(*)  = (-!)"( 


ELo  Oin-k)z 

a(ez  — l)n+1 


,n+l 


*)• 


id 


t 


re 
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(b)  YjI=  o (fc)efc7n!  = fo  ■ ■ ■ fo  exp( Lm!  4 h Unjo-)  dui  . . . dun 

< fo  exp((“i  4 1-  Un)fr)  duj  . . . dun  = (e'T-l)n/an. 

The  lower  bound  is  similar,  since  |_iti  + (-  un J > ui  H + un  — 1. 

(c)  Thus  n!  (1— /3/a)  < (— cr)"g*n+1'(cr)  < 0,  and  we  need  only  verify  that  1— /3/a  < 
2(1  - /3),  namely  that  2 a/3  < a + /3.  But  a/3  < 1 and  a + /3  > 2,  by  exercise  54. 

57.  (a)  n + 1 — m = (n  + 1)(1  - 1/a)  < (n  + 1)(1  - /3/a)  = (n  + l)cr/a  < 21V  as  in 

answer  56(c).  (b)  The  quantity  a + a/3  increases  as  a increases,  because  its  derivative 

with  respect  to  a is  1 + /3  + /3(1  - a)/(l  - /3)  = (1  - a/3)/(  1 - /3)  + /3  > 0.  Therefore 

1 -/3  < 2(1  - 1/a). 

58.  (a)  The  derivative  of  \ea+lt  - 1|2/| er  + it|2  = (e'T+it  - l)(e,T_i‘  - l)/(cr2  + t2)  with 
respect  to  t is  ( a 2 + f2)  sint  - t(2sin  |)2  - (2sinh  f )2f  times  a positive  function.  This 
derivative  is  always  negative  for  0 < t < 2n.  because  it  is  less  than  f2 sin  t - 1(2 sin  |)2  = 
8usinucosu(u  — tanu)  where  t = 2u. 

Let  s = 2sinh  ^ . When  a > n and  2n  < t < 4tt,  the  derivative  is  still  negative, 
because  we  have  t < 4n  < s2  - <t2/(2t t)  < s2  - a2/t.  Similarly,  when  a > 2?r  the 
derivative  remains  negative  for  4n  < t < 1687r;  the  proof  gets  easier  and  easier. 

(b)  Let  t = ua/\/N.  Then  (41)  and  (42)  prove  that 


e(n+1)g(a+it) 


dt  = 


(ea  - l)m 

cr"V/V 


( -iu)3a3 

N 1/2 


( -iu)lai 
lVi/2-1 


+ o(iv((+1)£-(i-1)/2)jd«, 


where  (1  - P)ak  is  a polynomial  of  degree  k - 1 in  a and  /?,  with  0 < ak  < 2/k.  (For 
example,  6a3  = (2  - /3(a  + /3))/(l  - ^)  and  24a4  = (6  - /3(a2  + 4a/3  + /32))/(l  - /?).) 
The  monotonicity  of  the  integrand  shows  that  the  integral  over  the  rest  of  the  range  is 
negligible.  Now  trade  tails,  extend  the  integral  over  -00  < u < 00,  and  use  the  formula 
of  answer  44  with  ck  = 2 k/2ak  to  define  bx,  b2,  ...  . 

(c)  We  will  prove  that  \ez  — 1| ,ncrn+1  /((ea  — l)m|2|n+1)  is  exponentially  small  on 
those  three  paths.  If  a < 1,  this  quantity  is  less  than  1 / ( 2 7r) " + 1 (because,  for  example, 
ea  — 1 > a).  If  a > 1,  we  have  cr  < 2\z\  and  \ez  — 1|  < e”  — 1. 

59.  In  this  extreme  case,  a = 1 + n~l  and  (3  = 1 - n-1  + |n-2  + 0(n~3);  hence  N = 
1 + f + 0(n~2).  The  leading  term  P~n/V2nN  is  e/y/2n  times  1 - |n_1  + 0(n~2). 
(Notice  that  e/\/27r  w 1.0844.)  The  quantity  ak  in  answer  58(b)  turns  out  to  be 
1 /k  + 0(n_1).  So  the  correction  terms,  to  first  order,  are 


7/7  = L2  J exP( 


(-Y  -B2^2k~1)+o(f) 
V ^ 2k(2k-  1)J  + \n)7 


namely  the  terms  in  the  (divergent)  series  corresponding  to  Stirling’s  approximation 

1 -J—(\  1 1 139  571  \ 

1!  ~ \/2^\  12  + 288  + 51840  2488320 


60.  (a)  The  number  of  m-ary  strings  of  length  n in  which  all  m digits  appear  is  m!  { " } , 
and  the  inclusion-exclusion  principle  expresses  this  quantity  as  (™)mn  - (™)(m-  1)"  + 
• • • . Now  see  exercise  7.2.1.4-37. 
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(b)  We  have  ( m — 1 )n/{m  - 1)!  = (mn/m\)mexp(n  ln(l  — 1/m)),  and  ln(l  — 1/m) 
is  less  than  — ne_1. 

(c)  In  this  case  a > ne  and  fi  = ae~ae3  < ae1~a.  Therefore  1 < (1  - /3/a)m~n  < 
exp (nO(e~a));  and  1 > e~Pm  = e-<n+1)'3/“  > exp(-nO(e_a)).  So  (45)  becomes 
(mn/m!)(l  + 0(n-1)  + 0(ne~nC)). 

61.  Now  a = 1 + £ + 0(n2e~2)  and  /3  = 1 - £ + 0(n2e_2).  Thus  IV  = r + 0(n2e_1), 
and  the  case  l = 0 of  Eq.  (43)  reduces  to 


(This  approximation  meshes  well  with  identities  such  as  {„-J  = (2)  and  = 

2(”)  + ("I');  indeed,  we  have 


when  r is  constant,  according  to  formulas  (6.42)  and  (6.43)  of  CMath.) 

62.  The  assertion  is  true  for  1 < n < 10000  (with  m = [e?  - lj  in  5648  of  those 
cases).  E.  R.  Canfield  and  C.  Pomerance,  in  a paper  that  nicely  surveys  previous  work 
on  related  problems,  have  shown  that  the  statement  holds  for  all  sufficiently  large  n, 
and  that  the  maximum  occurs  in  both  cases  only  if  e?  mod  1 is  extremely  close  to  - . 
[Integers  2 (2002),  Al,  113.] 

63.  (a)  The  result  holds  when  pi  = • • • = pn  = p,  because  ak-i/ak  = (k/(n+l-k))  x 
((n  — p)/p)  < (n  — p)/(n  + 1 — p)  < 1.  It  is  also  true  by  induction  when  pn  = 0 or  1. 
For  the  general  case,  consider  the  minimum  of  ak  — ak-\  over  all  choices  of  (pi, , pn) 
with  pi  + ■ ■ ■ + pn  = p,:  If  0 < pi  < P2  < 1,  let  pi  = px  — & and  P2  = P2  + <5;  and  notice 
that  a'k  — a'k_i  = ak  — dfc-i  +<5(pi  — P2  — S) a for  some  a depending  only  on  P3,  . . . , pn. 
At  a minimum  point  we  must  have  a = 0;  thus  we  can  choose  (5  so  that  either  pi  = 0 
or  p'n  = 1 . The  minimum  can  therefore  be  achieved  when  all  pj  have  one  of  three  values 
{0,  l,p}.  But  we  have  proved  that  ak  — ak- 1 > 0 in  such  cases. 

(b)  Changing  each  pj  to  1 — pj  changes  p,  to  n — p and  ak  to  a„~k. 

(c)  No  roots  of  f(x)  are  positive.  Hence  f(z)/f(  1)  has  the  form  in  (a)  and  (b). 

(d)  Let  C(f)  be  the  number  of  sign  changes  in  the  sequence  of  coefficients  of  /; 
we  want  to  show  that  C((l  - x)2 f)  = 2.  In  fact,  C((l  - x)m  f)  = m for  all  m > 0. 
For  C((  1 — x)m)  = m,  and  C((a  + bx)f)  < C(f)  when  a and  b are  positive;  hence 
C((l  — x)m  f)  < m.  And  if  f(x)  is  any  nonzero  polynomial  whatsoever,  C((l  — x)f ) > 
C(f)\  hence  C((l  — x)m f)  > m. 

(e)  Since  Ylk  \^\xk  = x(a:  + l) . . . (x  + n — 1),  part  (c)  applies  directly  with  p = Hn. 
And  for  the  polynomials  f„(x)  = J2k  {l}xk,  we  can  use  part  (c)  with  p = wn+i/zon-l, 
if  fn(x ) has  n real  roots.  The  latter  statement  follows  by  induction  because  fn+i(x ) = 
x(fn(x)  + fn(x))'-  If  a > 0 and  if  f(x)  has  n real  roots,  so  does  the  function  g(x ) = 
eaxf(x).  And  g(x)  -+0asi->  -00;  hence  g'(x)  = eax(af(  x)  + f'(x))  also  has  n real 
roots  (namely,  one  at  the  far  left,  and  n — 1 between  the  roots  of  g(x)). 

[See  E.  Laguerre,  J.  de  Math.  (3)  9 (1883),  99-146;  W.  Hoeffding,  Annals  Math. 
St  at.  27  (1956),  713-721;  J.  N.  Darroch,  Annals  Math.  St  at.  35  (1964),  1317-1321; 
J.  Pitman,  J.  Combinatorial  Theory  A77  (1997),  297-303.] 

64.  We  need  only  use  computer  algebra  to  subtract  lnro„  from  lnvon-k. 
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65.  It  is  G7„ 1 times  the  number  of  occurrences  of  fc-blocks  plus  the  number  of  occur- 
rences of  ordered  pairs  of  fc-blocks  in  the  list  of  all  set  partitions,  namely  ((^)c7n_fc  + 
(fc)  (n^k)^n-2k)/zun,  minus  the  square  of  (49).  Asymptotically,  (£7^0(1  + 0(n4e~1)). 

66.  (The  maximum  of  (48)  when  n = 100  is  achieved  for  the  partitions  71625446372614 
and  71625446382513.) 

67.  The  expected  value  of  Mk  is  wn+k/zun.  By  (50),  the  mean  is  therefore  w„+i/wn  = 
n/£  + £/(2(£  -I-  l)2)  + 0(n_1),  and  the  variance  is 


2 

^n+2  _ ^n  + l 


€(2g+l) 

(£  + 1 )2n 


e 

(£  + l)2n 


n 

7C+T) 


+ 0(1). 


68.  The  maximum  number  of  nonzero  components  in  all  parts  of  a partition  is  n — 

n 1 + h nm;  it  occurs  if  and  only  if  all  component  parts  are  0 or  1.  Then  the  values 

of  l + 1 = n and  b = mn  1 + (m  — l)ri2  + • • • + nm  reach  their  maximum.  [Thus  it’s  best 
to  choose  names  of  the  multiset  elements  so  that  m < n2  < • • • < nm .] 

69.  At  the  beginning  of  step  M3,  if  k > b and  l = r - 1,  go  to  M5.  In  step  M5,  if  j = a 
and  ( Vj  — l)(r  — l)  < Uj,  go  to  M6  instead  of  decreasing  Vj. 

70-  (a)  |"lj|  + |"I2|  -I 1-  | , since  |"“*|  contain  the  block  {0, ...  ,0, 1}  with  k 0s. 

The  total,  also  known  as  p(n  — 1, 1),  is  p(n  — 1)  + • ■ ■ + p(l)  + p( 0). 

(b)  Exactly  N = {”7} + {"-2}  of  the  T'-block  partitions  of  {1, . . . ,n  — 1,  n}  are  the 
same  if  we  interchange  n-1  On.  So  the  answer  is  Ar+f({”}-./V)  = !({"}+jV),  which 
is  also  the  number  of  restricted  growth  strings  ai  . . ,an  with  max(ai, . . . ,a„)  = r - 1 
and  a„_i  < an.  And  the  total  is  + wn-\  + wn-2). 

71.  [f  («i  + l)  • • • (nm  + l)-  |J,  because  there  are  (m  + 1) . . . (nm  + l)-2  compositions 
into  two  parts,  and  half  of  those  compositions  fail  to  be  in  lexicographic  order  unless 
all  nj  are  even.  (See  exercise  7.2.1.4-31.  Formulas  for  up  to  5 parts  have  been  worked 
out  by  E.  M.  Wright,  Proc.  London  Math.  Soc.  (3)  11  (1961),  499-510.) 

72.  Yes.  The  following  algorithm  computes  ajk  = p(j,k)  for  0 < j,  k < n in  0(n4) 

steps:  Start  with  djk  i — 1 for  all  j and  k.  Then  for  l = 0,  1,  . . . , n and  m = 0, 
1,  . . . , n (in  any  order),  if  l + m > 1 set  a,jk  for  j = l,  . . . , n and 

k = m,  . . . , n (in  increasing  order). 

(See  Table  A-l.  A similar  method  computes  p(m, . . . , nm)  in  0(ni  . . . nm)2  steps. 
Cheema  and  Motzkin,  in  the  cited  paper,  have  derived  the  recurrence  relation 


OO 

nip(rn, . . . ,nm)  = ^ kip(m  - kil, . . . , nm  - kml), 

1=1 

but  this  interesting  formula  is  helpful  for  computation  only  in  certain  cases.) 


Table  A-l 

MULTIPARTITION  NUMBERS 


n 

0 

1 2 3 

4 

5 6 

n 

0 

1 

2 

3 

4 

5 

P(0,n) 

1 

1 2 3 

5 

7 11 

P(0,n) 

1 

2 

9 

66 

712 

10457 

p(l,n) 

1 

2 4 7 

12 

19  30 

P(l,n) 

1 

4 

26 

249 

3274 

56135 

p(2>  n) 

2 

4 9 16 

29 

47  77 

P(2,n) 

2 

11 

92 

1075 

16601 

325269 

p(3,n) 

3 

7 16  31 

57 

97  162 

P(3,n) 

5 

36 

371 

5133 

91226 

2014321 

p(4,  n) 

5 12  29  57  109  189  323 

P(4,n) 

15  135  1663 

26683 

537813  13241402 

p(5,n) 

7 19  47  97  189  339  589 

P(5,n) 

52  566  8155  149410  3376696  91914202 
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73.  Yes.  Let  P(m,n)  = p(  1, . . . , 1,  2, . . . , 2)  when  there  are  m Is  and  n 2s;  then 
P(m,  0)  = ujm,  and  we  can  use  the  recurrence 


2P(m,n+l)  = P(m  + 2,  n)  + P(m  + 1,  n)  + 

k 

This  recurrence  can  be  proved  by  considering  what  happens  when  we  replace  a pair 
of  x's  in  the  multiset  for  P(m,  n + 1)  by  two  distinct  elements  x and  x . We  get 
2P(m,  n + 1)  partitions,  representing  P(m  + 2 , n),  except  in  the  P(m  + l,n)  cases 
where  x and  x'  belong  to  the  same  block,  or  in  (")P(m,  n - k)  cases  where  the  blocks 
containing  x and  x'  are  identical  and  have  k additional  elements. 

Notes:  See  Table  A-l.  Another  recurrence,  less  useful  for  computation,  is 

«»+>.•> . £(;)(*■ 

The  sequence  P(0,  n)  was  first  investigated  by  E.  K.  Lloyd,  Proc.  Cambridge  Philos. 
Soc.  103  (1988),  277-284,  and  by  G.  Labelle,  Discrete  Math.  217  (2000),  237-248,  who 
computed  it  by  completely  different  methods.  Exercise  70(b)  showed  that  P(m,  1)  = 
(rum  + rcjm+i  + c7m+2)/2;  in  general  P(m,n)  can  be  written  in  the  umbral  notation 
Gj,nqn(xu),  where  qn(x)  is  a polynomial  of  degree  2n  defined  by  the  generating  function 
Qn{x)zn/n\  = exp((ez  + (x  + x2)z  — l)/2).  Thus,  by  exercise  31, 


x>( 


m^nj- 
ni 


(e*-l)/2  Z0(lk+m+l)(k+m+ 1)  Z 


2k 


k\  ' 


Labelle  proved,  as  a special  case  of  much  more  general  results,  that  the  number  of 
partitions  of  {1, 1, . . . , n,  n}  into  exactly  r blocks  is 


n\  [ xrzn ] e~x+x2^z~1'>l2  y czk(k+ 1)/2  x 

k\  ’ 


75.  The  saddle  point  method  yields  CeAn2/3+s’,1/3/n55/36,  where  A = 3C(3)1^3,  B = 
tt2C(3)-1/3/2,  and  C = C(3) 19/36 (2tt ) ~5/6 3“ 1/2  exp(l/3  + P2/4  + C'(2)/(2tt2)  - 7/12). 
[F.  C.  Auluck,  Proc.  Cambridge  Philos.  Soc.  49  (1953),  72-83;  E.  M.  Wright,  American 
J.  Math.  80  (1958),  643-658.] 

76.  Using  the  fact  that  p(n1,n2,n3, . . . ) > p(n i + n2,n3, . . .),  hence  P(m  + 2 ,n)  > 
P(m,  n + 1),  one  can  prove  by  induction  that  P(m,  n + 1)  > ( m + n+  1 )P(m,  n).  Thus 

2 P{m,n)  < P(m  + 2,  n - 1)  + P(m  + l,n  - 1)  + eP(rri,n  - 1). 

Iterating  this  inequality  shows  that  2nP(0,n)  = (zu2  + w)n  + 0(n(xu2  + = 

(nct72n-i+^2n)(l+0((logu)3/n)).  (A  more  precise  asymptotic  formula  can  be  obtained 
from  the  generating  function  in  the  answer  to  exercise  73.) 

78.  333321000 

100022320  (because  the  encoded  partitions 
221002102  must  all  be  (000000000)) 

210220013 

79.  There  are  432  such  cycles.  But  they  yield  only  304  different  cycles  of  set  partitions, 
since  different  cycles  might  describe  the  same  sequence  of  partitions.  For  example, 
(000012022332321)  and  (000012022112123)  are  partitionwise  equivalent. 
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80.  [See  F.  Chung,  P.  Diaconis,  and  R.  Graham,  Discrete  Mathematics  110  (1992), 
52-55.]  Construct  a digraph  with  wn-i  vertices  and  wn  arcs;  each  restricted  growth 
string  ai  . . . an  defines  an  arc  from  vertex  a\  . . . an-i  to  vertex  p(a 2 . . . an),  where  p is 
the  function  of  exercise  4.  (For  example,  arc  01001213  runs  from  0100121  to  0110203.) 
Every  universal  cycle  defines  an  Eulerian  trail  in  this  digraph;  conversely,  every  Eulerian 
trail  can  be  used  to  define  one  or  more  universal  sequences  of  restricted  growth  on  the 
elements  {0, 1, . . . , n — 1}. 

An  Eulerian  trail  exists  by  the  method  of  Section  2. 3. 4. 2,  if  we  let  the  last  exit 
from  every  nonzero  vertex  ai  . . . o„_  1 be  through  arc  a\  . . . an-\an- 1-  The  sequence 
might  not  be  cyclic,  however.  For  example,  no  universal  cycle  exists  when  n < 4;  and 
when  n = 4 the  universal  sequence  000012030110100222  defines  a cycle  of  set  partitions 
that  does  not  correspond  to  any  universal  cycle. 

The  existence  of  a cycle  can  be  proved  for  n > 6 if  we  start  with  an  Eulerian 
trail  that  begins  Qnxyxn~3u{uv)^n~2^2^v!Ln°id'i  for  some  distinct  elements  {u,v,x,y}. 
This  pattern  is  possible  if  we  alter  the  last  exit  of  0fc121n_3~fc  from  0fc_1121n_2_fe  to 
0k~1 121n~3~k2  for  2 < k < n— 4,  and  let  the  last  exits  of  0121n-4  and  0T'_32  be  respec- 
tively 010rl_4l  and  0n-310.  Now  if  we  choose  numbers  of  the  cycle  backwards,  thereby 
determining  u and  v,  we  can  let  x and  y be  the  smallest  elements  distinct  from  {0,  u,  ?;}. 

We  can  conclude  in  fact  that  the  number  of  universal  cycles  having  this  extremely 
special  type  is  huge  — at  least 

n — 1 

n(^-*)){  k ^)/((n  ~ !)•  (n  — 2)332”~522),  when  n > 6. 

fc=2  ' 

Yet  none  of  them  are  known  to  be  readily  decodable.  See  below  for  the  case  n = 5. 

81.  Noting  that  = 52,  we  use  a universal  cycle  for  {1,2, 3, 4, 5}  in  which  the 
elements  are  13  clubs,  13  diamonds,  13  hearts,  12  spades,  and  a joker.  One  such  cycle, 
found  by  trial  and  error  using  Eulerian  trails  as  in  the  previous  answer,  is 

(In  fact,  there  are  essentially  114,056  such  cycles  if  we  branch  to  at,  = ak- 1 as  a last 
resort  and  if  we  introduce  the  joker  as  soon  as  possible.)  The  trick  still  works  with 
probability  ||  if  we  call  the  joker  a spade. 

82.  There  are  13644  solutions,  although  this  number  reduces  to  1981  if  we  regard 


e 


1 


The  smallest  common  sum  is  5/2,  and  the  largest  is  25/2;  the  remarkable  solution 


is  one  of  only  two  essentially  distinct  ways  to  get  the  common  sum  118/15.  [This 
problem  was  posed  by  B.  A.  Kordemsky  in  Matematicheskaia  Smekalka  (1954);  it  is 
number  78  in  the  English  translation,  The  Moscow  Puzzles  (1972).] 

SECTION  7. 2. 1.6 

1.  It  could  “see”  a left  parenthesis  at  the  left  of  every  internal  node  and  a right 
parenthesis  at  the  bottom  of  every  internal  node.  Alternatively,  it  could  associate  right 
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parentheses  with  the  external  nodes  that  it  encounters  — except  for  the  very  last  □; 
see  exercise  20. 


2.  Zl.  [Initialize.]  Set  zk  2k  — 1 for  0 < k < n.  (Assume  that  n > 2.) 

Z2.  [Visit.]  Visit  the  tree-combination  Z1Z2  ■ ■ .zn. 

Z3.  [Easy  case?]  If  2n_ i < zn  — 1,  set  zn  <—  zn  — 1 and  return  to  Z2. 

Z4.  [Find  j]  Set  j <-  n - 1 and  zn  <-  2n  - 1.  While  Zj-i  = Zj  - 1,  set  zj  «-  2j  - 1 
and  j <—  j — 1 . 

Z5.  [Decrease  Terminate  the  algorithm  if  j = 1.  Otherwise  set  Zj  <—  Zj  - 1 
and  go  back  to  Z2.  | 


3.  Label  the  nodes  of  the  forest  in  preorder.  The  first  Zk  — 1 elements  of  ai  . . . a2n 
contain  k — 1 left  parentheses  and  Zk  — k right  parentheses.  So  there  is  an  excess  of 
2k-l-zk  left  parentheses  over  right  parentheses  when  the  “worm”  first  reaches  node  k; 
and  2k  — 1 — zk  is  the  level  (or  depth)  of  that  node. 

Let  <?i  ...  q„  be  the  inverse  of  pi  . . . pn,  so  that  node  k is  the  qk  th  node  in  postorder. 
Since  k occurs  to  the  left  of  j in  pi  . . . pn  if  and  only  if  qk  < qj,  we  see  that  ck  is  the 
number  of  nodes  j that  precede  k in  preorder  but  follow  it  in  postorder,  namely  the 
number  of  proper  ancestors  of  k;  again,  this  is  the  level  of  k. 

Alternative  proof:  We  can  also  show  that  both  sequences  z2  ...  zn  and  a ...cn  have 
essentially  the  same  recursive  structure  as  (5):  Zpq  = {Zp(q_x)  + lp),  l(Z(p_i)9  + lp_1) 
when  0 < p < q;  and  Cpq  = Cp(q_i),  {q-p)C(p-^q.  (Consider  the  mate  of  the  last, 
next-to-last,  etc.,  left  parenthesis.) 

Incidentally,  the  formula  lck+i  + dk  = ck  + T is  equivalent  to  (11). 


4.  Almost  true;  but  d\  . . . dn  and  z\  . . . zn  occur  in  decreasing  order,  while  pi  . ■ .pn 
and  ci  ...cn  are  increasing.  (This  lexicographic  property  for  a sequence  of  permutations 
Pi  ■■  Pn  is  not  automatically  inherited  from  lexicographic  order  of  the  corresponding 
inversion  tables  cj  . . . c„;  but  the  result  does  hold  for  this  particular  class  of  pi  . . .p„.) 

5.  di  ..  .dis  = 020020010320104;  ...215  = 125671011  1214151922232526; 

Pi  ..-Pis  = 215481097116131514123;  C1...C15  = 010121233421223. 

6.  Match  up  the  parentheses  as  usual;  then  simply  curl  the  string 
and  around  until  a2n  becomes  adjacent  to  ai,  and  notice  that  the 
distinction  between  left  and  right  parentheses  can  be  reconstructed 
from  the  context.  Letting  a 1 correspond  to  the  bottom  of  the  circle, 
as  in  Table  1,  yields  the  diagram  shown.  [A.  Errera,  Memoires  de 
la  Classe  Sci.  8°,  Acad.  Royale  de  Belgique  (2)  11,6  (1931),  26  pp.] 

7.  (a)  It  equals  ))()  ...  ();  setting  ai  <—  ‘ (’  will  restore  the  init 
string,  (b)  The  initial  binary  tree  (from  step  Bl)  will  have  been  restored, 
except  that  ln  = n + 1. 


8.  Zi  ...  /15  = 2 0 4 5 0 7 8 0 10  0 0 13  0 15  0;  ri  ...  n5  = 3 0 0 6 0 12  11  9 0 0 0 0 14  0 0; 
ei  ...  eis  = 1 0 3 1 0 2 2 0 1 0 0 2 0 1 0;  si  ...  s15  = 1 0 12  1 0 5 3 0 1 0 0 3 0 1 0. 

9.  Node  j is  a (proper)  ancestor  of  node  k if  and  only  if  j < k and  s3  + j > k.  (As  a 

consequence,  we  have  ci  -| fc„  = Si  -) h sn-) 

10.  If  j is  the  index  zk  of  the  fcth  left  parenthesis,  we  have  Wj  = ck  + 1 and  wp  = ck, 
where  j’  is  the  index  of  the  matching  right  parenthesis. 

11.  Swap  left  and  right  parentheses  in  a2n  ■ ■ ■ ai  to  get  the  mirror  image  of  ai  . . . a2n. 
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but  the  significance  of  transposition  is  clearer,  forest-wise,  if  we  draw  right-sibling  and 
left-child  links  horizontally  and  vertically,  then  do  a matrix-like  transposition: 


and  postorder(FH)  = preorder (F)R. 

(b)  Let  F correspond  to  the  binary  tree  B:  then  preorder(F)  = preorder(B) 
and  postorder(F)  = inorder(B),  as  noted  after  2.3.2-(6).  Therefore  preorder(FT)  = 
preorder  = postorder(B)fi  has  no  simple  relationship  to  either  preorder  ( F)  or 
postorder(F).  But  postorder(FT)  = inorder(Bfl)  = inorder(J3)fl  = postorder(F)fl. 

14.  According  to  answer  13,  postorder(FflT)  = preorder(F)  = preorder(B)  when  F 
corresponds  naturally  to  B ; and  postorder(FTB)  = preorder(F7  )H  = postorder(F). 
Therefore  the  equation  FRT  = F™  holds  if  and  only  if  F has  at  most  one  node. 

15.  If  Fr  corresponds  naturally  to  the  binary  tree  B' , the  root  of  B'  is  the  root  of  F’s 
rightmost  tree.  The  left  link  of  node  x in  B'  is  to  the  leftmost  child  of  x in  FR,  which 
is  the  rightmost  child  of  x in  F;  similarly,  the  right  link  is  to  x’s  left  sibling  in  F. 

Note:  Since  B corresponds  naturally  to  FRT,  answer  13  tells  us  that  inorder(F)  = 
postorder(Fflr)  = postorder(Ffl)fl  = preorder(F). 

16.  The  forest  F | G is  obtained  by  placing  the  trees  of  F below  the  first  node  of  G in 
postorder.  Associativity  follows  because  F|  (G| H)  = (HTGTFT)T  = (F|G)|/F  Notice, 
incidentally,  that  postorder(F  | G)  = postorder(F)  postorder(G),  and  that  F | ( GH ) = 
(F  | G)H  when  G is  nonnull. 

17.  Any  nonnull  forest  can  be  written  F = (G  | •)//,  where  • denotes  the  1-node  forest; 
then  Fr  = HR(GR  \ •)  and  FT  = (H7  \ -)GT . In  particular  we  cannot  have  FR  = FT 
unless  H is  the  null  forest  A,  since  the  first  tree  of  HR  can’t  be  H ' | ■ ; and  G must 
then  also  be  A.  Furthermore  F = FT  if  and  only  if  G = HT . In  that  case  we  cannot 
also  have  FR  — FRT  unless  G = A;  the  first  tree  of  GTR  would  otherwise  have  more 
nodes  than  G itself. 


7.2. 1.6 


ANSWERS  TO  EXERCISES  783 


It  appears  to  be  true  that  we  cannot  have  FRT  = FTR  unless  F — FR . Under 
that  assumption,  FRT  = FTR  if  and  only  if  F and  FT  are  both  self-conjugate.  David 
Callan  has  discovered  two  infinite  families  of  such  forests,  with  parameters  i,j,  k > 0: 


(In  these  examples,  i = 2,  j = 3,  and  k = 5.)  Are  there  any  other  cases? 

18.  The  C15  = 9,694,845  forests  are  partitioned  into  20,982  classes.  The  largest  is 
a cycle  of  length  58,968,  one  of  whose  elements  is  (( ()  (()))())()((()(())())())() . 
The  shortest  are  six  two-element  classes  (corresponding  to  exercise  17),  consisting  of 

()()()()()()()()()()()()()()(),  ()()()()((()()()()()))()()()(), 

()()()(((((()()()())))))()()(),  ()()((((((((()()()))))))))()(), 

()((()())((()(())()))(()()))(),  ()(((((((((((()())))))))))))(), 

and  their  transposes.  The  somewhat  strange  strings  (((((((())))))))()()()()()()(), 
()()()()()()()(((((((()))))))),  and  (((((((()()()()()()()())))))))  each  have 
wedge-shaped  binary  trees  and  form  a unique  class  of  size  3.  The  path  that  runs  from 
()((()(()()))(())((()())()))()  to  ((()())(()())(())(()())(()()))  has  3120  el- 
ements, one  of  which  is  (2).  According  to  the  conjecture  in  answer  19,  the  shortest 
possible  cycle  has  length  6;  when  n = 15  there  are  66  such  cycles.  (The  next-shortest 
cycle,  which  is  unique,  has  length  10  and  includes  ()(()()())()((((())()))((()))).) 

19.  The  transformation  from  Fj  to  Fj+ 1 by  Algorithm  P can  be  paraphrased  as  follows: 
“Find  the  last  node  in  preorder,  say  x,  that  has  a left  sibling,  say  y.  Remove  x from 
its  family  and  make  it  the  new  rightmost  child  of  y.  And  if  x < n,  change  all  of  x’s 
descendants  x + 1,  . . . , n into  trivial  one-node  trees.” 

The  transformation  that  takes  FR  into  FR+1  can  therefore  be  stated  as  follows, 
if  we  recall  that  the  kth  node  of  Fj  in  preorder  is  the  fcth-from-last  node  of  FR  in 
postorder:  “Find  the  first  node  in  postorder,  say  x,  that  has  a right  sibling,  say  y. 
Remove  x from  its  family  and  make  it  the  new  leftmost  child  of  y.  And  if  x > 1, 
change  all  of  x’s  descendants  x - 1,  . . . , 1 into  trivial  one-node  trees.” 

Similarly,  we  can  paraphrase  the  transformation  from  Gj  to  Gj+1  that  is  specified 
by  Algorithm  B:  “Find  j,  the  root  of  the  leftmost  nontrivial  tree;  then  find  k,  its 
rightmost  child.  Remove  k and  its  descendants  from  j’s  family,  and  insert  them  between 
j and  j’s  right  sibling.  Finally,  if  j > 1,  make  j and  its  right  siblings  all  children  of 
j — 1,  and  j — 1 a child  of  j — 2,  etc.” 

When  this  transformation  changes  the  left-sibling/right-child  representation  from 
Gj  T to  Gj+i  (see  exercise  15),  it  turns  out  to  be  identical  to  the  transformation  that 
takes  Fr  to  FR+l  in  the  left-child/right-sibling  representation.  Therefore  GRT  = FR, 
because  this  identity  clearly  holds  when  j = 1. 
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(It  follows  that  the  sequence  of  tables  ei  . ..en-i  for  the  binary  trees  generated 
by  Algorithm  B is  exactly  the  sequence  of  tables  dn-i  . • • di  for  the  parenthesis  strings 
generated  by  Algorithm  P;  this  phenomenon  is  illustrated  in  Tables  1 and  2.) 

Several  symmetries  between  lists  of  forests  have  been  explored  by  M.  C.  Er  in 
Comp.  J.  32  (1989),  76-85. 

20.  (a)  This  assertion,  which  generalizes  Lemma  2.3. IP,  is  readily  proved  by  induction, 
(b)  The  following  procedure  is,  in  fact,  almost  identical  to  Algorithm  P: 

Tl.  [Initialize.]  Set  63^-2  4—  3 and  b3k~i  4—  hut  4—  0 for  1 < k < n;  also  set  bo  4— 
&jv  4—  0 and  m <—  N — 3,  where  TV  = 3n  + 1. 

T2.  [Visit.]  Visit  bi  . . . bn-  (Now  bm  = 3 and  bm+i  . . . bjv  = 0 ...  0.) 

T3.  [Easy  case?]  Set  bm  4—  0.  If  6m-i  = 0,  set  bm-i  4—  3,  m 4—  m — 1,  and  go  to  T2. 

T4.  [Find  j.]  Set  j 4-  m - 1 and  k i-  N - 3.  While  bj  = 3,  set  bj  4-  0,  bk  4-  3, 
j *—  j — 1,  and  k 4 — k — 3. 

T5.  [Increase  bj.]  Terminate  the  algorithm  if  j = 0.  Otherwise  set  bj  4—  3,  m 4-  N — 3, 
and  return  to  T2.  | 

[See  S.  Zaks,  Theoretical  Comp.  Sci.  10  (1980),  63-82.  In  that  article,  Zaks  pointed 
out  that  it  is  even  easier  to  generate  the  sequence  z\  . . . zn  of  indices  j such  that  bj  = 3, 
using  an  algorithm  virtually  identical  to  the  answer  to  exercise  2,  because  a valid  ternary 
tree  combination  Z\  ...  zn  is  characterized  by  the  inequalities  Zk- 1 < zk  < 3 k - 2.] 

21.  For  this  problem  we  can  essentially  combine  Algorithm  P with  Algorithm  7.2.1 ,2L. 
We  shall  assume  for  convenience  that  nt>  0 and  ni  + ■ • • + nt  > 1. 

Gl.  [Initialize.]  Set  l 4—  N.  Then  for  j = t,  . . . , 2,  1 (in  this  order),  do  the  following 
operations  rij  times:  Set  bi-j  4-  j,  bt-j+1  4-  • • ■ 4-  b;_i  4-  0,  and  l 4-  l - j. 
Finally  set  bo  t—  bjv  4-  Co  4—  0 and  ra<-iV-(. 

G2.  [Visit.]  Visit  bi  . . . bjv-  (At  this  point  bm  > 0 and  bm+ 1 = •--==  bjv  = 0.) 

G3.  [Easy  case?]  If  bm-i  = 0,  set  bm-i  4—  bm,  bm  4—  0,  m 4—  m — 1,  and  return  to  G2. 

G4.  [Find  j .]  Set  ci  4—  bm,  bm  4—  0,  j 4—  m — 1,  and  k 4—  1.  While  bj  > c*,,  set 

k 4—  k + 1,  Ck  4—  bj,  bj  4—  0,  and  j 4—  j — 1. 

G5.  [Increase  bj.\  If  bj  > 0,  find  the  smallest  l > 1 such  that  bj  < c/ , and  interchange 
bj  4A  c,.  Otherwise,  if  j > 0,  set  bj  4—  ci  and  ci  4—  0.  Otherwise  terminate. 

G6.  [Reverse  and  spread  out.]  Set  j <-  k and  l 4-  N.  While  Cj  > 0,  set  bi-Cj  4-  Cj, 
l 4—  l — Cj,  and  j 4—  j — 1.  Then  set  m 4—  N — Ck  and  go  back  to  G2.  | 

This  algorithm  assumes  that  N > m +2n2  H \-tnt.  [See  SICOMP  8 (1979),  73-81.] 

22.  Note  first  that  d\  can  be  increased  if  and  only  if  n = 0 in  the  linked  representation. 
Otherwise  the  successor  of  dx  . . . dn-i  is  obtained  by  finding  the  smallest  j with  d3  > 0 
and  setting  dj  4—  0,  dj+ 1 4—  dj  + i + 1.  We  may  assume  that  n > 2. 

Kl.  [Initialize.]  Set  lk  4-  k + 1 and  rk  4-  0 for  1 < k < n;  also  set  ln  4-  rn  4-  0. 

K2.  [Visit.]  Visit  the  binary  tree  represented  by  I1I2  . . . ln  and  rir2  . . . rn. 

K3.  [Easy  cases?]  Set  y 4—  ri.  If  y = 0,  set  rq  4—  2,  l\  4—  0,  and  return  to  K2. 

Otherwise  if  Zi  = 0,  set  l\  4—  2,  ri  4—  r2,  r2  4—  Z2,  Z2  4—  0,  and  return  to  K2. 

Otherwise  set  j 4—2  and  A;  4 — 1. 

K4.  [Find  j and  fc.]  If  rj  > 0,  set  k 4-  j and  y 4-  r,  . Then  if  J # 3/  — 1,  set  j 4-  j + 1 
and  repeat  this  step. 
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K5.  [Shuffle  subtrees.]  Set  l3  4 - y,  r3  4-  ry,  ry  4-  ly,  and  ly  4-  0.  If  j = k,  go  to  K2. 

K6.  [Shift  subtrees.]  Terminate  if  y = n.  Otherwise,  while  k > 1,  set  k 4-  k - 1,  j 4- 
j - 1,  and  r3  4-  rk.  Then  while  j > 1,  set  j 4-  j - 1 and  r3  4-  0.  Return  to  K2.  | 

(See  the  analysis  in  exercise  45.  Korsh  [Comp.  J.  48  (2005),  488-497;  49  (2006), 
351-357]  has  shown  that  this  algorithm  can  be  extended  in  an  interesting  way  to  t-ary 
trees;  and  he  has  also  found  an  efficient  t-ary  generalization  of  Algorithm  B.) 

23.  (a)  Since  zn  begins  at  2n  - 1 and  goes  back  and  forth  Cn- i times,  it  ends  at 

2n  — l — (Cn- i mod  2),  when  n > 1.  Furthermore  the  final  value  of  Zj  is  constant  for 
all  n > j.  Thus  the  final  string  ziz2  ...  is  1 2 5 6 9 11  13  14  17  19  ... , containing  all 
odd  numbers  < 2 n except  3,  7,  15,  31, 

(b)  Similarly,  the  preorder  permutation  that  characterizes  the  final  tree  is  2k  2fc~1 
...  1 3 5 6 7 9 10  ... , where  k = LlgnJ.  Forestwise,  node  2j  is  the  parent  of  2j~l  nodes 
{2J  \ 2J  1 + 1, . . . , 2j  — 1},  for  1 < j < k,  and  the  trees  {2k  + 1, . . . , n}  are  trivial. 

Note:  If  Algorithm  N is  restarted  at  step  N2  after  it  has  terminated,  it  will  generate 
the  same  sequence,  but  backwards.  Algorithm  L has  the  same  property. 

24.  I0I1...I15  = 201030065080012114;  ri...n5  = 015  010  7009014130000; 
ki  ..  .k15  = 002  245  5484101111  10  2;  q3  ...g15  = 21  15  43108  5 76914111312;  and 
u i ■ • • U15  = 12  31005031001010.  (If  nodes  of  the  forest  F are  numbered  in  post- 
order, kj  is  the  left  sibling  of  j ; or,  if  j is  the  leftmost  child  of  p,  k3  = kp.  Stated 
another  way,  k3  is  the  parent  of  j in  the  forest  FTR.  And  k3  is  also  j - 1 - un+i-j,  the 
number  of  elements  to  the  left  of  j in  qi  . . . qn  that  are  less  than  j.) 

25.  Taking  a cue  from  Algorithms  N and  L,  we  want  to  extend  each  ( n — l)-node  tree 
to  a list  of  two  or  more  n-node  trees.  The  idea  in  this  case  is  to  make  n a child  of 
n ~ 1 in  the  binary  tree  at  the  beginning  and  the  end  of  every  such  list.  The  following 
algorithm  uses  additional  link  fields  p3  and  s3 , where  pj  points  to  the  parent  of  j in  the 
forest,  and  s3  points  to  j’s  left  sibling  or  to  j' s rightmost  sibling  if  j is  the  leftmost  in 
its  family.  (These  pointers  p3  and  s3  are,  of  course,  not  the  same  as  the  permutations 
Pi  ...pn  in  Table  1 or  the  scope  coordinates  si  . . . s„  in  Table  2.  In  fact  si  . . . s„  is  the 
permutation  A of  exercise  33  below.) 

Ml.  [Initialize.]  Set  l3  <-  j + 1,  r3  <-  0,  s3  <-  j,  Pj  j - 1,  and  o3  i 1 for  1 < j < n, 

except  that  ln  0. 

M2.  [Visit.]  Visit  l\  . . . ln  and  r\  . . .rn.  Then  set  j <—  n. 

M3.  [Find  j.]  If  o3  > 0,  set  k p3  and  go  to  M5  if  k ^ j - 1.  If  o3  < 0,  set  k <-  s3 

and  go  to  M4  if  k ^ j — 1.  If  k = j — 1 in  either  case,  set  o,  < Oj,  j j — 1, 

and  repeat  this  step. 

M4.  [Transfer  down.]  (At  this  point  k is  j’s  left  sibling,  or  the  rightmost  member  of 
j’s  family.)  If  k > j,  terminate  if  j = 1,  otherwise  set  x «—  pj,  lx  t—  0,  z 4-  k,  and 
fc  •<—  0 (thereby  detaching  node  j from  its  parent  and  heading  for  the  top  level). 
But  if  k < j,  set  x <—  p3  + 1,  z t—  sx,  rk  t—  0,  and  sx  t—  k (thereby  detaching 
node  j from  k and  going  down  a level).  Then  set  x k + 1,  y <-  sx,  sx  <-  z, 
si  V,  rv  j i and  x j.  While  x / 0,  set  px  t-  k and  x 4-  rx.  Return  to  M2. 

M5.  [Transfer  up.]  (At  this  point  k is  j’s  parent.)  Set  x •<-  ifc  + 1,  y <-  s3,  z <-  sx, 
sx  <—  y,  and  ry  <r-  0.  If  k ^ 0,  set  y <—  pk,  rk  j,  s3  4-  k,  sy+ 1 4—  z,  and  x 4—  j; 
otherwise  set  y 4—  j — 1,  ly  4—  j,  s3  4-  z,  and  x 4—  j.  While  x / 0,  set  px  4—  y and 
x 4—  rx.  Return  to  M2.  | 
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Running  time  notes:  We  can  argue  as  in  exercise  44  that  step  M3  costs  2 Cn  +3(C„_i  + 

1- Ci)  mems,  and  that  steps  M4  and  M5  together  cost  8C„  — 2(C„_i  + • • - + Ci),  plus 

twice  the  number  of  times  x «—  rx.  The  latter  quantity  is  difficult  to  analyze  precisely; 
for  example,  when  n = 15  and  j = 6,  the  algorithm  sets  x <—  rx  exactly  (1, 2, 3, 4,  5, 6) 
times  in  respectively  (45, 23,  7, 9,  2, 4)  cases.  But  heuristically  the  average  number  of 
times  x <—  rx  should  be  approximately  2 — 2J-n  when  j is  given,  therefore  about 
(2Cn  — (C„  — Cn— i)  — (Cn— i — Cn-2)/2  — (Cn- 2 — Cn-3)/4  — • ■ ■ )/Cn  ss  8/7  overall. 
Empirical  tests  confirm  this  predicted  behavior,  showing  that  the  total  cost  per  tree 
approaches  265/21  ss  12.6  mems  as  n — >-  oo. 

26.  (a)  The  condition  is  clearly  necessary.  And  if  it  holds,  we  can  uniquely  construct  F: 
Node  1 and  its  siblings  are  the  roots  of  the  forest,  and  their  descendants  are  defined 
inductively  by  noncrossing  partitions.  (In  fact,  we  can  compute  the  depth  coordinates 
Ci  . . .c„  directly  from  77’s  restricted  growth  string  a\  . . .a„:  Set  ci  t—  0 and  i0  <—  0. 
For  2 < j < n,  if  a,j  > max(ai, . . . ,a>_i),  set  Cj  t-  Cj-i  + 1 and  iaj  +-  Cj,  otherwise 
set  Cj  «-  i a ,-.) 

(b)  If  77  and  77'  satisfy  the  noncrossing  condition,  so  does  their  greatest  common 
refinement  77  V 77',  so  we  can  proceed  as  in  exercise  7.2.1.5-12(a). 

(c)  Let  xi,  . . . , Xm  be  the  children  of  some  node  in  F,  and  let  1 < j < k < m. 
Form  F'  by  removing  Xj+i,  . . . , Xk  from  their  family  and  reattaching  them  as  children 
of  Xj+ 1 — 1,  the  rightmost  descendant  of  Xj. 

(d)  Obvious,  by  (c).  Thus  the  forests  are  ranked  from  bottom  to  top  by  the  number 
of  nonleaf  nodes  they  contain  (which  is  one  less  than  the  number  of  blocks  in  77). 

(e)  Exactly  ]Pfc=0  ek(ek  — l)/2,  where  eg  = n — e i — ■ ■ ■ — en  is  the  number  of  roots. 

(f)  Dualization  is  similar  to  the  transposition  operation  in  exercise  12,  but  we  use 
left-sibling  and  right-child  links  instead  of  left-child  and  right-sibling,  and  we  transpose 
about  the  minor  diagonal: 


(“Right”  links  now  point  downward.  Notice  that  j is  the  rightmost  child  of  k in  F if 
and  only  if  j is  the  left  sibling  of  k in  F° . Preorder  of  F°  reverses  the  preorder  of  F, 
just  as  postorder  of  FT  reverses  postorder  of  F.) 

(g)  From  (f)  we  can  see  that  F'  covers  F if  and  only  if  F°  covers  F'D . (Therefore 
F°  has  n + 1 — k leaves  if  F has  k.) 

(h)  FAF'  = (FdYF,d)d. 

(i)  No.  If  it  did,  equality  would  necessarily  hold,  by  duality.  But,  for  example, 
0101  A 0121  = 0000  and  0101  V 0121  = 0123,  while  leaves(OlOl)  + leaves(0121)  / 
leaves(OOOO)  + leaves(0123). 

[Noncrossing  partitions  were  first,  considered  by  H.  W.  Becker  in  Math.  Mag.  22 
(1948),  23-26.  G.  Kreweras  proved  in  1971  that  they  form  a lattice;  see  the  references 
in  answer  2. 3. 4. 6-3.] 


7.2. 1.6 


ANSWERS  TO  EXERCISES  787 


27.  (a)  This  assertion  is  equivalent  to  exercise  2.3.3-19. 

(b)  If  we  represent  a forest  by  right-child  and  left-sibling  links,  preorder  cor- 
responds to  inorder  of  the  binary  tree  (see  exercise  2.3.2-5),  and  a,  is  the  size  of 
node  j s right  subtree.  Rotation  to  the  left  at  any  nonleaf  of  this  binary  tree  decreases 
exactly  one  of  the  scope  coordinates,  and  the  amount  of  decrease  is  as  small  as  possible 
consistent  with  a valid  table  si  . . . sn.  Therefore  F'  covers  F if  and  only  if  F is  obtained 
from  F by  such  a rotation.  (Rotation  in  the  left-child/ right-sibling  representation  is 
similar,  but  with  respect  to  postorder.) 

(c)  Dualization  preserves  the  covering  relation  but  exchanges  left  with  right. 

(d)  FT  F'  = (Fd  _L  F'd)d  . Equivalently,  as  noted  in  exercise  6.2.3-32,  we  can 
independently  minimize  the  left-subtree  sizes. 

(e)  The  covering  transformation  in  answer  26(c)  obviously  makes  s}  < s'  for  all  j. 

(f)  True,  because  FAF'kFHFIF  and  F 7\  F'  K F'  -\  F T F' . 

(g)  False;  for  example,  0121  V 0122  = 0123  and  0121  T 0122  = 0122.  (But  we  do 
have  FT  F1  -|  F V F',  by  taking  duals  in  (f).) 

(h)  1 he  longest  path,  of  length  (2),  repeatedly  decreases  the  rightmost  nonzero  Sj 
by  1.  The  shortest,  of  length  n — 1,  repeatedly  sets  the  leftmost  nonzero  sj  to  0. 

Answer  6.2.3-32  gives  many  references  to  the  literature  of  Tamari  lattices. 

28.  (a)  Just  compute  min(cl5  c[) . . . min(cn,  c'n)  and  max(c1,c']) . . . max(cn,  c/),  be- 
cause ci  ...  cn  is  a valid  sequence  if  and  only  if  ci  = 0 and  Cj  < cj-i  + 1 for  1 < j < n. 

(b)  Obvious  because  of  (a).  Note:  The  elements  of  any  distributive  lattice  can  be 

represented  as  the  order  ideals  of  some  partial  ordering.  In  the  case  of  Fig.  62,  a 
that  partial  ordering  is  shown  at  the  right,  and  a similar  triangular  grid  with  yV 
sides  of  length  n - 2 yields  Stanley’s  lattice  of  order  n.  / V\ 

(c)  Take  a node  k of  F that  has  a left  sibling,  j . Remove  k from  its  family  and 
place  it  as  a new  right  child  of  j . followed  by  its  former  children  as  new  children  of  /; 
the  former  children  of  k retain  their  own  descendants.  (This  operation  corresponds 
to  changing  )(  to  ()  in  a nested  parenthesis  string.  Thus  a “perfect”  Gray  code 
for  parentheses  corresponds  to  a Hamiltonian  path  in  the  cover  graph  of  Stanley’s 
lattice.  Exactly  38  such  paths  exist  when  n = 4,  namely  (8, 6, 6, 8, 4, 6)  from  0123  to 
(0001, 0010,  0012,0100, 0111, 0120)  respectively.) 

(d)  True,  because  the  cover  relation  in  (c)  is  left-right  symmetric.  (We  have 
F C F if  and  only  if  Wj  < w'j  for  0 < j < 2 n,  where  the  worm  depths  Wj  are  defined 
in  exercise  10.  If  w0  . . . w2n  is  the  worm  walk  of  F,  its  reverse  w2n  ■ ■ ■ w0  is  the  worm 
walk  of  F . Notice  that  the  cover  relation  changes  just  one  coordinate  wj.  One  can 
compute  F n F'  and  F U F'  by  taking  min  and  max  of  the  re’s  instead  of  the  c’s.) 

(e)  See  exercise  9.  (Thus  FiF'CFnF',  etc.,  as  in  exercise  27(f).) 

Notes:  Stanley  introduced  this  lattice  in  Fibonacci  Quarterly  13  (1975),  222-223. 
Since  three  important  lattices  are  defined  on  the  same  elements,  we  need  three  notations 
for  the  different  orderings;  the  symbols  K,  H,  and  C adopted  here  are  intended  to  be 
reminiscent  of  the  names  of  Kreweras,  Tamari,  and  Stanley  (who  is  CTemiH  in  Russia). 

29.  If  we  paste  six  regular  pentagons  together,  we  get  14  vertices  whose  coordinates 
after  suitable  rotation  and  scaling  are  respectively 

Pioio  — Poooo  = P 3000  = P2100  = (— 1,  n/3,  2/cf>); 

P0010  = P3100  = (<T2,  V3  </>,  0);  p3010  = p“00  = (0, 0, 2);  p3210  = p-200  = (2, 0,  2/0); 

P0210  = P3200  = (A  A,  0);  p1000  = P2000  = (-02,  A/0, 0); 
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here  ( x,y,z )*  means  ( x,-y,z ) and  ( x,y,z)~  means  (x,y,—z).  But  then  the  three 
4-edged  “faces”  are  not  squares;  in  fact,  they  don’t  even  lie  in  a plane. 

(One  can  however  get  a similar-looking  solid,  with  true  squares  but  irregular 
pentagons,  by  gluing  together  two  suitable  tetrahedra  and  lopping  off  the  three  glued- 
together  corners.  Alternative  sets  of  coordinates  for  the  associahedron,  of  substantial 
mathematical  interest  but  less  appealing  to  the  eye,  are  discussed  by  Gunter  Ziegler  in 
his  Lectures  on  Polytopes  (New  York:  Springer,  1995),  example  9.11.) 

30.  (a)  fn-i  • • • / i 0,  because  internal  node  j in  symmetric  order  has  a nonempty  right 
subtree  if  and  only  if  internal  node  j + 1 in  symmetric  order  has  an  empty  left  subtree. 

(b)  In  general  if  the  footprint  were  pi  o?l  + 1 1P2+1092+1  . . . iPfc+1o,fc  + 1,  we  would 
want  to  count  all  binary  trees  whose  nodes  in  symmetric  order  have  the  specification 
RP1  NLqi  BRP 2 N Lq 2 B . . . RPk  NLqk , where  B means  “both  subtrees  are  nonempty,” 
R means  “the  right  subtree  is  nonempty  but  not  the  left,”  L means  “the  left  subtree 
is  nonempty  but  not  the  right,”  and  N means  “neither  subtree  is  nonempty.”  This 
number  in  general  is 


and  in  particular  it  is  (°+°)  (*  j°)  (8+3 ) (°+°)  (°+°)  (°+2)  (°+°)  (^JC,  = 240240. 

(c)  dj  = 0 if  and  only  if  Cj+ 1 > Cj,  by  exercise  3. 

(d)  In  general,  the  footprint  of  F _L  F'  is  f1  . . . fn  A f[  . . . f'n,  by  exercise  27(a);  the 
footprint  of  F T F'  is  /i  ■ ■ ■ /n  V /[■■■/'„,  by  (a)  and  exercise  27(d). 

[The  fact  that  complements  always  exist  in  the  Tamari  lattice  is  due  to  H.  Lakser; 
see  G.  Gratzer,  General  Lattice  Theory  (1978),  exercise  1.6.30.] 

31.  (a)  2n_1;  see  exercise  6. 2. 2-5. 

(b)  ci  < • • • < c„;  di,  . . . , dn-i  < 1;  ej  > 0 implies  ej  + •••-)-  en  = n — j; 

kj+i  < kj  + 1;  pi  < • • • < pj  > ■ ■ ■ > pn  for  some  j;  Sj  > 0 implies  Sj  — n — j; 

'Hi  >■■  ■ > u,i ; Zj+ 1 < Zj  +2.  (Other  constraints,  which  apply  in  general,  whittle  down 
the  number  of  possibilities  to  2n_1  in  each  case.  For  example,  U\  ...  u„  must  be  a valid 
sequence  of  scope  coordinates.) 

(c)  True  in  only  n cases  out  of  2n_1.  (But  FT  is  degenerate.) 

(d)  The  degenerate  forest  with  footprint  f \ . . . fn  has  Cj+ 1 = Cj  + . Elements 

j < k are  siblings  if  and  only  if  fj  = fj+1  = ■■■  = fk_1  = 0.  Thus  if  F"  is  the 

degenerate  forest  with  footprint  /i  . . . /„  A f[  . . . fn,  then  F"  K F and  F"  K F';  hence 

F"  K F A F1  H F Y F' . And  we  also  have  F 1 F’  H F"  by  (b).  A similar  argument 
proves  that  F V F'  = F T F'  is  the  degenerate  forest  with  footprint  A ■ . . fn  V f[  ■ . . f'n. 

Thus,  when  the  Kreweras  and  Tamari  lattices  are  restricted  to  degenerate  forests, 
they  become  identical  to  the  Boolean  lattice  of  subsets  of  {1, . . . , n — 1}.  [This  result, 
in  the  case  of  Tamari  lattices,  is  due  to  George  Markowsky,  Order  9 (1992),  265-290, 
whose  paper  also  shows  that  Tamari  lattices  enjoy  many  further  properties.] 

32.  Suppose  F and  F'  have  scope  coordinates  sx  . . . sn  and  s'1...s'n.  Call  index  j 
frozen  if  Sj  < s'  or  j = 0.  We  want  to  specify  the  values  of  the  frozen  coordinates  and 
maximize  the  others.  Let  s0  = n,  and  for  0 < k < n let 


s'i  = Sj  - k + j,  where  j = max{ i \ 0 < i < k,  i is  frozen,  and  i + Si  > k}. 


Since  sk  < Sj  — (k  — j)  whenever  0 < k — j < Sj,  we  have  sk  > sk,  with  equality  when 
k is  frozen. 
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The  scopes  s'qs"  . . . s"  correspond  to  a valid  forest  according  to  the  condition  of 
exercise  27(a).  For  if  k > 0 and  0 < l < sk  = Sj  -k  + j and  4'+(  = sjt  - k - l + j' , we 
have  sk+l  + l < sk  if  0 < j'  — j < Sj,  because  Sj > + j'  — j < Sj  in  that  case.  And  we 
can’t  have  j > j'  or  j'  > j + Sj,  because  j + Sj  > k + l > j' . 

Let  F'"  be  a forest  with  scopes  satisfying  sk  < s'k  < sk.  Then  min(4,  4")  = sk, 
because  sk  = s'k  when  k is  frozen,  otherwise  sk  = s'k. 

Conversely,  if  F'"  is  a forest  with  F'  1 F'"  = F,  we  must  have  sk  < s'k  < sk. 
For  4"  < sk  would  imply  sk  < s'k.  And  if  k is  minimal  with  s'k  > sk,  we  have 
4 = Sj  - k + j for  some  frozen  j with  0 < j < k and  j + Sj  >k.  Then  s'"  > Sj  implies 
k — j < Sj  , hence  s'k  + k — j < s'" . If  j < k we  have  s'"  < s"  — sj,  a contradiction. 
But  j = k implies  min(s'fc", s'k)  > sk. 

To  get  the  first  semidistributive  law,  apply  this  principle  with  F replaced  by  F_1_G 
and  F'  replaced  by  F;  then  the  hypotheses  F H G ^ F"  and  F H H H F"  imply  that 
F H G T H H F" . The  second  semidistributive  law  follows  by  taking  duals  in  the  first. 

(Ralph  Freese  suggests  calling  F"  the  pseudo-complement  of  F'  over  F.) 

33.  (a)  Let  k\  = LLINK[fc]  if  LLINKffc]  / 0,  otherwise  RLINK[fc  — 1]  if  k 4 1,  otherwise 
the  root  of  the  binary  tree.  This  rule  defines  a permutation  because  kX  = j if  and  only 
if  & = parent(j)  + [j  is  a right  child],  or  k = 1 and  j is  the  root.  Also  kX  > k when 
LLINKffc]  = 0 and  kaX  < k when  RLINKfA:]  = 0.  [For  a generalization  to  f-ary  trees, 
see  P.  H.  Edelman,  Discrete  Math.  40  (1982),  171 -179.] 

(b)  Using  the  representation  of  (2)  in  answer  26(f),  we  see  that  X(F)  is  (31)(2) 
(12 6 4)(5)(11 7)(14  13)(98)(15)(10)  in  that  case.  In  general  the  cycles  are  the  families 
of  the  forest,  in  decreasing  order  within  each  cycle;  nodes  are  numbered  in  preorder. 
[See  Dershowitz  and  Zaks,  Discrete  Math.  62  (1986),  215-218.] 

(c)  X(Fd)  = paX(F)p,  where  p is  the  “flip”  permutation  (1  n)(2  n—  1) . . . , because 
the  dual  forest  interchanges  LLINK  ++  RLINK  and  flips  the  preorder  numbering. 

(d)  The  cycle  breakup  (xj  xk)(xx  . . . xm)  = (xi  . . . XjXk+i  ■ ■ ■ xm){xj+i  ...xk)  cor- 
responds to  answer  26(c). 

(e)  By  (d),  each  covering  path  corresponds  to  a factorization  of  (n  . . . 21).  Let 

qn  denote  the  number  of  such  factorizations.  Then  we  have  the  recurrence  qi  = 1 and 
In  = 5Z”=i  (n  “ because  there  are  n — l choices  with  k — j = l by  which 

the  first  transposition  breaks  the  cycle  into  parts  of  sizes  l and  n - then  (““* ) ways 
to  interleave  the  subsequent  factors.  The  solution  is  qn  = nn~2,  because 

e(41K‘(»-'>“- 


[See  J.  Denes,  Magyar  Tudomanyos  Akademia  Matematikai  Kutato  Intezetenek 
Kozlemenyei  4 (1959),  63-70.  It  is  natural  to  seek  a correspondence  between  factoriza- 
tions and  labeled  free  trees,  since  there  also  happen  to  be  n"~ 2 of  the  latter.  Perhaps 
the  simplest  is  the  following,  given  (1  2 ...  n)  = {xxy+j...  {x„-X  yn-i)  where  Xj  < yj: 
Suppose  the  cycle  containing  x3  and  y3  in  (x3  y3) . . . (xn-i  yn- 1)  is  (zi  ...  zm),  where 
zi  < ■ ■ ■ < zm-  If  Vj  = zm,  let  dj  = z\,  otherwise  let  aj  = min{2j  | z<  > x3}.  Then  one 
can  show  that  a i . . . an~i  is  a “wake-up  sequence”  for  parking  n — 1 cars,  and  exercise 
6.4-31  connects  it  to  free  trees.] 


n — 1 

!“»E(“7  )<*+o'-'(!/-o-- ' 

x-^0  x V ’ 
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34.  Each  covering  path  from  bottom  to  top  is  equivalent  to  a Young  tableau  of  shape 
(n  — 1,  n — 2, . . . , 1),  so  we  can  use  Theorem  5.1.4H.  (See  exercise  5.3.4-38.) 

[The  enumeration  of  such  paths  in  Tamari  lattices  remains  mysterious;  the  relevant 
sequence  is  1,  1,  2,  9,  98,  2981,  340549,  . . . .] 

35.  Multiply  by  n + 1,  then  see  AMM  97  (1990),  626-630. 


36.  We  might  as  well  generalize  to  f-ary  trees  for  arbitrary  t > 1,  by  making  obvious 
amendments  to  steps  T1-T5.  Let  C’n1  be  the  number  of  f-ary  trees  with  n internal 
nodes;  thus  Cn  = Cn  ’ and  ch'1  = ((t  — l)n  + 1)  If  h of  the  degrees  bj  are 

changed  between  visits,  we  have  h > x in  C%lx  cases.  So  the  easy  case  occurs  with 
probability  1 — C^^/Cn  ^ ~ 1 — (t  — 1)*  1/ti,  and  the  average  number  of  times  bj  <—  0 
in  step  T4  is  j + ■ • ■ + C[t))/C%)  « (t- l)t~1/(tt  - (t-l)4”1),  or  4/23  when  t = 3. 


Indeed,  we  can  also  study  the  t-ary  recursive  structure  A™  = 0 A1'*)  ...  t A^ 

P{Q—  1)  (p—  l)q 

when  0 < (f  - l)p  < q ^ 0,  generalizing  (5).  The  number  of  such  degree  sequences, 
CpJ,  satisfies  the  recurrence  (21)  except  that  Cpq  = 0 when  p < 0 or  (f  - l)p  > q.  The 
general  solution  is 


and  we  have  Ci!'1  = C^t_1)n).  The  triangle  for  t = 3 begins 
as  shown  at  the  right. 

37.  The  basic  lexicographic  recursion  for  all  such  forests  is 


A(n0,  ni, . . . , nt)  = 0 A(no  — l,m, . . . ,nt), 

1 .A(no,ni  — 1, . . . ,nt),  ...,  t A(n0,  ni, . . . , nt  - 1) 


1 

1 

1 1 

1 2 

13  3 

14  7 

1 5 12  12 

1 6 18  30 

1 7 25  55  55 

1 8 33  88  143 


when  no  > n2  + 2ri3  + • • • + (t—  l)nt  and  ni,  . . . , > 0;  otherwise  A(no,ni, . . . ,nt ) is 

empty,  except  that  A(0, . . . , 0)  = e is  the  sequence  consisting  of  the  empty  string  alone. 
Step  G1  computes  the  first  entry  of  ^4(no, . . . , nt).  We  want  to  analyze  five  quantities: 


C,  the  number  of  times  G2  is  executed  (the  total  number  of  forests); 
E,  the  number  of  times  G3  goes  to  G2  (the  number  of  easy  cases); 

K,  the  number  of  times  G4  moves  some  bi  into  list  c; 

L,  the  number  of  times  G5  compares  bj  with  some  ct; 

Z,  the  number  of  times  G5  sets  c\  •<—  0. 


Then  the  loop  in  step  G6  sets  bi-Cj  <-  Cj  a total  of  K - Z - m - • • • - nt  times. 

Let  n be  the  vector  (no,  n%, . . . , nt),  and  let  ej  be  the  unit  vector  with  1 in 

coordinate  position  j.  Let  |n|  = n0  + ml b nt  and  ||n||  = m + 2n2  H b tnt. 

Using  this  notation  we  can  rewrite  the  basic  recurrence  above  in  the  convenient  form 


A(n)  — 0 A(n  — eo),  1 A(n  — ei),  . . . , t A(n  — et)  when  |n|  > 
Consider  the  general  recurrence  relation 

t 

F(n)  = /W+(5:F(n-e,))[|n|>|M|], 

j= 0 
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with  F(n)  — 0 whenever  the  vector  n has  a negative  component.  If  /(n)  = [|n|  =0], 
then  F(n)  — C(n)  is  the  total  number  of  forests.  Answer  2.3.4.4-32  tells  us  that 


C(n)  = 


(M  - l)!(|w|  - ||n|| 
no!  ni! . . . nt! 


= E(i-j)( 

—'o  w..., 


|n|  - 1 
— 1,  n 


), 

j + 1 > • • • 5 fit  / 


generalizing  the  formula  for  CpJ  in  answer  36  (which  is  the  case  n0  = (t  - \)q  + 1 and 
nt  = p).  Similarly,  we  obtain  recurrences  for  the  other  quantities  E(n),  K(n ),  L(n), 
and  Z(n)  needed  in  our  analysis  by  choosing  other  kernel  functions  f(n): 


f(n) 

= [|n|  = n0  + 1 and  n0  > ||n||] 

yields 

F(n) 

= E{n ); 

f(n) 

= [|n|  > n0] 

yields 

F(n) 

= E(n)  + K(n)\ 

f(n) 

= [M  = INI  + 1] 

yields 

F(n) 

= C(n)  + K(n)  — Z(n 

f(n) 

= 1 '2l<j<k<t  nAnk  > 0] 

yields 

F(n) 

= L(n). 

The  symbolic  methods  of  exercise  2.3.4.4-32  do  not  seem  to  yield  quick  solutions 
to  these  more  general  recurrences,  but  we  can  readily  establish  the  value  of  C — E by 
noting  that  bm  + m < N in  step  G2  if  and  only  if  the  previous  step  was  G3.  Therefore 

t 

C(n)  - E(n)  = ^C(n  - /,),  where  fj  = ej  - (j- l)e0; 
i= i 

this  sum  counts  the  subforests  in  which  m + • • • + nt,  the  number  of  internal  (nonleaf) 
nodes,  has  decreased  by  1.  Similarly  we  can  let 

C{x)(n)  = ^{C(n  - iifi  itft)  | A -f f it  = x} 

be  the  number  of  subforests  having  n H b nt  - x internal  nodes.  Then  we  have 

M 

K(n)  - Z(n)  = ^C(x)(n), 

X = 1 

a formula  analogous  to  (20),  because  k - [bj  = 0]  > x > 1 in  step  G5  if  and  only 
it  bm-x  > 0 and  bm^x+i  > • • • > bm.  Such  preorder  degree  strings  are  in  one-to- 
one  correspondence  with  the  forests  of  C(a:)(n)  if  we  remove  bm-x+1 . . .bm  and  an 
appropriate  number  of  trailing  Os  from  the  string  bi  . . . b n- 

From  these  formulas  we  can  conclude  that  the  Zaks-Richards  algorithm  needs 

only  0(1)  operations  per  forest  visited,  whenever  m = n2  H b nt  + 0(1),  because 

C(n  — fj)/C(n)  = /{\n\  — 1 )i  < 1/4  + 0(|n|_1)  when  j > 1.  Indeed,  the  value 

of  A is  quite  small  in  nearly  all  cases  of  practical  interest.  However,  the  algorithm  can 
be  slow  when  m is  large.  For  example,  iff==l,  n0  = m + r+  l,  and  m = m,  the 
algorithm  essentially  computes  all  r-combinations  of  m + r things;  then  C(n)  = (m/'r) 
and  K(n ) — Z(n)  = ('//(’/)  = fl(mO(n))  when  r is  fixed.  [To  ensure  efficiency  in 
all  cases,  we  can  keep  track  of  trailing  Is;  see  Ruskey  and  Roelants  van  Baronaigien, 
Congressus  Numerantium  41  (1984),  53-62.] 

Exact  formulas  for  AT,  Z,  and  (especially)  L do  not  seem  to  be  simple,  but  we  can 
compute  those  quantities  as  follows.  Say  that  the  “active  block”  of  a forest  is  the  right- 
most substring  of  nonzero  degrees;  for  example,  the  active  block  of  302102021230000000 
is  2123.  All  permutations  of  the  active  block  occur  equally  often.  Indeed,  let  D(n) 
denote  the  sum  of  “trailing  zeros (/3)  — 1”  over  all  preorder  degree  strings  /3  for  forests 


792  ANSWERS  TO  EXERCISES 


7.2. 1.6 


of  specification  n.  Then  a block  with  n'  occurrences  of  j for  1 < j < t is  active  in 
exactly  D(n  — n[fi  — • ■ ■ — n[ft)  + [ n[  + • • • + n't  = n±  + ■ ■ • + nt]  cases.  For  example, 
given  the  string  3021020000,  we  can  insert  21230000  in  three  places  to  obtain  a forest 
with  active  block  2123.  The  contributions  to  K and  L when  the  active  block  is  flush 
left  (not  preceded  by  any  0s)  can  be  computed  as  in  exercise  7. 2. 1.2-6,  namely 

k(n)  =w(eni(z)...ent(z)),  l{n)  = w (eni  (z) . . . ent  (z)  ^ (m  - zri(z))rj(z)) 

in  the  notation  of  that  answer.  Analogous  contributions  occur  in  general;  therefore 

K(n)  = fc(n)+^D(n-n')/c(n'),  L(n ) = f(n)+]P£>(n-n')/(n'),  Z(n)  = J ^D(n-n'), 

summed  over  all  vectors  n!  such  that  n'  < nq  for  1 < j < t and  |n'|  - ||n'||  = |n|  - ||n|| 
and  n'i  + + n't  < rii  + ■ ■ ■ + nt  — 2. 

It  remains  to  determine  D(n).  Let  C*(n;  j)  be  the  number  of  forests  of  specification 
n = (no,  • • • , nt)  in  which  the  last  internal  node  in  preorder  has  degree  j.  Then  we  have 

t 

cin)  = 7!%))  and  C(n  + ei;  1)  = C(n  + e2;2)  = • • • = C(n  + et;t)  = C(n)  + D(n). 
i= i 


From  this  infinite  system  of  linear  equations  we  can  deduce  that  C(n ) + D(n ) is 


n2  nt 

£...£(-ir+-+i‘ 


12=0  it  =0 


/ *2  + • • • + it  \ 

V *2,-..,  it  ) 


C(n  + (l+i2  + 


+*«)ei  - i2/2  — 


Simpler  expressions  would  of  course  be  desirable,  if  they  exist. 

38.  Step  LI  obviously  uses  4n  + 2 mems.  Step  L3  exits  to  L4  or  L5  exactly  Cj  -Cj-i 
times  with  a particular  value  of  j ; therefore  it  costs  2Cn  + 3 (n  ~ J ) (ci  ~ ci  -l ) = 
2 Cn  + 3(Cn_i  + • • ■ + Ci  + Co)  mems.  Steps  L4  and  L5  jointly  cost  a total  of  6Cn  — 6. 
Therefore  the  entire  process  involves  9 + 0(n-1^2)  mems  per  visit. 

39.  A Young  tableau  of  shape  ( q,p ) and  entries  ytj  corresponds  to  an  element  of  Apq 
that  has  left  parens  in  positions  p + q + 1 — j/2 1 , . . . , p + q + 1 — j/2p  and  right  parens 
in  positions  p + q + 1 — yu,  p + q+  1 — ylq.  The  hook  lengths  are  (g  + 1,  q, . . . , 1, 
P)P~  !>•••, 1}  \ {<7 - P+1};  so  CPq  = (p  + q)\{q  — p+  l)/(p\(q+  1)!)  by  Theorem  5.1.4H. 

40.  (a)  Cpq  = (p+?)  - (pj^)  = (p+9)  + (p+’)  = (P+p+1)  (modulo  2);  now  use  exercise 
1.2.6-11.  (b)  By  Eq.  7.1.3-(36)  we  know  that  u(n  & (n  + 1))  = u(n  + 1)  — 1. 

41.  It  equals  C(wz)/(l -zC(wz))  = 1/(1  - z-wzC(wz))  = (l-wC(wz))/(l-w-z), 
where  C(z)  is  the  Catalan  generating  function  (18).  The  first  of  these  formulas,  C(wz)  + 

zC(wz)2  + z2C(wz)3-\ , is  easily  seen  to  be  equivalent  to  (24).  [See  P.  A.  MacMahon, 

Combinatory  Analysis  1 (Cambridge  Univ.  Press,  1915),  128-130.] 

42.  (a)  Elements  a\  . . ,an  determine  an  entire  self-conjugate  nested  string  ai  . . . a2„, 
and  there  are  Cq^n_q~)  possibilities  for  a\  . . ,a„  having  exactly  q right  parentheses.  So 
the  answer  is 


Ln/2j 

q=0 


tn/2] 


(b)  Exactly  C(7l_1)/2  [nodd],  because  a self-transpose  binary  tree  is  determined  by  its 
left  subtree.  And  (c)  has  the  same  answer,  because  F is  self-dual  if  and  only  if  FR  is 
self-transpose. 
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43.  Cpq  = Cq-{q  \ x)Cq- iH = Er=o(_1)r(’  r r)C?-r,  by  induction  on  q-  p. 

44.  The  number  of  mems  between  visits  is  3 j — 2 in  step  B3,  h + 1 in  step  B4,  and  4 
in  step  B5,  where  h is  the  number  of  times  y 4—  ry.  The  number  of  binary  trees  with 
h > x,  given  j and  x,  is  [zn~:,~x~1]  C(z)x+3  when  j < n,  because  we  get  such  trees  by 
attaching  x+3  subtrees  below  j+x+1  internal  nodes.  Setting  x = 0 tells  us  that  a given 
value  of  j occurs  C(n_;)_i)(n_:)+1)  = Cn+i-j  — C„-j  times,  using  (24)  and  exercise  43. 

Thus  over  all  binary  trees  is  n + Ylj=i  (Cn+i-i  ~ Cn-j)j  = Cn  + C„- 1 H (-Ci. 

Similarly,  £(h  + 1)  is  C(n-j-l)(n-J'+2)  = 

5Zj=i(Cn-j+2  — 2Cn-j+ 1)  = Cn+ 1 — ( Cn  + Cn- 1 H h Co).  So  overall,  the  algorithm 

costs  Cn+ 1 + 4Cn  T 2(Cn_i  -j-  •••-)-  Ci ) T O(n)  = (26/3  — 10/ (3n)  -T  0(n  2)) Cn  mems. 

45.  Each  of  the  easy  cases  in  step  K3  occurs  Cn-i  times,  so  the  total  cost  of  that  step  is 
3C„_i  +8Cn_i  + 2(Cn  — 2Cn-i)  mems.  Step  K4  fetches  Xi  a total  of  [zn~'~1]  C(z)t+2  = 
C(n_i_1)n  times;  summing  for  i > 2 gives  C(n_3)(n+1)  = Cn+i  - 3C„  + C„- 1 mems 
altogether  in  that  loop.  Step  K5  costs  6Cn  — 12Cn_i.  Step  K6  is  a bit  more  complicated, 
but  one  can  show  that  the  operation  Xj  4—  Xk  is  performed  Cn  — 3Cn-i  + 1 times  when 
n > 2,  while  the  operation  Xj  4—  0 is  performed  Cn_i  — n + 1 times.  The  total  number 
of  mems  therefore  comes  to  Cn+i  +7C„  — 9Cn_i  +n  + 3 = (8.75-9.375 /n  + 0(ra~2))C„. 

Although  this  total  is  asymptotically  worse  than  that  of  Algorithm  B in  answer  44, 
the  large  negative  coefficient  of  n~1  means  that  Algorithm  B actually  wins  only  when 
n > 58;  and  n won’t  ever  be  that  big. 

Skarbek  has,  however,  improved  Algorithm  B to  the  following  Algorithm  B*,  which 
generates  the  trees  in  reverse  order  and  uses  an  auxiliary  table  ci  . . . c„: 

Bl*.  [Initialize.]  Set  Ik  <-  Ck  4—  0 and  Xk  4—  k + 1 for  1 < k < n;  also  set  l„  4-  xn  4-  0, 
and  set  xn+\  4-1  (for  convenience  in  step  B3*). 

B2*.  [Visit.]  Visit  the  binary  tree  represented  by  hh  ■ ■ ■ ln  and  X\X2  ■ ■ - xn. 

B3*.  [Find  j.]  Set  j 4-  1.  While  r j = 0,  set  lj  4-  Cj  <—  0,  xj  4—  j + 1,  and  j 4—  j + 1. 
Then  terminate  the  algorithm  if  j > n. 

B4*.  [Demote  Xj.]  Set  x 4—  Xj,  Xj  4—  xx,  xx  4—  0,  z 4—  Cj , Cj  4—  x.  If  z > 0,  set  xz  4—  x; 
otherwise  set  lj  4—  x.  Return  to  B2*.  | 

If  the  values  of  n and  ci  are  maintained  in  registers,  this  algorithm  needs  only  4C„  + 
Cn-i  + 4(Cn-i  + Cn-2  + --  - + Co)  + 3n  — 6 = (67/12  + 73/(24 n)  + 0(n_2))Cn  mems  to 
generate  all  Cn  trees.  [See  W.  Skarbek,  Fundaments  Information  75  (2007),  505-536.] 

46.  (a)  Going  to  the  left  from  (pg)  increases  the  area  by  q — p. 

(b)  The  leftward  steps  on  a path  from  (rm)  to  (00)  correspond  to  the  left  parentheses 
in  ai  . . . a2 n,  and  we  have  q — p = c*,  at  the  fcth  such  step. 

(c)  Equivalently,  Cn+i(x)  = ^2k=0xkCk{x)C„-k(x).  This  recurrence  holds  be- 
cause an  (n  + l)-node  forest  F consists  of  the  root  of  the  leftmost  tree  together  with 
a fc-node  forest  Fi  (the  descendants  of  that  root)  and  an  ( n — fc)-node  forest  Fr  (the 
remaining  trees),  and  because  we  have 

internal  path  length(E)  = k + internal  path  length(Fj)  + internal  path  length(Fr). 

(d)  The  strings  of  Ap(p+r)  have  the  form  ao)au)  ...ctr_i)ar  where  each  otj  is 
properly  nested.  The  area  of  such  a string  is  the  sum  over  j of  the  area  of  <y.j  plus  i — j 
times  the  number  of  left  parens  in  aj . 

Notes:  The  polynomials  Cpq(x)  were  introduced  by  L.  Carlitz  and  J.  Riordan  in 
Duke  Math.  J.  31  (1964),  371-388;  the  identity  in  part  (d)  is  equivalent  to  their  formula 
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(10.12).  They  also  proved  that 


Cpq(x) 


£(  — 1 yxr(r~l)-(‘<-2p)fq-p-r 


generalizing  the  result  of  exercise  43.  From  part  (c)  we  have  the  infinite  continued 

fraction  C(x,z ) = l/(l  - z/(  1 - xz/(  1 - x2z/(  1 )))),  which  G.  N.  Watson  proved 

is  equal  to  F(x,  z)/ F(x,  z/x),  where 


w \ ^ (~l)nxn  zn 

see  J.  London  Math.  Soc.  4 (1929),  39—48.  We  have  already  encountered  the  same 
generating  function,  slightly  disguised,  in  exercise  5.2.1  15. 

The  internal  path  length  of  a forest  is  the  “left  path  length”  of  the  corresponding 
binary  tree,  namely  the  sum  over  all  internal  nodes  of  the  number  of  left  branches  on 
the  path  from  the  root.  The  more  general  polynomial 

Cn{x,y)  = E ^left  path  length(T)  ^right  path  length(T) 

summed  over  all  n-node  binary  trees  T , seems  to  obey  no  simple  additive  recurrence  like 
the  one  for  Cnn(x)  = Cn(x,  1)  studied  in  this  exercise;  but  we  do  have  Cn+i(x,y)  = 
52kxkCk(x,y)yn~kCn-k(x,y)-  Therefore  the  super  generating  function  C(x,y,z)  = 
E„  Cn(x,y)zn  satisfies  the  functional  equation  C(x,y,z)  = 1 + zC(x,  y,  xz)C(x,  y,  yz). 
(The  case  x = y was  considered  in  exercise  2. 3. 4. 5-5.) 

47.  Cn(x)  = x(  2 )cpq(x)C(n^q)(n_1_p-)(x)  for  0 < p < n. 

48.  Let  C(z)  = C(— l,z)  in  the  notation  of  exercise  46,  and  let  C(z)C(— z)  = F(z2). 
Then  C(z)  = 1 + zF(z2)  and  C(-z)  = 1 - zF(z2);  so  F(z)  = 1 - zF(z)2,  and 
F(z)  = C(—z).  It  follows  that  Cpq(- 1)  = [zp]  C(-z2)t(g~p)/21  ( 1 + zC(-z2))[,J_pevenl , 
which  is  ( — 1 )(p/2)C(p/2)(g/2-i)  [p even]  when  q is  even,  (-l)Lp/2JCLp/2j  L,/2j  when  q is 
odd.  A perfect  Gray  code  through  the  strings  of  Apq  can  exist  only  if  |CP,(-1)|  < 1, 
because  the  associated  graph  is  bipartite  (see  Fig.  62);  |CP,(-1)|  is  the  difference 
between  the  sizes  of  the  parts,  because  each  perfect  transposition  changes  ci  + ■ ■ ■ + c„ 
by  ±1. 

49.  By  Algorithm  U with  n=15  and  Al=106,  it  is  ()(()())(((()())))((((())()))) . 

50.  Make  the  following  changes  to  Algorithm  U:  In  step  Ul,  also  set  r •<-  0.  In  step 
U3,  test  if  am  = ' ) instead  of  testing  N K c . In  step  U4,  set  r < — r + c'  instead  of 
N <—  N — c'.  And  omit  the  assignments  to  am  in  steps  U3  and  LT4. 

The  string  in  (i)  turns  out  to  have  rank  3141592.  (Who  knew?) 

51.  By  Theorem  7.2.1.3L,  IV  = (*1)  + („*i1)+"-+(*i,);  hence  nnN  = + (n%)  + 

b (“o  ),  since  zn  > 1.  Now  note  that  N - nnN  is  the  rank  of  zxz2  ...zn,  because  of 

(23)  and  exercise  50.  (For  example,  let  z1...z4  = 1256,  which  has  rank  6 in  Table  1. 
Then  z\  ...  z.\  = 16 32,  N = 60,  and  K46O  = 54.  Notice  that  N is  fairly  large,  because 
zi  = 2n-  1;  Fig.  47  shows  that  k„N  usually  exceeds  N when  N is  smaller.) 

52.  The  number  of  trailing  right  parentheses  has  the  same  distribution  as  the  number 
of  leading  left  parentheses,  and  the  sequence  of  nested  strings  that  begin  with  i('c)’  is 


7.2. 1.6 


ANSWERS  TO  EXERCISES  795 


(k)  A(n_/e)(n_1).  Therefore  the  probability  that  d„  = k is  C(n_fc)(n_1)/Cn.  We  find 


' 2n  — 1 
n 


-))(:) 


( 2n ) _ 

( 2n  \ 

\n  + tj 

\n  + t + 1 / 

^7(n  — <)(n+t) 

using  Eq.  1.2.6-(25),  and  it  follows  that  the  mean  and  variance  are  respectively  equal 
to  3n/(n  + 2)  = 3-6/(n  + 2)  and  2n(2n2  - n - l)/((n  + 2)2(n  + 3))  = 4-f  0(n-1).  [The 
moments  of  this  distribution  were  first  calculated  by  R.  Kemp  in  Acta  Informatica  35 
(1998),  17-89,  Theorem  9.  Notice  that  c„  = d„  — 1 has  essentially  the  same  behavior.] 

53.  (a)  3n/(n+2),  by  exercise  52.  (b)  Hn,  by  exercise  6.2.2-  7.  (c)  2-2_n,  by  induction. 

(d)  Any  particular  (but  fixed)  sequence  of  left  or  right  branches  has  the  same 
distribution  of  steps  before  a leaf  is  encountered.  (In  other  words,  the  probability  that 
a node  with  Dewey  binary  notation  01101  occurs  is  the  same  as  the  probability  that 
00000  occurs.)  Thus  if  X = k with  probability  pk,  each  of  the  2k  potential  nodes  on 
level  k is  external  with  probability  pk.  The  expected  value  ’52k2kpk  is  therefore  the 
expected  number  of  external  nodes,  namely  n + 1 in  all  three  cases.  (One  can  of  course 
also  verify  this  result  directly,  with  pk  = C(n_*,)(„_1)/C'„  in  case  (a),  pk  = [”]/n!  in 
case  (b),  and  pk  = 2~fc+[A:=rl1  in  case  (c).) 

Notes:  The  average  level  of  a leaf  turns  out  to  be  Q(y/n),  0(logn),  and  0(n)  in 
these  three  cases;  thus  it  is  longer  when  the  expected  time  to  hit  the  leftmost  leaf 
is  shorter!  The  reason  is  that  ubiquitous  “holes”  near  the  root  force  other  paths 
to  be  long.  Case  (a)  has  an  interesting  generalization  to  f-ary  trees,  when  pk  = 
C(„l.fc)(((_i)n_  i)/Cn^  in  the  notation  of  answer  36.  Then  the  mean  distance  to  the  left- 
most leaf  is  (t+l)n/((f  — l)n+2),  and  it  is  instructive  to  prove  via  telescoping  series  that 


53  tkc\t] 


(n  — k)((t  — l)n- 


-»-(:)■ 


54.  Differentiating  with  respect  to  x we  have 

C'(x,  z)  = zC'(x,  z)C(x,xz)  + zC(x,z)(C'(x,xz)  + zC,(x,xz)), 

where  C,(x,z)  denotes  the  derivative  of  C(x,z)  with  respect  to  2.  Thus  C'(l,z)  = 
2 zC  (1,  z)C(z)  + z2C(z)C' (2);  and  since  C' (2)  = C(2)2  + 2zC\z)C' (z)  we  can  solve  for 
C'(l,2),  obtaining  z2C(z)3/{\ - 2zC(z))2.  Therefore  '£,(c1  + - ■ -+cn)  = [zn]C"(l,2)  = 

22n-i  _ + 

in  agreement  with  exercise  2. 3. 4. 5-5.  Similarly  we  find 


J2(^  + "-+Cn)2-=[zn]C"(l,z)=  ( 


5 nz  + 19  n + 6 


)CH>+fK 


Thus  the  mean  and  variance  are  ±y/nn3/2  + 0(n)  and  (|  - J)n3/2  + 0(n),  respectively. 

55.  Differentiating  as  in  answer  54,  and  using  the  formulas  of  exercises  46(d)  and 
5.2.1  14  together  with  [z"]C(2)7(l  - 4z)  = 22n+r  - £J=1  2r~J(2n+j),  yields 

r p,7  ,^z‘2C(z)r+3  , (r+l\zC(z)r+2\ 

crI,p+r)(l)  = M((r+i)-r-Z_+(  2 )^L=) 

r-f  1 


( (r  I ^C(z)r+1-2C{z)r+C(z)r~1  / r+!  N C (z)r+1  —C  (z)r  \ 

V ^ 1-42  V 2 ) y/1^  ) 


(r+l) 


,2p+r—  1 


/ 2p+r+l ^ 

V p 


)-£^-'(27))+(T)(n) 

7 = 1 r r 


id 


r 


§ 

I 


re 
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56.  Use  1.2.6-53(b).  [See  BIT  30  (1990),  67-68.] 

57.  2So(a,  6)  = (2a“)  (2b)  + (2°+26)  by  1.2.6-(2i).  Exercise  1.2.6-53  tells  us  that 


“ / 

£ ( 


( 2“  \ 

( 2b  \ 

\a  — k ) 

\b-kj 

k = (m  + l)(a  + b — m 


( 2a  \ 

( 2b  ) 

V 777  + 1 / 

\a  + b — 777/ 

therefore  2Si(a,6)  = (2a“)  (266)  ^ ■ And  since  b2Sp(a,b)  - Sp+2(a,b)  = Sp(a,b  - 1),  we 
find  2 S2(a,b)  = (2°+2t>)  2a+2i,-i’  2S3(a,6)  = (2“)  (26) a2b2/(a  + b)~.  Formula  (30)  follows 
by  setting  o = m,  b = n - m,  and  C(x-k)(x+k)  = (x2_\)  - 

Similarly,  the  average  of  w2m-i  is  o(2k~ 1 )C(m-k)(m+k-i)C(n-m-k+i)(n-m+k ) 

divided  by  Cn,  namely 


2S3(m,  n+l—m)  — £>2(777,77+1—771) 
m(n+l—m)Cn 


777(77+1  — 777) 
77 


/ 2m \ / 2t7+2— 2t77 \ / 1 2n\ 
V 777  / V 77+1  -777  / / V 77  / 


- 1. 


[R.  Kemp,  BIT  20  (1980),  157-163;  H.  Prodinger,  Soochow  J.  Math.  9 (1983),  193-196.] 
58.  Summing  over  cases  in  which  the  left  subtree  has  k internal  nodes,  we  have 


m— 1 n— 1 

tlmn  = [/  = 777  = 77  = 0]  + Cfcf(i-l)(m-k-l)(n-/c-l)  + Cn-\-kt(i~i)mk. 

k= 0 k=m 


Thus  the  triple  generating  function  t( v,  w,  z)  = Xu  m n timnVlwmzn  satisfies 
t(v,  w,  z)  = 1 + vwzC(wz)t(v,  w,  z)  + vzC(z)t(v,  w,  z)\ 


and  the  analogous  linear  relation  for  t(w,z ) = dt(v,w,z)/dv\v=i  follows,  because 
t(l,w,z)  = J2Zo  YZ.=o  Cnwmzn  = ( C(z ) - wC(wz))/(l  - w)  and  zC(z )2  = C(z)  - 1. 
Algebraic  manipulation  now  yields 

s _ C(z)  + wC(wz)  — (1  + w)  2wC(z)C(wz)  C(z)  — wC(wz) 

[W’Z>~  (1  - 777)2  ’ 


and  we  obtain  the  formula  tmn  — (m  + l)C„+i  - 2J2™=0(m  ~ k)CkCn~k  - Cn.  Now 


m — 1 

^^(k  + l)CkCn-i-k  = 
k= o 


777.  / 2 777  \ / 277  — 2m  \ 
2?7  V 777  / V 77  — 777  / 


can  be  proved  as  in  exercise  56,  and  it  follows  that 


/2777\  /2n  — 2777  \ (2m  + 1)(2t7  — 2 777  + 1) 
V 777  / V 77  — 777  / (?7+l)(77  + 2) 


for  0 < 777  < 77. 


[P.  Kirschenhofer,  J.  Combinatorics,  Information  and  System  Sciences  8 (1983),  44-60. 
For  higher  moments  and  generalizations,  see  W.  J.  Gutjahr,  Random  Structures  & 
Algorithms  3 (1992),  361-374;  A.  Panholzer  and  H.  Prodinger,  J.  Statistical  Planning 
and  Inference  101  (2002),  267-279.  Note  that  the  generating  function  t(v,w,z)  yields 


tlmn  ''z  ^ ^ ^ ^(m  — fc)(m  — l)C(n  — ro  — 7 + 7c)(n  — m — 1)  ■ 


Using  the  fact  that  Ylk  (r)C(»-/c)(m-i)  = C'(n_r)(m+r)  when  777  > 1,  we  obtain  the 
formula  tmn  + Cn  — Sfc(fc+l)C(m-fc)(m-l)  C'(n-m)(n-m+k+ 1)>  a sum  that  can  therefore 
(surprisingly)  be  expressed  in  closed  form.] 
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59.  T(w,z)  — — — ~ — wzC(z)C(wz)+zC(z)T(w,  z)  + wzC(wz)T(wz) 

= w((C(z)+C(wz)-2)/z  - (1  +w)C(z)C(wz)  - (1  -w){C(z)-C(wz))) 

(1  — w)2 

Hence  Tm„  = tmn  — J2k=m  CkCn-k ■ [Is  there  a combinatorial  proof?]  And 


/2m\  /2n+2— 2m\ 4ra(n+l  — m)  + n + 1 1 

V m ) l n+l-m  ) 2(n+  l)(n  + 2)  2Cn+1  ~ 


for  1 < m < n. 


60.  (a)  It  is  the  number  of  right  parentheses  in  co-atoms.  (Therefore  it  is  also  the 
number  of  k for  which  W2k-i  < 0 in  the  associated  “worm  walk.”) 

(b)  For  convenience  let  d(‘(’)  = +1  and  d(‘)’)  = —1. 

Al.  [Initialize.]  Set  i 4—  j 4—  1 and  k 4—  2n. 

A2.  [Done?]  Terminate  the  algorithm  if  j > k.  Otherwise  set  aj  4—  ‘(\  j 4—  j + 1. 

A3.  [Atom?]  If  bi  = *)’,  set  s 4 1,  i <—  i + 1,  and  go  to  A4.  Otherwise  set  s 4-  1, 

i <-  i + 1,  and  while  s > 0 set  a,  4-  bi,  j 4-  j + 1,  s 4-  s + d(bi),  i<-i  + 1.  Return 
to  A2. 


A4.  [Co-atom.]  Set  s s + d(bi).  Then  if  s < 0,  set  afc4—  bi,  k 4—  k — 1,  * < — i + 1,  and 
repeat  step  A4.  Otherwise  set  ‘) k <—  k — 1,  i 4—  i + 1,  and  return  to  A2.  | 

(c)  The  defect-11  inverse  of  (l)  is  (()))((())))))(()((())(()))(((.  In  general 
we  find  it  by  locating  the  subscript  m just  before  the  Zth-from-last  right  parenthesis,  and 
the  indices  {u0,v o),  . . . , (us- i,us_i)  of  matching  parentheses  such  that  Uj  < m < Vj. 

11.  [Initialize.]  Set  c <-  j <-  s i-  0,  k 4-  m +-  2 n,  and  u0  2n  + 1. 

12.  [Scan  right  to  left.]  If  k = 0,  go  to  15;  if  ak  = ‘)\  go  to  13;  if  ak  = ‘ go  to  14. 

13.  [Process  a *)’.]  Set  r,  <-  k,  j <-  j + 1,  c +-  c + 1.  If  c = l,  set  m «-  A;  - 1,  s +-  j, 
and  u„  4—  fc.  Then  decrease  A:  by  1 and  return  to  12. 

14.  [Process  a ‘ ( .]  (At  this  point  the  left  parenthesis  ak  matches  the  right  parenthesis 
arj_1-)  Set  j 4—  j — 1.  If  r-j  > m,  set  Uj  4—  k and  Vj  4 — r^.  Then  decrease  fc  by  1 
and  return  to  12. 


15.  [Prepare  to  permute.]  Set  z 4 — j 4 — 1,  A:  4 — 2 n,  and  c 4-  0. 

16.  [Permute.]  While  j uc,  set  bi  4—  a,j,  i 4—  i+  1,  j 4—  j + 1.  Then  terminate  if  c = s; 

otherwise  set  bt  4-  i <-  i + 1,  j 4-  j + 1.  While  k / vc,  set  bi  4-  a*,  i 4-  i + 1, 
k k — 1.  Then  set  6;  4—  ‘ i f-  i + 1,  A:  4—  k — 1,  c4—  c+1,  and  repeat  step  16.  | 

Notes:  The  fact  that  exactly  C„  balanced  strings  of  length  2 n have  defect  l,  for 
0 < / < n.  was  discovered  by  P.  A.  MacMahon  [Philosophical  Transactions  209  (1909), 
153-175,  §20],  then  rediscovered  by  K.  L.  Chung  and  W.  Feller  [Proc.  Nat.  Acad.  Sci. 
35  (1949),  605-608],  using  generating  functions.  A simple  combinatorial  explanation 
was  found  subsequently  by  J.  L.  Hodges,  Jr.  [Biometrika  42  (1955),  261-262],  who 
observed  that  if  /?i  . . . f3r  has  defect  / > 0 and  if  /3k  = ak  is  its  rightmost  co-atom,  the 
balanced  string  j3x  . . . /3k_1  (fdk+1  . . . /3r ) a'^  has  defect  l — 1 (and  this  transformation  is 
reversible).  The  efficient  mapping  in  the  present  exercise  is  similar  to  a construction  of 
M.  D.  Atkinson  and  J.-R.  Sack  [Information  Processing  Letters  41  (1992),  21-23]. 

61.  (a)  Let  Cj  — 1 — bj\  thus  Cj  < 1,  ci  + • • • + cjv  = /,  and  we  must  prove  that 


)d 

ir 


§ 


re 


ci  + C2  + • ■ ■ + ck  < / if  and  only  if  k < N 
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holds  for  exactly  / cyclic  shifts.  We  can  define  c,  for  all  integers  j by  letting  Cj±N  = Cj. 
Let  us  also  define  E j for  all  j by  letting  E0  = 0 and  E j = Ej_i  + c3  ; then  T,j+Nt  = 
Ej  A ft,  and  E^+i  < E j + 1 . It  follows  that  for  each  integer  x there  is  a smallest  integer 
j = j(x)  such  that  Ej  = x.  Moreover,  j(x)  < j(x  + 1);  and  j(x  + f)  = j(x)  + N.  Thus 
the  desired  condition  holds  if  and  only  if  we  shift  by  j(x)  mod  N for  x = 1,  2,  . . . , or  /. 
(The  history  of  this  important  lemma  is  discussed  in  answer  2. 3. 4. 4-32.) 

(b)  Start  with  / m <—  s 0.  Then  for  k = 1,  2,  . . . , N (in  this  order)  do  the 
following:  Set  s <—  s + 1 — bk\  and  if  s > m,  set  m -f-  s,  ji  <—  k,  and  /<—(/  + 1)  mod  /. 
The  answers  are  jo,  ■ ■ ■ , jf-i,  by  the  proof  in  part  (a). 

(c)  Start  with  any  string  &1&2  . . • b;v  containing  rij  occurrences  of  j for  0 < j < t. 
Apply  a random  permutation  to  this  string,  then  apply  the  algorithm  of  part  (b). 
Choose  randomly  between  ( j0 , ■ ■ ■ ,jf-i)  and  use  the  resulting  cyclic  shift  as  a preorder 
sequence  to  define  the  forest. 

[See  L.  Alonso,  J.  L.  Remy,  and  R.  Schott,  Algorithmica  17  (1997),  162  -182,  for 
an  even  more  general  algorithm.] 


62.  Bit  strings  (R  . . . Z„,  n . . . r„)  are  valid  if  and  only  if  61  ...  6n  is  valid  in  exercise  20, 
where  bj  = lj  + rj.  Therefore  we  can  use  exercise  61.  [See  J.  F.  Korsh,  Information 
Processing  Letters  45  (1993),  291-294.] 


64.  X — 2 k + b where  ( k,b ) = (0,1),  (2,1),  (0,0),  (5,1),  (6,0),  (1,1);  eventually 
LqLi  ...  L12  = 5 11  3 4 0 7 9 8 1 6 10  12  2. 

65.  See  A.  Panholzer  and  H.  Prodinger,  Discrete  Mathematics  250  (2002),  181  195; 
M.  Luczak  and  P.  Winkler,  Random  Structures  <fc  Algorithms  24  (2004),  420-443. 

66.  (a)  “Shrink”  the  white  edges,  merging  the  nodes  that  they  connect.  For  example, 

t/X  ^ sfc 

are  the  ordinary  trees  that  correspond  to  the  eleven  Schroder  trees  depicted  for  n = 3. 
Under  this  correspondence  a left  link  means,  “here  is  a child”;  a white  right  link  means, 
“look  here  for  more  children”;  a black  right  link  means,  “here’s  the  last  child.” 

(b)  Mimic  Algorithm  L,  but  between  rotations  use  an  ordinary  Gray  binary  code 
to  run  through  all  color  patterns  of  whatever  right  links  are  present.  (The  case  n = 3 
has,  in  fact,  been  illustrated  in  the  example.) 

Note  that  Schroder  trees  also  correspond  to  series-parallel  graphs,  as  in  (53).  They 
do,  however,  impose  an  order  on  the  edges  and/or  superedges  that  are  joined  in  parallel; 
so  they  correspond  more  precisely  to  series-parallel  graphs  as  embedded  in  the  plane 
(and  with  edges  and  vertices  unlabeled,  except  for  s and  t). 

67.  S(z)  = 1 + z5(2)(l  + 2 (S(z)  — 1)),  because  1 + 2 (S(z)  — 1)  enumerates  the  right 
subtrees;  therefore  S(z ) = (1  + z — \/l  - 6z  + z2)/(4z). 

Notes:  We’ve  seen  Schroder  numbers  in  exercise  2.3.4.4-31,  where  G(z)  = zS(z); 
and  in  exercise  2.2.1-11,  where  bn  = 2S'n_i  for  n > 2 and  where  we  found  the  recurrence 
(n  — 1)5,!  = (6n  — 3)Sn-i  — (n  — 2)5n-2-  They  grow  asymptotically  as  explored  in 
exercise  2.2.1-12.  A triangle  of  numbers  Spq , analogous  to  (22),  can  be  used  to  generate 
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random  Schroder  trees.  These  numbers  satisfy 


Spq  — Sp(,_!)  + S(p_!),  + S(p—2)q  H h Soq  — 5p(g_i)  + 2S(p_i)g  - S(p— !)(,_  1) 


g-P+1 

<7+1 


= [«;p29]  S(wz)/(1  — zS(wz )); 


the  double  generating  function  on  the  last  line  is  due  to  Emeric  Deutsch.  Many 
other  properties  of  Schroder  trees  are  discussed  in  Richard  Stanley’s  Enumerative 
Combinatorics  2 (1999),  exercise  6.39. 

68.  A single  row  that  contains  only  the  empty  string  e.  (The  general  rule  (36)  for 
going  from  n - 1 to  n converts  this  row  into  ‘0  T,  the  pattern  of  order  1.) 

69.  The  first  (8)  = 20  rows  are  the  Christmas  tree  pattern  of  order  6,  if  we  ignore  the 
TO’  at  the  beginning  of  each  string.  The  pattern  of  order  7 is  a bit  more  difficult  to  see; 
but  there  are  (3)  = 35  rows  in  which  the  leftmost  entry  begins  with  0.  Disregard  the 
rightmost  string  in  all  such  rows,  and  ignore  the  0 at  the  beginning  of  each  remaining 
string.  (Other  answers  are  also  possible.) 

70.  If  <7  appears  in  column  k of  the  Christmas  tree  pattern,  let  o'  be  the  string  in 
column  n — k of  the  same  row.  (If  we  think  of  parentheses  instead  of  bits,  this  rule 
takes  the  mirror  reflection  of  the  free  parentheses  in  the  sense  of  answer  11,  by  (39).) 

71.  Mtn  is  the  sum  of  the  t largest  binomial  coefficients  (£),  because  each  row  of  the 
Christmas  tree  pattern  can  contain  at  most  t elements  of  S,  and  because  we  do  get 
such  a set  S by  choosing  all  strings  o with  (n  — t)/ 2 < u(o)  < (n  + t — l)/2.  (The 
formula 

E (l) 

n — t<2k<.n-\-t  — l 

is  about  as  simple  as  possible;  however,  special  formulas  like  M(2 )n  = Mn+ 1 hold  for 
small  t , and  we  also  have  Mtn  = 2n  for  t > n.) 

72.  You  get  Msn,  the  same  number  as  in  the  previous  exercise.  In  fact,  one  can  prove 
by  induction  that  there  are  exactly  (n"fc)  - (fc"J  rows  of  length  s + n - 2k  > 0. 

73.  011001001000000000100101001100,  111001011011111111101101011100; see  (38). 

74.  By  the  lexicographic  property,  we  want  to  count  the  number  of  rows  whose  right- 
most elements  have  the  respective  forms  0*29,  10*28,  110*27,  111000*24,  11100100*22, 
111001010*21,  11100101100*19,  111001011010*18,  1110010110110*17,  ...,  namely  all 
30-bit  strings  that  precede  r = 111001011011111111101101011100. 

If  0 has  p more  Is  than  0s,  the  number  of  Christmas  tree  rows  ending  with  9*n  is 
the  same  as  the  number  of  rows  ending  with  lp*n;  and  this  is  M(p+1)„,  by  exercise  71, 
because  all  such  rows  are  the  n-step  descendants  of  the  starting  row  ‘0P  0P-11  ...  lp’. 
Consequently  the  answer  is  M0(29)  + M1(28)  + M2(27)  + M1(24)  -| b M(12)3  + 

M(13)2  = Efei  1 M(  2k-i-Zk)(n-zk)  = 0 +0  + O + (iD  + (12)  + • • - + 8 + 4 = 84867708, 
where  (z  1, . . . , Z21)  = (1,2, 3, 6, ... , 27,  28)  is  the  sequence  of  places  where  Is  occur  in  r. 

75.  We  have  = Afn_2,  because  row  is  the  bottom  descendant  of  the  first 

row  in  (33).  We  also  have  r\n+\  - rjn)  = Mj(n_W)  - = M0+1)(n_2_j) 


id 


r 


l 


re 
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by  the  formula  in  answer  74,  because  the  relevant  sequence  z\  . . . z„~i  for  row  r 
l3  01n~1~3 . Therefore,  since  Mjn/Mn  -4  j for  fixed  j as  n -4  oo,  we  have 


r(n) 

lim 

n—too  Mn 


E 


2k+ 1 


= 1 - 


3 + 2 
2J+1  ' 


(n) 


And  we’ve  also  implicitly  proved  that  Mk(n_k)  = Mn+ 1 — 1. 

76.  The  first  (2^*)  elements  of  the  infinite  sequence 

Q = 1313351313351335355713133513133513353557131335133535571335355735575779. . . 


are  the  row  sizes  in  the  pattern  of  order  2 n;  this  sequence  Q = <?i<?2<73  ...  is  the  unique 
fixed  point  of  the  transformation  that  maps  1 s4  13  and  n s4  (n— 2)nn(n+2)  for  odd 
n > 1,  representing  two  steps  of  (36). 

Let  f(x)  = limsup^^^  s(fiM„])/n  for  0 < x < 1.  This  function  apparently 
vanishes  almost  everywhere;  but  it  equals  1 when  x has  the  form  (<71  + • • • + ©)/ 2n, 
because  of  answer  72.  On  the  other  hand  if  we  define  g(x)  = limn_>00  s(\xMn])/\/n, 
the  function  g(x)  appears  to  be  measurable,  with  f*  g(x ) dx  = y/n,  although  g(x)  is 
infinite  when  f(x)  > 0.  (Rigorous  proofs  or  disproofs  of  these  conjectures  are  solicited.) 

77.  The  hint  follows  from  (39),  by  considering  worm  walks;  so  we  can  proceed  thus: 

XI.  [Initialize.]  Set  a-j  <—  0 for  0 < j < n;  also  set  x e—  1.  (In  the  following  steps  we 
will  have  x = 1 + 2(ai  + • • • + a„).) 

X2.  [Correct  the  tail.]  While  x < n,  set  ax  ■<—  1 and  x x + 2. 

X3.  [Visit.]  Visit  the  bit  string  ai  . . . an. 

X4.  [Easy  case?]  If  an  = 0,  set  an  4—  1,  x x + 2,  and  return  to  X3. 

X5.  [Find  and  advance  dj.]  Set  an  0 and  j <—  n-  1.  Then  while  a.j  = 1,  set  a}  •<—  0, 
x <-  x — 2,  and  j <—  j — 1 . Stop  if  j = 0;  otherwise  set  aj  <—  1 and  go  back  to  X2.  | 

78.  True,  by  (39)  and  exercise  11. 

79.  (a)  List  the  indices  of  the  0s,  then  the  indices  of  the  Is;  for  instance,  the  bit  string 
in  exercise  73  corresponds  to  the  permutation  1 4 5 7 8 10  11  12  13  20  23  25  29  30  2 3 
6 9 14  15  16  17  18  19  21  22  24  26  27  28. 

(b)  Using  the  conventions  of  (39),  the  P tableau  has  the  indices  of  left  parentheses 
and  free  parentheses  in  its  top  row,  other  indices  in  the  second  row.  Thus,  from  (38), 


1 

2 

3 

6 

8 

9 

11 

12 

13 

14 

15 

16 

17 

18 

19 

21 

22 

24 

26; 

27 

28 

4 

5 

7 

10 

20 

23 

25 

29 

30 

[See  K.-P.  Vo,  SIAM  J.  Algebraic  and  Discrete  Methods  2 (1981),  324-332,  for  a 
generalization  to  chains  of  submultisets.] 

80.  This  curious  fact  is  a consequence  of  exercise  79  together  with  Theorem  6 in  the 
author’s  paper  on  tableaux;  see  Pacific  J.  Math.  34  (1970),  709-727. 

81.  Suppose  cr  and  cr'  belong  respectively  to  chains  of  length  s and  s'  in  the  Christmas 
tree  patterns  of  order  n and  n' . At  most  min(s,  s')  of  the  ss'  pairs  of  strings  in  those 
chains  can  be  in  the  biclutter.  Furthermore,  because  of  (39),  those  ss'  pairs  of  strings 
actually  constitute  exactly  min(s,s/)  chains  in  the  Christmas  tree  pattern  of  order 
n + n , when  they  are  concatenated.  Therefore  the  sum  of  min(s,  s')  over  all  pairs  of 
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chains  is  Mn+ni,  and  the  result  follows.  We  have  incidentally  proved  the  nonobvious 
identity 

'y  ' min(m  -j-  1 2j,  n + 1 2 k)  Cj^m—j)  Ck(n—k)  ~ Mm+n • 

j,  k 

Notes:  This  extension  of  Sperner’s  theorem  was  proved  independently  by  G.  Katona 
[Studia  Sci.  Math.  Hungar.  1 (1966),  59-63]  and  D.  J.  Kleitman  [Math.  Zeitschrift  90 
(1965),  251  259].  See  Greene  and  Kleitman,  J.  Combinatorial  Theory  A20  (1976), 
80-88,  for  the  proof  given  here  and  for  further  results. 

82.  (a)  There  is  at  least  one  evaluation  in  each  row  m;  there  are  two  if  and  only 
if  s(m)  > 1 and  the  first  evaluation  yields  0.  Thus  if  / is  identically  1,  we  get  the 
minimum,  Mn;  if  / is  identically  0,  we  get  the  maximum,  Mn  + ]T]m[s(m)  > 1]  = Mn+ 1. 

(b)  Let  f(x{m,n/ 2))  = 0 in  the  Cn/2  cases  where  s(m)  = 1;  otherwise  let 
f(x(m,a))  = 1,  where  a is  defined  by  the  algorithm.  When  n is  odd,  this  rule  implies 
that  /(er)  is  always  1;  but  when  n is  even,  /(<r)  = 0 if  and  only  if  cr  is  first  in  its  row. 
(To  see  why,  use  the  fact  that  the  row  containing  cr'  in  (41)  always  has  size  s — 2.) 
This  function  / is  indeed  monotonic;  for  if  cr  < r and  if  a has  a free  left  parenthesis, 
so  does  r.  For  example,  in  the  case  n = 8 we  have 

f(x  1, . . . ,xs)  = xs  V x6x7  V X4Xs(x6  V xr)  V *2X3  (2:4  (xs  Vi6V  x7)  V x5(x6  V X7)). 

(c)  In  these  circumstances  (45)  is  the  solution  for  all  n. 

83.  At  most  3 outcomes  are  possible  in  step  114  — in  fact,  at  most  2 when  s(m)  = 1. 
[See  exercise  5.3.4-31  for  sharper  bounds;  in  the  notation  of  that  exercise,  there  are 
exactly  Sn  + 2 monotone  Boolean  functions  of  n Boolean  variables.] 

84.  For  this  problem  we  partition  the  2n  bit  strings  into  Mn  blocks  instead  of  chains, 
where  the  strings  {cr  1, . . . , crs } of  each  block  satisfy  ||Acrf  — AcrJ||  > 1 for  i / j\  then 
at  most  one  bit  string  per  block  can  satisfy  ||A<7r  — fe||  < 

Let  A'  denote  the  first  n — 1 columns  of  A,  and  let  v be  the  nth  column.  Suppose 
(crj, . . . , as}  is  a block  for  A1,  and  number  the  subscripts  so  that  vTA'crJ  is  the 
minimum  of  vTA'aJ . Then  rule  (36)  defines  appropriate  blocks  for  A,  because  we  have 
\\A(at0)T  - A{<tj 0HI  = \\A{aA)T  - A(os  1)^||  = || A'af  - A’aJ ||  and 

\\Aiaj  1)T  - A(ffl 0)T||2  = || A’aj  + v - A'a f ||2 

= ||  A'icrj  - a.fW2  + IMI2  + 2vTA’(a]  - <t,)T  > ||u||2  > 1. 

[And  more  is  true;  see  Advances  in  Math.  5 (1970),  155-157.  This  result  extends  a 
theorem  of  J.  E.  Littlewood  and  A.  C.  Offord,  Mat.  Sbornik  54  (1943),  277-285,  who 
considered  the  case  m = 2.] 

85.  If  V has  dimension  n — m,  we  can  renumber  the  coordinates  so  that 

(1,  0,  . . . , 0,  xn,  . . . , xim) 

(0,  1,  . . . , 0,  X21,  . . . , X2 m) 

(0,  0,  . . . , 1,  . . . , X(n_m)m) 

is  a basis,  with  none  of  the  row  vectors  Vj  = (xji, . . . , Xjm ) entirely  zero.  Let  vn_m+i  = 
( — 1, 0, . . . , 0),  . . . , vn  = (0, 0, . . . , — 1).  Then  the  number  of  0-1  vectors  in  V is  the  num- 
ber of  0-1  solutions  to  Ax  = 0,  where  A is  the  m x n matrix  with  columns  v\,  . . . , vn. 
But  this  quantity  is  at  most  the  number  of  solutions  to  ||Ax||  < \ min(||m||, . . . , )|un||), 
which  is  at  most  Mn  by  exercise  84. 


802  ANSWERS  TO  EXERCISES 


7.2. 1.6 


Conversely,  the  basis  with  m = 1 and  Xji  = (-1)J_1  yields  Mn  solutions.  [This 
result  has  application  to  electronic  voting;  see  Golle’s  Ph.D.  thesis  (Stanford,  2004).] 

86.  First  reorder  the  4-node  subtrees  so  that  their  level  codes  are  0121  (plus  a con- 
stant); then  sort  larger  and  larger  subtrees  until  everything  is  canonical.  The  re- 
sulting level  codes  are  0 1 2 3 4 3 2 1 2 3 2 1 2 0 1,  and  the  parent  pointers  are 
012343218981  12  0 14. 

87.  (a)  The  condition  holds  if  and  only  if  ci  < • • • < ck  > ck+ 1 > • • • > cn  for  some  k, 

so  the  total  number  of  cases  is  J2k  = 2n_1. 

(b)  Note  that  cx...ck  = c\  . . . c'k  if  and  only  if  p1...pk  = pi  . . . p'k ; and  in  such 
cases,  ck+1  < c'k+1  if  and  only  if  pk+1  < p'k+1. 

88.  Exactly  An+k  forests  are  visited,  and  Ak  of  them  have  pk  = • • • = pn  = 0. 
Therefore  04  is  performed  An  times;  and  pk  is  changed  Ak+i  - 1 times  in  step  05,  for 
1 ^ & < n.  Step  05  also  changes  p„  a total  of  An  — 1 times.  The  average  number  of 
mems  per  visit  is  therefore  only  2+3/(a-l)+0(l/n)  « 3.534,  if  we  keep  pn  in  a register. 
[See  E.  Kubicka,  Combinatorics,  Probability  and  Computing  5 (1996),  403-417.] 

89.  If  step  05  sets  pn  4-  p3  exactly  Qn  times,  it  sets  pk  4-  p3  exactly  Qk  + Afc+i  - 
Ak  times,  for  1 < k < n,  because  every  prefix  of  a canonical  pi  . . . pn  is  canonical. 
We  have  (Qi,Q2,--.)  = (0,0,1,2,5,9,22,48,118,288,...);  and  one  can  show  that 
Qn  = Sd>i  Ei<c<  n/d—i  a(n—cd)(n—cd—d) , where  ank  is  the  number  of  canonical  parent 
sequences  p\  . . . pn  with  pn  = k.  But  these  numbers  ank  remain  mysterious. 

90.  (a)  This  property  is  equivalent  to  2.3.4.4-(7);  vertex  0 is  the  centroid. 

(b)  Let  m = \n/2\.  At  the  end  of  step  Ol,  set  pm+1  4-  0,  and  also  p2rn+  i 4-  0 if 
n is  odd.  At  the  end  of  step  04,  set  i 4-  j and  while  pt  / 0 set  i 4-  (Then  i is  the 
root  of  the  tree  containing  j and  k .)  At  the  beginning  of  step  05,  if  k = i + m and 
i < j , set  j 4—  i and  d 4—  to. 

(c)  If  n is  even,  there  are  no  bicentroidal  trees  with  n + 1 vertices.  Otherwise  find 
all  pairs  (pi  . . .pm,p[  . . . p'n ) of  canonical  forests  on  m=  [n/2j  nodes,  with  p[  . . . p'rn  > 
pi  ...p";  let  pi  = 0,  Pj+i  = p'  + 1,  and  pm+J+i  = (p"  + m + l)[p"  > 0]  for  1 < j < m. 
(Two  incarnations  of  Algorithm  O will  generate  all  such  sequences.  This  algorithm  for 
free  trees  is  due  to  F.  Ruskey  and  G.  Li;  see  SODA  10  (1999),  S939-S940.) 

91.  Use  the  following  recursive  procedure  W(n):  If  n < 2,  return  the  unique  n-node 
oriented  tree.  Otherwise  choose  positive  integers  j and  d so  that  a given  pair  ( j , d) 
is  obtained  with  probability  dAdAn-jd/((n  — l)An).  Compute  random  oriented  trees 
T'  4—  W(n  — jd)  and  T"  4—  W(d).  Return  the  tree  T obtained  by  linking  j clones  of  T" 
to  the  root  of  T' . [ Combinatorial  Algorithms  (Academic  Press,  1975),  Chapter  25.] 

92.  Not  always.  [R.  L.  Cummins,  in  IEEE  Trans.  CT-13  (1966),  82-90,  proved  that 
the  graph  of  S(G)  always  contains  a cycle;  see  also  C.  A.  Holzmann  and  F.  Harary, 
SIAM  J.  Applied  Math.  22  (1972),  187-193.  But  their  constructions  are  unsuitable  for 
efficient  computation,  because  they  require  foreknowledge  of  the  parity  of  the  sizes  of 
intermediate  results.] 

93.  Yes.  Step  S7  undoes  step  S3;  step  S9  undoes  the  deletions  of  step  S8. 

94.  For  example,  we  can  use  depth-first  search,  with  an  auxiliary  table  ftj  ..  ,bn: 

i)  Set  b\  . . . bn  4—  0 ...  0,  then  v 4—  1,  w 4—  1,  bi  4—  1,  and  k 4—  n — 1. 

ii)  Set  e 4—  n„_i.  While  te  / 0,  do  the  following  substeps: 

a)  Set  u 4 — te.  If  bu  / 0,  go  to  substep  (c). 

b)  Set  bu  4—  w,  w 4—  u,  ak  4—  e,  k 4—  k — 1.  Terminate  if  k = 0. 

c)  Set  e 4 — ne. 
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iii)  If  w 1,  set  v 4—  w,  w <—  bw , and  return  to  (ii).  Otherwise  report  an  error:  The 
given  graph  was  not  connected. 

We  could  actually  terminate  as  soon  as  substep  (b)  reduces  k to  1,  since  Algorithm  S 
never  looks  at  the  initial  value  of  a i . But  we  might  as  well  test  for  connectivity. 

95.  The  following  steps  perform  a breadth-first  search  from  u,  to  see  if  v is  reachable 
without  using  edge  e.  An  auxiliary  array  b\  . . . bn  of  arc  pointers  is  used,  which  should 
be  initialized  to  0 ...  0 at  the  end  of  step  SI;  we  will  reset  it  to  0 ...  0 again. 

i)  Set  w <—  u and  bw  <—  v. 

ii)  Set  / <r-  nu-i-  While  tf  ^ 0,  do  the  following  substeps: 

a)  Set  v'  <—  tf.  If  bv / / 0,  go  to  substep  (d). 

b)  If  v'  / v,  set  bv'  <—  v,  bw  «—  v' , w <—  v' , and  go  to  substep  (d). 

c)  If  / / e © 1,  go  to  step  (v). 

d)  Set  / <—  ri.f. 

iii)  Set  u «—  bu.  If  u ^ v,  return  to  step  (ii). 

iv)  Set  u <-  te.  While  u / v,  set  w «-  bu,  bu  <-  0,  u 4-  w.  Go  to  S9  (e  is  a bridge). 

v)  Set  u i te . While  u ^ v,  set  w ■<—  bu,  bu  0,  u <—  w.  Then  set  u ■<—  te  again  and 

continue  step  S8  (e  is  not  a bridge). 

Two  quick  heuristics  can  be  used  before  starting  this  calculation:  If  du  = 1,  then  e is 
obviously  a bridge;  and  if  lie  / 0,  then  e is  obviously  a nonbridge  (because  there’s  an- 
other edge  between  u and  v).  Such  special  cases  are  detected  readily  by  the  breadth-first 
search,  yet  experiments  by  the  author  indicate  that  both  heuristics  are  definitely  worth- 
while. For  example,  the  test  on  lie  typically  saves  3%  or  so  of  the  total  running  time. 

96.  (a)  Let  e*,  be  the  arc  k — 1 — > k.  The  steps  in  answer  94  set  at,  t—  en+i_fc  for 

n > k > 1.  Then  at  level  k we  shrink  en-k , for  1 < k < n — 1.  After  visiting  the 

(unique)  spanning  tree  en_i  . . . e2en,  we  unshrink  en_*,  and  discover  quickly  that  it  is 
a bridge,  for  n — 1 > k > 1.  Thus  the  running  time  is  linear  in  n;  in  the  author’s 
implementation  it  turns  out  to  be  exactly  102n  - 226  mems  for  n > 3. 

However,  this  result  depends  critically  on  the  order  of  the  edges  in  the  initial 
spanning  tree.  If  step  SI  had  produced  “organ-pipe  order”  such  as 


en/ 2+1  en/2  en/2+2  en/2-i  • . . en_i  e2 


in  positions  a2-..an-\  when  n is  even,  the  running  time  would  have  been  Q(n2), 
because  f 1(n)  of  the  bridge  tests  would  each  have  taken  Q(n)  steps. 

(b)  Now  ak  is  initially  en-k  for  n > k > 1,  where  e\  is  the  arc  n — > 1.  The 
spanning  trees  visited,  when  n > 4,  are  respectively  en_2...eien,  e„_2  . . . eien_i, 
en_2  . . . e2en-ie„,  en-2  ■ • • e3e„_ie„ei,  ...,  en-iene\  . . . en-3-  Following  the  tree 
en-2  • • • ei,+2e„_ienei  . . . ek  the  computations  move  down  to  level  n — fc  — 3 and  up 
again,  for  0 < k < n — 4;  the  bridge  tests  are  all  efficient.  Thus  the  total  running  time 
is  quadratic  (in  the  author’s  version,  exactly  35.5n2  + 7.5n  - 145  mems,  for  n > 5). 

Incidentally,  Pn  is  board  (n,  (1, 0.  0, 1 . 0. 0)  in  the  notation  of  the  Stanford  Graph- 
Base,  and  Cn  is  board (n.  0, 0, 0. 1, 1,0);  the  SGB  vertices  are  named  0 through  n — 1. 


97. 


Yes,  when  {s,t}  is  {1,2},  {1,3},  {2,3},  {2,4},  or  {3,4},  but  not  {1,4}. 


98.  A'  = 


; this  is  the  “dual  planar  graph”  of  the  planar  graph  A. 


(The  near  trees  of  A'  are  complements  of  the  spanning  trees  of  A,  and  vice  versa.) 
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99.  The  stated  method  works,  by  induction  on  the  size  of  the  tree,  for  essentially  the 
same  reasons  that  it  worked  for  n-tuples  in  Section  7.2. 1.1 — but  with  the  additional 
proviso  that  we  must  successively  designate  each  child  of  an  uneasy  node. 

Leaf  nodes  are  always  passive,  and  they  are  neither  easy  nor  uneasy;  so  we  will 
assume  that  the  branch  nodes  are  numbered  1 tom  in  preorder.  Let  fp  = p for  all 
branch  nodes,  except  when  p is  a passive  uneasy  node  for  which  the  nearest  uneasy  node 
to  its  right  is  active;  in  the  latter  case,  fp  should  point  to  the  nearest  active  uneasy 
node  to  its  left.  (For  purposes  of  this  definition,  we  imagine  that  artificial  nodes  0 
and  m + 1 are  present  at  the  left  and  right,  both  of  which  are  uneasy  and  active.) 

FI.  [Initialize.]  Set  fp  p for  0 < p < m;  also  set  t0  1,  v0  <—  0,  and  set  each  zv  so 
that  rZp  = dp. 

F2.  [Select  node  p.]  Set  ?t-m;  then  while  tq  = vq  set  q <-  q — 1.  Set  p fq  and 
fq  A-  q;  terminate  the  algorithm  if  p = 0. 

F3.  [Change  dp.]  Set  s dp,  s'  <—  ra,  k vp,  and  dp  t—  s'.  (Now  k = vs  ^ tv.) 

F4.  [Update  the  values.]  Set  q •(-  s and  vq  •(-  k © 1.  While  dq  / 0,  set  q e-  dq  and 
vq  «-  k ® 1.  (Now  q is  a leaf  that  has  entered  the  config  if  k = 0,  left  it  if  k = 1.) 
Similarly,  set  q <—  s'  and  vq  <—  k.  While  dq  ^ 0,  set  q «—  dq  and  vq  4—  k.  (Now  q is 
a leaf  that  has  left  the  config  if  k = 0,  entered  it  if  k = 1.) 

F5.  [Visit.]  Visit  the  current  config,  represented  by  all  the  leaf  values. 

F6.  [Passivate  p?]  (All  uneasy  nodes  to  p’s  right  are  now  active.)  If  dp  / zp,  return 
to  step  F2.  Otherwise  set  zp  <-  s,  q <-  p - 1;  while  tq  = vq,  set  q q - 1.  (Now 
q is  the  first  uneasy  node  to  the  left  of  p;  we  will  make  p passive.)  Set  fp  /,, 
fq  «—  q,  and  return  to  F2.  | 

Although  step  F4  may  change  uneasy  nodes  to  easy  nodes  and  vice  versa,  the  focus 
pointers  need  not  be  updated,  because  they’re  still  set  correctly. 

100.  A complete  program,  called  GRAYSPSPAN,  appears  on  the  author’s  website.  Its 
asymptotic  efficiency  can  be  proved  by  using  the  result  of  exercise  110  below. 

102.  If  so,  ordinary  spanning  trees  can  be  listed  in  a strong  revolving-door  order , where 
the  edges  that  enter  and  leave  at  each  step  are  adjacent. 

Interesting  algorithms  to  generate  all  the  oriented  spanning  trees  with  a given 
root  have  been  developed  by  Harold  N.  Gabow  and  Eugene  W.  Myers,  SICOMP  7 
(1978),  280-287;  S.  Kapoor  and  H.  Ramesh,  Algorithmica  27  (2000),  120-130. 

103.  (a)  Toppling  increases  (*o,  x\,,, . , xn)  lexicographically,  but  does  not  change  xo  + 

• • • + xn.  If  we  can  topple  at  both  V)  and  V),  either  order  gives  the  same  result. 

(b)  Adding  a grain  of  sand  changes  the  16  stable  states  as  follows: 

Given  0000  0001  0010  0011  0100  0101  0110  0111  1000  1001  1010  1011  1100  1101  1110  1111 
+ 0001  0001  0010  0011  0001  0101  0110  0111  0101  1001  1010  1011  1001  1101  1110  1111  1101 

+ 0010  0010  0011  0001  0010  0110  0111  0101  0110  1010  1011  1001  1010  1110  1111  1101  1110 

+ 0100  0100  0101  0110  0111  1000  1001  1010  1011  1100  1101  1110  nil  0100  0101  0110  0111 

+ iooo  iooo  iooi  ioio  ion  noo  noi  mo  nil  0100  0101  ono  0111  1000  1001 1010  ion 

The  recurrent  states  are  the  nine  cases  with  X!  + x2  > 0 and  x3  + x4  > 0.  Notice  that 
repeated  addition  of  0001  leads  to  the  infinite  cycle  0000  -a  0001  -a  0010  -A  0011  -a 
0001  -A  0010  -A  • • • ; but  the  states  0001,  0010,  and  0011  are  not  recurrent. 

(c)  If  x = cr(x  + t)  then  also  x = o(x  + kt)  for  all  k > 0.  All  components 
of  t are  positive;  thus  x = <j(x  + max(di, . . . ,dn)t)  is  recurrent.  Conversely,  suppose 
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x — a(d  + y),  where  all  yi  > 0;  then  d + y + t topples  to  x + t and  it  also  topples  to 
a(d)  + y + t = d + y.  Therefore  a(x  + t)  = a(d  + y)  = x. 

(d)  There  are  N = det(a^)  classes,  because  elementary  row  operations  (exercise 
4.6.1-19)  triangularize  the  matrix  while  preserving  congruence. 

(e)  There  are  nonnegative  integers  raj,  . . . , mn,  mi,  . . . , m'n  such  that 

x + miai  + • • • + mnan  = x + m!xa\  + ■ ■ ■ + m'nan  = y,  say. 

For  sufficiently  large  k,  the  vector  y + kt  topples  inmH b mn  steps  to  x + kt,  and 

in  mi  + • • • + m'n  steps  to  x'  + kt.  Therefore  x = a(x  + kt)  = cr(x'  + kt)  = x' . 

(f)  The  triangularization  in  (d)  shows  that  x = x + Ny  for  arbitrary  vectors  y. 
And  toppling  preserves  congruence;  hence  every  class  contains  a recurrent  state. 

(g)  Since  a = oi  + • • • + an  in  a balanced  digraph,  we  have  x = x + a.  If  x is 
recurrent,  we  see  in  fact  that  every  vertex  topples  exactly  once  when  x + a reduces  to  x, 
because  the  vectors  {ai, . . . ,an}  are  linearly  independent. 

Conversely,  if  u{x  + a)  = x we  must  prove  that  x is  recurrent.  Let  zm  = cr(ma); 
there  must  be  some  positive  k and  m with  zm+k  = zm.  Then  every  vertex  topples 
k times  when  zm  + ka  reduces  to  zm',  hence  there  are  vectors  yj  = (yji, . . . ,Vjn)  with 
yjj  > dj  such  that  (m  + k)a  topples  to  yj.  It  follows  that  x + n(m  + k)a  topples  to 
x + yi  + ■ ■ ■ + yn,  and  a(x  + y\  + ■ • • + yn)  = a(x  + n(m  + k)a)  = x. 

(h)  Treating  subscripts  cyclically,  the  spanning  trees  with  arcs  Vj  — >-  Vo  for  j = i\, 
...,  ik  have  n - k other  arcs:  Vj  -»  Vj_i  for  ii  < j < it  + qi  and  Vj  —t  Vj+i  for 
L + qi  < j < ii+i-  The  recurrent  states,  similarly,  have  xj  = 2 for  j = ii,  . . . , ik,  and 
Xj  = 1 for  ii  < j < ii+ 1,  except  that  Xj  = 0 when  j = it  + qt  and  qi  > 0. 

(i)  In  this  case  state  x = (x\, . . . , xn ) is  recurrent  if  and  only  if  (n  — x i, . . . ,n  — xn) 
solves  the  parking  problem  in  the  hint,  because  t = (1,...,1),  and  a sequence  that 
doesn’t  get  parked  leaves  a “hole”  that  stops  x + t from  toppling  to  x. 

Notes:  This  sandpile  model,  introduced  by  Deepak  Dhar  [Phys.  Review  Letters 
64  (1990),  1613-1616],  has  led  to  many  papers  in  the  physics  literature.  Dhar  noted 
that,  if  M grains  of  sand  are  introduced  at  random,  each  recurrent  state  is  equally 
probable  as  M —>  oo.  The  present  exercise  was  inspired  by  the  work  of  R.  Cori  and 
D.  Rossin,  European  J.  Combinatorics  21  (2000),  447-459. 

Sandpile  theory  proves  that  every  digraph  D yields  an  abelian  group  whose 
elements  correspond  somehow  to  the  oriented  spanning  trees  of  D with  root  Vo.  In 
particular,  the  same  is  true  when  D is  an  ordinary  graph,  with  arcs  u — >■  v and  v -4  u 
whenever  u and  v are  adjacent.  Thus,  for  example,  we  can  “add”  two  spanning  trees; 
and  some  spanning  tree  can  be  regarded  as  “zero.”  An  elegant  correspondence  between 
spanning  trees  and  recurrent  states,  in  the  special  case  when  D is  an  ordinary  graph, 
has  been  found  by  R.  Cori  and  Y.  Le  Borgne,  Advances  in  Applied  Math.  30  (2003), 
44-52.  But  no  simple  correspondence  is  known  for  general  digraphs  D.  For  example, 
suppose  n = 2 and  (eio,  ei2,  e2o,  e2i)  = (p,  q,  r,  s );  then  there  are  pr  + ps  + qr  oriented 
trees,  and  the  recurrent  states  correspond  to  generalized  two-dimensional  toruses  as  in 
exercise  7 -137.  Yet  even  in  the  “balanced”  case,  when  p + q > s and  r + s > q,  no  easy 
mapping  between  spanning  trees  and  recurrent  states  is  apparent. 

104.  (a)  If  det(a/  — C)  = 0,  there  is  a vector  x = (xi,..,  ,xn)T  such  that  Cx  = ax 
and  max(ii, . . . , xn)  = xm  — 1 for  some  m.  Then  a = axm  = cmm  - ^2j^m  e-mjXj  > 
Cmm  — EJ?:m  emi  = 0.  (Incidentally,  a real  symmetric  matrix  whose  eigenvalues  are 
nonnegative  is  called  positive  semidefinite.  Our  proof  establishes  the  well-known  fact 
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that  any  real  symmetric  matrix  with  cmm  > | cmj- 1 for  1 < m < n has  this 

property.)  Thus  a0  > 0;  and  a0  = 0 because  (7(1, . . . , 1)T  = (0, . . . , 0)T. 

(b)  det(xl  — C(G))  = x(x  — an) . . . (x  — an- 1);  and  the  coefficient  of  x is  ( — l)n_1n 
times  the  number  of  spanning  trees,  by  the  matrix  tree  theorem. 

(c)  det(a/  — C(Kn))  = det((a  — n)I  + J)  = (a  — n)n~1a  by  exercise  1.2.3-36; 
here  J is  the  matrix  of  all  Is.  The  aspects  are  therefore  0,  n,  . . . , n. 

105.  (a)  If  etJ  = a + be't]  we  have  C(G)  = nal  - aj  + bC(G').  And  if  C is  any  matrix 
whose  row  sums  are  zero,  the  identity 

det(*/  + yj-  zC)  = zn  det((x/ z)I  - C) 

can  be  proved  by  adding  columns  2 through  n to  column  1,  factoring  out  (x  + ny)/x, 
subtracting  y/x  times  column  1 from  columns  2 through  n,  then  subtracting  columns 
2 through  n from  column  1.  Therefore,  by  setting  x = a - na,  y = a,  z = b,  a = 1,  and 
b = ~ 1>  we  find  that  G has  the  aspects  0,  n — an-x,  . . . , n — cti.  (In  particular,  this 
result  agrees  with  exercise  104(c)  when  G'  is  the  empty  graph  AT  ■) 

(b)  Sort  (af), . . . , a'ni_1,  Qq  , . . . , Q^/_x}  into  order.  (An  easy  case,  for  variety.) 

(c)  Here  G = G'  © G",  so  G’s  aspects  are  (0,n'  + n" ,n"  + a'1; . . . ,n"  + 

n'  + a" , ■ ■ . , ri  +a",/_1}  by  (a)  and  (b).  (In  particular,  G is  A'm,n  when  G'  = Km  and 
G"  = Kn,  hence  the  aspects  of  Km,n  are  {0,  (n  - 1)  ■ m,  (m  - 1)  ■ n,  m + n}.) 

(d)  G(G)  = In'  ®C(G")-\-C(G')®Inii , where  In  denotes  the  nxn  identity  matrix 

and  (g>  denotes  the  direct  product  of  matrices.  The  aspects  of  C(G)  are  (a)  + a'/  | 
0<j<n',0<k<  n"}-  for  if  A and  B are  arbitrary  matrices  whose  eigenvalues 
are  {A  1 ^ • 5 A m } and  (pi, . . . , Hn},  respectively,  the  eigenvalues  of  A <g>  In  + Im  <g>  jB 

are  the  mn  sums  A j + y,k.  Proof:  Choose  S and  T so  that  S~AS  and  T~BT  are 
triangular.  Then  use  the  matrix  identity  (A  ® B)(C  <g>  D)  = AC  ® BD  to  show  that 
(S®T)  (A<g>In+Im®B)(S<giT)  = (S  AS)®In+Im®{T~BT).  (In  particular,  repeated 
use  of  this  formula  shows  that  the  aspects  of  the  rc-cube  are  { (")  -0,  (")  -2,  . . . , (")  -2 n}, 
and  Eq.  (57)  follows  from  exercise  104(b).) 

(e)  When  G is  a regular  graph  of  degree  d,  its  aspects  are  aj  = d — AJ+i,  where 
Ai  > ■ • • > An  are  the  eigenvalues  of  the  adjacency  matrix  A = (a,,).  The  adjacency 
matrix  of  G'  is  A!  = BTB  — d!  Ini , where  B = ( bjj ) is  the  nxn'  incidence  matrix  with 
entries  b = [edge  i touches  vertex  j],  and  where  n = rid'/ 2 is  the  number  of  edges. 
The  adjacency  matrix  of  G is  A = BBT  - 2 1„ . Now  we  have 

xn  det(a :Ini  — BTB)  = xn  det (xln  — BBT); 

this  identity  follows  from  the  fact  that  the  coefficients  of  det(xl  — A)  can  be  expressed  in 
terms  of  trace(Afc)  for  k = 1,  2,  . . . , via  Newton’s  identities  (exercise  1.2.9  -10).  So  the 
aspects  of  G are  the  same  as  those  of  G',  plus  n-ri  aspects  equal  to  2d1.  [This  result 
is  due  to  E.  B.  Vakhovsky,  Sibirskii  Mat.  Zhurnal  6 (1965),  44-49;  see  also  H.  Sachs, 
Wissenschaftliche  Zeitschrift  der  Technischen  Hochschule  Ilmen au  13  (1967),  405-412.] 

(f)  A = A'®  A" , so  the  aspects  are  {d"a'j+d'ak-a'jak  \ 0<j<ri,0<k<  n"). 

(g)  A(G)  = Ini  ® A + A'  ® In'i  + A!  ® A"  — ( Ini  + A')  <g>  (7n//  + A")  — In  yields 
the  aspects  { (d"  + 1 )a'j  + (d'  + 1 )a'/  - a/a/  \ 0 < j < ri , 0 < k < n"}. 

(h)  When  G'  is  regular,  we  can  make  S~  A' S a diagonal  matrix  with  entries  d' —a/, 
while  simultaneously  S~Jn/S  is  a diagonal  matrix  with  entries  (ri , 0, . . . , 0),  because 
(1, . . . , 1)T  is  an  eigenvector  of  both  A'  and  .Jn> . Thus,  by  the  formula  of  answer  7-96(c), 
t he  aspects  t urn  out  to  be  { d + (d1 - a'  ri  [ j = 0 ] ) (d"  - a/ ) + (d!  - a) ) (d"  - a'/.  - ri'  [ k = 0 ] ) | 
0 < j < n',  0 < k < ri'},  where  d = d'(ri'  — d")  + (ri  - d')d" . 
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(i)  A similar  argument  yields  the  scaled  aspects  {n"a'-  | 0 < j < n'}  of  G', 
together  with  n'  copies  of  shifted  aspects  {d'n”  + ak  | 1 < k < n"}  of  G". 

106.  (a)  If  a is  an  aspect  of  the  path  P„,  there’s  a nonzero  solution  (x0,  xx, . . . , xn+x) 
to  the  equations  axk  = 2xk  — Xk-i  — xk+x  for  1 < k < n,  with  xo  = xx  and  xn  = xn+x. 
If  we  set  xk  = cos(2fc  — 1)9,  we  find  x0  = xx  and  2xk  -xk-i -xk+x  = 2xk  - (2  cos  26)xk\ 
hence  2 — 2 cos  28  = 4sin20  will  be  an  aspect  if  we  choose  9 so  that  xn  = xn+i  and  so 
that  the  Ps  are  not  all  zero.  Thus  the  aspects  of  Pn  turn  out  to  be  ao„,  . . . , <T(n-i)„. 

We  must  have  ax  . . . an-i  = n,  by  exercise  104(b),  since  c(Pn)  = 1;  therefore 

m — 1 n—  1 

c(PmdPn)  = ^ J (ffjm  + CTkn). 

3 = 1 k = l 

(b,  c)  Similarly,  if  a is  an  aspect  of  the  cycle  Cn,  there’s  a nonzero  solution  to  the 
stated  equations  with  xn  = xo-  For  this  case  we  try  xk  = cos 2 k9  and  find  solutions 
when  6 = jw/n  for  0 < j < [n/2] . And  xk  = sin  k9  gives  further,  linearly  independent 
solutions  for  [n/2]  < j < n.  The  aspects  of  Cn  are  therefore  a0n,  <r2„,  •••,  V(2n-2)n\ 
and  we  have 

m-ln-l 

c(PmO  c„)  — Tl  (CTjm  + 0'(2fc)n)i  c(CmO  Cn)  = mn  icr(2j)m^~0'(2k)n)- 

3= 1 k= 1 j= l fc=l 

Let  fn(x)  = (x  + (Tin) . . . (x  + (T(n_1)n)  and  gn(x ) = (x  + a2n) . . . (x  + <T(2n_2)n). 
These  polynomials  have  integer  coefficients;  indeed,  fn(x)  = Un-i{x/2  + l)  and  gn(x)  = 
2(Tn(x/2  + 1)  — l)/x,  where  Tn(x)  and  U„(x)  are  the  Chebyshev  polynomials  defined 
byTn(cos9)  = cos  n6  and  Un(cos9)  = (sin(n  + l)0)/sin0.  The  calculation  of  c(PmnPn) 
can  be  reduced  to  the  evaluation  of  an  m x m determinant,  because  it  is  the  resultant 
of  fm{x)  with  fn{-x );  see  exercise  4.6.1  12.  Similarly,  Lc(pmoC<rl)  and  -i-c(CrnnCn) 
are  the  respective  resultants  of  fm(x)  with  g„(-x)  and  of  gm(x)  with  gn(~x). 

Let  an(x)  = ]/[d\n  fd{xY{n/d)-  thus  ax{x)  = 1,  q2(x)  = x + 2,  a3(x)  = (x  + 3)  x 
(x+  1),  a4(x)  = x2  + 4x  + 2,  a5{x)  = (x2  + 5x  + 5)(*2  + 3x  + 1),  a6(x)  = x2  + 4x  + 1, 
etc.  By  considering  so-called  field  polynomials  one  can  show  that  an(x ) is  irreducible 
over  the  integers  when  n is  even,  otherwise  it  is  the  product  of  two  irreducible  factors 
of  the  same  degree.  Similarly,  if  /3n(x)  = Ud\n  9d(x)^{n/d) , it  turns  out  that  /3n(x) 
is  the  square  of  an  irreducible  polynomial  when  n > 3.  These  facts  account  for  the 
presence  of  fairly  small  prime  factors  in  the  results.  For  example,  the  largest  prime 
factor  in  c(Pm  □ Pn)  for  m < n < 10  is  1009;  it  occurs  only  in  the  resultant  of  06 (a:) 
with  a9(— x),  which  is  662913  = 32  • 73  • 1009. 

107.  There  are  (1,1,2,6,21)  nonisomorphic  graphs  for  n = ( 1 , . . . , 5) ; but  we  need 
consider  only  cases  with  < | Q)  edges,  because  of  exercise  105(a).  The  surviving  cases 
when  n = 4 are  free  trees:  The  star  is  the  complement  of  Kx  ® K3,  with  aspects  0,  1, 
1,  4;  and  P4  has  aspects  0,  2 — y/2,  2,  2 + \/2  by  exercise  106.  There  are  three  free  trees 
when  n = 5:  The  star  has  aspects  0,  1,  1,  1,  5;  P5’s  aspects  are  0,  2 - <f>,  3 - 4>,  1 + <j>, 
2 + 4>-  and  the  aspects  of  Jwh.  are  0,  n,  1,  r2,  r3,  where  (ri,r2,r3)  « (0.52,2.31,4.17) 
are  the  roots  of  a;3  — 7x2  + 13a:  — 5 = 0. 

Finally,  there  are  five  cases  with  a single  cycle:  £«£  is  K\  — (K2  ® A"/),  so  its 
aspects  are  0,  1,  1,  3,  5;  C3  has  aspects  0,  3 — </>,  3 — </>,  2 -|-  </>,  2 -|-  </>;  ^»-o-o  has  aspects 
0,  n,  r2,  3,  r3;  its  complement  «£x>  has  aspects  0,  5 - r3,  2,  5 - r2,  5 - n;  and  the 
aspects  of  turn  out  to  be  0,  (5  - ^13)/2,  3 - 4>,  2 + <f>,  (5  + Vu)/2. 
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108.  Given  a digraph  D on  vertices  {Vi, . . . , V'n  } , let  eZJ  be  the  number  of  arcs  from  Vl 
to  Vj.  Define  C(D)  and  its  aspects  as  before.  Since  C(D)  is  not  necessarily  symmetric, 
the  aspects  are  no  longer  guaranteed  to  be  real.  But  if  a is  an  aspect,  so  is  its  complex 
conjugate  a;  and  if  we  order  the  aspects  by  their  real  parts,  again  we  find  a0  = 0.  The 
formula  c(D ) = an  . . . a„_j/n  remains  valid  if  we  now  interpret  c(D)  as  the  average 
number  of  oriented  spanning  trees,  taken  over  all  n possible  roots  Vj.  The  aspects  of 
the  transitive  tournament  Kn,  whose  arcs  are  Vi  ->  Vj  for  1 < i < j < n,  are  obviously 
0,  1,  . . . , n — 1;  and  those  of  its  subgraphs  are  equally  obvious. 

The  derivations  in  parts  (a)-(d)  of  answer  105  carry  over  without  change.  For 
example,  consider  K i — K3,  which  has  aspects  0,  2,  3,  4;  this  digraph  D has  (2, 4, 6, 12) 
oriented  spanning  trees  with  the  four  possible  roots,  and  c(D)  is  indeed  equal  to 
2-3-4/4.  Notice  also  that  the  digraph  o — >oc3o  is  its  own  complement,  and  that  it 
has  the  same  aspects  as  K 3. 

Directed  graphs  also  admit  another  family  of  interesting  operations:  If  D'  and 
D"  are  digraphs  on  disjoint  sets  of  vertices  V’  and  V" , consider  adding  a arcs  v'  -»  v" 
and  b arcs  v"  — > v'  whenever  v'  £ V'  and  v"  £ V" . By  manipulating  determinants 
as  in  answer  105(a),  we  can  show  that  the  resulting  digraph  has  aspects  {0,  an"  + bn', 
an"  + a'i, . . . ,an"  + a'n,_1,bn'  + a", . . . ,bn'  + In  the  special  case  a = 1 and 

b = 0,  we  can  conveniently  denote  the  new  digraph  by  D'  ->•  D":  thus,  for  example, 

Kn  = K 1 ->■  Kn- 1.  The  digraph  Kni  ->■  Kn2  K„m  ontii+n2f h nm 

vertices  has  aspects  {0,  nm  ■ Sm , . • • , n2  * S2,  (ni  1)  • si},  where  sk  — n k T * ■ * T nm. 

The  aspects  of  the  oriented  path  Pn  from  Vi  to  Vn  are  obviously  0,  1,  . . . , 1.  The 
oriented  cycle  Cn  has  aspects  {0, 1 — w, . . , , 1 — wn-1},  where  ui  = e27ri/n. 

There  is  also  a nice  result  for  arc  digraphs:  The  aspects  of  IT  are  obtained  from 
those  of  D by  simply  adding  rk  - 1 copies  of  the  number  ok,  for  1 < k < n.  where  rk 
is  the  in-degree  of  Vk  and  ak  is  its  out-degree.  (If  rk  = 0,  we  remove  one  aspect  equal 
to  a*,.)  The  proof  is  similar  to,  but  simpler  than,  the  derivation  in  answer  2.3.4.2-21. 

Historical  remarks:  The  results  in  exercises  104(b)  and  105(a)  are  due  to  A.  K. 
Kelmans,  Avtomatika,  i Telemekhanika  26  (1965),  2194-2204;  27,2  (February  1966), 
56-65;  English  translation  in  Automation  and  Remote  Control  26  (1965),  2118-2129; 
27  (1966),  233-241.  Miroslav  Fiedler  [Czech.  Math.  J.  23  (1973),  298-305]  introduced 
exercise  105(d),  and  proved  interesting  results  about  the  aspect  ak,  which  he  called 
the  “algebraic  connectivity”  of  G.  Germain  Kreweras,  in  J.  Combinatorial  Theory 
B24  (1978),  202-212,  enumerated  spanning  trees  on  grids,  cylinders,  and  toruses,  as 
well  as  oriented  spanning  trees  on  directed  toruses  such  as  Cm  □ Cn-  An  excellent 
survey  of  graph  aspects  was  published  by  Bojan  Mohar  in  Graph  Theory,  Combinatorics 
and  Applications  (Wiley,  1991),  871-898;  Discrete  Math.  109  (1992),  171-183.  For  a 
thorough  discussion  of  important  families  of  graph  eigenvalues  and  their  properties, 
including  a comprehensive  bibliography,  see  Spectra  of  Graphs  by  D.  M.  Cvetkovic, 
M.  Doob,  and  H.  Sachs,  third  edition  (1995). 

109.  Perhaps  there  is  also  a sandpile-related  reason;  see  exercise  103. 

110.  By  induction:  Suppose  there  are  k > 1 parallel  edges  between  u and  v.  Then 
c(G)  = kc(Gi)  + c(G 2),  where  G 1 is  G with  u and  v identified,  and  G2  is  G with  those 
k edges  removed.  Let  du  = k + a and  dv  = k + b. 

Case  1:  G2  is  connected.  Then  ab  > 0,  so  we  can  write  a = x + 1 and  b = y + 1. 
We  have  c(G  1)  > a\/x  + y + 1 and  c(G2)  > a\fxy,  where  a is  a product  over  the  other 
n — 2 vertices;  and  it  is  easy  to  verify  that 


k\Jx  + y + 1 + i/xy  > \/(x  + k)(y  + k ). 
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Case  2.  There  are  no  such  u and  v for  which  G2  is  connected.  Then  every  multi- 
edge of  G is  a bridge;  in  other  words,  G is  a free  tree  except  for  parallel  edges.  In 
this  case  the  result  is  trivial  if  there’s  a vertex  of  degree  1.  Otherwise  suppose  u is  an 
endpoint,  with  du  = k edges  u — v.  If  dv  > k + 1,  we  have  c(G)  = fcc(Gi)  > otkyfi 
where  dv  = k + 1 + x,  and  it  is  easy  to  check  that  ky/x  > y/(k  - 1 )(k  + x)  when  x > 0. 
If  dv  = k we  have  c(G)  = k > y/(k  - l)2.  Finally  if  dv  = k + 1,  let  v0  = u,  Vl  = v,  and 
consider  the  unique  path  vi  i>2  — • • • — vr  where  r > 1 and  vr  has  degree  greater 
than  2;  only  one  edge  joins  Vj  to  Vj+ 1 for  1 < j < r.  Again  the  induction  goes  through. 

[Other  lower  bounds  on  the  number  of  spanning  trees  have  been  derived  by  A.  V. 
Kostochka,  Random  Structures  & Algorithms  6 (1995),  269-274.] 

111.  2 1 5 4 11  7 9 8 6 10  15  12  14  13  3. 

H2.  Either  p appears  on  an  even  level  and  is  an  ancestor  of  q,  or  q appears  on  an  odd 
level  and  is  an  ancestor  of  p. 

113.  prepostorder (F/,)=postpreorder(F)fl  and  postpreorder(Ffl)=prepostorder(F)fl. 

Hd.  The  most  elegant  approach,  considering  that  the  forest  might  be  empty,  is  to  set 
things  up  so  that  CHILD  (A)  points  to  the  root  of  the  leftmost  tree,  if  any.  Then  initiate 
the  first  visit  by  setting  Q «—  A,  L < 1,  and  going  to  step  Q6. 

115.  Suppose  there  are  ne  nodes  on  even  levels  and  nQ  nodes  on  odd  levels,  and 
that  n'  of  the  even-level  nodes  are  nonleaves.  Then  steps  (Ql, . . . , Q7)  are  performed 
respectively  (nc  + n„,  n0,  ne,  ne,  n'e , n0  + 1,  nc)  times,  including  one  execution  of  Q6 
because  of  answer  114. 

116.  (a)  This  result  follows  from  Algorithm  Q. 

(b)  In  fact,  non-ordinary  nodes  strictly  alternate  between  lucky  and  unlucky, 
beginning  and  ending  with  a lucky  one.  Proof:  Consider  the  forest  F'  obtained  by 
deleting  the  leftmost  leaf  of  F,  and  use  induction  on  n. 

117.  Such  forests  are  precisely  those  whose  left-child/right-sibling  representation  is  a 
degenerate  binary  tree  (exercise  31).  So  the  answer  is  2n_1. 

118.  (a)  tk~2,  for  k > 1;  luckiness  occurs  only  near  extreme  leaves. 

(b)  An  interesting  recurrence  leads  to  the  solution  (Fk  + 1 - ( k + 1)  mod  3) /2. 

119.  Label  each  node  x with  the  value  v(x)  = £{  2'c  | k is  an  arc  label  on  the  path 
from  the  root  to  x).  Then  the  node  values  in  prepostorder  are  exactly  the  Gray  binary 
code  r„,  because  exercise  113  shows  that  they  satisfy  recurrence  7.2.1.1-(5). 

(If  we  apply  the  same  value  labeling  to  the  ordinary  binomial  tree  Tn  and  traverse 
it  in  preorder,  we  simply  get  the  integers  0,  1,  . . . , 2"  - 1.) 

120.  False:  Only  four  of  the  “hollow”  vertices  in  the  illustration  can  appear 
next  to  the  two  “square”  vertices,  in  a Hamiltonian  cycle;  one  hollow  pair  is 
therefore  out  of  luck.  [See  H.  Fleischner  and  H.  V.  Kronk,  Monatshefte  fiir 
Mathematik  76  (1972),  112  117.] 

121.  Furthermore,  there  is  a Hamiltonian  path  from  u to  v in  / " if  and  only  if  similar 
conditions  hold;  but  we  retain  u and/or  v in  T<()  if  they  have  degree  1,  and  we  require 
that  the  path  in  (i)  be  inside  the  path  from  u to  v (excluding  u and  v themselves). 
Condition  (ii)  is  also  strengthened  by  changing  ‘vertices  of  degree  4’  to  ‘dangerous 
vertices  , where  a vertex  of  T{  1 is  called  dangerous  if  it  either  has  degree  4 or  has 
degree  2 and  equals  u or  v.  The  smallest  impossible  case  is  T = / 4 . with  u and  v chosen 
to  be  the  non-endpoints.  [Casopis  pro  Pestovam  Matematiky  89  (1964),  323-338.] 
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Consequently  T2  contains  a Hamiltonian  cycle  if  and  only  if  T is  a caterpillar, 
namely  a free  tree  whose  derivative  is  a path.  [See  Frank  Harary  and  A.  J.  Schwenk, 
Mathematika  18  (1971),  138-140.] 

122.  (a)  We  can  represent  an  expression  by  a binary  tree,  with  operators  at  the  internal 
nodes  and  digits  at  the  external  nodes.  If  binary  trees  are  implemented  as  in  Algo- 
rithm B,  the  essential  constraint  imposed  by  the  given  grammar  is  that,  if  rj  = k > 0, 
then  the  operator  at  node  j is  + or  — if  and  only  if  the  operator  at  node  k is  x or  /. 
Therefore  the  total  number  of  possibilities  for  a tree  with  n leaves  is  2nSn-i,  where  Sn 
is  a Schroder  number;  namely  10,646,016  when  n = 9.  (See  exercise  66,  but  interchange 
left  with  right.)  We  can  rather  quickly  generate  them  all,  encountering  exactly  1641 
solutions.  Only  one  expression,  namely  1 + 2/ ((3  - 4)/ (5  + 6)  - (7  - 8) /9),  does  the  job 
with  no  multiplications;  twenty  of  them,  such  as  (((1  — 2)/ ((3/ 4)  x 5 — 6))  x 7 + 8)  x 9, 
require  flve  pairs  of  parentheses;  only  15  require  no  parentheses  whatever. 

(b)  Now  there  are  1 + E!t=i  (*)  2fc+1£fc  = 23,463,169  cases,  and  3366  solutions. 
The  shortest,  of  length  12,  was  found  by  Dudeney  [The  Weekly  Dispatch  (18  June 
1899)],  namely  123  — 45  — 67  + 89;  but  he  wasn’t  sure  at  the  time  that  it  was  best.  The 
longest  solutions  have  length  27;  there  are  twenty  of  them,  as  mentioned  above. 

(c)  The  number  of  cases  rises  dramatically  to  2+^®=1(®)4fc+1Sfc  = 8,157,017,474, 
and  there  now  are  97,221  solutions.  The  longest,  which  is  unique,  has  length  40: 
((((.l/(.2  + .3))/.4)/.5)/(.6  — .7))/(.8  — .9).  There  are  five  amusing  examples  such  as 
.1  + (2  + 3 + 4 + 5)  x 6 + 7 + 8 + .9,  with  seven  +’s;  furthermore,  there  are  ten  like 
(1  — .2  — .3  — 4 — .5  — 6)  x (7  — 8 — 9),  with  seven  — ’s. 

There  is  in  fact  very  little  principle  in  the  thing, 
and  there  is  no  certain  way  of  demonstrating 
that  we  have  got  the  best  possible  solution. 

— HENRY  E.  DUDENEY  (1899) 

Notes:  Marie  Leske’s  Illustriertes  Spielbuch  fur  Madchen , first  published  in  1864, 
contained  the  earliest  known  appearance  of  such  a problem;  in  the  eleventh  edition 
(1889),  the  fact  that  100  = l+2+3+4+5+6+7+8x9  was  the  solution  to  puzzle 
16  in  section  553.  See  also  the  references  in  exercise  7. 2. 1.1  111. 

Richard  Bellman  explained  in  AMM  69  (1962),  640-643,  how  to  handle  the 
special  case  of  part  (a)  in  which  the  operators  are  restricted  to  be  either  + and  x, 
without  parentheses.  His  technique  of  dynamic  programming  can  be  used  also  in 
this  more  general  problem  to  reduce  the  number  of  cases  being  considered.  The  idea 
is  to  determine  the  rational  numbers  obtainable  from  every  subinterval  of  the  digits 

{1 ,n},  having  a given  operator  at  the  root  of  the  tree.  We  can  also  save  a good 

deal  of  computation  by  discarding  cases  for  the  subintervals  {1, . . . , 8}  and  {2, . . . , 9} 
that  cannot  lead  to  integer  solutions.  In  this  way  the  number  of  essentially  different 
trees  to  consider  is  reduced  to  (a)  2,735,136  cases;  (b)  6,813,760;  (c)  739,361,319. 

Floating  point  arithmetic  is  unreliable  in  this  application.  But  the  exact  rational 
arithmetic  routines  of  Section  4.5.1  do  the  job  nicely,  never  needing  to  work  with  an 
integer  greater  than  109  in  absolute  value. 

123.  (a)  2284;  but  2284  = (1  + 2 x 3)  x (4  + 5 x 67)  - 89.  (b)  6964;  but  6964  = 
(1/.2)  x 34  + 5 + 6789.  (c)  14786;  but  14786  = -1  + 2 x (.3  + 4 + 5)  x (6  + 789).  [If  we 
allow  also  a minus  sign  at  the  left  of  the  expression,  as  Dudeney  did,  we  actually  obtain 
1362,  2759,  and  85597  additional  solutions  to  problems  122(a),  (b),  and  (c),  including 
nineteen  longer  expressions  in  case  (a)  such  as  -(1  - 2)  x ((3  + 4)  x (5  - (6  - 7)  x 8) + 9). 
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With  such  an  extension,  the  smallest  unreachable  numbers  in  the  present  problem 
become  (a)  3802,  (b)  8312,  and  (c)  17722.]  The  total  number  of  representable  integers 
(positive,  negative,  or  zero)  turns  out  to  be  (a)  27,666;  (b)  136,607;  (c)  200,765. 

124.  Horton-Strahler  numbers  originated  in  studies  of  river  flows:  R.  E.  Horton,  Bull. 
Geol.  Soc.  Amer.  56  (1945),  275-370;  A.  N.  Strahler,  Bull.  Geol.  Soc.  Amer.  63  (1952), 
1117-1142.  Many  tree-drawing  ideas  are  explored  and  illustrated  in  a classic  paper  by 
Viennot,  Eyrolles,  Janey,  and  Arques,  Computer  Graphics  23,3  (July  1989),  31-40. 


SECTION  7.2. 1.7 

1.  Perhaps  under  hexagram  21,  “crunching”  (R);  however,  the  ancient  commentators 
related  this  hexagram  more  to  law  enforcement  than  to  the  interaction  of  electrons. 

2.  (a)  For  the  first  nucleotide  in  the  codon,  let  (T,  C,  A,  G)  be  respectively  represented 

by  (LI,  Li,  Li,  LI);  represent  the  second  nucleotide,  similarly,  by  (f4,  Pi, ±4,  ??);  repre- 
sent the  third  by  (f  i,  j-f,  f 7,  t — r);  and  superimpose  those  three  representations.  Thus, 
for  example,  hexagram  number  34  is  = = Lj  + + H;  it  represents  the  codon  TTC, 

which  maps  to  the  amino  acid  F.  Under  this  correspondence,  hexagrams  34  through  54 
inclusive  map  into  the  respective  values  (F,  G,  L,  Q,  W,  D,  S,  - , P,  Y,  K,  A,  I,  T,  N,  H,  M, 
R,  V,  E,C).  Moreover,  the  three  hexagrams  that  map  to  ‘ — ’ are  numbers  1,  9,  and  41, 
namely  m,  H,  and  |J,  which  mean  “creation,”  “taming,”  and  “removal  of  excess”  in 
the  I Ching — all  quite  appropriate  for  the  notion  of  completing  a protein. 

(b)  Consider  the  (6^,6AA^iAt3j362)2,2,2,2,2,2,2,2,i,i)  ~ 23  x 1()69  ways  to  permute 
the  elements  of  the  4 x 4 x 4 genetic  code  array.  Exactly 

2402880402175789790003993681964551328451668718750185553920000000  « 2.4  x 1063 


of  them  contain  at  least  one  run  of  21  distinct  consecutive  elements.  [Using  the  principle 
of  inclusion  and  exclusion  one  can  show  that  any  multiset  {(m  + 1)  -xi, . . . , (nr+l)-xr} 
with  r distinct  elements  and  nT  = 0 has  exactly 


r\-J2(n+l~k)k\(r-k)\ak 

fe=l  0<di,...,dr<l 

d \ +•  • •-\-dr=k 


/ n — k 

V 72-1  d\  5 , Tlr  d-p 


) 


such  permutations,  where  n — n\  + • • • + nr  and  is  the  number  of  indecomposable 
permutations  with  k elements  (exercise  7.2.1.2-100).]  Thus  only  about  one  out  of  every 
million  permutations  has  the  stated  property. 

But  there  are  4!3(2  ® 2)  = 1244160  ways  to  represent  codons  as  in  part  (a),  and 
most  of  them  correspond  to  different  permutations  of  the  amino  acids  (except  for 
interchanging  the  representations  of  T and  C in  third  position). 

Empirically,  in  fact,  about  31%  of  all  permutations  of  the  64  hexagrams  turn  out 
to  have  suitable  codon  mappings.  Thus  the  construction  in  part  (a)  gives  no  reason  to 
believe  that  the  authors  of  the  I Ching  anticipated  the  genetic  code  in  any  way. 

3.  Since  F31  — 10°  = F2g  + F22  + F20  + Fig  + Fie  + F14  + Fg,  the  millionth  is 


Going  the  other  way  is  easier:  Fn  - (F5  + Fg  + F10  + Fi6  + Fig  + F27  + F30)  = 314159. 

4.  One  of  the  two  appearances  of  7’7)33  on  line  4 should  be  77753;  similarly,  one 
31377’  line  8 should  be  37137’ ■ And  the  six  cases  with  rightmost  letters  73  appear 
twice,  in  lines  3 and  4,  while  the  cases  with  rightmost  73  are  missing.  These  glitches 
are  probably  typographical  and/or  scribal  errors,  not  made  by  Donnolo  himself. 
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5.  The  last  one  should  have  been  . not  . 

6.  The  nth  value  mn  in  Mersenne’s  list  agrees  with  n!  only  for  1 < n < 13  and 
15  < n < 38.  Mersenne  knew  that  14!  = 87178291200  / m14  = 8778291200,  because  he 
inserted  the  missing  T in  his  personal  copy  of  the  book  (now  owned  by  the  Bibliot.heque 
Nationale;  a facsimile  was  published  in  1963).  But  the  other  errors  in  his  table  were  not 
merely  typographical,  because  they  propagated  into  subsequent  entries,  except  in  the 
case  of  77750:  m39  = 39!  + 1026  - 1010;  m40  = 40m39;  m4 1 = 41m40  - 4 ■ 1025  - 14  • 1011; 
mn  = nmn-i  for  n = 42,  43,  44,  46,  47,  48,  49,  55,  60,  and  62;  m50  = 50 m49  + 1066; 
m5 1 = 51  ■ 50  • m,4g  • When  he  computed  m45  = 9 • 45  • m44  - 104°  + 1029,  he  apparently 
decided  to  take  a shortcut,  because  it’s  easy  to  multiply  by  5 or  by  9;  but  he  multiplied 
twice  by  9.  Most  of  his  errors  indicate  an  unreliable  multiplication  technique,  which 
may  have  depended  on  an  abacus:  m52  = 52m5i  + 5 ■ 1056  - 2 • 1047  + 1034;  m53  = 
53/7752  - 4-10  ; ms4  = 5477is3  + 1016;  77757  = 5777756  + 1033  + 1024;  77758  = 5877757  + 

1067  — 1035  + 1032  + 11  • 1 026 ; 77759  = 5977758  + 1066  + 1049  — 1 028;  77761  = 6I77760  — 5 • 1081; 
77763  = 6377762  + 10®2  - 1074;  77764  = 6477763  + 3 • 1081  + 1067  + 2 • 1038  - 2 ■ 1033  - 1023. 

The  remaining  case,  m5 e « 10.91277755  is  baffling;  it  is  = 5677155  (modulo  1017),  but 
its  other  digits  seem  to  satisfy  neither  rhyme  nor  reason.  Can  they  be  easily  explained? 

Notes:  Athanasius  Kircher  must  have  copied  from  Mersenne  when  he  tabulated  77! 
for  1 < 77  < 50  on  page  157  of  his  Ars  Magna  Sciendi  (1669),  because  he  repeated  all  of 
Mersenne’s  mistakes.  Kircher  did,  however,  list  the  values  10mi4,  77745/10,  and  10m49 
instead  of  ttii4,  tt745,  and  77749;  perhaps  he  was  trying  to  make  the  sequence  grow  more 
steadily.  It  is  not  clear  who  first  calculated  the  correct  value  of  39!;  exercise  1.2. 5-4 
tells  the  story  of  1000!. 

7.  The  basic  permutations  are  12345,  13254,  14523,  15432,  12453.  14235,  15324, 
13542,  12534,  15243,  13425,  14352.  But  then  we  find  that  all  60  of  the  even 
permutations  are  both  alive  and  dead,  because  (9)  differs  by  an  even  permutation 
from  (8).  (Moreover,  if  we  somehow  repair  the  case  n = 5,  half  of  the  live  permutations 
for  ?7  = 6 will  turn  out  to  be  odd.) 

8.  For  example,  we  can  replace  (9)  by 

ana3  . . . an-ia2ai,  aia4  . . . ana3a2,  ...,  an_1u2  . . . arl_2aian, 

thus  flipping  the  ends  and  cyclically  shifting  the  other  elements  in  the  permutations 
of  (8).  This  modification  works  because  all  permutations  have  the  correct  parity,  and 
because  the  live  and  dead  ones  both  have  01  in  every  possible  position.  (We  essentially 
have  a dual  Sims  table  for  the  alternating  group,  as  in  Eq.  7.2.1.2-(32);  but  our  elements 
are  named  (77, 77  - 1, . . . , 1)  instead  of  (0, 1, . . . , 77  - 1).) 

A simpler  way  to  generate  permutations  with  the  proper  signs  was  published  by 
E.  Bezout  [Memoires  Acad.  Royale  des  Sciences  (Paris,  1764),  292]:  Each  permutation 
±ai  . . .an_i  of  (1, ...  ,77-  1}  yields  77  others,  ±ai  . . ,a„_ian=Fai  • . . an~2anan-i  ± • ■ • . 

9.  (•,  \,  V,T,  i,  o,3,V,  A,  A);  or  perhaps  we  should  say  ( A,  A,  V,  1,  0,  £ , V,  X,  \ , • ).  Notes: 
A different  system  was  used  for  the  index  numbers  of  the  equations;  for  example,  \J 
stood  for  200.  Moreover,  it  should  be  noted  that  (11)  is  actually  a transcription  of  al- 
Samaw’al’s  work  into  modern  Arabic;  Ahmad  and  Rashed  based  their  work  on  a 14th- 
century  copy  that  used  similar  but  older  forms  of  the  digits:  (0,  \ , X, X,  f , 8 , 1, V,  A,  “\). 
Al-Samaw’al  himself  may  well  have  used  numerals  of  an  even  earlier  vintage. 

10.  If  the  56  cases  were  equally  likely,  the  answer  would  be  56 H5e  « 258.2,  as  in 
the  coupon  collector’s  problem  (exercise  3. 3. 2-8).  But  (6,30,20)  cases  occur  with  the 
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respective  probabilities  (1/216, 1/72, 1/36);  so  the  correct  answer  turns  out  to  be 

roc 

/ e-‘/216)6  (1  - e_t/72)30(i  _ e-t/36)20)  dt  ^ 546  6j 

Jo 

about  42%  of  the  upper  bound  2 UiHiu;.  [See  P.  Flajolet,  D.  Gardy,  and  L.  Thimonier, 
Discrete  Applied  Math.  39  (1992),  207-229.] 

11.  It  tabulates  the  (®)  = 20  combinations  of  (b,  c,  d,  B,  C,  D)  taken  three  at  a time; 
furthermore,  they  appear  in  lexicographic  order  if  we  regard  b<c<d<B<C<D. 
The  letter  t ( f ) means  “shift  from  lowercase  to  uppercase.”  [See  A.  Bonner,  Selected 
Works  of  Ramon  Hull  (Princeton:  1985),  596-597.]  There  are  two  typos:  ‘d’  should 
be  ‘b’  at  the  beginning  of  line  6;  ‘c’  should  be  ‘d’  at  the  end  of  line  18.  Line  1 would 
have  been  more  consistent  with  the  others  if  Llull  had  presented  it  as 

. b c Aci: 

but  in  that  line,  of  course,  no  case  shift  was  needed. 

12.  Multiply  Poinsot’s  cycle  by  5 and  add  2 (mod  7). 

13.  It’s  best  to  have  just  n lines  when  there  are  n different  letters: 

a.  aa.  aaa 

b.  ab.  aab.  aaab.  bb.  abb.  aabb.  aaabb 

Then,  assigning  the  weights  (a,b)  = (1,4)  gives  the  numbers  1 through  11  as  in  (18). 
(The  first  line  of  (16)  should  also  be  omitted.)  Similarly,  for  {a,a,a.,b,b,c}  we  would 
implicitly  give  c the  weight  12  and  add  the  additional  line 

c.  ac.  aac.  aaac.  be.  abc.  aabc.  aaabc.  bbc.  abbe.  aabbe.  aaabbe. 

[J.  Bernoulli  almost  did  it  right  in  Ars  Conjectandi , Part  2,  Chapter  6.] 

14.  ABC  ABD  ABE  ACD  ACE  ACB  ADE  ADB  ADC  AEB  AEC  AED  BCD  BCE  BCA  BDE 
BDA  BDC  BEA  BEC  BED  BAC  BAD  BAE  CDE  CDA  CDB  CEA  CEB  CED  CAB  CAD 
CAE  CBD  CBE  CBA  DEA  DEB  DEC  DAB  DAC  DAE  DBC  DBE  DBA  DCE  DCA  DCB 
EAB  EAC  EAD  EBC  EBD  EBA  ECD  ECA  ECB  EDA  EDB  EDC.  It’s  a genlex  ordering 
(see  Algorithm  7.2.1.3R),  proceeding  cyclically  through  the  letters  not  yet  used. 

[A  similar  ordering  had  been  used  to  form  all  120  permutations  of  five  letters  in  a 
kabbalistic  work  entitled  Sha‘ari  Tzedeq,  ascribed  to  the  13th-century  author  Natan  ben 
Sa'adyah  Har’ar  of  Messina,  Sicily;  see  Le  Porte  della  Giustizi a (Milan:  Adelphi,  2001).] 

15.  After  j we  place  the  ( n — l)-combinations  of  {/,...,  m}  with  repetition,  so  the 
answer  is  ((™+1-J>+("-1>-i)  = (m+"r/_1).  [Jean  Borrel,  also  known  as  Buteonis, 
pointed  this  out  on  pages  305-309  of  his  early  book  Logistica  (Lyon:  1560).  He 
tabulated  all  throws  of  n dice  for  1 < n < 4,  then  used  a sum  over  j to  deduce  that 
there  are  56  + 35  + 20+  10  + 4+  1 = 252  distinct  throws  for  n = 5,  and  462  for  n = 6.] 

16.  Nl.  [Initialize.]  Set  r <—  n,  t «—  0,  and  a0  -f — 0. 

N2.  [Advance.]  While  r > q,  set  t <-  t + 1,  ot  «-  q,  and  r +-  r - q.  Then  if  r > 0, 
set  t «—  t + 1 and  at  «—  r. 

N3.  [Visit.]  Visit  the  composition  a\  . . . at. 

N4.  [Find  j.]  Set  j t—  t,  t — 1,  . . . , until  a}  ^ 1.  Terminate  the  algorithm  if  j = 0. 
N5.  [Decrease  a,.}  Set  aj  <-  aj  - 1,  r <-  t - j + 1,  t -f-  j;  return  to  N2.  | 
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For  example,  the  compositions  for  n = 7 and  q = 3 are  331,  322,  3211,  313,  3121,  3112, 
31111,  232,  2311,  223,  2221,  2212,  22111,  2131,  2122,  21211,  2113,  21121,  21112,  211111, 
133,  1321,  1312,  13111,  1231,  1222,  12211,  1213,  12121,  12112,  121111,  1132,  11311, 
1123,  11221,  11212,  112111,  11131,  11122,  111211,  11113,  111121,  111112,  1111111. 

Narayana’s  sutras  79  and  80  gave  essentially  this  procedure,  but  with  the  strings 
reversed  (133,  223,  1123,  . . . ),  because  he  preferred  decreasing  colex  order.  [Sarngadeva, 
in  Sangltaratnakara  §5.316-375,  had  previously  developed  an  elaborate  theory  for  the 
set  of  all  compositions  (rhythms)  that  can  be  formed  from  the  basic  parts  {1,2, 4, 6}.] 

17.  The  number  Vn  of  visits  is  F^q_1  = Q(a q);  see  exercise  5.4.2-7.  The  number 

Xn  of  times  step  N4  tests  aj  = 1 satisfies  Xn  = X„-t  4 b Xn~q  + 1,  and  we  find 

Xn  = Vo  + • • • + Vn  = ( qVn  + (q  - l)V„-i  + • • • + Vn_,+1  - 1 ) / (q  - 1)  = 0(V„).  The 
number  Yn  of  times  step  N2  sets  at  <—  q satisfies  the  same  recurrence,  and  we  find 
Yn  = Xn-q.  And  the  number  of  times  step  N2  finds  r = 0 turns  out  to  be  Vn-q. 

18.  It  was  MDCLXVI  in  Roman  numerals,  where  M>D>C>L>X>V>I. 

19.  Lines  329  and  1022.  (Puteanus  included  139  such  verses  among  his  list  of  1022.) 

20.  With  ‘tria’  preceding  ‘lumina’,  there  are  5!  x 2!  x (11,12,12,16)  ways  having  a 
dactyl  in  the  (1st,  2nd,  3rd,  4th)  foot,  respectively;  with  ‘lumina’  preceding  ‘tria’  there 
are  5!  x 2!  X (16,12,12,11).  So  the  total  is  24480.  [Leibniz  considered  this  problem 
near  the  end  of  his  Dissertatio  de  Arte  Combinatoria,  and  came  up  with  the  answer 
45870;  but  his  argument  was  riddled  with  errors.] 

21.  (a)  The  generating  function  1 / ( ( 1 - zu  - yu2)(  1 - zv  - yv2)(  1 - zw  - yw2))  is 
clearly  equal  to  T,p,q,r,s,t>o  f(P’<b  s>  t)upvqwrzsyt . 

(b)  If  ‘tibi’  is  and  ‘Virgo’  is , the  number  is  3! 3!  times  £^=0(/(2ifc  + 1, 

6 - 2k,  2;  3, 3)  + f(2k,  6 - 2k,  2;  2, 3)),  namely  36((7+7)  + (9+5)  + (10+5)  + (14+7))  = 
2304.  Otherwise  ‘tibi’  is  — , ‘Virgo’  is  — -q  and  the  number  is  2!  3!  times  J2l=o(f( 2fc> 
5 — 2k,  2;  3,  2)  + f{2k,b  — 2k,  1 ; 3,  2) ) , namely  12  ((7+6)  + (5+4)  + (4+4)  + (0+6))  = 432. 

(c)  The  fifth  foot  begins  with  the  second  syllable  of  ‘caelo’,  ‘dotes’,  or  ‘Virgo’. 
Hence  the  additional  number  is  3!  3!  J2k= o /( 2fc,  5 - 2k,  2;  3,  2)  = 36(7  + 5 + 4)  = 576, 
and  the  grand  total  is  2304  + 432  + 576  = 3312. 

22.  Let  a € {quot,  sunt,  tot},  /3  € {cado,  dotes,  Virgo},  a = sidera,  and  r = tibi. 
Prestet’s  analysis  was  essentially  equivalent  to  that  of  Bernoulli,  but  he  forgot  to  include 
the  36  cases  aaaTf3f3a(3.  (In  his  favor  one  can  say  that  those  cases  are  poetically  sterile; 
Puteanus  found  no  use  for  them.)  The  1675  edition  of  Prestet’s  book  had  also  omitted 
all  permutations  that  end  with  t/3. 

Wallis  divided  the  possibilities  into  23  types,  Ti  U T2  U ■ • • U T23.  He  claimed  that 
his  types  6 and  7 each  yielded  324  verses;  but  actually  |T6|  = |T7]  = 252,  because  his 
variable  i should  be  7,  not  9.  He  also  counted  many  solutions  twice:  |T3  n Ts|  = 72, 

|t2  n t7|  = |t5  n r7|  = |t3  n t6|  = |t6  n Ti0|  = 36,  and  |r„  n t12|  = |t12  n t13|  = 

I /14  n T15I  = 12.  He  missed  the  36  possibilities  a/3f3aaar/3  (19  of  which  were  used  by 
Puteanus).  And  he  also  missed  all  the  permutations  of  exercise  21(c);  Puteanus  had 
used  250  of  those  576.  The  Latin  edition  of  Wallis’s  book,  published  in  1693,  corrected 
several  typographic  errors  in  this  section,  but  none  of  the  mathematical  mistakes. 

Whitworth  and  Hartley  omitted  all  cases  with  ‘tibi’  = — (see  exercise  19), 

possibly  because  people’s  knowledge  of  classical  hexameter  was  beginning  to  fade. 

[Speaking  of  errors,  Puteanus  actually  published  only  1020  distinct  permutations, 
not  1022,  because  lines  592  and  593  in  his  list  were  identical  to  lines  601  and  602.  But 
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he  would  have  had  no  trouble  finding  two  more  cases  — for  example,  by  changing  ‘tot 
sunt’  to  ‘sunt  tot.’  in  lines  252,  345,  511,  548,  659,  663,  678,  693,  or  797.] 

23.  Reading  each  diagram  left-to-right,  so  that  1 2 1 345  -H>  (][J],  we  get 

mn  urn  n mu  mu  nrn  urn  imi  n nm  nm  n nm 

mu  mu  imi  itm  imi  ran  mn  imi  nm  ii  mu  mu  mu 

nm  imi  n iim  imi  n iim  inn  inn  iim  nm  nm  imi 

nm  iim  iim  imi  n iim  nm  nm  n nm  nm  mn  mu 

24.  His  rule  was:  For  k = 0,  1,  . . . , n - 1,  and  for  each  combination  0 < ji  < • ■ ■ < 
jk  < n of  n-1  things  taken  A:  at  a time,  visit  all  partitions  of  {1, . . . , n-  l}\{j1, . . . , jk} 
together  with  the  block  {ji , . . . , jk,  n.}.  His  order  for  n = 5 was: 

iiiii  mu  mu  mu  mn  nm  nm  imi  ii  11  imi  imi  imi  ,d 

imi  imi  nm  nm  mn  mn  mn  nm  nm  im  m nm  nm 

nm  nm  n nm  nm  nm  n n n nm  nm  n nm 

liunmimiiiiinmimiiimnmnmiiii 

But  strictly  speaking,  the  answer  to  this  exercise  is  “No”  — because  Honda’s  rule  is  not  I 

complete  until  the  order  of  the  combinations  is  specified.  He  generated  combinations 
in  colex  order  (lexicographic  on  jt  ...ji).  Lexicographic  order  on  ji  ...jt  would  also  be 
consistent  with  the  list  given  for  n = 4,  but  it  would  put  |[Jj][fj[]  before  f|j|]  ffjj] . Reference: 

T.  Hayashi,  Tohoku  Math.  J.  33  (1931),  332-337. 

25.  No;  (28)  misses  1 4 1 2 3 5 (the  top-bottom  reflection  of  its  second  pattern). 

26.  Let  a„  be  the  number  of  indecomposable  partitions  of  {l,...,n},  and  let  a'n 
be  the  number  that  are  both  indecomposable  and  complete.  These  sequences  begin 
(ai,a2,...)  = (1,1,2,6,22,92,426,...),  (ai,a2,...)  = (0,1,1,3,9,33,135,...);  and 
the  answer  to  this  exercise  is  a'n  - 1 for  n > 2.  It  turns  out  that  an  is  also  the  number  of 
symmetric  polynomials  of  degree  n in  noncommuting  variables.  [See  M.  C.  Wolf,  Duke 
Math.  J.  2 (1936),  626—637,  who  also  tabulated  indecomposable  partitions  into  k parts.] 

If  A(z)  = YlnanZn j and  ^ B{z)  = is  the  non-exponential  generating 

function  for  Bell  numbers,  we  have  A(z)B(z)  = B(z)  - 1,  hence  A(z)  = 1 - 1/5(2). 

And  the  result  of  exercise  7.2.1.5-35  implies  that  'Yf,no!nzn  = zA(z)/(  1 + 2 — A(z))  = re 

2(5(2)  - 1)/(1  + zB(z)).  Unfortunately  B(z ) has  no  especially  nice  closed  form, 
although  it  does  satisfy  the  interesting  functional  relation  1 + zB(z)  = B{z/(1  + 2)). 

Notice  that  indecomposable  set  partitions  with  n > 1 correspond  to  vacillating  tableau 
loops  with  no  three  consecutive  As  equal  to  zero  (see  exercise  7.2.1.5-27). 

27.  The  problem  is  ambiguous  because  genji-ko  diagrams  are  not  well  defined.  Let’s  l- 

require  all  vertical  lines  of  a block  to  have  the  same  height;  then,  for  example,  145 1 236 

has  no  single-crossing  diagram  because  nfnl  is  not  allowed. 

The  number  of  partitions  with  no  crossing  is  Cn  (see  exercise  7.2.1.6-26).  For  one 
crossing,  the  elements  of  the  two  blocks  that  cross  must  appear  within  the  restricted 
growth  string  as  either  x'yxjyk  or  or  xiyjxykxl,  where  i,j,k,l  > 0. 

Suppose  the  pattern  is  x'yx3yk.  There  are  t = i + j + k + 2 “slots”  between  the 
i + 1 + j + k elements  of  this  pattern,  and  the  number  of  ways  to  fill  these  slots  with 
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noncrossing  partitions  is  {-it=n-i-j-k-i  C*i  ■ ■ -Cit.  We  can  express  this  number 

as 

r jn-i-j-k-l-i  ru  „\i+j  + k+2  _ n 
\~  J ^ (n  — i—j — k — l)n 

by  Eq.  7.2.1.6-(24).  Summing  on  k gives  C(n_i_J-_2)(„+i);  then  summing  on  j and  i 
gives  C(n_4)(n+3). 

Similarly,  the  other  two  patterns  contribute  C(n_5)(n+3)  and  C(n_5)(n+4).  The 
total  number  of  single-crossing  partitions  is  therefore  C(n_5)(n+3)  + C(n_4)(n+4) . 

28.  Order  the  divisors  of  cbbaaa  by  their  number  of  prime  factors  and  then  colexico- 
graphically:  1 -<  a -<  b -<  c -<  aa  -<  ba  -<  ca  -<  66  X cb  -<  aaa  -<  baa  -<  caa  -<  bba  < 
cba  -<  ebb  -<  baaa  -<  caaa  -<  bbaa  -<  ebaa  -<  ebba  -<  bbaaa  -<  ebaaa  -<  ebbaa  -<  cbbaaa. 
For  every  such  divisor  d,  in  decreasing  order,  let  d be  the  first  factor;  recursively  append 
all  factorizations  of  cbbaaa /d  whose  first  factor  is  R d. 

If  the  divisors  had  been  ordered  lexicographically  (namely  1 < o < aa  < aaa  < 
b < ba  < • • • < ebbaa  < cbbaaa),  Wallis’s  algorithm  would  have  been  equivalent  to 
Algorithm  7.2.1.5M  with  (ni, 712,713)  = (1,2,3).  He  probably  chose  his  more  compli- 
cated ordering  of  the  divisors  because  it  tends  to  agree  more  closely  with  ordinary 
numerical  order  when  a « b « c;  for  example,  his  ordering  is  precisely  numerical  when 
( a,b,c ) = (7,11,13).  By  generating  the  divisors  according  to  his  somewhat  complex 
scheme,  Wallis  was  essentially  generating  multiset  combinations,  which  we  noted  in 
Section  7. 2. 1.3  are  equivalent  to  bounded  compositions.  [Reference:  A Discourse  of 
Combinations  (1685),  126-128,  with  two  typographic  errors  corrected.] 

29.  The  factorizations  edeba,  edeb-a,  edca-b,  ...,  e-d-c-b-a  correspond  respectively  to 


II  II  II  il  11 


l!  II  H II  11  H il  Kill  H I 

30.  The  coefficient  is  zero  unless  H + 2i2  H = n;  in  that  case  it  is  (X)aol_fc  (j,  ) 

where  fc  = ii  + i2  + • • • . (Consider  ( a0z)m  times  (1  + (ai/ao)z  + (a2/ao)22  + ■ • • )m.) 

31.  The  order  produced  by  that  algorithm  is  decreasing  lexicographic,  the  reverse 
of  (31),  if  we  assume  that  partitions  a4  . . . a*,  have  a4  > • • ■ > an,;  de  Moivre’s  was 
increasing  colexicographic. 


fl 

mu 

m 

nui 

mn 

mn 

rim 

nm 

m 

mu 

mn 

mn 

it  is 

(TK 

32.  20  • 1 = 7 + 13  • 1 = 2 • 7 + 6 • 1 = 10  + 10  • 1 = 10  + 7 + 3 • 1 = 2 • 10.  In  general, 
Boscovich  suggested  starting  with  n ■ 1 and  computing  the  successor  ofa-lO  + fe-7-l-c-l 
as  follows:  If  c > 7,  the  successor  is  a - 10  + (b+  1)  • 7+  (c-  7)  • 1;  otherwise  if  c + 7b  > 10, 
the  successor  is  (a  + 1)  • 10  + (c  + 76  - 10)  ■ 1;  otherwise  stop. 


“I  may,"  said  Poirot  in  a completely  unconvinced  tone,  "be  wrong." 

— AGATHA  CHRISTIE,  After  the  Funeral  (1953) 
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— HARPO  MARX,  The  Cocoanuts  (1925) 
— MARCEL  MARCEAU,  Baptiste  (1946) 
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TABLES  OF  NUMERICAL  QUANTITIES 


Table  1 


QUANTITIES  THAT  ARE  FREQUENTLY  USED  IN  STANDARD  SUBROUTINES 
AND  IN  ANALYSIS  OF  COMPUTER  PROGRAMS  (40  DECIMAL  PLACES) 


V2  = 1.41421 

35623 

73095 

04880 

16887 

24209 

69807 

85697- 

V3  = 1.73205 

08075 

68877 

29352 

74463 

41505 

87236 

69428+ 

\/5  = 2.23606 

79774 

99789 

69640 

91736 

68731 

27623 

54406+ 

\/10  = 3.16227 

76601 

68379 

33199 

88935 

44432 

71853 

37196- 

V2  = 1.25992 

10498 

94873 

16476 

72106 

07278 

22835 

05703- 

^3  = 1.44224 

95703 

07408 

38232 

16383 

10780 

10958 

83919- 

V?  = 1.18920 

71150 

02721 

06671 

74999 

70560 

47591 

52930- 

In  2 = 0.69314 

71805 

59945 

30941 

72321 

21458 

17656 

80755+ 

In  3 = 1.09861 

22886 

68109 

69139 

52452 

36922 

52570 

46475- 

In  10  = 2.30258 

50929 

94045 

68401 

79914 

54684 

36420 

76011+ 

1/ln  2 = 1.44269 

50408 

88963 

40735 

99246 

81001 

89213 

74266+ 

1/ln  10  = 0.43429 

44819 

03251 

82765 

11289 

18916 

60508 

22944- 

7r  = 3.14159 

26535 

89793 

23846 

26433 

83279 

50288 

41972- 

1° 

= 7r/180  = 0.01745 

32925 

19943 

29576 

92369 

07684 

88612 

71344+ 

1/tt  = 0.31830 

98861 

83790 

67153 

77675 

26745 

02872 

40689+ 

7r2  = 9.86960 

44010 

89358 

61883 

44909 

99876 

15113 

53137- 

0F 

= T(l/2)  = 1.77245 

38509 

05516 

02729 

81674 

83341 

14518 

27975+ 

17(1/3)  = 2.67893 

85347 

07747 

63365 

56929 

40974 

67764 

41287- 

T(2/3)  = 1.35411 

79394 

26400 

41694 

52880 

28154 

51378 

55193+ 

e = 2.71828 

18284 

59045 

23536 

02874 

71352 

66249 

77572+ 

1/e  = 0.36787 

94411 

71442 

32159 

55237 

70161 

46086 

74458+ 

e2  = 7.38905 

60989 

30650 

22723 

04274 

60575 

00781 

31803+ 

7 = 0.57721 

56649 

01532 

86060 

65120 

90082 

40243 

10422- 

InTr  = 1.14472 

98858 

49400 

17414 

34273 

51353 

05871 

16473- 

4>  = 1.61803 

39887 

49894 

84820 

45868 

34365 

63811 

77203+ 

e7  = 1.78107 

24179 

90197 

98523 

65041 

03107 

17954 

91696+ 

en/i  = 2.19328 

00507 

38015 

45655 

97696 

59278 

73822 

34616+ 

sin  1 = 0.84147 

09848 

07896 

50665 

25023 

21630 

29899 

96226- 

cos  1 = 0.54030 

23058 

68139 

71740 

09366 

07442 

97660 

37323+ 

-C'(2)  = 0.93754 

82543 

15843 

75370 

25740 

94567 

86497 

78979- 

C(3)  = 1.20205 

69031 

59594 

28539 

97381 

61511 

44999 

07650- 

huf>  = 0.48121 

18250 

59603 

44749 

77589 

13424 

36842 

31352- 

1/ln  <f>  = 2.07808 

69212 

35027 

53760 

13226 

06117 

79576 

77422- 

-In  In  2 = 0.36651 

29205 

81664 

32701 

24391 

58232 

66946 

94543- 
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Table  2 

QUANTITIES  THAT  ARE  FREQUENTLY  USED  IN  STANDARD  SUBROUTINES 
AND  IN  ANALYSIS  OF  COMPUTER  PROGRAMS  (40  HEXADECIMAL  PLACES) 

The  names  at  the  left  of  the  “=”  signs  are  given  in  decimal  notation. 


0.1  = 0.1999  9999  9999  9999  9999  9999  9999  9999  9999  999A- 

0.01  = 0.028F  5C28  F5C2  8F5C  28F5  C28F  5C28  F5C2  8F5C  28F6- 

0.001  = 0.0041  8937  4BC6  A7EF  9DB2  2D0E  5604  1893  74BC  6A7F- 

0.0001  = 0.0006  8DB8  BAC7  10CB  295E  9E1B  089A  0275  2546  0AA6+ 

0.00001  = 0.0000  A7C5  AC47  1B47  8423  0FCF  80DC  3372  1D53  CDDD+ 

0.000001  = 0.0000  10C6  F7A0  B5ED  8D36  B4C7  F349  3858  3621  FAFD— 

0.0000001  = 0.0000  01AD  7F29  ABCA  F485  787A  6520  EC08  D236  9919+ 

0.00000001  = 0.0000  002A  F31D  C461  1873  BF3F  7083  4ACD  AE9F  0F4F+ 

0.000000001  = 0.0000  0004  4B82  FA09  B5A5  2CB9  8B40  5447  C4A9  8188- 

0.0000000001  = 0.0000  0000  6DF3  7F67  5EF6  EADF  5AB9  A207  2D44  268E- 

72  = 1.6A09  E667  F3BC  C908  B2FB  1366  EA95  7D3E  3ADE  C175+ 

73  = 1.BB67  AE85  84CA  A73B  2574  2D70  78B8  3B89  25D8  34CC+ 

75  = 2.3C6E  F372  FE94  F82B  E739  80C0  B9DB  9068  2104  4ED8- 

710  = 3.2986  075B  4B6A  5240  9457  9061  9B37  FD4A  B4E0  ABBO- 

TS =1.428A  2F98  D728  AE22  3DDA  B715  BE25  0D0C  288F  1029+ 
73  = 1.7137  4491  23EF  65CD  DE7F  16C5  6E32  67C0  A189  4C2B- 

72  = 1.306F  E0A3  1B71  52DE  8D5A  4630  5C85  EDEC  BC27  3436+ 

In  2 = 0 .B172  17F7  D1CF  79AB  C9E3  B398  03F2  F6AF  40F3  4326+ 

In  3=  1.193E  A7AA  D030  A976  A419  8D55  053B  7CB5  BE14  42DA- 

ln  10  = 2 .4D76  3776  AAA2  B05B  A95B  58AE  0B4C  28A3  8A3F  B3E7+ 

1/ln  2 = 1.7154  7652  B82F  E177  7D0F  FDA0  D23A  7D11  D6AE  F552- 

1/lnlO  = 0.6F2D  EC54  9B94  38CA  9AAD  D557  D699  EE19  1F71  A301+ 

7r  = 3 . 243F  6A88  85A3  08D3  1319  8A2E  0370  7344  A409  3822+ 

1°  = tt/180  = 0.0477  D1A8  94A7  4E45  7076  2FB3  74A4  2E26  C805  BD78- 

1/tt  = 0.517C  C1B7  2722  0A94  FE13  ABE8  FA9A  6EE0  6DB1  4ACD- 

7r2  = 9 . DE9E  64DF  22EF  2D25  6E26  CD98  08C1  AC70  8566  A3FE+ 

Ttt  = T(l/2)  = 1.C5BF  891B  4EF6  AA79  C3B0  520D  5DB9  383F  E392  1547- 

T(  1/3)  = 2.ADCE  EA72  905E  2CEE  C8D3  E92C  D580  46D8  4B46  A6B3- 

1/2/3) = 1.5AA7  7928  C367  8CAB  2F4F  EB70  2B26  990A  54F7  EDBC+ 
e = 2.B7E1  5162  8AED  2A6A  BF71  5880  9CF4  F3C7  62E7  160F+ 

1/e  = 0 . 5E2D  58D8  B3BC  DF1A  BADE  C782  9054  F90D  DA98  05AB- 

e2  = 7.6399  2E35  376B  730C  E8EE  881A  DA2A  EEA1  1EB9  EBD9+ 

7 = 0. 93C4  67E3  7DB0  C7A4  D1BE  3F81  0152  CB56  A1CE  CC3B- 

ln  7r  = 1.250D  048E  7A1B  D0BD  5F95  6C6A  843F  4998  5E6D  DBF4- 

0=  1.9E37  79B9  7F4A  7C15  F39C  C060  5CED  C834  1082  276C- 
e7  = 1.C7F4  5CAB  1356  BF14  A7EF  5AEB  6B9F  6C45  60A9  1932+ 

e*/4  = 2 . 317A  CD28  E395  4F87  6B04  B8AB  AAC8  C708  F1C0  3C4A+ 

sin  1 = 0.D76A  A478  4867  7020  C6E9  E909  C50F  3C32  89E5  1113+ 

cos  1 = 0.8A51  407D  A834  5C91  C246  6D97  6871  BD29  A237  3A89+ 

-C'(2)  = 0.F003  2992  B55C  4F28  88E9  BA28  1E4C  405F  8CBE  9FEE+ 
C(3)  = 1.33BA  004F  0062  1383  7171  5C59  E690  7F1B  180B  7DB1  + 

In  </>  = 0 . 7B30  B2BB  1458  2652  F810  812A  5A31  C083  4C9E  B233+ 

1/ln  </>  = 2.13FD  8124  F324  34A2  63C7  5F40  76C7  9883  5224  4685- 

- In  In  2 = 0.5DD3  CA6F  75AE  7A83  E037  67D6  6E33  2DBC  09DF  AA82- 
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Several  interesting  constants  with  less  common  names  have  arisen  in  connec- 
tion with  the  analyses  in  the  present  book.  Those  constants  have  been  evaluated 
to  40  decimal  places  in  Eqs.  7.1.4-(go)  and  7.2.1.5-(34),  and  in  the  answer  to 
exercise  7.1.4-191. 


Table  3 

VALUES  OF  HARMONIC  NUMBERS,  BERNOULLI  NUMBERS, 
AND  FIBONACCI  NUMBERS,  FOR  SMALL  VALUES  OF  n 


n 

Hn 

Fn 

n 

0 

0 

1 

0 

0 

1 

1 

-1/2 

1 

1 

2 

3/2 

1/6 

1 

2 

3 

11/6 

0 

2 

3 

4 

25/12 

-1/30 

3 

4 

5 

137/60 

0 

5 

5 

6 

49/20 

1/42 

8 

6 

7 

363/140 

0 

13 

7 

8 

761/280 

-1/30 

21 

8 

9 

7129/2520 

0 

34 

9 

10 

7381/2520 

5/66 

55 

10 

11 

83711/27720 

0 

89 

11 

12 

86021/27720 

-691/2730 

144 

12 

13 

1145993/360360 

0 

233 

13 

14 

1171733/360360 

7/6 

377 

14 

15 

1195757/360360 

0 

610 

15 

16 

2436559/720720 

-3617/510 

987 

16 

17 

42142223/12252240 

0 

1597 

17 

18 

14274301/4084080 

43867/798 

2584 

18 

19 

275295799/77597520 

0 

4181 

19 

20 

55835135/15519504 

-174611/330 

6765 

20 

21 

18858053/5173168 

0 

10946 

21 

22 

19093197/5173168 

854513/138 

17711 

22 

23 

444316699/118982864 

0 

28657 

23 

24 

1347822955/356948592 

-236364091/2730 

46368 

24 

25 

34052522467/8923714800 

0 

75025 

25 

26 

34395742267/8923714800 

8553103/6 

121393 

26 

27 

312536252003/80313433200 

0 

196418 

27 

28 

315404588903/80313433200 

-23749461029/870 

317811 

28 

29 

9227046511387/2329089562800 

0 

514229 

29 

30 

9304682830147/2329089562800 

8615841276005/14322 

832040 

30 
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r any  x,  let  Hx  = ^ (-  - 1 


n>  1 


\n  n + a: 


. Then 


#i/2  = 2-21n2, 

#1/3  = 3-  |7r/V3-  | In  3, 

#2/3  — | + V3  — | In  3, 

#1/4  = 4 — ^71  — 3 In  2, 

#3/4  = | + - 31n2, 

#1/5  = 5 - i7T<j!>3/25-1/4  - I In  5 - ii/51n<?i, 
#2/5  = | - |tt0-3/25-1/4  - | In  5 + | \/5  In  0, 

#3/5  = | + |7r^)-3/25-1/4  - | In  5 + A In  </>, 

#4/5  = f + iTT^S-1/4  - | In 5 - !v/51n</>, 

#1/6  = 6-  |tt  V3  — 2 In  2 — | In  3, 

#5/e  = f + |tt\/3  — 2 In  2 — | In  3, 


general,  when  0 < p < q (see  exercise  1.2.9-19), 

w _ 9 n i.  P ioio  2pn  . n 

™p/q—  — cot -7r  — In  2(/ + 2 > cos 7r-lnsm— 7i. 

P 2 (7 


l<n<g/2 


Reader,  if  you  ever  have  to  start  a computing  laboratory, 
be  warned  by  me  and  do  not  take  as  a computer  an  accountant, 
no  matter  how  honest  and  efficient.  Your  computer  must  work 
to  so  and  so  many  significant  figures,  whether  the  significance 
of  the  digits  begins  six  places  before  or  six  places  after  the 
decimal  point.  Your  accountant  works  to  cents,  and  he  will  work  to  cents 
until  hell  freezes  over.  Whatever  numbers  our  accountant  computed 
he  kept  at  all  stages  to  exactly  two  places  after  the  decimal  point  . . . 
This  was  his  conscience,  that  he  should  be  accurate  to  the  last  cent ; 
and  he  simply  could  not  understand  that  physical  quantities  are  not 
measured  in  cents  but  on  a sliding  scale  of  values  in  which 
the  cents  of  one  problem  might  be  the  dollars  of  another. 

— NORBERT  WIENER,  / am  a Mathematician  (1956) 
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INDEX  TO  NOTATIONS 

In  the  following  formulas,  letters  that  are  not  further  qualified  have  the  following 
significance: 

j,  k integer- valued  arithmetic  expression 
m,  n nonnegative  integer-valued  arithmetic  expression 
p,q  binary-valued  arithmetic  expression  (0  or  1) 
x , y real-valued  arithmetic  expression 
z complex-valued  arithmetic  expression 
/ integer- valued,  real- valued,  or  complex- valued  function 
G, H graph 
S , T set  or  multiset 

J7,  Q family  of  sets 

u,  v vertex  of  a graph 

a,  p string  of  symbols 

The  place  of  definition  is  either  a page  number  in  the  present  volume  or  a section 
number  in  a previous  volume.  Many  other  notations,  such  as  Kn  for  the  complete 
graph  on  n vertices,  appear  in  the  main  index  at  the  close  of  this  book. 


Formal 

symbolism 

Meaning 

Where 

defined 

V <—  E 

give  variable  V the  value  of  expression  E 

§1-1 

U o V 

interchange  the  values  of  variables  U and  V 

§1-1 

An  or  A[n] 

the  nth  element  of  linear  array  A 

§1-1 

Amn  or  A[m,n] 

the  element  in  row  m and  column  n of 

rectangular  array  A 

§1-1 

(. ft?  a:  b) 

conditional  expression:  denotes 

a if  relation  R is  true,  b if  R is  false 

96 

[R\ 

characteristic  function  of  relation  R:  (R‘!  1:  0) 

§1.2.3 

fijk 

Kronecker  delta:  [ j — k) 

§1.2.3 

[zn]f(z) 

coefficient  of  zn  in  power  series  f{z) 

§1.2.9 

Z1  + z2  + ■ ■ ■ + zn 

sum  of  n numbers  (even  when  n is  0 or  1) 

§1.2.3 

® 1 

product  or  string  or  vector  of  n elements 

{x\ , . . . , Xn ) 

vector  of  n elements 

(xix2  . ..X2k- 1) 

median  value  (middle  value  after  sorting) 

75 

L 
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Formal 

symbolism 

Meaning 

Where 

defined 

sum  of  all  f(k)  such  that  relation  R(k)  is  true 

§1.2.3 

riij(fc)  /w 

product  of  all  f(k)  such  that  relation  R(k)  is  true 

§1.2.3 

minfi(fc)  /(/e) 

minimum  of  all  f(k)  such  that  relation  R(k)  is  true 

§1.2.3 

maxfl(fe)  /(A:) 

maximum  of  all  f(k)  such  that  relation  R(k)  is  true 

§1.2.3 

union  of  all  S(k)  such  that  relation  R(k)  is  true 

shorthand  for  Ea<k<b  f(k) 

§1.2.3 

{a  | i?(a) } 

set  of  all  a such  that  relation  R(a ) is  true 

£{/(*)  1 *(*)} 

another  way  to  write  YLa.(k)  f(k) 

{al:  ^2*  • ■ • j ^n} 

the  set  or  multiset  {a^  | 1 < k < n} 

[*  • • y] 

closed  interval:  {a  \ x < a < y) 

§1.2.2 

(x..y) 

open  interval:  {a  \ x < a < y} 

§1.2.2 

[x..y) 

half-open  interval:  {a  \ x < a < y} 

§1.2.2 

(x  ■ ■ y\ 

half-closed  interval:  {a  \ x < a < y} 

§1.2.2 

\s\ 

cardinality:  the  number  of  elements  in  S 

I/I 

number  of  solutions  (when  / is  Boolean):  Ylx  f{x) 
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M 

absolute  value  of  x:  (x  > 0?  x:  —x) 

1*1 

absolute  value  of  z\  \fzH 

§1.2.2 

M 

length  of  a:  m if  a = aia2  . . . am 

[xj 

floor  of  x , greatest  integer  function:  maxfc<x  k 

§1.2.4 

M 

ceiling  of  x,  least  integer  function:  minfc>x  k 

§1-2.4 

x mod  y 

mod  function:  (y  = 0?  x:  x — y[x/y\) 

§1.2.4 

{x} 

fractional  part  (used  in  contexts  where 

a real  value,  not  a set,  is  implied):  x mod  1 

§1.2.11.2 

= x'  (modulo  y) 

relation  of  congruence:  x mod  y = x'  mod  y 

§1.2.4 

3\k 

j divides  k:  k mod  j — 0 and  j > 0 

§1.2.4 

S\T 

set  difference:  {s  | s in  S and  s not  in  T} 

S\t 

shorthand  for  5 \ {f } 

G\v 

G with  vertex  v removed 

13 

G\e 

G with  edge  e removed 

13 

G/e 

G with  edge  e shrunk  to  a point 

463 

sut 

shorthand  for  S U {f} 

5 tt)  T 

multiset  sum;  e.g.,  {a,b}  l±l  {a,c}  = {a,  a,  6,  c} 

§4.6.3 

gcd  {j,k) 

greatest  common  divisor:  (j—k= 0?  0:  ma Xd\j,d\kd) 

§1-1 

j 1 k 

j is  relatively  prime  to  k:  gcd  (j,  k)  = 1 

§1.2.4 
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Formal 

symbolism 

Meaning 

Where 

defined 

At 

transpose  of  rectangular  array  A:  AT[j,  fc]  = A[k,j] 

aR 

left-right  reversal  of  string  a 

aT 

conjugate  of  partition  a 

394 

xy 

x to  the  y power  (when  x > 0):  eyinx 

§1.2.2 

xk 

x to  the  k power:  (k  > 0?  x:  l/x~k) 

§1.2.2 

x~ 

inverse  (or  reciprocal)  of  x:  x~x 

§1.3.3 

xk 

x to  the  k rising:  T(a:  + k)/T(k)  — 

{k  > 0?  nJ=o(® + i)=  l/(*  + k)-k) 

§1.2.5 

X* 

x to  the  k falling:  x\/(x  — k)\  = 

(k  > o?  n?=o(®  - j)-  iKx  - k)—) 

§1.2.5 

n\ 

n factorial:  r(n  + 1)  = n- 

§1.2.5 

(l) 

binomial  coefficient:  ( k < 0?  0:  x-/k\) 

§1.2.6 

multinomial  coefficient  (when  n — ni  + • • ■ + nm) 

§1.2.6 

a 

Stirling  cycle  number:  Eo<fe1<...<fc„_m<n  • • • kn-m 

§1.2.6 

{;} 

Stirling  subset  number:  J2i<k1<-<kn.m<m  • • • kn-m 

§1.2.6 

(") 

\m/ 

Eulerian  number:  Er=o(~l)fe  (”k  *)  (TO  + 1 — &)” 

§5.1.3 

1 n 1 

1 ml 

m-part  partitions  of  n:  Ei<ki<--<km  [fci  + • • • + km  = n] 

399 

ia0.a_!  . . . )b 

radix-6  positional  notation:  Ek  akbk 

§4.1 

real  part  of  z 

§1.2.2 

imaginary  part  of  z 

§1.2.2 

z 

complex  conjugate:  — i 

§1.2.2 

~p  or  p 

complement:  1 — p 

49 

~x  or  x 

bitwise  complement 

135 

pAq 

Boolean  conjunction  (and):  pq 

49 

x Ay 

minimum:  min{x,  y) 

63 

x & y 

bitwise  AND 

134 

P v? 

Boolean  disjunction  (or):  pq 

49 

rVy 

maximum:  max{r,  y) 

63 

x | y 

bitwise  OR 

134 

p®  q 

Boolean  exclusive  disjunction  (xor):  ( p + q)  mod  2 

50 

x®y 

bitwise  XOR 

134 

x-y 

saturated  subtraction,  x monus  y:  max{0,  x — y} 

§1.3.1' 

x <C  k 

bitwise  left  shift:  [2kx\ 

135 

x 3>  A; 

bitwise  right  shift:  x -C  (—k) 

135 

V 

“zipper  function”  for  interleaving  bits,  x zip  y 

147 
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Formal 

symbolism 

Meaning 

Where 

defined 

logfcZ 

logarithm,  base  b,  of  x (defined  when  x > 0, 

6 > 0,  and  6 / 1):  the  y such  that  x = by 

§1.2.2 

111  X 

natural  logarithm:  loge  x 

§1-2.2 

\gx 

binary  logarithm:  log2  x 

§1.2.2 

An 

binary  logsize  (when  n > 0):  [_lg  tt,J 

142 

exp  a: 

exponential  of  x:  ex  = J2T-0  xk/^ 

§1.2.9 

pn 

ruler  function  (when  n > 0):  max2™\„m 

140 

vn 

sideways  sum  (when  n > 0):  ]Tfe>o  ((n  > k)  k l) 

143 

{■ Xn ) 

the  infinite  sequence  X0,  X\,  X2,  ... 

(here  the  letter  n is  part  of  the  symbolism) 

§1.2.9 

f'(x) 

derivative  of  / at  x 

§1.2.9 

/"(*) 

second  derivative  of  / at  x 

§1.2.10 

is(x) 

■H-  n 

harmonic  number  of  order  x:  J2k- 1 1 /kx 

§1.2.7 

Hn 

harmonic  number:  Hn1 

§1-2.7 

Fn 

Fibonacci  number:  (n  < 1?  n:  Fn_i  + Fn_2) 

§1.2.8 

Bn 

Bernoulli  number:  n!  [ zn ] z/(ez  — 1) 

§1.2.11.2 

det(A) 

determinant  of  square  matrix  A 

§1.2.3 

sign  (a;) 

sign  of  x:  [x  > 0]  - [x  < 0] 

C(x) 

zeta  function:  lim^oo  Hn  } (when  x > 1) 

§1.2.7 

T{x) 

gamma  function:  (x  - 1)!  = 7(3,  00) 

§1.2.5 

7 (x,y) 

incomplete  gamma  function:  Q e~ttx~1dt 

§1.2.11.3 

7 

Euler’s  constant:  -r'(l)  = lim „_yoo(Fn  - Inn) 

§1.2.7 

e 

base  of  natural  logarithms:  Yln>o  Vn- 

§1.2.2 

7T 

circle  ratio:  4 E„>o(-l)7(2n  + 1) 

§1.2.2 

OO 

infinity:  larger  than  any  number 

A 

null  link  (pointer  to  no  address) 

§2.1 

0 

empty  set  (set  with  no  elements) 

e 

empty  string  (string  of  length  zero) 

e 

unit  family:  {0} 

273 

<t> 

golden  ratio:  (l  + \/5  )/2 

§1.2.8 

<p(n) 

Euler’s  totient  function:  J2k~o  [&-Lra] 

§1.2.4 

XK,y 

x is  approximately  equal  to  y 

§1.2.5 

G = H 

G is  isomorphic  to  H 

14 

0{f(n)) 

big-oh  of  /(n),  as  the  variable  n — > oo 

§1.2.11.1 

0(f(z)) 

big-oh  of  f(z),  as  the  variable  z — t 0 

§1.2.11.1 

n(f(n)) 

big-omega  of  /(n),  as  the  variable  n ->  00 

§1.2.11.1 

©(/(»)) 

big-theta  of  /(n),  as  the  variable  n — > 00 

§1.2.11.1 

id 


re 
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Formal 

symbolism 

Meaning 

Where 

defined 

G 

complement  of  graph  (or  uniform  hypergraph)  G 

26 

G | U 

G restricted  to  the  vertices  of  set  U 

13 

u — V 

u is  adjacent  to  v 

13 

u —/—  V 

u is  not  adjacent  to  v 

13 

u — >v 

there  is  an  arc  from  u to  v 

18 

u — V 

transitive  closure:  v is  reachable  from  u 

159 

d(u,  v) 

distance  from  u to  v 

16 

GuH 

union  of  G and  H 

26 

G®  H 

direct  sum  (juxtaposition)  of  G and  H 

26 

G — H 

join  of  G and  H 

26 

G — >H 

directed  join  of  G and  H 

26 

GnH 

Cartesian  product  of  G and  H 

27 

G®H 

direct  product  (conjunction)  of  G and  H 

28 

GmH 

strong  product  of  G and  H 

28 

GaH 

odd  product  of  G and  H 

28 

GoH 

lexicographic  product  (composition)  of  G and  H 

28 

ej 

elementary  family:  {{j}} 

273 

Tug 

union  of  families:  {5|5eJ’or5e^} 

273 

Tug 

intersection  of  families:  {5  5 £ T and  S E g} 

273 

T\g 

difference  of  families:  {5  \ S € T and  S (f-_  g} 

273 

t ®g 

symmetric  difference  of  families:  (T  \ g)  U (C?  \ T) 

273 

Tug 

join  of  families:  {S  UT  \ S £ T,  T E g) 

273 

Tug 

meet  of  families:  {5  fl  T \ S £ T,  T £ gj 

273 

Tmg 

delta  of  families:  {S  ® T \ S £ T,  T £ g} 

273 

T/g 

quotient  (cofactor)  of  families 

273 

T mod  g 

remainder  of  families:  J\(5u  (T/g)) 

273 

T§k 

symmetrized  family,  if  T = ej1  U ej2  U • • • U ejn 

274 

JT 

maximal  elements  of  T: 

{S  £ T \ T £ T and  S CT  implies  S = T} 

276 

JT 

minimal  elements  of  T: 

{S  £ T \ T £ T and  S D T implies  S = T} 

276 

T/g 

nonsubsets:  {S  £ T \ T £ g implies  S £ T} 

276 

T\g 

nonsupersets:  {S  £ T \ T £ g implies  S 2 T) 

276 

T/g 

subsets:  {S  £ T \ T £ g implies  S C T}  = T \ (T  / g) 

669 

T\g 

supersets:  {S  £ T \ T £ g implies  S U T}  — T \ (T  \ g) 

669 

i 

end  of  algorithm,  program,  or  proof 

§1-1 
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Formal 

symbolism 

Meaning 

Where 

defined 

XY 

dot  product  of  vectors:  xx yx  + x2y2  H b xnyn, 

if  X = xxx2  ...xn  and  Y = yiy2  ...yn 

12 

X C Y 

containment  of  vectors:  Xk  < yu  for  1 < k < n, 

ii  X X\X2  . . . xn  and  Y ~ y\y2  . . . yn 

135 

OL  O (3 

melding  of  truth  tables 

218 

And  to  auoide  the  tediouse  repetition  of  these  woordes  : is  equaiie  to  : 
I will  sette  as  I doe  often  in  woorke  use,  a paire  of  paralleles, 
or  Gemowe  lines  of  one  lengthe,  thus:  — , 

bicause  noe  .2.  thynges,  can  be  moare  equalle. 
— ROBERT  RECORDE,  The  Whetstone  of  Witte  (1557) 


Prof.  Le  Gendre,  in  the  treatise  that  we  shall  often  have  occasion  to  cite, 
used  the  same  sign  for  both  equality  and  congruence. 

To  avoid  ambiguity  we  have  made  a distinction. 
— C.  F.  GAUSS,  Disquisitiones  Arithmeticae  (1801) 


Someone  told  me  that  each  equation  I included  in  the  book 

would  halve  the  sales. 
— STEPHEN  HAWKING,  A Brief  History  of  Time  (1987) 


APPENDIX  C 


INDEX  TO  ALGORITHMS  AND  THEOREMS 


Algorithm  7B,  22-23. 
Program  7B,  523-524. 
Theorem  7B,  17. 

Algorithm  7H,  30. 
Corollary  7H,  30-31. 
Lemma  7M,  30. 

Algorithm  7.1. IB,  538. 
Algorithm  7.1. 1C,  59-60. 
Corollary  7.1. 1C,  69. 
Theorem  7.1. 1C,  68. 
Algorithm  7. LIE,  83. 
Corollary  7.1. IF,  74. 
Theorem  7.1.  IF,  73-74. 
Theorem  7.1.1G,  66. 
Algorithm  7.1.1H,  69. 
Theorem  7.1. 1H,  57. 
Subroutine  7.1.11,  70. 
Algorithm  7.1. IK,  553. 
Theorem  7.1. IK,  62. 
Lemma  7.1.1M,  65. 
Algorithm  7.1. IP,  538-539. 
Theorem  7.1. IP,  63-64. 
Corollary  7.1.1Q,  55. 
Theorem  7.1.1Q,  55. 
Theorem  7.1. IS,  72. 
Theorem  7.1. IT,  77. 
Algorithm  7.1.  IX,  82. 
Algorithm  7.1.2L,  100. 
Theorem  7.1. 2L,  111-112. 
Algorithm  7.1.2S,  127. 
Theorem  7.1.2S,  109-110. 
Algorithm  7.1.2T,  128. 
Algorithm  7.1.2U,  562. 
Lemma  7.1.3A,  156. 
Algorithm  7.1.3B,  154. 
Lemma  7.1.3B,  157. 
Algorithm  7.1.3C,  610. 
Algorithm  7.1.31,  612-613. 


Corollary  7.1.31,  156. 
Algorithm  7.1.3K,  600. 
Corollary  7.1.3L,  157. 
Algorithm  7.1.3M,  603. 
Corollary  7.1.3M,  158. 
Algorithm  7.1.3N,  618. 
Theorem  7.1. 3P,  157-158. 
Theorem  7.1.3P',  159. 
Algorithm  7.1.3Q,  603-604. 
Algorithm  7.1. 3R,  159. 
Program  7.1.3R,  160. 
Program  7.1.3R',  603. 
Theorem  7.1.3R,  157. 
Theorem  7.1.3R',  158. 
Algorithm  7.1.3S,  606. 
Algorithm  7.1.3T,  178-179. 
Algorithm  7.1.3V,  166-167. 
Corollary  7.1.3W,  156-157. 
Algorithm  7.1.4A,  624. 
Theorem  7.1.4A,  248. 
Algorithm  7.1.4B,  209. 
Theorem  7.1. 4B,  237-238. 
Algorithm  7.1.4C,  207. 
Algorithm  7.1.4C',  624. 
Algorithm  7.1.4E,  645. 
Algorithm  7.1.4H,  640. 
Algorithm  7.1.41,  622. 
Algorithm  7.1.4J,  243-244. 
Theorem  7.1.4J+,  242. 
Theorem  7.1.4J-,  242. 
Algorithm  7.1.4K,  625-626. 
Theorem  7.1.4K,  238. 
Theorem  7.1.4M,  214-215. 
Algorithm  7.1.4N,  636. 
Algorithm  7.1. 4R,  216-217. 
Algorithm  7.1.4S,  222-223. 
Corollary  7.1.4S,  256. 
Theorem  7.1.4S,  256. 


Algorithm  7.1.4T,  647-648. 
Algorithm  7.1.4U,  227. 
Theorem  7.1.4U,  234. 
Theorem  7.1.4W,  246-247. 
Theorem  7.1.4X,  247. 
Corollary  7.1.4Y,  249. 
Theorem  7.1.4Y,  248. 
Algorithm  7.2.1.1A,  303. 
Algorithm  7. 2.1. IB,  680. 
Corollary  7. 2. 1.1B,  294-295. 
Algorithm  7. 2.1. 1C,  679. 
Algorithm  7. 2. 1.1D,  304-305. 
Theorem  7. 2. 1.1D,  294. 
Algorithm  7.2.1. IE,  700. 
Algorithm  7.2.1. IF,  307. 
Algorithm  7.2.1.1G,  286. 
Algorithm  7.2.1.1H,  300. 
Algorithm  7.2.1.1J,  680. 
Algorithm  7. 2.1. IK,  301. 
Algorithm  7.2.1.1L,  290. 
Algorithm  7.2.1.1M,  282. 
Algorithm  7.2. 1.1N,  678-679. 
Algorithm  7. 2. 1.1P,  683. 
Definition  7.2.1. IP,  305. 
Theorem  7. 2. 1.1P,  306. 
Algorithm  7.2.1.1Q,  683. 
Definition  7.2.1.1Q,  306. 
Theorem  7.2.1.1Q,  306. 
Algorithm  7.2.1.1R,  304. 
Algorithm  7. 2.1. IS,  302. 
Algorithm  7.2.1. IT,  684. 
Algorithm  7. 2. 1.1U,  696. 
Algorithm  7.2.1.1V,  680. 
Algorithm  7.2. 1.1  W,  291. 
Algorithm  7.2.1. IX,  691. 
Algorithm  7.2. 1.1  Y,  691. 
Algorithm  7.2. 1.1Z,  679. 
Algorithm  7.2.1.2A,  325. 
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Algorithm  7.2.1.2C,  336. 
Algorithm  7.2.1.2E,  337. 
Algorithm  7.2.1.2E',  349. 
Algorithm  7.2.1.2G,  329,  331. 
Algorithm  7.2.1.2H,  335. 
Algorithm  7.2.1.2L,  319. 
Algorithm  7.2.1.2L',  702-704. 
Algorithm  7.2.1.2M,  703. 
Algorithm  7.2.1.2P,  322. 
Algorithm  7.2.1.2P',  705. 
Algorithm  7.2.1.2Q,  705. 
Algorithm  7.2.1.2R,  704-705. 
Theorem  7.2.1.2R,  338. 

Lemma  7.2.1.2S,  327. 
Algorithm  7.2.1.2T,  323-324. 
Algorithm  7.2.1.2V,  343. 
Algorithm  7.2.1.2X,  334. 
Algorithm  7.2.1.2Y,  719-720. 
Algorithm  7.2.1.2Z,  350-351. 
Algorithm  7.2.1.3A,  725. 
Algorithm  7.2.1.3C,  367. 
Corollary  7.2.1.3C,  379. 
Algorithm  7.2.1.3CB,  733. 
Algorithm  7.2.1.3CC,  732-733. 
Algorithm  7.2.1.3F,  361. 
Theorem  7.2.1.3K,  372. 
Algorithm  7.2.1.3L,  358. 


Theorem  7.2.1.3L,  360. 
Theorem  7.2.1.3M,  373. 
Algorithm  7.2.1.3N,  724. 
Theorem  7.2.1.3N,  365. 
Theorem  7.2.1.3P,  369-370. 
Algorithm  7.2.1.3Q,  735-736. 
Algorithm  7.2.1.3R,  363. 
Algorithm  7.2.1.3S,  724. 
Lemma  7.2.1.3S,  377. 
Algorithm  7.2.1.3T,  359. 
Algorithm  7.2.1.3V,  726. 
Algorithm  7.2.1.3W,  734. 
Theorem  7.2.1.3W,  376-378. 
Algorithm  7.2.1.4A,  749. 
Algorithm  7.2.1.4B,  749. 
Algorithm  7.2.1.4C,  747. 
Theorem  7.2.1.4C,  403. 
Theorem  7.2.1.4D,  398. 
Theorem  7.2.1.4E,  400-401. 
Algorithm  7.2.1.4H,  392. 
Theorem  7.2.1.4H,  404. 
Algorithm  7.2.1.4K,  758-759. 
Algorithm  7.2.1.4M,  757. 
Algorithm  7.2.1.4N,  411. 
Algorithm  7.2.1.4P,  392. 
Theorem  7.2.1.4S,  407. 
Algorithm  7.2.1.5E,  764. 


Algorithm  7.2.1.5H,  416-417. 
Algorithm  7.2.1.5L,  761. 
Algorithm  7.2.1.5M,  429-430. 
Algorithm  7.2.1.6A,  797. 
Algorithm  7.2.1.6B,  444. 
Algorithm  7.2.1.6B*,  793. 
Algorithm  7.2.1.6F,  804. 
Algorithm  7.2.1.6G,  784. 
Algorithm  7.2.1.6H,  460. 
Algorithm  7.2.1.61,  797. 
Algorithm  7.2.1.6K,  784-785. 
Algorithm  7.2.1.6L,  448. 
Algorithm  7.2.1.6M,  785. 
Algorithm  7.2.1.6N,  446. 
Algorithm  7.2.1.60,  462. 
Algorithm  7.2.1.6P,  443. 
Algorithm  7.2.1.6Q,  470-471. 
Algorithm  7.2.1.6R,  457. 
Algorithm  7.2.1.6S,  464-465. 
Theorem  7.2.1.6S,  470. 
Algorithm  7.2.1.6T,  784. 
Algorithm  7.2.1.6U,  452. 
Algorithm  7.2.1.6W,  453. 
Algorithm  7.2.1.6X,  800. 
Algorithm  7.2.1.6Z,  781. 
Algorithm  7.2.1.7N,  813. 


[An  inverted  list]  provides  duplicate,  redundant  information 
in  order  to  speed  up  secondary  key  retrieval. 
— DONALD  E.  KNUTH,  Sorting  and  Searching  (1973) 


APPENDIX  D 


INDEX  TO  COMBINATORIAL  PROBLEMS 

The  purpose  of  this  appendix  is  to  present  concise  descriptions  of  the  major  problems 
treated  in  the  present  book,  and  to  associate  each  problem  description  with  the  name 
under  which  it  can  be  found  in  the  main  index.  Some  of  these  problems  can  be  solved 
efficiently,  while  others  appear  to  be  very  difficult  in  general  although  special  cases 
might  be  easy.  No  indication  of  problem  complexity  is  given  here. 

Combinatorial  problems  have  a chameleon-like  tendency  to  assume  many  forms. 
For  example,  certain  properties  of  graphs  and  hypergraphs  are  equivalent  to  other 
properties  of  0 1 matrices;  and  an  m x n matrix  of  Os  and  Is  can  itself  be  regarded  as  a 
Boolean  function  of  mn  Boolean  variables,  with  0 representing  TRUE  and  1 representing 
FALSE.  Each  problem  also  has  many  flavors:  We  sometimes  ask  only  whether  a solution 
to  certain  constraints  exists  at  all;  but  usually  we  ask  to  see  at  least  one  explicit  solution, 
or  we  try  to  count  the  number  of  solutions,  or  to  visit  them  all.  Often  we  require  a 
solution  that  is  optimum  in  some  sense. 

In  the  following  list  — which  is  intended  to  be  helpful  but  by  no  means  complete  — 
each  problem  is  presented  in  more-or-less  formal  terms  as  the  task  of  “finding”  some 
desired  objective.  This  characterization  is  then  followed  by  an  informal  paraphrase  (in 
parentheses  and  quotation  marks),  and  perhaps  also  by  further  comments. 

Any  problem  that  is  stated  in  terms  of  directed  graphs  is  automatically  applicable 
also  to  undirected  graphs,  unless  the  digraph  must  be  acyclic,  because  an  undirected 
edge  u — v is  equivalent  to  the  two  directed  arcs  u — > v and  v — > u. 

• Satisfiability:  Given  a Boolean  function  / of  n Boolean  variables,  find  Boolean  values 
X\,  . . . , xn  such  that  f(x i, . . . , xn)  = 1.  (“If  possible,  show  that  / can  be  true.”) 

• fcSAT:  The  satisfiability  problem  when  / is  the  conjunction  of  clauses,  where  each 
clause  is  a disjunction  of  at  most  k literals  Xj  or  Xj.  (“Can  all  the  clauses  be  true?”) 
The  cases  2SAT  and  3SAT  are  most  important.  Another  significant  special  case  arises 
when  / is  a conjunction  of  Horn  clauses,  each  having  at  most  one  negated  literal  Xj. 

• Boolean  chain:  Given  one  or  more  Boolean  functions  of  n Boolean  values  x\,  . . . , xn, 
find  xn+ i,  . . . , xn  such  that  each  Xk  for  n < k < N is  a Boolean  function  of  Xi  and  Xj 
for  some  i < k and  j < k , and  such  that  each  of  the  given  functions  is  either  constant 
or  equal  to  xi  for  some  l < N.  (“Construct  a straight-line  program  to  evaluate  a given 
set  of  functions,  sharing  intermediate  values.”)  (“Build  a circuit  to  compute  a given 
collection  of  outputs  from  the  inputs  0,  1,  xi,  . . . , xn,  using  2-input  Boolean  gates  with 
unlimited  fanout.”)  The  goal  is  usually  to  minimize  N. 

• Broadword  chain:  Like  a Boolean  chain,  but  using  bitwise  and/or  arithmetic  opera- 
tions on  integers  modulo  2d  instead  of  Boolean  operations  on  Boolean  values;  the  given 
value  of  d can  be  arbitrarily  large.  (“Work  on  several  related  problems  at  once.”) 

• Boolean  programming:  Given  a Boolean  function  f of  n Boolean  variables,  to- 
gether with  given  weights  w\,  ...,  wn,  find  Boolean  values  xi,  ...,  x„  such  that 
f(xi,  . . . ,xn)  = 1 and  w\X\  + •••  + wnxn  is  as  large  as  possible.  (“How  can  / be 
satisfied  with  maximum  payoff?” ) 
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• Matching:  Given  a graph  G,  find  a set  of  disjoint  edges.  (“Pair  up  the  vertices  so 
that  each  vertex  has  at  most  one  partner.”)  The  goal  is  usually  to  find  as  many  edges 
as  possible;  a “perfect  matching”  includes  all  the  vertices.  In  a bipartite  graph  with  m 
vertices  in  one  part  and  n vertices  in  the  other,  matching  is  equivalent  to  selecting  a 
set  of  Is  in  an  m x n matrix  of  Os  and  Is,  with  at  most  one  selected  in  each  row  and 
at  most  one  selected  in  each  column. 

• Assignment  problem:  A generalization  of  bipartite  matching,  with  weights  associated 
with  each  edge;  the  total  weight  of  the  matching  should  be  maximized.  (“What 
assignment  of  people  to  jobs  is  best?”)  Equivalently,  we  wish  to  select  elements  of 
an  m x n matrix,  at  most  one  per  row  and  at  most  one  per  column,  so  that  the  sum  of 
selected  elements  is  as  large  as  possible. 

• Covering:  Given  a matrix  Ajk  of  Os  and  Is,  find  a set  of  rows  R such  that  we  have 

> 0 for  all  A: . (“Mark  a 1 in  each  column  and  select  all  rows  that  have  been 
marked.”)  The  goal  is  usually  to  minimize  |J?|. 

• Exact  cover:  Given  a matrix  Ajk  of  Os  and  Is,  find  a set  of  rows  R such  that 
Sjgi?  Aifc  = 1 for  all  k.  (“Cover  with  mutually  orthogonal  rows.”)  The  perfect 
matching  problem  is  equivalent  to  finding  an  exact  cover  of  the  transposed  incidence 
matrix. 

• Independent  set:  Given  a graph  or  hypergraph  G,  find  a set  of  vertices  U such  that 
the  induced  graph  G \ U has  no  edges.  (“Choose  unrelated  vertices.”)  The  goal  is 
usually  to  maximize  \U\.  Classical  special  cases  include  the  8-queens  problem,  when  G 
is  the  graph  of  queen  moves  on  a chessboard,  and  the  no-three-on-a-line  problem. 

• Clique:  Given  a graph  G,  find  a set  of  vertices  U such  that  the  induced  graph  G | U 
is  complete.  (“Choose  mutually  adjacent  vertices.”)  Equivalently,  find  an  independent 
set  in  ~G.  The  goal  is  usually  to  maximize  |f/|. 

• Vertex  cover:  Given  a graph  or  hypergraph,  find  a set  of  vertices  U such  that 
every  edge  includes  at  least  one  vertex  of  U.  (“Mark  some  vertices  so  that  no  edge 
remains  unmarked.”)  Equivalently,  find  a covering  of  the  transposed  incidence  matrix. 
Equivalently,  find  U such  that  V \ U is  independent,  where  V is  the  set  of  all  vertices. 
The  goal  is  usually  to  minimize  \U\. 

• Dominating  set:  Given  a graph,  find  a set  of  vertices  U such  that  every  vertex  not 
in  U is  adjacent  to  some  vertex  of  U.  (“What  vertices  are  within  one  step  of  them 
all?”)  The  classic  5-queens  problem  is  the  special  case  when  G is  the  graph  of  queen 
moves  on  a chessboard. 

• Kernel:  Given  a directed  graph,  find  an  independent  set  of  vertices  U such  that 
every  vertex  not  in  U is  the  predecessor  of  some  vertex  of  U . (“In  what  independent 
positions  of  a 2-player  game  can  your  opponent  force  you  to  remain?” ) If  the  graph  is 
undirected,  a kernel  is  equivalent  to  a maximal  independent  set,  and  to  a dominating 
set  that  is  both  minimal  and  independent. 

• Coloring:  Given  a graph,  find  a way  to  partition  its  vertices  into  k independent  sets. 
(“Color  the  vertices  with  k colors,  never  giving  the  same  color  to  adjacent  points.”) 
The  goal  is  usually  to  minimize  k. 

• Shortest  path:  Given  vertices  u and  v of  a directed  graph  in  which  weights  are 
associated  with  every  arc,  find  the  smallest  total  weight  of  an  oriented  path  from  u 
to  v.  (“Determine  the  best  route.”) 
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• Longest  path:  Given  vertices  u and  v of  a directed  graph  in  which  weights  are 
associated  with  every  arc,  find  the  largest  total  weight  of  a simple  oriented  path  from 
u to  v.  (“What  route  meanders  the  most?”) 

• Reachability:  Given  a set  of  vertices  U in  a directed  graph  G,  find  all  vertices  v such 
that  u — >*  v for  some  u £ U.  (“What  vertices  occur  on  paths  that  start  in  [/?”) 

• Spanning  tree:  Given  a graph  G,  find  a free  tree  F on  the  same  vertices,  such  that 
every  edge  of  F is  an  edge  of  G.  (“Choose  just  enough  edges  to  connect  up  all  the 
vertices.”)  If  weights  are  associated  with  each  edge,  a minimum  spanning  tree  is  a 
spanning  tree  of  smallest  total  weight. 

• Hamiltonian  path:  Given  a graph  G,  find  a path  P on  the  same  vertices,  such  that 
every  edge  of  P is  an  edge  of  G.  (“Discover  a path  that  encounters  every  vertex  exactly 
once.”)  This  is  the  classic  knight’s  tour  problem  when  G is  the  graph  of  knight  moves 
on  a chessboard.  When  the  vertices  of  G are  combinatorial  objects  — for  example, 
tuples,  permutations,  combinations,  partitions,  or  trees  — that  are  adjacent  when  they 
are  “close”  to  each  other,  a Hamiltonian  path  is  often  called  a Gray  code. 

• Hamiltonian  cycle:  Given  a graph  G,  find  a cycle  C on  the  same  vertices,  such  that 
every  edge  of  C is  an  edge  of  G.  ( “Discover  a path  that  encounters  every  vertex  exactly 
once  and  returns  to  the  starting  point.”) 

• Traveling  Salesrep  Problem:  Find  a Hamiltonian  cycle  of  smallest  total  weight,  when 
weights  are  associated  with  each  edge  of  the  given  graph.  (“What’s  the  cheapest  way  to 
visit  everything?”)  If  the  graph  has  no  Hamiltonian  cycle,  we  extend  it  to  a complete 
graph  by  assigning  a very  large  weight  IT  to  every  nonexistent  edge. 

• Topological  sorting:  Given  a directed  graph,  find  a way  to  label  each  vertex  x with  a 
distinct  number  l(x)  in  such  a way  that  x — >y  implies  l(x)  < l(y).  (“Place  the  vertices 
in  a row,  with  each  vertex  to  the  left  of  all  its  successors.” ) Such  a labeling  is  possible 
if  and  only  if  the  given  digraph  is  acyclic. 

• Optimum  linear  arrangement:  Given  a graph,  find  a way  to  label  each  vertex  x with 

a distinct  integer  l(x),  such  that  v \l(u ) - l(v)|  is  as  small  as  possible.  (“Place  the 

vertices  in  a row,  minimizing  the  sum  of  the  resulting  edge  lengths.”) 

• Knapsack  problem:  Given  a sequence  of  weights  w i,  ...,  wn , a threshold  IT,  and 
a sequence  of  values  vi,  , vn,  find  K C {1, . . . ,n}  such  that  J2keK  wk  < W and 
YlkeK  Vk  *s  maximum.  (“How  much  value  can  be  carried?”) 

• Orthogonal  array:  Given  positive  integers  m and  n,  find  an  rn  x n2  array  with 
entries  Ajk  £ {0, 1, . . . , n — 1}  and  with  the  property  that  j j'  and  k ^ k'  implies 
(Ajk,Aj'k)  ^ {Ajk' , Ajik').  (“Construct  m different  n x n matrices  of  n-ary  digits  in 
such  a way  that  all  n2  possible  digit  pairs  occur  when  any  two  of  the  matrices  are 
superimposed.”)  The  case  m = 3 corresponds  to  a latin  square,  and  the  case  m > 3 
corresponds  tom-2  mutually  orthogonal  latin  squares. 

• Nearest  common  ancestor:  Given  nodes  u and  v of  a forest,  find  w such  that  every 
inclusive  ancestor  of  u and  of  v is  also  an  inclusive  ancestor  of  w.  (“Where  does  the 
shortest  path  from  u to  v change  direction?”) 

• Range  minimum  query:  Given  a sequence  of  numbers  ai,  . . . , a„,  find  the  minimum 
elements  of  each  subinterval  ai,...,aj  for  1 < i < j < n.  (“Solve  all  possible  queries 
concerning  the  minimum  value  in  any  given  range.”)  Exercises  150  and  151  of  Section 
7.1.3  show  that  this  problem  is  equivalent  to  finding  nearest  common  ancestors. 
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• Universal  cycle:  Given  6,  k,  and  N,  find  a cyclic  sequence  of  elements  xo,  x\,  . . . , 
XN-1,  xo,  ...  of  6-ary  digits  {0, 1, . . . ,6—1}  with  the  property  that  all  combinatorial 
arrangements  of  a particular  kind  are  given  by  the  consecutive  A:- tuples  xoxi  . . -Xk-i, 
xiX2-..Xk,  ...,  xn-iXq  . . . Xk-2-  (“Exhibit  all  possibilities  in  a circular  fashion.”) 
The  result  is  called  a de  Bruijn  cycle  if  N = bk  and  all  possible  A:-tuples  appear;  it’s  a 
universal  cycle  of  combinations  if  N = (£)  and  if  all  A;-combinations  of  6 things  appear; 
and  it’s  a universal  cycle  of  permutations  if  N = b\,  k = b—  1,  and  if  all  (6—  l)-variations 
appear  as  A;-tuples. 


In  most  cases  we  have  been  able  to  give  a set-theoretic  definition 
that  describes  the  problem  completely,  although  the  need  for  conciseness 
has  often  led  to  some  obscuring  of  the  intuition  behind  the  problem. 
— M.  R.  GAREY  and  D.  S.  JOHNSON,  A List  of  NP-Complete  Problems  (1979) 
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Indexes  need  not  necessarily  be  dry. 
— HENRY  B.  WHEATLEY,  How  to  Make  an  Index  (1902) 


When  an  index  entry  refers  to  a page  containing  a relevant  exercise,  see  also  the  answer  to 
that  exercise  for  further  information.  An  answer  page  is  not  indexed  here  unless  it  refers  to  a 
topic  not  included  in  the  statement  of  the  exercise. 


see  smile. 

# (number  sign  or  hash  mark),  x. 

d (shadow),  372. 

Q (upper  shadow),  372. 

H (FALSE),  202-208,  249,  250,  253-254, 
259,  272,  273,  676-677. 

(T]  (TRUE),  202-209,  250,  259,  273, 
676-677. 

-1  (the  constant  ( • • • 111)2),  135,  140, 
141,  182,  581,  586,  619. 

0-origin  indexing,  326. 

0-preserving  functions,  see  Normal 
Boolean  functions. 

0-1  matrices,  20,  32-35,  44,  46,  125,  130, 
199-202,  230,  238,  251,  264,  267, 
269-270,  279,  580,  623,  632,  643,  665, 
757,  830—831,  see  also  Adjacency 
matrices  of  graphs,  Bitmaps, 
multiplication  of,  182-183,  188, 

230,  264,  619. 

transposing,  33,  147,  188,  199,  201, 
591-592. 

triangularizing,  200,  725. 

0-1  principle,  68,  186. 

0- 1  vectors,  480. 

1- decision  list  functions,  654. 

2- adic  chains,  155-159,  169,  193,  602,  608. 

2-adic  fractions,  141,  193,  585,  see  also 

Magic  masks. 

2-adic  integers:  Infinite  binary  strings 
(. . . £2Xixo)2  subject  to  arithmetic 
and  bitwise  operations,  134,  140,  147, 
153,  185,  187,  193,  311. 
as  a metric  space,  584. 
with  unsigned  ordering,  581. 

2-bit  encoding  for  3-state  data, 

160-163, 195. 

2-coloring  problem,  17,  22-23,  41. 

2-cube  equivalence,  161-162. 

2-dimensional  data  allocation,  147-148. 

2-level  redundancies  function,  see 
Covering  function. 

2-monotonic  functions,  see  Regular 
Boolean  functions. 

2-nomial  coefficients,  726. 

2-partite  graphs  (2-colorable  graphs), 
see  Bipartite  graphs. 


2- variable  functions,  47-50,  79-80, 

259,  272,  279. 
table,  49. 

2ADDU  (times  2 and  add  unsigned), 

590,  596,  620. 

2CNF,  57,  72,  86-87,  91,  545,  see  also 
Krom  functions. 

2SAT  functions,  see2CNF,  Krom  clauses. 
2SAT  problem,  57,  60-62,  72,  86,  830. 
2m-way  multiplexer  (Mm(x\y)),  109, 

127,  131,  214,  243,  263,  266,  272, 

627,  630,  638,  647,  659. 
permuted,  235,  239,  267,  269. 

3- colorable  tilings,  274. 

3-colored  tilings,  634. 

3-coloring  problem,  39,  42,  529. 

3-cube,  14,  346,  387. 

3-partite  graphs  (3-colorable  graphs), 

265,  277. 

3-regular  graphs,  14,  15,  39,  531. 

3-state  encodings,  160-163,  195. 

3-uniform  hypergraphs,  32-33,  672. 

3-valued  logic,  163,  195. 

3- variable  functions,  63,  99,  104-105. 
table,  78. 

3CNF,  56,  85. 

3SAT  problem,  56,  85,  542,  830. 

4- colored  graphs,  233,  246,  258,  265. 

4-coloring  problem,  17,  39,  530. 

4-cube,  90,  327-328,  347,  468-469,  680,  693. 

4-cycles,  69,  522. 

4-neighbors,  172,  see  Rook-neighbors. 

4- variable  functions,  79,  98-105,  112-114, 

122,  126,  129,  641. 

5- queens  problem,  672,  831. 

5-variable  functions,  79,  105-106,  126, 

267,  277,  572. 

8-cube,  297,  315. 

8-neighbors,  172,  see  King-neighbors. 
8-queens  problem,  604,  672,  831. 

00  (infinity),  63,  140,  187. 

01(H)  (independence  number  of  a graph 
or  hypergraph),  35,  44. 

7 (Euler’s  constant),  as  source  of  “random” 
data,  516,  818-819. 

Fn,  see  Gray  binary  code. 

(i-maps,  595. 
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5-shifts,  148,  189. 
cyclic,  149,  190. 

5-swaps,  145-148,  182,  187-188,  619. 
e (the  empty  string),  266,  272. 
e (the  unit  family  {0}),  273,  658,  662, 
669-670. 

Kf(n)  (Kruskal’s  lower  function),  373-375, 

385- 388,  477,  739. 

A (the  null  link),  21,  222-223,  647-648. 

Ax  (|_lg  xj),  x,  see  Binary  logsize  function. 

A t(n)  (Kruskal's  upper  function), 

374-375,  386-387. 

p (average  memory  access  time),  340, 
see  Mems. 

n(n)  (Mobius  function),  306. 

Pk  and  see  Magic  masks. 

Pt{n)  (Macaulay  function),  374-375, 

386- 387,  739. 

vx  (Is  count),  see  Sideways  sum. 

(2-adic  valuation),  589,  see  Ruler 
function. 

7r  (circle  ratio),  as  source  of  “random” 
data,  76,  98,  118,  128,  149,  205,  247, 
272,  310,  317,  345,  348,  354,  356, 

367,  381-383,  388,  434-435,  459,  479, 
486,  516,  641,  650,  681,  759,  794,  811, 
818-819,  see  also  Pi  function. 
zon,  418,  see  Bell  numbers. 
zj'n  (singleton-free  partitions),  436. 
ronfc,  418,  see  Peirce  triangle. 
px  (ruler  function),  x,  see  Ruler  function. 
p(< t):  restricted  growth  string  function,  432. 
cr (n):  sum  of  divisors,  409. 

<7— r path,  338-339,  351. 

(T-cycles,  437. 

E(/)  (true-vector  sum),  76-77,  92,  95. 
t(x)  (Takagi  function),  374-375,  386—387. 
v (MMIX  instruction  cycle  time),  340. 

4>  (golden  ratio),  196,  236,  246,  270,  514. 
as  source  of  “random”  data,  348, 

516,  818-819. 

4>(k)  (flip  permutation),  330—331,  349. 

X(H)  (chromatic  number  of  a graph  or 
hypergraph),  35,  39,  44,  46. 
ui(G)  (clique  number  of  a graph),  35,  44. 

a-codes,  82,  see  Asterisk  codes  for  subcubes. 
Abacus,  812. 

Abel,  Niels  Henrik,  751. 

Abelian  (commutative)  groups,  414, 

532,  805. 

Aborhey,  Samuel  Edmund  Nii  Sai,  639. 
Absorbent  sets,  see  Dominating  sets. 
Absorption  laws,  50,  135,  255-256. 

Abstract  algebra,  212-213,  see  also 
Family  algebra. 

Abstract  RISC  (reduced-instruction-set 
computer)  model,  158. 

ACq  complexity  class,  194. 


Ackland,  Bryan  David,  176. 

Acronyms,  203-204. 

Active  bits,  366. 

Acyclic:  Containing  no  cycles,  15. 

Acyclic  digraphs,  31-32,  165,  203,  215, 
253,  257,  259,  550,  830. 

Adachi,  Fumie  (^CjUlSSL),  504. 

Adam,  Andras,  687. 

Addition,  135. 

binary  (radix  2),  107-108,  127-128, 

151,  215-216,  262. 
bytewise,  151,  599. 
modulo  3,  129. 
modulo  4,  126. 
modulo  5,  129,  192. 
of  sparse  integers,  278. 
scattered,  150,  189. 
sideways,  see  Sideways  addition, 
unary  (radix  1),  192. 

Addition  chains,  675. 

Addition  table  modulo  n,  36. 

Additive  alphametics,  324-325, 

332-333,  348. 

Address  bits,  641,  650. 

Adena,  Michael  Anthony,  673. 

Adjacency  lists,  21-22,  194. 

Adjacency  matrices  of  graphs,  19—20,  26, 
27,  40-41,  43,  123,  133,  160,  194, 

267,  529,  533,  632,  806. 

Adjacent  interchanges,  240-246,  320-325, 
349,  353,  369-371,  384,  650,  719,  720. 

Adjacent  subsets  of  vertices,  231-232,  636. 

Adjacent  vertices,  13. 

Adventure  game,  597. 

jElfric  Grammaticus,  abbot  of  Eynsham, 
280. 

Affine  Boolean  functions,  96,  563. 

Affirmation  function  (T),  49. 

Agrawal,  Dharma  Prakash  (srf"  SfWST 
3TCP4T7T),  582. 

Ahlswede,  Rudolph,  745. 

Ahmad,  Salah  (j^.i  493,  812. 

Ahrens,  Wilhelm  Ernst  Martin  Georg, 

516,  672. 

Aiken,  Howard  Hathaway,  104. 

Ainley,  Stephen,  673. 

Ajtai,  Miklos,  91. 

Akers,  Sheldon  Buckingham,  Jr.,  87, 

257,  677. 

Akl,  Selim  George  (Jit.  .4...).  719. 

al-Samaw’al  (=  as-Samaw’al), 

ibn  Yahya  ibn  Yahuda  al-Maghribl 
(JtjiuJI  I <jjl), 

493,  812.' 

Albers,  Susanne,  599. 

Aldous,  David  John,  454. 

Alekseyev,  Valery  Borisovich  (AjieKceeB, 
iiajiepHH  BopncoBHu),  560. 
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Algebraic  connectivity,  808. 

Algebraic  normal  form,  see  Multilinear 
representation  of  a Boolean  function. 
Aliquot  parts,  505. 

All-Os  matrix,  27,  526. 

All-ls  matrix,  26,  27,  526. 

All-equal  function  (So,n),  131. 

All-zero  rows  or  columns,  267. 

Allouche,  Jean-Paul,  589. 

Almkvist,  Gert  Einar  Torsten,  754. 

Almost  linear  recurrence,  303. 

Almost  symmetric  Boolean  functions,  650. 
Alon,  Noga  (-|1>N  rui),  556,  579,  580. 
Alonso,  Laurent,  798. 

Alpha  channels,  191. 

Alphabetic  data,  152,  191. 

Alphabetic  order,  38. 

Alphametics,  324,  432. 

additive,  324-325,  332-333,  348. 
doubly  true,  347. 
multiplicative,  347. 
pure,  325,  346-347,  432. 

Alternating  combinatorial  number 
system,  363,  381. 

Alternating  group,  323,  354,  812. 

Amano,  Kazuyuki  — $),  248,  271. 
Amarel,  Saul,  581. 

Amino  acids,  511. 

Amir,  Yair  ("Uny  TN>),  555. 

AMM:  American  Mathematical  Monthly , 
published  by  the  Mathematical 
Association  of  America  since  1894. 
Anagrams,  491,  519. 

Analog-to-digital  conversion,  283-284,  295. 
Analysis  of  algorithms,  41,  82-84,  130, 

187,  233-235,  242-243,  246-250,  263, 
267-270,  274,  308,  317,  318,  344-349, 
352-353,  358-359,  379,  381,  383, 
403-405,  412,  438,  450,  476,  480, 

483,  512,  596,  638,  786. 

Ancestors  in  a forest,  165,  444,  781,  809. 

nearest  common,  165-167,  196,  832. 

AND  (bitwise  conjunction,  &),  22,  58, 

74,  82,  84,  134-136,  538. 

AND  function  (A),  48-51,  53,  57,  63,  81. 
AND  gates  (A),  32,  33,  97. 

with  vacuum  tubes,  104. 

AND-OR  chains,  125,  132-133. 
AND-OR-NOT  chains,  131,  see  Canalizing 
chains. 

AND  subroutine,  225-227,  229-230,  264, 
272,  273,  279,  660,  663. 

ANDAND  subroutine,  229-230,  264,  663. 
Andersen,  Lars  Dpvling,  516. 

Andrews,  George  W.  Eyre,  391,  754. 
Animating  functions,  185,  189. 

Anti-Gray  code,  315. 

Antichains  of  subsets,  see  Clutters. 
Antipodal  words,  291. 


Antisymmetric  digraphs,  62,  91. 

Appel,  Kenneth  Ira,  17. 

Applying  a permutation,  326-328. 

Apportionment,  8. 

Approximately  decomposable  functions,  576. 

Approximating  functions,  264-265. 

Arabic  mathematics,  493,  499,  516. 

Arabic  numerals,  493,  512. 

Arbogast,  Louis  Frangois  Antoine,  419. 

Arborescences,  see  Oriented  trees. 

Arc  digraph  of  a digraph,  669,  808. 

Arc  lists,  21-22,  194. 

Arc  variables,  21,  23. 

Archimedes  of  Syracuse  ( ApxqxriSrn 
6 Eupoocoumo?),  solids,  475. 

Arcs  in  a graph,  18-23. 
as  edges,  18. 

ARCS(u)  (first  arc  of  a vertex),  21. 

Arima,  Yoriyuki  (^,^flfi),  504,  679. 

Arimura,  Hiroki  (%fttf  fE),  673,  675. 

Arisawa,  Makoto  (%i'p||^),  708. 

Aristotle  of  Stagira,  son  of  Nicomachus 
( ’ApiotoTeXri?  Ntxopuxxpu  6 Erayipitrii;), 
496. 

Aristoxenus  (’Aptax6$evoi;),  490. 

Arithmetic  mean,  414,  438. 

Arithmetic  progressions,  38. 

Ariyoshi,  Hiromu  (^ff  cJllA),  604. 

Arndt,  Jorg  Uwe,  587,  596,  683. 

Arnold,  David  Bryan,  453. 

Arques,  Didier,  811. 

Array  storage  allocation,  147-148, 

154,  186,  191. 

Articulation  points,  255. 

Artificial  intelligence,  346,  681. 

Ascents  of  a permutation,  720. 

Aschbacher,  Michael  George,  523. 

ASCII:  American  Standard  Code  for 
Information  Interchange,  191,  201. 

Ashar,  Pranav  Navinchandra 
(iw  arrar),  631. 

Ashenhurst,  Robert  Lovett,  117,  120. 

Aspects  of  a graph,  482-483. 

Aspvall,  Bengt  Ingemar,  87,  545. 

Assignment  problem,  344,  831. 

Associahedron,  475. 

Associative  block  designs,  56. 

Associative  laws,  27,  28,  42,  50,  65,  68,  80, 
135,  212,  261,  270,  273,  447,  472,  475, 
484,  536,  539,  566,  569,  584,  634. 

Asterisk  codes  for  subcubes,  54,  82-84, 

150,  195,  535. 

Asymptotic  methods,  109-112,  128-129, 

234,  236,  263,  267,  313,  353,  396-402, 
410-412,  419-426,  437-439,  450, 

461,  538,  540,  550,  555,  558-559, 

627,  641,  653,  658,  699,  708,  711, 

719,  741,  776-777,  793. 

Atkin,  Arthur  Oliver  Lonsdale,  751. 
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Atkinson,  Michael  David,  797. 

Atomic  strings  of  parentheses,  478. 
Attributed  edges,  see  Complement  links. 
Attributes  of  God,  494-496. 

Aubert,  Jacques,  693. 

Auluck,  Faqir  Chand  ' ^3" 

SffrW),  779. 

Austin,  Richard  Bruce,  641. 

Automata  theory,  257,  279,  601. 

cellular,  172-175,  197-198. 

Automated  deduction,  548. 

Automorphisms,  14-15,  39,  45,  327-328,  346, 
347,  523,  528,  532-533,  654,  688. 
Autosifting,  245,  649. 

AVAIL  stack,  216-217,  262. 

Availability  polynomial  of  a Boolean 

function,  see  Reliability  polynomials. 
Avalanches,  482. 

Avann,  Sherwin  Parker,  89,  718. 

Average  nodes  on  level  k,  234,  272,  677. 
Average  weight  of  a solution,  276. 

Averages,  bytewise,  151,  191. 

6-ary  codewords,  38. 

6-codes,  82,  see  Bit  codes  for  subcubes. 
.B-schemes,  free,  257. 

B(f)  (the  BDD  size  of  /),  205,  233-235. 
B(fi,  • ■ -s  fm)  (the  BDD  size  of 
{/l,...,/m}),  216,  229,  240. 

^raax  239-240,  268. 

^min  (/ii  ■ ■ • i /m),  239  240,  268—269. 
Babbage,  Charles,  116,  569,  574. 

Babbage,  Henry  Provost,  569. 

Bach,  Johann  Sebastian,  xvi. 

Background  of  an  image,  174-175. 

Backtrack  method,  295,  430,  503,  553,  665. 
Backward-computation  principle,  102. 

Bad  pairs,  118-120,  130-131. 

Baez,  John  Carlos,  685. 

Bahl,  Lalit  Rai  (eiRin  TTtr  ),  677. 

Bakos,  Tibor,  687. 

Balanced  ANDing,  663,  665. 

Balanced  branching  functions,  185. 

Balanced  digraphs,  482. 

Balanced  Gray  codes,  294-297,  313, 

315,  688. 

Balanced  partitions,  407. 

Balanced  permutations,  354. 

Balanced  strings,  478. 

Balanced  ternary  notation,  195,  590,  729. 
Balderic,  494. 

Ball,  Michael  Owen,  541,  542. 

Ballot  numbers  Cpq , 451-452,  459, 

476,  676,  816. 
generalized,  476-477. 
table,  451. 

Balls  in  urns,  390. 

Bandelt,  Hans-Jiirgen,  551. 


Bandwidth  of  n-cube,  315. 

Bang,  Thpger  Sophus  Vilhelm,  514. 

Banyan  networks,  592. 

Barbara  Milla,  Daniel,  88. 

Barbour,  Andrew  David,  533. 

Barnard,  Robert,  10. 

Baron,  Gerd,  515. 

Baronaigien,  see  Roelants  van  Baronaigien. 
Barwell,  Brian  Robert,  347. 

Barycentric  coordinates,  25,  88. 

Baseball,  380. 

Bases  of  a matroid,  674. 

Bases  of  vector  spaces,  canonical,  380,  385. 
Basic  RAM  (random-access  machine) 
model,  158-159,  194,  602. 

Basis  theorem,  388. 

Bassanio  of  Venice,  1. 

Batcher,  Kenneth  Edward,  189. 
baud:  One  transmission  unit  (e.g.,  one 
bit)  per  second,  284. 

Baudot,  Jean  Maurice  Emile,  284-285. 
Baugh,  Charles  Richmond,  553,  554,  559. 
Bauhuis,  Bernard  (=  Bauhusius, 

Bernardus),  500—502. 

Baumgart,  Bruce  Guenther,  144. 

Bays,  John  Carter,  588. 

BDD:  A reduced,  ordered  binary  decision 
diagram,  202,  203. 

BDD  base:  One  or  more  BDDs  having  no 
duplicate  nodes,  215-216,  257,  259, 

261,  262,  264,  279,  656. 

BDIF  (byte  difference/monus),  152,  598. 
Beads,  204-205,  215,  219,  233-235,  240,  250, 
259,  261,  266,  629,  638,  676-677. 

Bears,  California  Golden,  31. 

Beckenbach,  Edwin  Ford,  359,  510. 

Becker,  Harold  William,  766,  771,  786. 
Beckett,  Samuel  Barclay,  314-315. 

Beidler,  John  Anthony,  324. 

Beissinger,  Janet  Simpson,  654. 

Bell,  Eric  Temple,  418. 

numbers,  418-419,  434-438,  505, 

760,  762,  815. 

numbers,  asymptotic  value,  422-423, 
437-438.  ' 

Bell  ringing,  319,  322-323,  339,  722. 
Bell-shaped  curve,  424,  428,  438. 

Bell-shaped  sequence,  439. 

Bell  Telephone  Laboratories,  115. 

Bellhouse,  David  Richard,  494. 

Bellman,  Richard  Ernest,  xi,  373,  810. 

Benes,  Vaclav  Edvard,  145. 

Bennett,  Grahame,  653. 

Bennett,  William  Ralph,  284. 

Bent  functions,  96. 

Bentley,  Jon  Louis,  607. 

Berele,  Allan,  768. 

Berge,  Claude,  34,  622,  671. 

Berlekamp,  Elwyn  Ralph,  153,  573,  584,  609. 
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Berman,  Charles  Leonard,  627. 

Bern,  Jochen,  639. 

Bernays,  Paul  Issak,  53. 

Bernoulli,  Jacques  (=  Jakob  = James),  370, 
486,  499,  502-503,  704,  813,  814. 
numbers,  418,  655,  751,  820. 

Bernshtein,  Sergei  Natanovich  (BepiriiiTenH. 

Ceprefi  HaiaHOBn'i),  614. 

Bernstein,  Arthur  Jay,  683. 

Bernstein,  Benjamin  Abram,  535. 

Berthet,  Christian,  635. 

BESM-6  (BECM-6)  computer,  594. 

Bessel,  Friedrich  Wilhelm,  function,  398. 
Betweenness,  65,  89-90. 

Beyer,  Wendell  Terry,  174,  461. 

Bezier,  Pierre  Etienne,  splines,  180, 

198-199,  615. 

Bezout,  Etienne,  812. 

Bhaskara  II,  Acarya,  son  of  Mahesvara 
(RTRvfTTBj-,  ‘RPtH'i’l),  491,  492. 
Bhattotpala  (=  Utpala,  H^tcTeT),  500. 
bi-book  graphs,  23. 
biJisa  graphs,  24. 

Biclutters,  480. 

Big-endian  convention,  138-140,  144, 

152,  588,  615,  620. 

Biggs,  Norman  Linstead,  15. 

Bigraphs,  17,  see  Bipartite  graphs. 
Bijunctive  clauses,  seeKrom  clauses. 

Billon,  Jean-Paul,  677. 

Binary  arithmetic,  487. 

addition,  107-108,  127-128,  215-216,  262. 
comparison,  92,  120,  575-577. 
multiplication,  228-229,  247-249, 

264,  271-272,  278. 
subtraction,  135. 

Binary  basis,  582. 

Binary  Boolean  operations,  see  Synthesis  of 
BDDs,  Two-variable  functions. 
Binary-coded  decimal  digits,  114,  192. 
Binary  decision  dags,  203. 

Binary  decision  diagrams,  202-280. 
compared  to  ZDDs,  250-253,  272, 

273,  663-665. 
mixed  with  ZDDs,  256. 
toolkits  for,  220,  224,  257,  677. 

Binary  decoder,  109,  567. 

Binary  Gray  codes,  292-297,  313-315. 

enumeration  of,  293,  313. 

Binary  logsize  function  (Ax  = [lg  a;J ) , 

x,  134,  142-143,  153-154,  157,  165-167, 
187-188,  192-194,  196,  202,  234, 

262,  585,  626,  638,  677. 

Binary  majorization  lattices,  92-93. 

Binary  moment  diagrams,  675. 

Binary  number  system,  47,  75-76,  80, 

82,  90,  92,  281,  284. 


Binary  operator:  A function  of  two 
variables,  49. 

Binary  partitions,  414. 

Binary  recurrence  relations,  108,  109,  126, 
140,  142,  143,  187,  549,  566,  681,  699. 

Binary  relation:  A Boolean  function  of  two 
not  necessarily  Boolean  variables,  416. 

Binary  search,  460. 

Binary  search  trees,  196,  221,  477,  485,  590. 

Binary  strings,  54,  67,  90,  92-93. 

Binary  trees,  85,  97,  98,  164,  185,  203,  275, 
441-449,  471-479,  509-511,  570. 
as  representations  of  general  trees  and 
forests,  381,  441-442. 
complete,  81. 
decorated,  456,  478. 
degenerate,  475,  477,  809. 
drawings  of,  441,  454,  455,  485, 

509,  782,  786. 

extended,  441,  456,  472,  477,  485. 

Gray  codes  for,  446—449,  473. 
linked,  444-449,  473. 
random,  456-457,  478,  485. 
representation  of,  444,  448,  456, 

476,  478,  511. 
rotations  in,  447-449,  787. 

Binary  tries,  310. 
complete,  38-39. 

Binary  valuation,  see  Ruler  function. 

Binary  vector  spaces,  380,  385. 

Binate  covering  problem,  see  Boolean 
programming  problem. 

Bing,  R.  H.,  514. 

Binomial  coefficients,  355,  386. 
generalized,  387,  see  also  <y-nomial 
coefficients. 

identities  between,  477. 
summation  techniques  for,  640-641. 

Binomial  number  system,  see  Combinatorial 
number  system. 

Binomial  trees,  360-361,  381,  484,  809. 

Bioch,  Jan  Corstiaan,  547. 

Bipartite  graphs,  17,  22-25,  35,  39,  41, 

43,  120,  133,  146-147,  252,  277, 

522,  529,  608,  663. 

corresponding  to  hypergraphs,  33,  44. 

Bipartite  hypergraphs,  529. 

Bipartite  matching  problem,  125,  831. 

Bipartite  subgraphs,  119,  277. 

Bipartiteness  testing,  22-23. 

Bipartitions  (two-dimensional  partitions), 
429-431,  778-779. 

Birkhoff,  Garrett,  548,  763. 

Bishop  moves  on  a chessboard,  25,  26,  31. 

Bit  boards,  164,  195. 

Bit  codes  for  subcubes,  82,  84-85,  150,  195. 

Bit  permutations,  145-149,  157,  182. 

Bit  reversal,  144-145,  149,  157,  159,  187, 
188,  308,  311,  592,  608. 
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Bit  slices,  151,  202,  595. 

Bitburger  Brauerei,  xiv. 

Bitmaps,  171-180,  196-200. 
cleaning,  197. 
drawing  on,  180. 

filling  contours  in,  176-180,  198-199. 
rotation  and  transposition  of,  199. 

Bitner,  James  Richard,  289,  362. 

Bitonic  permutations,  189,  594. 

Bitstring  notation:  Data  represented  as  a 
string  of  0s  and  Is,  355-356,  380,  382. 
Bitwise  operations,  47,  74,  82,  100-102,  126, 
133-202,  259,  284,  291-292,  312,  341, 
358,  534,  562,  683,  732,  735,  746. 

AND  (&),  22,  58,  74,  82,  84,  134-136,  538. 
complementation,  134,  135,  184,  217,  604. 
medians,  67,  71,  72,  91,  549. 

OR  (|),  50,  74,  84,  134-135. 

XOR  (©),  90,  134-136,  538. 

Bjorner,  Anders,  739. 

Black,  Max,  252. 

Black  pixels,  136,  172-173,  179-180,  199. 
Blake,  Archie,  539. 

Blissard,  John,  765-766. 

Block  codes,  279. 

Blocks  of  a set  partition,  415. 

Blum,  Manuel,  624. 

Blum,  Norbert  Karl,  122. 

BMD:  A binary  moment  diagram,  675. 
board  graphs,  25,  26,  31,  41. 

Boas,  Peter  van  Emde,  164. 

Bochenski,  Jozef  (=  Innocenty)  Maria,  49. 
Bolker,  Ethan  David,  771. 

Bollig,  Beate  Barbara,  238,  243,  269, 

644,  647,  657. 

Bollobas,  Bela,  560. 

Bolyai,  Janos,  168. 

Bondy,  John  Adrian,  14,  533. 

Bonferroni,  Carlo  Emilio,  753. 

Bonner,  Anthony  Edmonde,  813. 
book  graphs,  23. 

Bookworm  problem,  186. 

Boole,  George,  48,  52,  133,  211,  570. 

Boolean  binary  operators,  47-51,  80,  87,  92. 
table,  49. 

Boolean  chains,  96-133,  197,  279,  830. 
AND-OR,  125,  132-133. 

AND-OR-NOT,  126,  131,  132,  570. 

canalizing,  126,  131,  132,  570. 

definition  of,  96,  830. 

median,  133. 

monotone,  125,  132-133. 

of  3 variables,  99,  104-105. 

of  4 variables,  98-105,  112-114, 

122,  126,  129. 

of  5 variables,  105-106,  126,  572. 
of  many  variables,  109-112,  117-133. 
optimization  of,  121-122,  576. 
optimum,  101-106,  126,  133. 


with  several  outputs,  107-109,  112-117, 
121-122, 126-130. 

Boolean  difference  quantifier  (0),  231, 

265,  634-635,  637. 

Boolean  function  calculator,  220. 

Boolean  functions,  33,  47-96,  388,  460, 

480,  see  also  Boolean  chains, 
affine,  96,  563. 
bent,  96. 

canalizing,  78,  79,  95,  261,  654. 
duals  of,  534,  538,  552. 
enumeration  of,  79. 

Horn,  58,  79,  87,  95,  266,  271,  661,  673. 
Krom,  60,  72,  79,  81,  95,  266,  see 
also  2CNF. 

majority,  63,  68,  76,  550,  see  Medians, 
monotone,  536—537,  see  Monotone 
Boolean  functions, 
normal,  100,  102,  110,  113,  125, 

279,  565,  577. 

of  2 variables,  47-50,  79-80. 

of  3 variables,  63,  78,  99,  104-105. 

of  4 variables,  79,  98-105,  112-114, 

122,  126,  129,  641. 
of  5 variables,  79,  105-106,  126, 

267,  277,  572. 
random,  56,  83,  541. 
regular,  93,  263,  271,  559. 
self-dual,  see  Self-dual  Boolean  functions, 
symmetric,  see  Symmetric  Boolean 
functions. 

threshold,  see  Threshold  functions, 
unate,  265,  537. 

versus  families  of  sets,  250,  253, 

273-276,  669. 

Boolean  games,  86. 

Boolean  lattices,  788. 

Boolean  matrices,  182,  201,  see  also 
0-1  matrices,  Bitmaps, 
multiplication  of,  182-183,  188, 

230,  264,  619. 

Boolean  programming  problem,  206, 
209-211,  258,  261,  663,  830. 
generalized,  261,  280. 

Boolean  values,  32. 

Bootstrapping,  565,  570. 

Boppana,  Ravi  Babu,  579,  580. 

Borel,  Emile  Felix  Edouard  Justin,  699. 

Borkowski,  Ludwik  Stefan,  163. 

Boros,  Endre,  120,  541. 

Borrel,  Jean  (=  Buteonis,  Ioannes),  813. 

Borrows,  598-599,  607,  678. 

Boskovic,  Ruder  Josip  (BouiKOBHh, 

PyJjep  Jocnn  = Boscovich,  Ruggiero 
Giuseppe  = Boscovich,  Roger  Joseph), 
507,  513,  754. 

Bose,  Raj  Chandra  5. 

Bossen,  Douglas  Craig,  519. 

Boswell,  James,  xiv. 
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Botermans,  Jacobus  (=  Jack)  Petrus 
Hermana,  694. 

Bottom-up  algorithms,  207,  209,  212, 

257,  636,  663-664,  678. 

Bottom-up  synthesis,  103-105,  126,  563-564. 
Boundary  curves,  digitized,  176-180. 
Bounded  compositions,  370,  384,  385,  816. 
Bourgogne-Artois,  Jeanne  de,  496. 
Boustrophedon  product,  316,  695. 

Bouton,  Charles  Leonard,  582. 

Boyer,  Robert  Stephen,  46. 

Brace,  Karl  Steven,  257. 

Branch  instructions,  158,  602,  see  also 
Branchless  computation. 

Branch  nodes:  Nonleaves,  202,  209, 

249,  280,  466-467. 

Branch-and-cut  methods,  531. 

Branching  functions,  185,  188. 

Branching  programs,  206-207,  257,  637. 
Branchless  computation,  142,  155-159, 
180-181,  193,  201,  202. 

Brandt,  Jprgen,  761. 

Braymore,  Caroline,  133. 

Brayton,  Robert  King,  122,  661. 

Breadth-first  search,  70,  603,  608,  803. 
Breadth-first  synthesis,  222-223,  264. 

compared  to  depth-first,  227-229. 

Breisch,  Richard  Lewis,  707. 

Breitbart,  Yuri  (Bpeirr6apT,  lOpuii 
HKOBJieBHu),  638,  676. 

Brent,  Richard  Peirce,  568,  594. 

Bresenham,  Jack  Elton,  614. 

Brette,  Jean,  515. 

Breuer,  Melvin  Allen,  605. 

Brewster,  George,  9. 

Bridges  of  a graph,  464-465,  481,  484, 

803,  809. 

Briggs,  Henry,  621. 

Brightwell,  Graham  Richard,  560,  718. 
Brinkmann,  Gunnar,  530. 

Broadword  chains,  155-159,  192-194, 

197,  259,  607,  830. 
strong,  193. 

Broadword  computations,  153-159, 

192-194,  197,  611. 

Brodal,  Gerth  Stalting,  154. 

Brodnik,  Andrej,  159. 

Bron,  Coenraad,  604. 

Brooker,  Ralph  Anthony,  134. 

Brooks,  Rowland  Leonard,  525. 

Brown,  Charles  Philip,  489. 

Brown,  David  Trent,  183. 

Brown,  John  Wesley,  516. 

Brown,  Joseph  Alexander,  707. 

Brown,  Robert,  see  Brownian  excursion. 
Brown,  William  Gordon,  523. 

Brownian  excursion,  454. 

Browning,  Elizabeth  Barrett,  436. 

Brualdi,  Richard  Anthony,  517. 


Bruck,  Richard  Hubert,  518. 

Bruijn,  Nicolaas  Govert  de,  302,  426, 

457,  773. 

cycles,  142,  302-307,  316-318,  354, 

489,  626,  658,  702,  833. 
toruses,  318. 

Bryant,  Randal  Everitt,  235,  237,  254-255, 
257,  265,  630,  636,  675. 

Brylawski,  Thomas  Henry,  755. 

Bubble  sort,  321. 

Biichi,  Julius  Richard,  124,  585. 

Buchner,  Morgan  Mallory,  Jr.,  683. 

Buck,  Marshall  Wilbert,  384. 

Bucket  sort,  217,  222-223,  631,  645. 

Buckley,  Michael  Robert  Warren,  346. 

Buddies,  82. 

Bui,  Alain,  2. 

Bulgarian  solitaire,  415. 

Burckhardt,  Johann  Karl  (=  Jean 
Charles),  508. 

Burger,  Alewyn  Petrus,  673. 

Burley  (=  Burleigh),  Walter,  51. 

Buteonis,  Ioannes  (=  Borrel,  Jean),  813. 

Butler,  Jon  Terry,  626. 

BUTNOT  gates  (5),  97,  100,  110. 

BUTNOT  subroutine,  272,  273,  633, 

634,  660,  662,  677. 

Butterfly  networks,  188-189,  595. 

Bypassing  blocks  of  permutations,  331-334, 
348,  719-720. 

Byte:  An  8-bit  quantity,  139-140,  615. 

Byte  permutations,  182. 

Bytes,  parallel  operations  on,  see  Multibyte 
processing. 

C language,  22,  50. 

c(G ):  The  number  of  spanning  trees 
of  G,  482. 

C(/):  Length  of  shortest  Boolean  chain 
for  /,  97,  111-112,  266,  see  Cost  of 
a Boolean  function. 

C(/i  107. 

C+(/):  Length  of  shortest  AND-OR 
chain,  132,  133. 

Cm(f)'-  Length  of  shortest 
minimum-memory  chain, 

102-103,  126,  581. 

Cn  (Catalan  number),  450-453,  456, 
476-477,  509,  511,  797,  815-816. 

C„  (cycle  graph  of  order  n),  x,  13,  28, 

39,  41,  481,  803,  807. 

Cn  (oriented  cycle  digraph  of  order  n), 

18,  41,  259. 

Cpq  (ballot  number),  451-452,  459, 

476-477,  676,  816. 

Cache  hit  patterns,  416. 

Cache  memory,  137,  167,  181,  226, 

587,  603,  631. 
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Cache  memos,  226-230,  232-233,  256,  264, 
265,  636,  647,  659-660,  670. 

Cache-oblivious  array  addressing,  see  Zip. 

CACM:  Communications  of  the  ACM, 
a publication  of  the  Association  for 
Computing  Machinery  since  1958. 

Caged  Life,  270. 

Cahn,  Leonard,  172. 

Cai,  Ning  (Hx),  745. 

Calabi,  Eugenio,  726. 

Calderbank,  Arthur  Robert,  681. 

Calendar  dates,  packed,  136,  192. 

California  Golden  Bears,  31. 

Callan,  Columcille  David,  783. 

Cambridge  Forty-Eight,  322,  323. 

Cameron,  Robert  Douglas,  620. 

Camion,  Paul  Frederic  Roger,  258. 

Canalizing  chains,  126,  131,  132,  570. 

Canalizing  functions,  78,  79,  95,  261,  654. 

Canalizing  operators,  123,  126,  563,  578. 

Cancellation  laws,  76,  80,  582-583. 

Canfield,  Earl  Rodney,  718,  777. 

Canoe  puzzle,  694. 

Canonical  bases  of  vector  spaces,  380,  385. 

Canonical  delta  sequences,  293,  688. 

Canonical  forms,  484,  see  Full  conjunctive 
normal  form,  Full  disjunctive  normal 
form,  Integer  multilinear  representation, 
Multilinear  representation  of  a 
Boolean  function. 

Canonical  ordered  forests,  461-462,  480-481. 

Cantor,  Georg  Ferdinand  Ludwig 
Philipp,  597. 

Cantor,  Moritz  Benedikt,  508. 

Capitol,  Montana,  254. 

Caplin,  Alfred  Gerald  (=  A1  Capp),  79. 

Cardano,  Girolamo  (=  Hieronymus 
Cardanus),  679. 

Cardinal,  Stanford,  31. 

Cardinality  of  a set,  143. 

Cares,  129,  264,  see  also  Don’t-cares. 

Carlitz,  Leonard,  759,  770,  793-794. 

Caron,  Jacques,  730. 

Carroll,  Lewis  (=  Dodgson,  Charles 
Lutwidge),  10-12,  48,  79,  521. 

Carry  bits,  107,  127,  128,  150-151,  157,  216, 
282,  569,  598-599,  657,  675,  701. 

Cartesian  coordinates,  176. 

Cartesian  product  of  graphs,  x,  27-28, 

42-44,  67,  467,  483,  526. 

Cartesian  trees,  590,  606. 

Casting  out  nines,  708. 

Castown,  Rudolph  William,  291. 

Catalan,  Eugene  Charles,  450,  724. 

Catalan  numbers,  278,  450-453,  456, 

476-477,  509,  511,  678,  738,  797, 
815-816. 

generalized,  476-477. 
tables,  450-451. 


Catalan  triangle,  451-452,  459,  476-477. 
t-ary,  790,  795. 

Categorical  product  of  graphs,  28,  see 
Direct  product  of  graphs. 

Caterpillar  graphs,  810. 

Cat’s  game,  86,  115,  117,  574. 

Cattell,  Kevin  Michael,  700. 

Cauchy,  Augustin  Louis,  403,  411. 

Cavior,  Stephan  Robert,  682. 

Cayley,  Arthur,  338,  509,  757. 
digraphs,  45. 

graphs,  45,  338,  349-352,  355,  532,  713. 
Hamilton  theorem,  683. 

Cellular  automata,  172-175,  197-198. 
Censorship,  10-11. 

Center  of  gravity,  297. 

Centroid  of  an  oriented  tree,  481. 

Cesare,  Giulio  (pen  name  of  Dani  Ferrari, 
Luigi  Rafaiani,  Luigi  Morelli,  and 
Dario  Uri),  707. 

Chain  in  a partial  order,  353. 

Chaining  with  separate  lists,  222. 

Chains  of  submultisets,  800. 

Chains  of  subsets,  457-461. 

Chambers,  Ephraim,  v. 

Chandra,  Ashok  Kumar  (3n?ffar  <$HI  < 
R^TT),  540,  557,  624. 

Chang,  Angel  Xuan  ($^),  653. 

Change  ringing,  319,  322-323,  339. 
Changing  coins,  408. 

Characteristic  polynomial  of  a Boolean 
function,  see  Reliability  polynomials. 
Characteristic  polynomial  of  a matrix, 

482,  620,  683,  806. 

Chase,  Phillip  John,  365-366,  382,  733. 

sequence,  94,  365-367,  370,  382-383. 
Chaucer,  Geoffrey,  519. 

Chebyshev  (=  Tschebyscheff),  Pafnutii 
Lvovich  (HebbimeB,  lIa(juiyTUH 
JlbBOBHM),  775. 
polynomials,  617,  807. 

Checkerboard,  663. 

Cheema,  Mohindar  Singh  (h!  few  < fR^ 
RtRT),  431,  778. 

Chen,  Kuo-Tsai  306. 

Chen,  William  Yong-Chuan  ($URJI|),  768. 
Chen,  Yirng-An  ( |?S , 675. 

Cheng,  Ching-Shui  (SBflSfTfC),  692. 

Cheong,  Matthew  Chao  ((JUS),  631. 
Cheshire  cat,  174-175,  197,  198,  611. 
Chess,  543. 

Chessboard-like  graphs, 
bishop  moves,  25,  26,  31. 
generalized  piece  moves,  41. 
king  moves,  43. 
knight  moves,  15,  25. 
queen  moves,  26,  44. 
rook  moves,  26,  41. 
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Chessboards,  164,  195,  251-252,  269-270, 
274,  276,  277. 

Chien,  Robert  Tien  Wen  519. 

Childs,  Roy  Sydney,  707. 

Chimani,  Markus,  531. 

Chinese  mathematics,  486-487. 

Chinese  remainder  algorithm,  207,  701. 
Chinese  ring  puzzle,  285-286,  308, 

382,  679-680. 

Chords,  364,  384. 

Chorees,  489-490. 

Chow,  Chao  Kong  (JnJfpJjt),  76. 

parameters  N(f)  and  E(/),  76-77,  92,  95. 
Christ,  Wilhelm  von,  490. 

Christian  mathematics,  493-503,  512. 
Christie  Mallowan,  Agatha  Mary  Clarissa 
Miller,  18-19,  816. 

Christmas  tree  pattern,  457-461, 

479-480,  558,  799. 

Chromatic  index,  see  Edge-chromatic 
number. 

Chromatic  number  x(G).  35,  39,  44,  46. 
Chung,  Kai  Lai  797. 

Chung,  Kin-Man  (IliitR),  149,  190. 

Chung  Graham,  Fan  Rong  King 
549,  745,  780. 

Chvatal,  Vaclav  (=  Vasek),  14. 

graph,  14,  39,  44,  530. 

Cl-nets,  72-74,  91. 

Cigar-shaped  curves,  613-614. 

Circles,  digitized,  176,  179. 

Circuit  complexity,  194. 

Circuits,  Boolean,  97,  see  Boolean  chains. 
Circular  lists,  194,  612. 

Claesson,  Anders  Karl,  762. 

Clapham,  Christopher  Robert  Jasper,  528. 
Clause:  A disjunction  of  literals,  54,  81. 
Clausen,  Thomas,  5. 

Claw  graph  (K  1,3),  522. 

Cleaning  images,  197,  610. 

Clearing  the  cache,  647. 

Clements,  George  Francis,  378-379, 

388,  742,  743. 

Clift,  Neill  Michael,  620. 

Clique  number  ui {G),  35,  44. 

Cliques,  35,  44,  267,  277,  385,  580,  644,  831. 
covering  by,  35,  277. 
maximal,  194-195,  259,  277. 

Clone  of  a node,  221,  223. 

Closed  bitmaps,  197. 

Closed  item  sets,  see  Closure  of  a family. 
Closure  of  a family  (/n),  277. 

Closure  under  intersections,  57,  132. 

Closure  under  medians,  72. 

Clustering,  vii. 

Clutters,  263,  388,  459,  631-632,  661,  669. 
CMath:  Concrete  Mathematics,  a book 
by  R.  L.  Graham,  D.  E.  Knuth,  and 
O.  Patashnik,  514,  658. 


CNF,  53,  see  Conjunctive  normal  form. 
Co-atoms,  478. 

Coalescence,  432,  770. 

Coalitions,  416,  550. 

Cobham,  Alan,  257. 

Cock,  John  Crowle,  701. 

Cockayne,  Ernest  James,  673. 

Cocke,  John,  677. 

Cocliques,  see  Independent  vertices. 

Codes  for  difficulty  of  exercises,  xi-xiii. 
Codewords,  310. 
b-ary,  38. 

Coding  theory,  519,  677. 

Codons,  511. 

Cofactors,  273,  634. 

Coforests,  448-449,  471-472,  476,  483. 
Cognate  forests,  472. 

Cographs,  42. 

Cohen,  Philip  Michael,  520. 

Cohn,  Martin,  687,  690,  693. 

Coins,  408. 
biased,  96. 
unbiased,  209. 

Colex  order:  Lexicographic  from  right 
to  left,  359,  392,  407,  445,  473,  488, 
500,  512,  719,  756,  814—816,  see  also 
Reverse  colex  order, 
of  integers,  238,  590. 

Collation  of  bits,  134. 

Colleges,  31. 

Collinear  points,  277. 

Collisions  in  a hash  table,  222,  226. 

Colman,  George,  the  younger,  133. 

Colman,  Walter  John  Alexander,  752. 
Coloring  of  graphs,  17,  35,  42,  44, 

46,  120-121,  233,  246,  258,  265, 

274,  277,  831. 

Coloring  of  hypergraphs,  32,  35,  44. 
Colthurst,  Thomas  Wallace,  759. 

Column  sums,  414. 

Combination  generation,  355-372, 

379-385,  389. 

Gray  codes  for,  362-372. 
homogeneous,  364-365,  370-371, 

382-383,  729,  733,  736. 
near-perfect,  365-371,  383. 
perfect,  369-371,  384. 
revolving-door,  362-364,  370—371, 
381-383. 

Combinational  complexity,  97,  111-112,  see 
Cost  of  a Boolean  function. 
Combinations:  Subsets  of  a given  size, 
355-390,  439,  443,  446,  492-500,  508, 
512,  586,  784,  791,  813,  815. 
dual,  356-358,  380-381,  383. 
null,  499. 

of  a multiset,  370,  372,  379,  387, 

498,  512,  816. 
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with  repetitions,  356-357,  365,  370-373, 
379,  387,  390,  393,  493-494,  499,  512. 

Combinatorial  explosion,  v,  224,  673. 

Combinatorial  number  system,  360,  381, 
385-386,  412,  725,  735,  761. 
alternating,  363,  381. 
generalized,  387. 

Combinatorics,  1-7,  see  also  Graphs. 

Comedy  festival,  60-62,  86. 

Comet,  Stig,  394. 

Common  subfunctions,  216,  224,  229, 

627,  628. 

Commutative  groups,  414,  532,  805. 

Commutative  laws,  28,  50,  65,  90,  135,  227, 
261,  270,  273,  539,  566,  582,  634,  652. 

Comp.  J.:  The  Computer  Journal,  a 

publication  of  the  British  Computer 
Society  since  1958. 

Comparator-inverter  networks,  72-74,  91. 

Comparator  modules,  72-74,  91,  127, 

190,  551. 

Comparison  function,  binary,  92,  120, 
575-577. 

Comparison  of  bytes,  153,  192. 

Compiler  technology,  58. 

Complement  in  a torus,  375. 

Complement  links,  279. 

Complements,  486,  489. 
laws  of,  50-51. 

of  Boolean  functions,  49,  55,  57,  78-79, 
99-100,  107,  246-247,  279,  534. 
of  families  of  sets  represented  by 
ZDDs,  272,  660,  666. 

of  graphs,  26,  27,  35,  41,  42,  483,  526,  807. 
of  r-uniform  hypergraphs,  32. 
of  simple  digraphs  without  loops,  42. 

Complementary  elements  of  a lattice,  475. 

Complementary  Gray  codes,  293, 

296-297,  313,  688. 

Complementation,  bitwise,  134,  135, 

184,  217,  604. 

Complete  bigraphs  (ATmj„),  17,  26,  39,  42, 
133,  497,  522,  580,  806. 

Complete  binary  relations,  354. 

Complete  binary  trees,  81,  165,  214, 

584,  727. 
infinite,  185. 

Complete  binary  tries,  38-39. 

Complete  bipartite  graphs  ( Km,n ),  17,  26, 

39,  42,  133,  497,  522,  580,  806. 

Complete  digraphs  (Jn),  18,  522,  524. 

Complete  graphs  ( K„ ),  13,  26-27,  39,  41-43, 
481,  482,  495-498,  525,  687,  745. 

Complete  fc-partite  graphs  (Kni  ), 

17,  26-27,  40,  44. 

Complete  r-uniform  hypergraphs,  32. 
bipartite,  44. 

Complete  rhyme  schemes,  513. 

Complete  f-ary  trees,  484. 


Complete  ternary  trees,  487,  520. 

Complete  tripartite  graphs  (RTm,n,r),  17,  42. 
Completing  the  square,  397,  775. 

Completion  of  a matrix,  46. 

Complex  conjugates,  808. 

Complexity  theory,  194. 

Components,  connected,  16,  18,  26,  40, 

42,  43,  314,  686. 
of  size  two,  265. 

COMPOSE  subroutine,  232,  265,  633. 
Composition  of  Boolean  functions,  232-233, 
263,  265,  569,  637. 

Composition  of  graphs,  28,  483,  see 
Lexicographic  product  of  graphs. 
Composition  of  permutations,  185,  188-189. 
Compositions:  Ordered  partitions  of  an 

integer,  25,  308-309,  356-358,  365,  379, 
390,  410,  488,  492,  512,  726,  778,  816. 
bounded,  370,  384,  385. 

Compression  of  a set,  377,  387,  743. 
Compression  of  data,  204,  233,  496. 
Compression  of  scattered  bits,  148-149, 

189,  594. 

Compton,  Robert  Christopher,  339,  350. 
Computed  table,  see  Memo  cache. 

Comtet,  Louis,  418,  719,  773. 

Concatenation,  305,  315. 

Concentric  wheels,  497. 

Concordant  bit  strings,  480. 

Concurrent  computing,  vii,  352,  682,  706. 
Condensation  of  variables,  213,  261. 
Condensation  principle,  89. 

Conditional  expression,  see  If-then-else 
function. 

Conditional-set  instructions,  141-142, 
180-181,  588-590,  600. 

Conditional-sum  adders,  127-128. 

Configs  of  a node,  467,  481. 

Confucius  (?Lfn  = JLftM  = ),  486. 

Conic  sections,  digitizing,  176-180,  198-199. 
Conic  splines,  615. 

Conjugate  of  a forest,  448-449,  471-472, 

476,  483. 

Conjugate  of  a partition,  29-30,  43,  394, 

408,  412,  414,  528,  755. 
of  a joint  partition,  749. 
of  a set  partition,  434. 

Conjugate  of  one  permutation  by 
another,  330. 

Conjunction  (A),  49,  219,  see  AND. 

in  3-valued  logic,  163. 

Conjunction  of  graphs,  28,  483,  see  Direct 
product  of  graphs. 

Conjunctive  normal  form  (CNF),  53,  56-57, 
72,  81,  85,  87,  97,  271,  277,  563,  571. 
full,  53. 

monotone,  81,  538. 
relation  to  DNF,  538. 

Conjunctive  prime  form,  54,  81. 
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Connected  components,  16,  18,  26,  40, 

42,  43,  314,  686. 
of  size  two,  265. 

Connected  digraphs,  18. 

Connected  graphs,  16,  33,  43,  44,  464, 

470,  484. 

Connectedness  function,  211-212,  256, 

262,  277,  624. 

Connectivity  of  a graph,  viii,  531. 
Connectivity  structure  of  an  image, 
173-175,  197-198. 

Connectivity  test,  803. 

Consecutive  Is,  84,  86,  127,  128;  see 
also  Runs  of  Is. 
forbidden,  208,  259,  263. 
required,  641. 

Consecutive  arcs,  19. 

Consecutive  integers,  408. 

Consensus  of  subcubes,  83,  195. 
Consonants,  38. 

Constants,  fundamental,  818-820. 
Constrained-by  operation  (fig),  264-265. 
Context-free  grammars,  85,  484,  658. 
Contiguous  United  States  of  America, 

15,  34,  39-40,  210-211,  231-233, 
244-246,  250,  254-255,  265,  269, 

276,  277,  636,  670. 

Contingency  tables,  372,  385,  414. 
Continuant  polynomials,  508,  617. 

Contour  integration,  419-424. 

Contraction  of  a graph,  463-465,  798. 
Contradiction  (_L),  49. 

Contrapositive,  61. 

Control  grids,  104. 

Control  points,  180. 

Converse  implication  (C),  49. 

Converse  nonimplication  (c),  49,  80. 
Converse  of  a digraph,  525. 

Convex  hull  of  points,  24,  68-69. 

Convex  optimization,  597. 

Convex  sets,  68-69,  90. 

Conway,  John  Horton,  172,  269,  354,  573, 
583-585,  609-610,  685. 
field  of  nimbers,  184. 

Cook,  Stephen  Arthur,  542. 

Cooke,  George  Erskine,  581. 

Cooke,  Raymond  Mark,  690,  693,  723. 
Coolean  algebra:  An  undiscovered  sequel 
to  Boolean  algebra. 

Coordinates,  293. 

Coppersmith,  Don,  532. 

Core  of  a Horn  function,  58,  86,  543,  545. 
Core  set  in  a torus,  376-377,  387. 

Cori,  Robert,  805. 

Corless,  Robert  Malcolm,  773. 

Coroutines,  350-351. 

recursive,  304-305,  316-317. 

Corteel,  Sylvie  Marie-Claude,  749. 


Cost  of  a Boolean  function,  97,  107, 

111-112, 126-132. 
statistics,  101,  105,  563-564,  581. 

Coteries,  88,  93. 

Coudert,  Olivier  Rene  Raymond,  258,  625, 
635,  663,  669,  670,  674. 

Counting  the  number  of  solutions,  206-207, 
251,  257,  259. 

Coupon  collector’s  problem,  812. 

Cover,  Thomas  Merrill,  13. 

Covering  function  (C(x\y)),  267. 

Covering  in  a lattice,  93,  412,  433, 

457,  473-476. 

Covering  problems,  11,  831. 

exact,  2,  7,  8,  35,  37,  515-526,  831. 
minimum,  34-35,  44,  258. 

Crama,  Yves  Jean-Marie  Mathieu 
Franz,  541. 

Cramer,  Carl  Harald,  775. 

CRC  (cyclic  redundancy  check),  183,  202. 
Crelle:  Journal  fur  die  reine  und  angewandte 
Mathematik,  an  international  journal 
founded  by  A.  L.  Crelle  in  1826. 

Cremer,  William  Henry,  Jr.,  694. 

Crests,  Japanese  heraldic,  504. 

Crette  de  Palluel,  Frangois,  8. 

Cribbage,  389. 

Cross-intersecting  sets,  385. 

Cross  order,  374-379,  387,  745. 

Crossbar  modules,  146-147,  190. 

Crossings  in  a diagram,  14,  531. 

Crossings  in  a set  partition,  473,  513,  768. 
Cryptarithms,  324. 

CSNZ  (conditional  set  if  nonzero),  142, 
180-181,  600. 

CS0D  (conditional  set  if  odd),  590. 

CSZ  (conditional  set  if  zero),  141,  588,  589. 
Cube-connected  computers,  682. 

Cube  of  a graph,  470. 

Cubes,  66,  see  Hypercubes,  n-cube,  Partial 
cubes,  Subcubes. 

Cubic  graphs,  14,  15,  39,  531. 

Cummings,  Larry  Jean,  696. 

Cummins,  Richard  Lee,  802. 

Cumulants  of  a distribution,  438,  775. 
Curious  properties,  653. 

Curtis,  Herbert  Allen,  96,  120-121. 
Curvature:  Reciprocal  of  the  radius,  614. 
Custering,  171,  176,  196-197. 

Cutler,  Robert  Brian,  538. 

Cvetkovic,  Dragos  Mladen  ( U,BeTKOBnh, 
flparom  MnaAeH),  468,  808. 

Cycle  graph  ( Cn ),  x,  13,  28,  39,  41,  42,  44, 
208,  249,  259,  481,  803,  807. 

Cycle  leaders,  311. 

Cycle  Lemma,  478. 

Cycle  structure  of  a permutation,  40, 

42,  326,  330,  762. 
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Cycles,  see  also  Universal  cycles, 
in  a graph,  147,  254,  275. 
in  a hypergraph,  33. 
oriented,  18,  19,  32,  40,  41. 
undirected,  346. 

Cyclic  permutations,  353,  437,  476. 

Cyclic  redundancy  checking,  183,  202. 
Cyclic  shifts,  149,  188,  306,  336,  338,  341, 
348,  478,  592,  595,  598. 

Cylinder  graphs,  28,  41,  469,  483,  808. 
Cylinders,  hyperbolic,  171,  608. 

Czerny,  Carl,  735. 

0 (differential  quantification),  231,  265, 
634-635,  637. 

99,  see  Depth  of  a Boolean  function. 
d(u,v)  (distance  in  a graph),  16,  19,  43. 

generalized,  16-17. 
d+(n)  (out-degree  of  v ),  18. 
d~  (v)  (in-degree  of  v),  18. 
da  Vinci,  Leonardo  di  ser  Piero,  9,  24. 
Dactyls,  489,  501,  512. 

Dags  (directed  acyclic  graphs),  31-32,  203, 
215,  253,  257,  259,  830. 

Dahlheimer,  Thorsten,  ix,  546,  597, 

623,  654,  655. 

Dallos,  Jozsef,  141,  590. 

Dally,  William  James,  682. 

Dancing  links  method,  2,  7,  8,  11,  464, 
515-516,  664. 

Danh,  Tran-Ngoc,  745. 

Dante  Alighieri,  771. 

Darroch,  John  Newton,  777. 

Dashed  lines  in  BDD  diagrams,  202. 
Dashed  lines  in  ZDD  diagrams,  249. 

Data  replication,  88. 

Data  structures  for  graphs,  21-23, 

464-466,  523-524. 

Dates  of  the  calendar,  packed,  136,  192. 
Davidson,  George  Henry  (=  Dee,  G.),  746. 
Davies,  Roy  Osborne,  2,  514. 

Day  kin,  David  Edward,  738,  745. 
de  Bruijn,  Nicolaas  Govert,  302,  426, 

457,  773. 

cycles,  142,  302-307,  316-318,  354, 

489,  625,  657,  702,  833. 
toruses,  318. 

de  Jaenisch,  Carl  Friedrich  Andreevitch 
(HHHini,  Kapjib  AH«peeBHHb),  672. 
De  Micheli,  Giovanni,  122. 
de  Moivre,  Abraham,  451,  506-508,  513. 
de  Montmort,  Pierre  Remond,  506,  513. 

De  Morgan,  Augustus,  51,  355,  410. 

laws,  51,  81,  534. 
de  Palluel,  Frangois  Crette,  8. 
de  Polignac,  Camille  Armand  Jules 
Marie,  15. 

Dead  nodes,  228,  633. 

Debugging,  659. 


Debye,  Peter  Joseph  William  (=  Debije, 
Petrus  Josephus  Wilhelmus),  420. 
Decimal  number  system,  282,  298-299, 

319,  324-325,  346-347,  762. 

Decoder,  binary,  109,  567. 

Decomposition  of  functions,  117-121, 
130-131,  268. 

Decomposition  of  partial  functions,  120-121. 
Decomposition  or  development  laws,  51,  52. 
Decorated  binary  trees,  456,  478. 

Dedekind,  Julius  Wilhelm  Richard,  398. 
sums,  398. 

Defect  of  a balanced  string,  478. 

Definite  Horn  clauses,  58-60,  86. 

Definite  Horn  functions,  58,  85,  95. 

Degen,  Carl  Ferdinand,  685. 

Degenerate  binary  trees,  475,  477,  809. 
Degree  of  a node,  472. 

Degree  of  a vertex,  14,  19,  39,  43,  44, 

464,  483,  529. 

Degree  one,  nodes  of,  466,  479. 

Degree  one,  vertices  of,  467,  484,  809. 

Degree  sequences,  29-31,  43,  46,  533. 
Delaunay,  Boris  Nikolaevich  ()J,ejiOHe,  Bopuc 
HHKOjraeBHu),  triangulation,  24. 

Dellac,  Hippolyte,  655. 

permutations,  271. 

Delta  operation  (/  EEj  g),  273. 

Delta  sequences,  293,  349,  734,  735. 

Denes,  Jozsef,  789. 

Deng,  EvaYu-PingJXPiT),  768. 

Deo,  Narsingh  (=hR1iJ  %T),  510. 

Dependency  on  a variable,  204,  225,  231, 

629,  637,  644-645,  674,  675. 

Depth  coordinates  c*,,  444,  461-462,  471, 

473,  477,  480,  786. 

Depth  of  a Boolean  function,  99,  100, 
124-128,  132,  145. 
statistics,  101,  105,  563-564,  581. 
Depth-first  search,  23,  41,  217,  575,  802-803. 
Depth-first  synthesis,  225-233,  264. 
Derangements,  271,  353. 

Derbes,  Joseph,  760. 

Dereferencing,  632,  660-661,  668-669. 
Derivative  of  a function,  386. 

Derivative  of  a graph,  484. 

Derivative  of  a reliability  polynomial, 

212,  260. 

Dershowitz,  Nachum  ('pniWhl  Din:),  789. 
Descartes,  Rene,  coordinates,  176; 

see  also  Cartesian  product. 

Descendant,  in  a tree  structure,  444, 

781,  809. 

Descents  of  a permutation,  430,  479,  759. 
Determinants,  40,  121,  130,  492,  526, 

532,  806-807. 

Deutsch,  Emeric,  798. 
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Dewey,  Melville  (=  Melvil)  Louis  Kossuth, 
notation  for  binary  trees  (due  to 
Galton),  795. 
notation  for  trees,  452. 

Dhar,  Deepak  *TT),  805. 

Diaconis,  Persi  Warren,  745,  780. 

Diagonal  matrices,  532. 

Diagonalization,  124. 

Diagonally  dominant  matrix,  806. 
Diagrams  for  digraphs,  18-19,  42. 
Diagrams  for  graphs,  14-15,  26-28,  39,  42. 
Diameter  of  a free  median  graph,  550. 
Diameter  of  a graph,  16,  24,  39,  41, 

42,  44,  526. 

Diamond  lemma,  756. 

Dice,  493-494,  505,  506,  512,  813. 

Dictator  functions,  49,  see  Projection 
functions. 

Dictionaries,  data  structures  for,  226, 
252-253. 

Dictionaries  of  English,  v,  10,  34,  38, 

47,  48,  520. 

Dietz,  Henry  Gordon,  151,  597. 

Difference  operation  (/  \ g),  273. 
Differential  quantification  (Q),  231,  265, 
633-634,  636. 

Digital  Century  puzzle,  484. 

Digitization  of  contours,  24,  176-180, 
198-199. 

Digraphs,  18,  see  Directed  graphs. 
Dijkstra,  Edsger  Wybe,  322,  596-597. 
Diker  Yiicel,  Melek,  561. 

Dilated  numbers,  see  Scattered 
arithmetic,  Zip. 

Dilation  of  an  embedded  graph,  315. 
Dillon,  John  Francis,  561. 

Dilogarithm  function,  410,  751,  754. 
Dimension  of  a vector  space,  380. 
Diomedes  (AiO[rf|8ric),  489. 

Direct  product  of  graphs,  28,  42-43,  483. 
Direct  product  of  matrices,  43,  684,  806. 
Direct  sum  of  graphs,  26-27,  42,  43,  483. 
Direct  sum  of  matrices,  27,  43. 

Directed  acyclic  graphs,  31-32,  203,  215, 
253,  257,  259,  830. 

Directed  distance  d(u,v),  19. 

Directed  graphs,  12,  18-22,  40,  42, 
482-483,  526. 
complete,  18,  522,  524. 
components  of,  18. 
converse  of,  525. 
degree  sequences  of,  43,  414. 
random,  25. 

representation  of,  19—22. 
simple,  18,  19,  40,  43,  525,  526. 
strong  components  of,  40,  61-62,  86, 

522,  545,  575. 

Directed  hypergraphs,  44. 

Directed  join  of  digraphs,  26-27. 


Directed  torus  graphs,  352,  808. 

Dirichlet,  Johann  Peter  Gustav  Lejeune, 
generating  function,  589. 

Discrete  Fourier  transform,  94,  289, 

308,  536,  560,  685. 

Discrete  logarithm,  see  Binary  logsize 
function. 

Discrete  torus,  414. 

Disjoint  decomposition,  117-120,  268. 

Disjoint  graphs,  26. 

Disjoint  sets:  Sets  with  no  common 
elements,  25. 

Disjoint  unions,  family  of,  661. 

Disjointness  testing,  190. 

Disjunction  (V),  49,  see  OR. 
in  3-valued  logic,  163. 

Disjunctive  normal  form  (DNF),  53-55, 

81,  85,  97,  571. 
full,  53-54,  81,  84,  557. 
irredundant,  95,  538,  541. 
monotone,  81,  82,  538. 
orthogonal,  84-85,  92. 
relation  to  CNF,  538. 
shortest,  55,  82,  83,  95,  258,  271. 

Disjunctive  prime  form,  54-55,  64,  71,  255. 

Distance  between  2-adic  integers,  584. 

Distance  d(u,v)  in  a graph,  16,  19,  43. 
generalized,  16-17. 

Distance  of  a code,  38. 

Distinct  bytes,  testing  for,  191. 

Distinct  columns,  33. 

Distinct  parts,  408,  409,  411,  412,  431. 

Distinct  rows,  46. 

Distributed  systems,  88. 

Distribution  networks,  see  Mapping 
networks. 

Distributive  lattices,  92,  551,  554. 

Distributive  laws,  43,  48,  50,  80,  87,  93, 
125,  135,  212-213,  261,  273,  474,  526, 
536-537,  582-583,  634. 
for  medians,  65,  67,  87,  90,  548. 

Divide  and  conquer  paradigm,  109, 

144,  148,  566-568. 

Divisibility  by  3,  202. 

Division,  186. 

avoiding,  136,  186. 
by  10,  156. 

by  powers  of  2,  135-136. 
in  Conway’s  field  of  nimbers,  184. 
of  2-bit  numbers,  191. 

Divisors  of  a number,  315,  498,  505. 
sum  of,  409. 

DNA,  511. 

DNF,  53,  see  Disjunctive  normal  form. 

Dobiriski,  G.,  419. 

Dodecahedron,  15. 

Dodgson,  Charles  Lutwidge  (=  Lewis 
Carroll),  10-12,  48,  79,  521. 

Doerr,  Benjamin,  561. 
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Dominance  order,  see  Majorization  lattices. 
Dominated  coteries,  88. 

Dominating  sets,  277,  831. 
minimal,  258,  277. 
minimum,  609. 

Dominoes,  251-252,  274,  389,  440. 

Donnolo,  Shabbetai  ben  Avraham 
(tinm  omnN  p ’raw),  490,  512. 
Don’t-cares,  46,  114,  116,  120-122,  129,  131, 
150,  161-162,  264,  574-575,  592,  605. 
Doob,  Michael,  808. 

Dot-minus  (x  — y),  see  Monus  operation. 

Dot  product  of  vectors,  12,  34,  37. 

Double  order  for  traversing  trees,  612-613. 
Doublets  game,  11. 

Doubly  bounded  partitions,  403,  411,  413. 
Doubly  Gray  code,  350. 

Doubly  linked  lists,  70,  301,  464-465,  696. 
Doubly  true  alphametics,  347. 

Doutte,  Edmond,  515. 

Dovetailing,  see  Interleaving. 

Dowling,  William  Francis,  60. 

Doyle,  Arthur  Ignatius  Conan,  1. 

Drawing  a binary  tree,  441,  454,  455, 

485,  782,  786. 

Drawing  on  a bitmap,  180. 

Drechsler,  Nicole,  646. 

Drechsler,  Rolf,  646. 

Drexel  (=  Drechsel  = Drexelius),  Jeremias 
(=  Hieremias),  490-491. 

Du,  Rosena  Ruo-Xia  768. 

Dual  boustrophedon  product,  695. 

Dual  identities,  534. 

Dual  of  a Boolean  function:  FD(x)  = F(x), 
266,  271,  534,  538,  552,  621,  653,  669, 
see  also  Self-dual  Boolean  functions, 
computing  monotone  CNF  from  DNF,  538. 
Dual  of  a combination,  356-358, 

380-381,  383. 

Dual  of  a forest,  448-449,  472-474. 

Dual  of  a hypergraph,  33,  35,  44. 

Dual  of  a planar  graph,  803. 

Dual  of  a set  partition,  768. 

Dual  permutation  generation,  335-337,  348. 
Dual  set  in  a torus,  376-377. 

Dual  size  vector,  388. 

Duality,  276,  387,  611,  743. 

Dubrova,  Elena  Vladimirovna  (pybpoBa. 

EjieHa  BnaflHMHpoBHa),  622,  646. 
Duckworth,  Richard,  322. 

Dudeney,  Henry  Ernest,  285,  324,  347,  432, 
484,  521,  672,  673,  702,  708,  810. 
Dughmi,  Shaddin  Faris 

JjU,),  674. 

Duguid,  Andrew  Melville,  145. 

Dull,  Brutus  Cyclops,  265. 

Dumont,  Dominique,  655. 
pistols,  271,  659. 


Dunham,  Bradford,  94,  557. 

Durfee,  William  Pitt,  393. 
rectangle,  748. 
square,  393-394,  402,  749. 

Duval,  Jean-Pierre,  700. 

DVIPAGE  program,  616. 

Dvorak,  Stanislav,  725. 

Dyck,  Walther  Franz  Anton  von,  510. 
paths,  see  Nested  parentheses, 
words,  510. 

Dyckman,  Howard  Lloyd,  316,  694. 
Dynamic  programming,  xi,  534,  810. 
Dynamic  reordering  of  variables,  243-246, 
251,  268-269,  663. 

Dynamic  storage  allocation,  227-228,  264. 
Dyson,  Freeman  John,  751. 

e,  as  source  of  “random”  data,  475, 

516,  771,  818-819. 

e*.  (an  elementary  family),  273-275,  659. 
Eades,  Peter  Dennis,  370,  734. 

Early  neighbors,  69,  90. 

Easy  nodes,  467,  481. 

Ebbenhorst  Tengbergen,  Cornelia  van,  457. 
econ  graphs,  31. 

Eckhoff,  Jurgen,  739. 

Edelman,  Paul  Henry,  789. 

Edge-chromatic  number  x(L(G)),  44. 

Edge  covering,  315. 

Edges  between  pixels,  176-180,  198-199. 
Edges  in  a graph,  11,  13. 

as  arcs,  19,  21-22. 

Edges  in  a hypergraph,  32-35. 

Edgeworth,  Francis  Ysidro,  expansion,  775. 
EDSAC  computer,  134,  143. 

Egiazarian,  Karen,  559. 

Egyptian  fractions,  658. 

Ehrlich,  Gideon  (p!nN  ynni),  289,  337,  350, 
362,  407,  417,  418,  705,  706,  730,  770. 
swap  method,  337-338,  349-350. 
Eigenvalues  and  eigenvectors  of  a matrix, 
40,  482-483,  723,  806,  808. 

Eight  queens  problem,  604,  672. 

Eiter,  Thomas  Robert,  654. 

Ekin,  Oya,  541. 

Elaborated  truth  tables,  212-213, 

260-261,  274. 

Electoral  college  (median  of  medians), 

268,  644. 

Electoral  districts,  8. 

Electrical  engineers,  96,  97,  105,  107, 

109,  114,  122. 

Elementary  families  (ejt),  273-275,  659. 
Elementary  symmetric  functions,  536,  757. 
Elements  (earth,  air,  fire,  water),  486,  497. 
Elgot,  Calvin  Creston,  124,  552. 

Ellipses,  176-179,  455,  613-615. 

Elliptic  functions,  398. 

Elusive  functions,  see  Evasive  functions. 
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Embedding  of  partial  order  into  linear  order, 
342,  see  Topological  sorting. 

Emde  Boas,  Peter  van,  164. 

Empty  family,  250,  273. 

Empty  graphs  (Kn),  26,  27,  41-43, 

46,  530,  806. 

Empty  quotation,  817. 

Empty  set,  499. 

Empty  string,  799. 

Encoding  of  ternary  data,  160-163,  195. 
End-around  swaps,  384. 

Endo-order,  368,  383,  446,  765. 

Endpoint  of  a graph,  484,  809. 

Engel,  Konrad  Wolfgang,  744. 

Enggren,  Willy,  346,  347. 

English  language,  v,  9-10,  38-39. 

Enns,  Theodore  Christian,  734. 

Enumeration,  1,  281. 

Enumeration  of  Boolean  functions,  79. 
asymptotic,  558. 

Enumeration  of  solutions,  206-207, 

251,  257,  259. 

Enveloping  series,  401,  411,  439,  774. 

Eofill  (even/odd  filling),  179. 

Eppstein,  David  Arthur,  620,  706. 

Equality  of  bytes,  152,  191,  192. 

Equality  testing  of  Boolean  functions, 

225,  257,  259. 
probabilistic,  260. 

Equidistant  cities,  44. 

Equivalence,  in  3-valued  logic,  195. 
Equivalence  classes,  482. 

Equivalence  operator  (=),  49-50, 

534-535,  553. 

Equivalence  relations,  45-46,  416, 

432,  518,  827. 

Equivalence  under  permutations,  78-79. 

and  complementations,  78-79,  104,  559. 
Equivalent  Gray  codes,  314. 

Er,  Meng  Chiau  334,  784. 

Eratosthenes  of  Cyrene  (’EpaxooSevri? 

6 Kup-qvatoc) , sieve  (xoaxivov),  137,  186. 
Erdos,  Pal  (=  Paul),  373,  389,  400,  411,  509, 
527,  528,  531,  550,  555,  714-715. 

Erdos,  Peter  L.,  763. 

Eriksson,  Henrik,  618. 

Eriksson,  Kimmo,  618. 

Errera,  Alfred,  781. 

Error-correcting  codes,  37-38,  279,  310. 
Errors,  ix,  487,  490-492,  498,  502-503, 

505,  511-513,  814,  816. 

Escher,  Giorgio  Arnaldo  (=  George 
Arnold),  169. 

Escher,  Maurits  Cornelis,  169. 

Etienne,  Gwihen,  761. 

Ettingshausen,  Andreas  von,  508. 

Etzion,  Tuvi  (yP3>l  mo,  born  TOlTI 
mo),  305. 


Euclid  (EuxXeiSric),  168. 
numbers,  658. 

Euclidean  distance,  10,  12. 

Euclidean  plane,  17. 

Euler,  Leonhard  (EiLnept,  JleoHapm  = 
9iijiep,  JleoHapfl),  3-7,  36,  395,  404, 
408,  409,  507,  516,  655,  759. 
summation  formula,  396,  410,  708. 
trails  in  a digraph,  498,  722,  745,  746,  780. 

Eulerian  numbers,  438,  479,  751. 

Evaluation  of  Boolean  functions,  96-133, 

206,  261,  see  Boolean  chains. 

Evasive  functions,  261. 

EVBDD,  204. 

Even  permutations,  40,  323,  354,  492, 

512,  812. 

Evolution  of  random  graphs,  25. 

Evolutionary  trees,  774. 

Ewing,  Ann  Catherine,  539. 

Exact  cover  problems,  2,  7,  8,  35,  37, 

251-252,  274,  503,  515-516,  665,  831. 

Exclusive  disjunction  (©),  49,  seeXOR. 

Exclusive  normal  form,  52,  see  Multilinear 
representation  of  a Boolean  function. 

Exercises,  notes  on,  xi-xiii. 

Exhaustive  functions,  see  Evasive  functions. 

Existential  quantification  (3),  85,  87, 

230,  265. 

EXISTS  subroutine,  230,  265,  633. 

Expander  graphs,  24. 

Exponential  generating  functions,  419, 

436,  771,  779. 

Exponential  growth:  2®(n),  225,  236, 

238,  242,  268,  396. 

Exponential  series,  partial  sums  of,  704. 

Exponential  time,  538. 

Exponentiation  mod  2n,  202. 

Extended  binary  trees,  441,  456,  472, 

477,  485. 

Extended  real  numbers:  Real  numbers 
together  with  — oo  and  +oo,  63. 

Extended  ternary  trees,  472. 

Extended  truth  tables,  241,  648. 

Extending  a partial  order,  342. 

Extension  of  a string,  306. 

External  nodes,  441,  455,  456,  472,  477, 

485,  781,  795. 

Extracting  bits,  134,  136,  140,  151. 

and  compressing  them,  148-149,  189,  594. 
the  least  significant  only  (2px), 

140-142,  150,  153. 

the  most  significant  only  (2Xx),  143, 
192-194,  600. 

Eyrolles,  Georges,  811. 

/-vectors,  see  Size  vectors. 

/n:  Closure  of  family  /,  277. 

/^:  Maximal  elements  of  family  /,  276,  717. 

/■*•:  Minimal  elements  of  family  /,  276,  717. 
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fs:  Minimal  hitting  sets  of  family  /, 

276,  671,  674. 

fn:  f with  variables  permuted  by  n, 
236-237,  266. 

fc  (family  of  complements),  276. 
fD  (dual  of  /),  266,  271,  621,  653,  669. 
fR  (reflection  of  /),  266,  647. 
fz(xi, . . . , xn)  (Z-transform  of  /),  272. 
Factorial  number  system,  500,  703. 
Factorial  ruler  function,  348. 

Factorials,  tables  of,  499,  512. 

Factorization  of  a graph,  28. 

Factorization  of  an  n-cycle,  476. 
Factorization  of  strings,  317. 
algorithm  for,  700. 

Factorizations  of  an  integer,  505,  513. 
Failing  units,  80. 

Faloutsos,  Christos  (4?aXouxoo5, 

Xprioxo;),  681. 

FALSE,  202. 

Falsehood  (_L),  49,  63,  79. 

Families  of  sets,  32,  250,  253,  263,  272-278, 

659,  669,  see  also  Hypergraphs, 
elementary  (e*,),  273-275,  659. 
generating  functions  for,  662,  668, 

670,  671. 

unit  (e),  273,  658,  662,  669-670. 
universal  (p),  275,  660-662,  666,  669. 
Families  of  subsets,  87-88. 

Family  algebra,  253,  255,  273,  275-277, 

660,  673. 

Fan-in:  The  number  of  inputs  to  a 
gate,  97,  104,  124. 

Fan-out:  The  number  of  uses  of  a gate, 

97,  104,  570. 

Fast  Fourier  transform,  188,  308. 

Fast  Walsh  transform,  312. 

Fault-tolerant  systems,  267. 

Faultfree  tilings,  274. 

FBDDs:  Free  BDDs,  204,  257,  261,  266. 
Feder,  Tomas,  73,  313,  551. 

Feller,  Willibald  (=  Vilim  = Willy  = 
William),  797. 

Felsenstein,  Joseph,  774. 

Fenichel,  Robert  Ross,  379. 

Fenner,  Trevor  Ian,  747,  748. 

Fenwick,  Peter  McAulay,  569. 

Fermat,  Pierre  de,  theorem,  318. 

Ferranti  Mercury  computer,  134. 

Ferrari,  Dani,  841. 

Ferrers,  Norman  Macleod,  393. 

diagrams,  393-394,  399,  402,  405,  426, 
747,  755,  757,  760,  768. 
diagrams,  generalized,  749. 

Feussner,  Friedrich  Wilhelm,  462. 

Feynman,  Richard  Phillips,  621. 

Fibonacci,  Leonardo,  of  Pisa  (=  Leonardo 
filio  Bonacii  Pisano),  numbers,  168, 


236,  246,  270,  316,  354,  396,  488,  566, 
569,  623,  675,  718,  811,  820. 
generalized,  488,  814. 

Fibonacci  number  system,  168,  196,  see  also 
NegaFibonacci  number  system, 
odd,  608. 

Fibonacci  polynomials,  199-200. 

Fibonacci  strings,  36. 

Fibonacci  threshold  functions,  92,  125,  261. 
Fibonacci  trees,  484,  485. 

Fiduccia,  Charles  Michael,  532. 

Fiedler,  Miroslav,  808. 

Field  polynomials,  807. 

Fields,  algebraic,  182,  184,  617. 

Fields,  finite,  50,  182,  184,  311-312, 

518,  617. 

Fields  of  data,  see  Packing  of  data. 

fragmented,  150,  201. 

Filling  a contour  in  a bitmap,  176-180, 
198-199. 

Final  vertex  of  an  arc,  18. 

Finck,  Hans-Joachim,  530. 

Fingerprints,  172. 

Finikov,  Boris  Ivanovich  (<J>hhhkob,  Bopnc 
MBaHOBHu),  125. 

Finite  fields,  50,  182,  184,  311-312,  518,  617. 
Finite-state  automata,  279,  601. 

Finite-state  transducers,  128. 

Fink,  Jiff,  687. 

Finozhenok,  Dmitriy  Nikolaevich  (Ohho- 
JKeHOK,  ^MHTpuft  HnKOJiaemiu) , 673. 
First-element  swaps,  337-338,  350, 

370-371,  384. 

First-order  predicate  calculus,  545. 

Fischer,  Johannes  Christian,  607. 

Fischer,  Ludwig  Joseph,  704. 

Fischer,  Michael  John,  127,  128,  570. 
Fischler,  Martin  Alvin,  556. 

Fiser,  Petr,  55. 

Fisher,  Randall  James,  151,  597. 

Fisher,  Ronald  Aylmer,  753. 

Five-letter  English  words,  9-12,  16, 

38-39,  43,  252-253,  275-277,  291, 

312,  318,  346,  432. 

Five-variable  Boolean  functions,  79, 

105-106,  126,  267,  277,  572. 

Fixed  point  arithmetic,  614-615. 

Fixed  points  of  a permutation,  434. 

Flag:  A 1-bit  indicator,  152,  191,  192. 
Flajolet,  Philippe  Patrick  Michel,  813. 
Flavors,  492. 

Fleischner,  Herbert,  809. 

Flight,  Colin,  774. 

Flip-flops  in  Life,  270. 

Flip  operation/permutation,  330-331, 

349,  351,  354,  710,  789. 

Floating  point  arithmetic,  142,  207, 

589,  810. 

Flores,  Ivan,  693. 
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Flows  of  money,  31. 

Floyd,  Robert  W,  190. 

Lemma,  vi. 

Flye  Sainte-Marie,  Camille,  745. 

FOCS:  Proceedings  of  the  IEEE  Symposia 
on  Foundations  of  Computer  Science 
(1975—),  formerly  called  the  Symposia 
on  Switching  Circuit  Theory  and 
Logic  Design  (1960-1965),  Symposia 
on  Switching  and  Automata  Theory 
(1966-1974). 

Focus  pointers,  290-291,  300-301, 

467,  481,  696. 

Folland,  Gerald  Budge,  v. 

Fomin,  Sergey  Vladimirovich  (Oomhh, 
Cepreii  B naAHMHpoBHu) , 768. 

Fontana  Tartaglia,  Niccolo,  499. 

Football  scores,  31. 

Footprints  of  a Boolean  function, 

100-101,  113,  114,  122,  125,  126, 

564,  571-572,  598,  605. 

Footprints  of  a forest,  475,  788. 

Forcade,  Rodney  Warring,  532. 

Forcing  functions,  78. 

Forests,  275,  278,  300-301,  440-485,  509. 
canonical,  461-462,  480-481. 
cognate,  472. 

conjugate  of,  448,  471-472,  476,  483. 
dual  of,  448-449,  472-474. 

Gray  codes  for,  446-449,  473. 
oriented,  165-167,  461-462,  480. 
outline  of,  453. 
random,  453-454,  478. 
representation  of,  see 
Left-child  / right-sibling 
links,  Nested  parentheses, 
Right-child/left-sibling  links, 
roots  of,  441. 
shape  of,  453-454. 
super-root  of,  470,  483. 
transpose  of,  471-472,  476. 
triply  linked,  470-471,  480,  606-607, 

612,  785. 

Formula  complexity,  see  Length  of  a 
Boolean  function. 

Forney,  George  David,  Jr.,  677. 

Fortet,  Robert  Marie,  258. 

Fortune,  Steven  Jonathon,  257. 

Foulds,  Leslie  Richard,  774. 

Four  Color  Theorem,  17. 

Four-letter  words,  10. 

Four-variable  Boolean  functions,  79,  98-105, 
112-114,  122,  126,  129,  642. 

Fourier,  Jean  Baptiste  Joseph, 
series,  287,  397. 

transform,  discrete,  94,  289,  308, 

536,  560,  685. 

Fox,  Ralph  Hartzler,  306. 

Fractal  dimension,  479. 


Fractals,  200,  589. 

Fractional  precision,  136,  201. 

Fraenkel,  Aviezri  S (bpns  ntyrtN),  721,  727. 
Fraer,  Ranan  (1«13  pyi),  87. 

Fragmented  fields,  150,  190. 

Fraisse,  Henri,  669. 

France,  queen  of,  496. 

Frankl,  Peter,  739,  740-741. 

Franklin,  Fabian,  408,  411. 

Fredman,  Michael  Lawrence,  154,  192, 

313,  538,  674,  686. 

Fredricksen,  Harold  Marvin,  306,  307. 

Free  binary  decision  diagrams,  257,  261,  266. 
Free  groups,  510. 

Free  median  algebras,  70-71,  91. 

Free  parentheses,  459,  800,  801. 

Free  systems,  535. 

Free  trees,  17,  44,  67,  91,  462,  480-481, 

484,  509,  577,  789,  807,  809. 

Freed,  Edwin  Earl,  187. 

Freese,  Ralph  Stanley,  789. 

Frequency  of  usage  in  English,  10. 

Frey,  Peter  William,  606. 

Fridshal,  Richard,  94,  557. 

Friedman,  Steven  Jeffrey,  646. 

Fringe,  301,  696. 

Fristedt,  Bert,  755. 

Frontiers,  628,  667. 

Fuchs,  David  Raymond,  611. 

Fulkerson,  Delbert  Ray,  528. 

Full  adders,  107-108,  126,  565,  569-570, 

575,  609. 

for  balanced  ternary  numbers,  195. 

Full  conjunctive  normal  form,  53. 

Full  disjunctive  normal  form,  53-54, 

81,  84,  558. 

Fully  elaborated  truth  tables,  212-213, 
260-261,  274. 

Functional  composition,  232-233, 

263,  265,  637. 

Functional  decomposition,  117-121,  130-131. 
Fundamental  constants,  818-820. 

Funk,  Isaac  Kauffman,  48. 

Gp:  Gigamems  = billions  of  memory 
accesses,  672-673. 

Gabow,  Harold  Neil,  607,  804. 

Gaddum,  Jerry  William,  530. 

Gadzhiev,  Makhach  Mamaevich  (raflJKneB, 
Maxau  MaMaeBHu),  558. 

Gale,  David,  757. 

Galen,  Claudius  (KXauSto?  FocXrivoc),  49. 
Galilei,  Galileo,  505. 

Gallai  (=  Griinwald),  Tibor,  527. 

Gallian,  Joseph  Anthony,  715. 

Gallier,  Jean  Henri,  60. 

Galois,  Evariste,  327. 

Galton,  Francis,  846. 
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Games,  58,  86,  172,  184,  195,  197, 

493-494,  503,  512,  597,  604,  622, 
see  also  Tic-tac-toe. 

games  graphs,  31. 

Gamma  function,  421-422,  751. 

Ganter,  Bernhard,  516-517. 

Gaps,  408. 

between  prime  numbers,  588. 
between  Ulam  numbers,  605. 
in  a scattered  accumulator,  596. 

Garbage  collection,  159,  227-229,  264, 

268,  647,  670. 

Garcia-Molina,  Hector,  88. 

Gardner,  Martin,  9,  11,  172,  337,  345,  497, 
547,  574,  609,  610,  673,  694,  702,  761. 

Gardy,  Daniele,  813. 

Garey,  Michael  Randolph,  833. 

Garsia,  Adriano  Mario,  769. 

Garvan,  Francis  Gerard,  751. 

Gates,  networks  of,  32,  97,  104. 

Gather-flip  operation,  149,  189,  190. 

Gathering  bits,  149,  189,  190,  594. 

Gaufi  (=  Gauss),  Johann  Friderich  Carl 
(=  Carl  Friedrich),  5,  17,  168,  827. 

Gebhardt,  Dieter,  617. 

Gegalkine  (=  Zhegalkin),  Jean  Jean 

(>KerajiKHH,  HBaH  HBaHOBjm),  51,  535. 

General  permutation  generator,  328-331, 
340-341,  347-348. 

Generalization,  sweeping,  212-213, 

260-261,  280. 

Generalized  Bell  numbers,  435,  438. 

Generalized  Binomial  coefficients,  387,  see 
also  q-nomial  coefficients. 

Generalized  Catalan  numbers,  476-477. 

Generalized  chess  pieces,  41. 

Generalized  consensus,  83,  120. 

Generalized  distance  d(v i, . . . ,i>/t),  16-17. 

Generalized  Fibonacci  numbers,  488,  814. 

Generalized  Stirling  numbers,  436,  765. 

Generalized  toruses,  45-46. 

Generating  all  solutions  to  a Boolean 
equation,  206,  259. 

Generating  functions,  techniques  for  using, 
187,  189,  345,  383,  395,  399,  408-409, 
411,  415,  419,  436,  450,  476,  477, 

479,  481,  513,  640-641,  655,  678, 

700,  704-705,  719,  735,  771,  779, 
793-795,  799,  815,  816. 
computed  from  BDDs,  206,  211,  255, 

260,  261,  650. 

computed  from  ZDDs,  662,  668,  670,  671. 
Dirichlet,  589. 

Generation  of  combinatorial  patterns,  1,  281. 
constant  amortized  time,  678. 
loopless,  289-292,  300,  308,  309,  316, 

346,  680,  706,  719. 

Generation  of  simple  cycles,  254,  275. 


Generator  routines  for  graphs,  23-26, 

30-32,  41. 

Genetic  code,  511. 

Genji-ko  (jift#),  503-505,  509,  513. 

Genlex  order,  363-367,  370-371,  382-383, 
732,  737,  813. 
for  Gray  codes,  385. 

Genocchi,  Angelo,  655. 
derangements,  271. 
numbers,  655,  659. 

Geometric  mean,  414,  438. 

Geometric  nets,  37-39. 

Gershwin,  George,  498. 

Gherardini,  Lisa,  see  Mona  Lisa. 

Giant  component  of  a graph,  16,  25,  39,  522. 
Gigamems  (G/r):  One  billion  memory 
accesses,  229,  256,  672-673. 

Gilbert,  Edgar  Nelson,  314. 

Gilbert,  William  Schwenck,  281. 

Gill,  Stanley,  143. 

Gillies,  Donald  Bruce,  143. 

Girth  of  a graph,  15,  24,  39-41,  44,  521. 
Gladwin,  Harmon  Timothy,  140. 

Glagolev,  Valery  Vladimirovich  ( IViaE  ojieB, 
Bajiepnii  BjiaAHMupoBHu),  558. 

Global  variables,  220,  221,  634,  637. 

Globally  optimum  solutions,  34-35. 

God,  494-496. 

Goddyn  de  la  Vega,  Luis  Armando, 

314,  689. 

Godfrey,  Michael  John,  36. 

Golden  Bears,  California,  31. 

Goldman,  Jay  Robert,  769. 

Goldstein,  Alan  Jay,  341. 

Golle,  Philippe,  480,  802. 

Golomb,  Solomon  Wolf,  356,  379,  519,  762. 
Gomes,  Peter  John,  xiv. 

Gonnet  Haas,  Gaston  Henry,  773. 
Gonzalez-Morris,  German  Antonio,  707. 
Good,  Irving  John,  772. 

Good  pairs,  118,  130-131. 

Gordian  Knot  puzzle,  315. 

Gordon,  Basil,  431. 

Gordon,  Leonard  Joseph,  519. 

Gosper,  Ralph  William,  Jr.,  188,  202. 

hack,  136,  186. 

GOST  cipher,  129. 

Goto,  Eiichi  ),  552,  554. 

Gradenigo,  Pietro,  495. 

Graeco-Latin  squares,  4-5,  8,  36,  518. 
Graham,  Ronald  Lewis  (UalIS),  90, 

549,  664,  745,  780,  842. 

Grammar,  context-free,  58,  484. 

Grandsire  Doubles,  323. 

Grant,  Jeffrey  Lloydd  Jagton,  520. 

Graph  homomorphisms,  73. 

Graph-paper  graphs,  28. 

Graph  theory,  introduction  to,  13-19. 
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Graphical  degree  sequences,  29-31, 

43,  46,  533. 

Graphs,  11-35,  39-45,  146-147,  264, 
462-470,  481-484,  830-832. 
algebra  of,  26-28,  42-45. 
algorithms  on,  159-160,  194-195,  258. 
bipartite,  see  Bipartite  graphs, 
complete,  13,  26-27,  39,  41-43,  525. 
empty  (null),  26,  27,  41-43,  46,  530. 
generators  for,  23-26,  30-32,  41. 
fcth  power  of,  470,  484. 
labeled  versus  unlabeled,  15,  16,  532. 
median,  67-74,  90,  550. 
of  orders  3 and  4,  42,  46. 
products  of,  27-28,  42-44,  526. 
random,  25,  41,  46. 
regarded  as  digraphs,  19-22. 
regular,  14,  24-25,  33,  40-44. 
representation  of,  19-22. 

Gratzer,  Gyorgy  (=  George),  788. 

Gray,  Elisha,  285. 

Gray,  Frank,  284. 

Gray  binary  code  (rn),  282-292,  296, 
308-313,  316,  321,  362,  484,  515, 

566,  584,  595,  601,  687,  696,  737, 

746,  765,  798. 

as  a permutation,  283,  311. 
restricted  to  combinations,  362-363, 
381-382. 

Gray  binary  trie,  310. 

Gray  code:  A sequence  of  adjacent 
objects,  510,  832. 
quasi-,  469-471. 

Gray  code  for  binary  partitions,  758. 

Gray  code  for  binary  trees,  446-449,  473. 

Gray  code  for  combinations,  362-372, 

381- 384,  446. 

homogeneous,  364-365,  370-371, 

382- 383,  729,  733,  736. 
revolving-door,  362-364,  370-371, 

381-383. 

Gray  code  for  forests,  446-449,  473. 

Gray  code  for  partitions,  405-407,  414,  759. 

Gray  code  for  matchings,  718-719. 

Gray  code  for  mixed  radices,  321,  705, 

710,  714,  719. 

Gray  code  for  n-tuples,  292,  295,  298. 
advantages  of,  286,  291-292. 
binary,  see  Binary  Gray  codes,  Gray 
binary  code, 
limitations  of,  678,  702. 
modular,  299-300,  315,  481. 
reflected,  299-300,  315,  316,  321,  382,  447, 
682,  693,  695,  705,  710,  719,  759. 

Gray  code  for  nested  parentheses,  446, 

473,  477,  787. 

Gray  code  for  permutations,  349-350, 

498,  710,  718-720. 

Gray  code  for  Schroder  trees,  479. 


Gray  code  for  set  partitions,  417-418,  433. 
Gray  code  for  spanning  trees,  463. 

Gray  code  for  trees,  446-449,  473. 

Gray  code  for  weak  orders,  720-721. 

Gray  cycle:  A cyclic  Gray  code,  292,  295. 
Gray  fields,  311-312. 

Gray  levels  in  image  data,  191,  199. 

Gray  path,  295,  see  Gray  code. 

Gray  stream,  314. 

Gray  ternary  code,  299,  316. 

Greatest  lower  bounds,  see  Lattices. 

Greedy  algorithm,  113,  488. 
Greedy-footprint  heuristic,  113-114,  122, 
571-572,  598,  605. 

Greek  logic,  48-49. 

Greek  poetry,  489,  501. 

Greene,  Curtis,  459,  756,  801. 

Greg,  Walter  Wilson,  trees,  774. 

GREG  (global  register  definition),  141,  144. 
Grid  graphs,  28,  41,  211-212,  252,  254, 

256,  260,  277,  469,  483,  609,  628, 

808,  see  also  Pentagrid. 
triangular,  25,  525,  554,  787. 

Grid  paths,  356-357,  379. 

Gries,  David  Joseph,  534. 

Griggs,  Jerrold  Robinson,  761. 

Gropes,  535. 

Gros,  Luc  Agathon  Louis,  285. 

Groth,  Edward  John,  Jr.,  8. 

Grouping  bits,  see  Gather-flip  operation, 
Sheep-and-goats  operation. 

Groupoids,  multiplication  tables  for, 

163,  195. 

Groups,  commutative,  414. 

Groups  of  functions,  185. 

Groups  of  permutations,  327-328,  338,  710. 
Griinbaum,  Branko,  530. 

Grundy,  Patrick  Michael,  582. 

Gualterus  Burleus  (=  Walter  Burley),  51. 
Guibas,  Leonidas  John  (rxipxas,  AearviSai; 
Icoavvou),  616. 

Guilielmus  ab  Occam  (=  William  of 
Ockham),  51. 

Guittone  d’Arezzo,  436. 

Gulliver,  Lemuel  (=  Jonathan  Swift),  620. 
Gumbel,  Emil  Julius,  distribution,  see 
Fisher,  Ronald  Aylmer. 
gunion  (union  of  SGB  graphs),  26. 

Gunther,  Wolfgang  Albrecht,  646. 

Guo,  Zicheng  Charles  (fP^fiS)),  173,  197. 
Gupta,  Hansraj  (SHTFJT  Jp^TT),  753. 

Gurvich,  Vladimir  Alexander  (rypBHH, 

BnauHMHp  AneKcaHApoBHu),  120,  537. 
Guthrie,  Francis,  17. 

Gutjahr,  Walter  Josef,  796. 

Guy,  Richard  Kenneth,  573,  584,  609,  641. 
Gvozdjak,  Pavol,  314. 
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/i-ordered  permutation,  353. 

hn,  see  Hidden  weighted  bit  function. 

Hachtel,  Gary  Deane,  122. 

Hacks,  133-202,  735. 

Hadamard,  Jacques  Salomon,  685. 
inequality,  551. 
matrices,  623. 

transform,  289,  312,  536,  560,  684,  685. 
Hagauer,  Johann  (=  Hans),  69. 

Hagerup,  Torben,  599. 

Haigh,  John,  428. 

Haken,  Armin,  542. 

Haken,  Wolfgang,  17. 

Hakimi,  Seifollah  Louis,  511. 

HAKMEM,  158,  582,  586. 

Half  adders,  107-108,  609. 

for  balanced  ternary  numbers,  195. 

Hall,  Marshall,  Jr.,  ix,  514,  518,  752,  766. 
Hall,  Richard  Wesley,  Jr.,  173,  197. 
Hamburg,  Michael  Alexander,  585. 

Hamilton,  William  Rowan,  15,  see 
also  Cayley. 

cycles,  15,  276,  293,  313-314,  321, 

338-339,  349-352,  355,  687,  713, 

734,  810,  832. 

paths,  15,  85,  254-255,  275-276,  295,  313, 
321,  338-339,  350-351,  355,  370,  384, 
484,  668,  712-714,  734,  736,  787,  832. 
Hamiltonian  graph:  A graph  that  contains  a 
Hamiltonian  cycle,  15,  44,  470,  484,  521. 
Hamley,  William,  and  sons,  694. 

Hammer,  Peter  Laszlo  (=  Peter  Leslie 
= Ivanescu,  Petru  Ladislav),  120, 

258,  541. 

Hamming,  Richard  Wesley,  677. 
code,  279. 

distance,  12,  28,  37-38,  90,  275,  549,  624. 
Hammond,  Eleanor  Prescott,  494. 

Hammons,  Arthur  Roger,  Jr.,  681. 
Handshaking  at  a circular  table,  444,  471. 
Handy  identity,  773. 

Hankel,  Hermann,  422,  771. 
contour,  438. 

Hansel,  Georges,  460-461,  480. 

Haralambous,  Yannis  (XapaXocpTrou^, 
Icnawric),  883. 

Har’ar,  Natan  ben  Sa'adyah 
(-nn  nnvc  p ru),  813. 

Harary,  Prank,  xiv,  18,  509,  522,  802,  810. 
Hardware  versus  software,  97,  569. 

Hardy,  Godfrey  Harold,  398,  399,  410, 

411,  586,  750,  758. 

Hare,  David  Edwin  George,  773. 

Harel,  Dov  (bNtil  n),  165. 

Hariharan,  Ramesh  (/tGldsi^ j syo'f srrreoir) , 

804. 

Harmonic  numbers,  820-821,  825. 

Harmuth,  Henning  Friedolf,  287. 

Harriot,  Thomas,  505. 


Hartley,  William  Ernest,  502-503,  513. 
Harvard  University  Computation 
Laboratory,  104,  126,  564. 

Hash  tables,  221-223,  226-227,  253,  268, 
see  also  Universal  hashing. 

Hash  values,  260. 

Hastad,  Johan  Torkel,  94,  602-603. 

Havel,  Vaclav  (mathematician),  29. 

Havel,  Vaclav  (playwright  and  statesman), 
8-9. 

Hawaii,  346. 

Hawking,  Stephen  William,  827. 

Hayashi,  Tsuruichi  ($£t§ — ),  815. 

Heap,  Brian  Richard,  331,  333,  339, 

348,  352,  706. 

Heap,  Mark  Andrew,  627. 

Heaps,  164-167,  195-196,  727. 
Hebraic-Graeco-Latin  squares,  36,  518. 
Hebrew  letters,  36,  490,  497,  811,  813. 
Heckel,  Paul  Charles,  593. 

Hedayat,  Samad  (=  Abdossamad, 

,.\fl,r>llj^.),  516,  518. 

Hedetniemi,  Sarah  Lee  Mitchell,  461. 

Heine,  Heinrich  Eduard,  409. 

Heinen,  Franz,  17. 

Hell,  Pavol,  74. 

Hellerman,  Leo,  104-105. 

Henrici,  Peter  Karl  Eugen,  397. 

Hensel,  Kurt  Wilhelm  Sebastian,  526. 
Herrmann,  Francine,  168. 

Heun,  Volker,  607. 

Hexadecimal  constants,  x,  819. 

Hexadecimal  digits,  201,  327,  722. 
Hexadecimal  notation  for  truth  tables, 

105,  132,  572. 

Hexadecimal  puzzle,  680. 

Hexagrams,  486-487,  511,  811. 

Hexameter,  500-503,  512. 

HI  field  in  a decision  diagram,  202-203,  216, 
221,  226,  241,  249,  250,  259,  263. 
Hickerson,  Dean  Robert,  476,  664. 

Hickey,  Michael,  370,  734. 

Hidden  nodes,  240-242,  648. 

Hidden  weighted  bit  function  ( hn ),  235-238, 
240,  262,  266-267,  269,  646. 
two-way,  676. 

Hight,  Stuart  Lee,  120. 

Highways,  31. 

Hilbert,  David,  basis  theorem,  388. 

Hilewitz,  Yedidya,  189. 

Hilton,  Anthony  John  William,  385,  739. 
Hindenburg,  Carl  Friedrich,  319,  392, 

419,  507-508. 

Hindman,  Neil  Bruce,  550. 

Hindu  mathematics,  487-489,  491-492, 
499-500,  507,  512. 

Hitting  sets  of  a family,  minimal  (/*), 

276,  671,  674. 

Hlavicka,  Jan,  55. 
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Ho,  Chih-Chang  Daniel  761. 

Hoare,  Arthur  Howard  Malortie,  761. 

Hobby,  John  Douglas,  180,  615,  883. 

Hodges,  Joseph  Lawson,  Jr.,  797. 

Hoeffding,  Wassily,  777. 

Hoffman,  Alan  Jerome,  523. 

Holes  in  images,  174-175. 

Hollis,  Jeffrey  John,  194. 

Holmes,  Thomas  Sherlock  Scott,  1. 

Holton,  Derek  Allan,  531,  673. 

Holzmann  Poisson,  Carlos  Alfonso,  802. 
Homer  (“Op/npo?),  9,  501. 

Homogeneous  generation  of  combinations, 
364-365,  382-384,  733. 
scheme  Kst,  364,  370-371,  383,  729,  736. 
Homogeneous  polynomials,  388. 
Homomorphisms  in  median  algebras,  67. 
Homomorphisms  of  graphs,  74. 

Honda,  Toshiaki  (^EBf'JBfl),  504,  513. 

Hook  lengths,  723. 

Hooks,  748,  749. 

Hopcroft,  John  Edward,  257,  683. 

Horiyama,  Takashi  (4H  ill  Tit  56),  637. 

Horn,  Alfred,  57. 

clauses,  57,  85,  86,  119,  548,  575. 
functions,  58,  79,  95,  266,  271,  661,  673. 
functions,  renamed,  87. 
satisfiability,  60,  85-87,  830. 

Horn,  Gavin  Bernard,  677. 

Horton,  Robert  Elmer,  811. 

Horton-Strahler  numbers,  485,  520. 

Hosaka,  Kazuhisa  ),  647. 

Hotels  and  comedians,  60-62,  86. 

HP  Compaq  2510p  Notebook  PC,  883. 
Hsiao,  Ben  Mu-Yue  = ^f||j§),  519. 

Hudson,  Richard  Howard,  588. 

Huffman,  David  Albert,  94,  676. 

Hugo,  Victor  Marie,  23. 

Hunt,  Harry  Bowen,  III,  638. 

Hunt,  Neil,  616. 

Hunter,  James  Alston  Hope,  324. 
Huntington,  Edward  Vermilye,  548. 

Hurlbert,  Glenn  Howland,  701,  722,  746. 
Hurwitz,  Adolf,  526. 

Hutchinson,  George  Allen,  416,  431. 
Hyperarcs,  529. 

Hyperbolas,  176,  198,  586,  614,  615. 
Hyperbolic  functions,  438. 

Hyperbolic  plane  geometry,  167-171, 

179,  196,  608. 

Hypercubes,  54,  74,  see  n-cube. 
retracts  of,  74,  91. 
subgraphs  of,  90. 

Hyperedges,  32. 

Hyperfloor  function  (2Xx),  143,  192-194, 

585,  600. 

Hyperforests,  44. 


Hypergraphs,  32-35,  44,  252,  372,  669. 
3-uniform,  32-33,  673. 
dual,  33. 

r-uniform,  32-33,  44. 

Hyperrectangles  (Pni  □ ■ • • □ P„m),  28,  67. 

Hypotheses,  59-60. 

I (identity  matrix),  26,  526,  620,  806. 

I Ching  (Mt),  486-487,  511-512. 

Iambuses,  489-490,  501,  512. 

Ibaraki,  Toshihide  120,  541, 

546,  547,  637,  654. 

IBDD,  204. 

IBM  Type  650  computer,  v. 

Ibn  al-Hajj,  Muhammad  ibn  Muhammad 
(^LJI  n),  516. 

Ibn  Mun‘im,  Ahmad  al-‘Abdari 
( aa.^1),  499. 

ID(g),  the  id  of  the  SGB  graph  g,  12,  22. 

Ide,  Mikio  604. 

Ideals  in  a median  algebra,  65. 

Idel,  Moshe  (t>T>N  TW O),  497. 

Identities  for  bitwise  operations,  135,  136, 
184,  185,  187,  585,  588,  598. 

Identity  matrix,  26,  526,  620,  806. 

Identity  permutation,  327. 

IEEE  Transactions,  citations  from,  ix. 

If-then-else  function  (/?  g:  h ),  96,  206-207, 
229,  561,  566,  568,  569,  see  also 
MUX  subroutine, 
nested,  see  Junction  function. 

Igarashi,  Yoshihide  (1+)®,#^),  557. 

Igusa,  Kiyoshi  761. 

ILLIAC  I computer,  143. 

Image,  digitized,  24. 

Image  of  an  element  under  a mapping  or 
permutation,  149,  236,  326. 

Implicants,  53—54,  81,  674,  see  also  Prime 
implicants  of  Boolean  functions. 

Implication  (3),  48-49. 
in  3-valued  logic,  163. 

Implicit  data  structures,  21  22,  164-171, 
195-196. 

Implicit  graphs,  232. 

IMPLIES  subroutine,  634. 

Imrich,  Wilfried,  28,  69,  549. 

In-degree  of  a vertex,  18,  19,  41,  43, 

262,  482,  808. 

in  situ  permutation,  308,  311. 

in  situ  transformation,  289. 

Incidence  matrix  of  a graph  or  hypergraph, 
33,  35,  44,  806. 

Inclusion  and  exclusion  principle,  286,  400, 
411,  540,  665,  719,  771,  776,  811. 

Inclusive  ancestors  of  a node:  The 

node  itself  together  with  its  proper 
ancestors,  41,  165. 

Inclusive  disjunction  (v),  49,  see  OR. 
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Incompatible  columns,  120. 

Incomplete  gamma  function,  421. 

Incremental  changes,  90. 

ind  a:  the  index  of  a,  431. 

Indecomposable  partitions,  513. 

Indecomposable  permutations,  353,  811. 

Independence  number  a(H)  of  a graph 
or  hypergraph,  35,  44. 

Independent-set  function,  231-233,  244,  269. 

Independent  sets,  34,  35,  44,  208,  210-211, 
231-232,  249,  250,  658,  831. 
maximal,  see  Kernels  of  a graph, 
of  a hypergraph,  34,  35,  673. 

Index  of  a permutation,  431,  759. 

Indian  mathematics,  487-489,  491-492, 
499-500,  507,  512. 

Indian  numerals,  491. 

Induced  fc-partite  subgraphs,  277. 

Induced  subgraphs,  13,  18,  39,  42,  43,  46. 

Induced  subhypergraphs,  32. 

Infinite  binary  trees,  164,  185. 

Infinite  exclusive-or  operation,  585. 

Infinite-precision  numbers,  134,  136,  184. 

Infinite  sets,  673. 

Infinity,  the  projective  point  at,  24. 

Initial  vertex  of  an  arc,  18. 

Inline  expansion,  291-292. 

Inorder  (symmetric  order  of  nodes), 

165,  441,  447,  448,  455,  476,  477, 

511,  782,  787,  788. 

Inshuffles,  201,  591. 

Inside  of  a curve,  176. 

Integer  multilinear  representation,  80, 

84,  94,  211,  536,  555,  see  Reliability 
polynomials. 

Integer  partitions,  391-415,  428-431, 

434,  505-508,  513,  767. 

Integer  programming  problems,  vii,  258. 

Integer  variables  represented  as  several 
Boolean  variables,  258. 

Interchanging  adjacent  variables, 

240-246,  650. 

Interchanging  selected  bits  of  two 
registers,  581. 

Interchanging  two  bits  of  a register,  187. 

Interleaving,  38,  191,  231,  317,  634, 

688,  see  also  Zip. 

Internal  path  length,  477. 

Internet,  iv,  viii,  ix,  xii,  10,  380,  511, 

520,  762,  804. 

Interpolating  polynomials,  536. 

Intersecting  family:  A family  containing 
no  disjoint  sets,  88,  555. 

Intersecting  two  families  of  sets,  273. 

Intersection  operation  (/  n j),  273,  see 
also  AND  subroutine. 

Intervals  in  graphs,  66,  67,  89. 

Intervals  in  median  algebras,  65-66. 


Intervals  of  the  majorization  lattice, 

403,  411,  413. 

Intruders,  36. 

Inverse  function  of  Gray  binary  code, 

284,  311. 

Inverse  of  a binary  matrix,  202. 

Inverse  of  a permutation,  20,  182,  342-343, 
346,  448,  591,  717. 

Inverse  of  an  odd  integer  mod  2n,  621. 
Inversion  tables,  321,  352,  444,  471, 

703,  718,  781. 

Inversions  of  a permutation,  321,  323, 

395,  435. 

Inverter  gates,  32,  see  Complements  of 
Boolean  functions. 

Inverter  modules,  72-74,  91,  550-551. 
Involutions:  Self-inverse  permutations,  266, 
353-354,  438,  632,  713,  718-720,  768. 
Irredundant  DNFs,  95,  538,  541. 

Irwin,  Joseph  Oscar,  765. 

Isaak,  Garth  Timothy,  701,  722. 

Ising,  Ernst,  configurations,  380,  385,  726. 
Islamic  mathematics,  493,  499,  516. 

Isolated  vertices:  Vertices  of  degree  0,  25, 
231-232,522,525,642. 

Isometric:  Distance-preserving,  90,  584. 

subgraphs,  90,  91,  551. 

Isomorphic  graphs,  13-15,  28,  39,  532-533. 
directed,  18. 

Isomorphic  Gray  cycles,  314. 

Isomorphism  of  BDDs,  259. 

Isotone  functions,  537. 

Isozaki,  Hideki  669. 

Isthmuses,  see  Bridges  of  a graph. 

Istrate,  Gabriel,  543. 

ITE,  see  If-then-else  function. 

Iteration  of  functions,  312,  683. 

Iteration  versus  recursion,  366-368,  383. 
Ivanescu,  Petru  Ladislav  (=  Hammer, 

Peter  Leslie),  120,  258,  541. 

Ivanyi,  Antal  Miklos,  701. 

Ives,  Frederick  Malcolm,  348. 

Izquierdo,  Sebastian,  498-499,  512. 

J (all-ones  matrix),  26,  27,  526. 

Jn  (complete  digraph  of  order  n), 

18,  522,  524. 

J(x;  /)  function,  262,  635,  642. 

Jackson,  Bradley  Warren,  354,  746. 

JACM:  Journal  of  the  ACM,  a publication 
of  the  Association  for  Computing 
Machinery  since  1954. 

Jacobi,  Carl  Gustav  Jacob,  396,  410. 
symbol,  752. 

Jaenisch,  Carl  Friedrich  Andreevitch  de 
(HHHmi,,  Kapjrb  AHupeoBU'rb),  672. 
James,  Henry,  xiii. 

Jaillet,  Christophe  Andre  Georges,  2. 

Jain,  Jawahar  (sraif^tr),  630. 
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Janey,  Nicolas,  811. 

Janson,  Carl  Svante,  46,  533. 

Japanese  mathematics,  492,  503-505, 

513,  679. 

Jardine,  Nicholas,  604. 

Jeffrey,  David  John,  773. 

Jelinek,  Frederick,  677. 

Jenkyns,  Thomas  Arnold,  365. 

Jeong,  Seh-Woong  p§Aj]-§-),  626. 

Jesus  of  Nazareth,  son  of  Joseph 
(nra  p iop  p in vu,  TnooO? 
ulo?  too  Tcooficp  6 ano  NafopeO),  491. 

Jevons,  William  Stanley,  48,  51,  535. 

Jewish  mathematics,  490,  497,  813. 

Jha,  Pranava  Kumar  (5TW  ' 3TT),  69. 

Jiang,  Ming  (ivBJ]),  339. 

Johnson,  Allan  William,  Jr.,  707. 

Johnson,  David  Stifler,  604,  833. 

Johnson,  Samuel,  v,  xiv. 

Johnson,  Selmer  Martin,  346. 

Joichi,  James  Tomei  (MTfOKHfj),  749,  769. 

Join  of  graphs,  26  -27,  483,  530. 

Join  operation  (/  U g),  273,  275-278. 

Joint  partitions,  409. 

Joke,  666. 

Jolivald,  Philippe  (=  Paul  de  Hijo),  745. 

Jordan,  Marie  Ennemond  Camille,  curve 
theorem,  176. 

Jump-down,  242-243,  269. 

Jump-up,  242-243,  269. 

Junction  function  ( J(x;f )),  262,  635,  642. 

Just,  Winfried,  560. 

Juxtaposition  of  forests,  472. 

Juxtaposition  of  graphs,  26,  483,  see 
Direct  sum  of  graphs. 

fc-ary  trees,  124. 

fc-colorable  graphs  or  hypergraphs,  17, 

32,  35,  42,  44,  831. 

fc-edge-colorable  graphs,  26,  42,  44. 

fc-in-a-row  function,  127. 

fc-partite  graphs  or  hypergraphs,  17, 

32,  35,  42,  44. 

complete,  17,  26-27,  40,  44. 

Kg,:  Kilomems  = thousands  of  memory 
accesses,  707. 

Kn  (complete  graph  of  order  n),  13, 

26-27,  41-43,  525. 

(transitive  tournament  of  order  n), 

18,  27,  40,  41. 

Kn  ' (complete  r-uniform  hypergraph),  32. 

K\t 3 (claw  graph),  522. 

^3,3  (utilities  graph),  17,  39,  42,  521. 

Km,n  (complete  bipartite  graph),  17,  26, 
39,  42,  133,  497,  522,  580,  806. 

Km,n  (complete  r-uniform  bipartite 
hypergraph),  44. 

Knit...tnk  (complete  k- partite  graph), 

17,  26-27,  40,  44. 


Kaas,  Robert,  164. 

Kaasila,  Sampo  Juhani,  615. 

Kabbalah,  490,  497,  813. 

Kahan,  Steven  Jay,  702,  707. 

Kak,  Subhash  Chandra  (1JRTT 
^TRT),  489. 

Kameda,  Tiko  (=  Tsunehiko)  (H,BE|tjI3>), 
546,  555. 

Kaneda,  Takayuki  ( ife  E3  ra  $ ) , 647. 

Kaplansky,  Irving,  509. 

Kapoor,  Sanjiv  (MuTlq  diV  ),  804. 

Karnaugh,  Maurice,  309. 

Karonski,  Michal,  533. 

Karp,  Richard  Manning,  121. 

Karpiriski  (=  Karpinski),  Marek 
Mieczyslaw,  546. 

Katajainen,  Jyrki  Juhani,  167. 

Katona,  Gyula  (Optimalis  Halmaz), 

373,  801. 

Kauffman,  Stuart  Alan,  78. 

Kautz,  William  Hall,  556. 

Kavut,  Selguk,  561. 

Ke  Zhao  (=  Chao  Ko,  ^SJQ),  555. 

Kedara  Bhatta  (%3TTRf).  488,  507. 

Keil,  Heinrich,  490. 

Keister,  William,  115,  680. 

Kelly,  Patrick  Arthur,  673. 

Kelly,  Paul  Joseph,  528. 

Kelmans,  Alexander  Kolmanovich 

(KejibManc,  AjieKcaHnp  KojibMaHOBnu), 
808. 

Kemp,  Rainer,  703,  795,  796. 

Kempe,  Alfred  Bray,  521. 

Kent  Treble  Bob  Major,  320. 

Kerbosch,  Joseph  (=  Joep)  Auguste 
Gerard  Marie,  604. 

Kernels  of  a digraph,  259,  831. 

Kernels  of  a graph  (maximal  independent 
sets),  34,  44,  195,  208-211,  231-232, 

249,  250,  258-260,  269,  277,  604, 

671,  673,  831. 

Keyboard,  364,  384. 

Khachiyan,  Leonid  Genrikhovich  (XaunnH, 
JleoHHA  reHpHXOBHu),  537-538,  674. 

Khrapchenko,  Valerii  Mikhailovich 

(XpanueHKO,  Bajiepufi  MnxaiijTOBHu), 
568. 

Kiefer,  Jack  Carl,  692. 

Kilomems  (K/t):  One  thousand  memory 
accesses,  333. 

Kimono,  504. 

King,  Andrew  Douglas,  719. 

King  moves  on  a chessboard,  43. 

King-neighbors,  172,  197,  611-612. 

King  Wen  of  Chou  (#H  = JIJjtiE), 
486-487,  512. 

King’s  tours,  276. 

Kingwise  connected  components,  173-175, 
197-198,  609. 
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Kingwise  torus,  44. 

Kircher,  Athanasius,  490,  491,  497,  512,  812. 

Kirchhoff,  Gustav  Robert,  law,  403. 

Kirkman,  Thomas  Penyngton,  15. 

Kirsch,  Russell  Andrew,  172,  197. 

Kirschenhofer,  Peter,  796. 

Kiss,  Stephen  Anthony,  548. 

Kitaev,  Sergey  Vladimirovich  (KHTaeB, 
Cepreii  B jiaanMHpoBHu) , 762. 

Klavzar,  Sandi,  28,  69. 

Kleber,  Michael  Steven,  759. 

Klee,  Victor  La  Rue,  Jr.,  ix. 

Klein,  Peter,  127. 

Kleine  Biining,  Hans  Gerhard,  544-546. 

Kleitman,  Daniel  J (Isaiah  Solomon),  459, 
480,  528,  550,  756,  801. 

Kleppis,  Gregor  (=  Kleppisius, 

Gregorius),  512. 

Klimko,  Eugene  Martin,  747. 

Kline,  John  Robert,  548. 

Kliigel,  Georg  Simon,  331,  508. 

Knapsack  problem,  75,  361,  832. 

Knight  moves  on  a chessboard,  15,  25,  195. 

Knight’s  tour,  northeasterly,  352. 

Knobloch,  Eberhard  Heinrich,  505,  506. 

Knodel,  Walter,  604. 

Knopp,  Marvin  Isadore,  751. 

Knott,  Gary  Don,  511. 

Knuth,  Donald  Ervin  ii,  iv,  ix, 

xvi,  7,  9-10,  12,  32,  62,  105,  115,  154, 
254,  457,  517,  537,  549,  569,  588, 

589,  605,  611,  615,  616,  618,  633-635, 
649-650,  663-666,  670,  677,  696,  726, 
773,  800,  803,  804,  829,  842. 

Knuth,  John  Martin  (jfi<HS),  see  Truth. 

Ko,  Chao  (=  Ke  Zhao,  M'S),  389,  555. 

Koch,  John  Allen,  17. 

Koda,  Yasunori  (JtEHft S),  300-301. 

Kogan,  Alexander  (KoraH,  AjieKcaHAp 
lOpteBHu),  120,  541. 

Kolibiar,  Milan,  548. 

Komlos,  Janos,  91. 

Kompel’makher,  Vladimir  Leont’evich 
(KoMnejiBMaxep,  BjiauMMup 
JleoHTbeBHu),  350. 

Konig,  Denes,  17. 

Konvalina,  John,  560. 

Kordemsky,  Boris  Anastas’evich 

(KopfleMCKHit,  BopHC  AuaCTRCbCBU'l), 
780. 

Korobkov,  Vitaly  Konstantinovich 
(Kopo6noB,  BirrajiHii 
KoHCTaHTHHOBHu),  461. 

Korsh,  James  F.,  444,  473,  726,  785,  798. 

Korshunov,  Aleksey  Dmitrievich 

(KopinyHOB,  AjieKceu  ^MHTpueBHu), 

558. 

Kostochka,  Alexandr  Vasilievich  (KocTouxa, 
AneKcaHAp  BacujibeBHu),  809. 


Kotani,  Yoshiyuki  ),  664. 

Kowalewski,  Arnold,  525. 

Krajecki,  Michael,  2. 

Kramp,  Christian,  419. 

Kratochvll,  Jan,  46,  533. 

Krause,  Karl  Christian  Friedrich,  704. 
Kreher,  Donald  Lawson,  511. 

Kreweras,  Germain,  786,  787,  808. 
lattice,  473-476,  788. 

Krichevsky,  Rafail  Evseevich  (Kphmcbckhh, 
PacfjaHji  EBceeBuu),  570. 

Krom,  Melven  Robert,  62,  545. 
clauses,  57,  72,  85,  87,  575. 
functions,  60,  72,  79,  81,  95,  266, 
see  also  2CNF. 

satisfiability,  57,  60-62,  72,  86. 

Kronecker,  Leopold,  product,  526. 

Kronk,  Hudson  Van  Etten,  809. 

Kruskal,  Joseph  Bernard,  Jr.,  373-374. 
function  ret,  373-375,  385-388,  477,  739. 
function  \t,  374-375,  386-387. 
Kruskal-Katona  theorem,  373. 

Kschischang,  Frank  Robert,  677. 

Kubicka,  Ewa,  802. 

Kuhn,  Markus  Gunther,  619. 

Kumar,  Panganamala  Vijay 
has &j  ) , 681. 

Kung,  Hsiang  Tsung  (JL#Jt),  568. 

Kusuba,  Takanori  (ffi||P§tj5),  500. 

L(f) , 99,  see  Length  of  a Boolean  function. 
L(G)  (line  graph  of  G),  26,  35,  42,  483, 

522,  526,  529. 

Labeled  graphs,  15,  532. 

Labeled  objects,  390,  432,  774. 

Labelle,  Gilbert,  779. 

Labels  of  graph  vertices,  67-74,  90. 

Laborde,  Jean-Marie,  540. 

LADDERS  program,  32. 

Ladner,  Richard  Emil,  127,  128,  570. 
Lafferty,  John  David,  677. 

Lagrange  (=  de  la  Grange),  Joseph  Louis, 
Comte,  inversion  formula,  773. 
Laguerre,  Edmond  Nicolas,  777. 

Lahdesmaki,  Harri,  559. 

Lakhtakia,  Akhlesh  (3ll*MJ(|  H^dRi^l), 

586. 

Lakser,  Harry,  788. 

Lambert,  Johann  Heinrich,  507. 

Lamport,  Leslie  B.,  88,  151,  152,  192. 
Landau,  Hyman  Garshin,  413,  522. 

Lander,  Leon  Joseph,  587-588. 

Langdon,  Glen  George,  Jr.,  337,  341,  352. 
Langford,  Charles  Dudley,  7,  9,  515. 
pairs,  1-3,  8,  36,  274. 
triples,  36. 

Lapko,  Olga  Georgievna  (JIanKO,  Ojibra 
reoprueBHa),  883. 
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Laplace  (=  de  la  Place),  Pierre  Simon, 
Marquis  de,  421. 

Large  megabytes:  220  bytes,  588. 

Large  numbers,  names  for,  707. 

Largest  element  of  a set,  143. 

Larrivee,  Jules  Alphonse,  286. 

Larvala,  Samuli  Kristian,  594. 

Las  Vegas  hotels,  60-62,  86. 

Latch  gates,  32. 

Late  neighbors,  69. 

Latin-1  supplement  to  ASCII,  597. 

Latin  poetry,  500-503,  512. 

Latin  squares,  3-8,  36-38,  516-518,  529. 
Lattice  paths,  356-357,  379,  395. 

Lattices,  see  also  Majorization  lattices, 
distributive,  92,  551,  554. 
of  partitions,  412-413,  432-433. 
of  trees,  473-476. 

Law  of  large  numbers,  438. 

Lawrence,  George  Melvin,  295,  689. 

Lawrie,  Duncan  Hamish,  592. 

LDTU  (load  tetra  unsigned),  616. 

Le  Borgne,  Yvan  Franccoise  Andre,  805. 

Le  Corre,  Jean  Pierre,  145. 

Leader,  Imre,  560. 

Leading  bit  of  a product,  247,  272. 

Leaf  nodes,  466,  473,  475,  786. 

Leahy,  Francis  Theodore,  Jr.  (=  Ted),  514. 
Leap  year,  599. 

Least  common  ancestors,  see  Nearest 
common  ancestors. 

Least  recently  used  replacement,  771. 

Least  significant  1 bit  (2px),  140-141,  153. 
Least  upper  bounds,  see  Lattices. 

Leek,  Uwe,  745. 

Lee,  Chester  Chi  Yuan  (^^tu)  = Chi 
Lee  ($}$),  257,  680. 
distance,  309. 
weight,  309. 

Lee,  Gilbert  Ching  Jye  ($^T^),  572. 

Lee,  Ruby  Bei-Loh  ($(ftR),  151,  189,  595. 
Leeuwen,  Marcus  Aurelius  Augustinus 
van,  ix,  750,  768. 

Left-child/right-sibling  links,  441,  445, 

447,  470,  667,  678,  727,  782,  783, 
786-787,  809. 

Left  complementation  (L),  49,  80. 

Left  path  length,  794. 

Left  projection  (L),  49,  63,  80,  534-535. 
Left-sibling/right-child  links,  445,  472, 

476,  783,  786-787. 

Left-to-right  minima,  432,  606. 

Leftmost  bits,  142-143,  154,  187. 

Legendre  (=  Le  Gendre),  Adrien  Marie,  827. 
Legitimate  lattices  of  functions,  132,  133. 
Lehmer,  Derrick  Henry,  39,  136,  320,  359, 
384,  410,  510,  734,  752. 

Lehner,  Joseph,  400,  411. 


Leibniz,  Gottfried  Wilhelm,  Freiherr  von, 
487,  501-502,  505-506,  512,  814. 

Leighton,  Robert  Eric,  519. 

Leiserson,  Charles  Eric,  142,  187. 

Lempel,  Abraham  (73Db  omiN),  305. 

Lenfant,  Jacques,  591. 

Length  of  a Boolean  function,  99, 

103,  125,  129. 

statistics,  101,  105,  563-564,  581. 

Lenin,  Vladimir  Ilyich  (JleHHH, 

BjiaAHMHp  HnbHu),  86,  365. 

Lenstra,  Hendrik  Willem,  Jr.,  184,  584. 

Leonardo  di  ser  Piero  da  Vinci,  9,  24. 

Leske,  Marie,  810. 

Lettmann,  Theodor  August,  543,  545. 

Level  coordinates  c*,,  442,  444,  452,  461-462, 
471,  473,  474,  477,  480,  786. 

Level  order  of  nodes,  748. 

Level  set  method,  177-180,  198. 

Levialdi  Ghiron,  Stefano,  174-175. 

Levy,  Paul,  454. 

Lewis,  Harry  Roy,  544. 

Lexicographic  generation,  319,  330,  333,  334, 
344-345,  358-361,  370-373,  379-381, 
383,  385,  391-392,  394,  407-408,  416, 
429-431,  433,  508,  719-720,  735,  755. 

Lexicographic  order,  3,  38,  41,  75,  85,  86, 
150,  200,  214,  282-283,  305,  309,  319, 
326,  442-443,  459,  461,  471,  477,  480, 
487,  491,  493,  494,  499,  505,  506,  508, 
509,  512,  546,  623,  663,  685,  813,  816. 

Lexicographic  product  of  graphs,  28, 

42-43,  483,  526. 

Lexicographic  successor,  320. 

Lexicographically  largest  solution,  259. 

Lexicographically  mth  smallest 
solution,  280. 

Lexicographically  smallest  solution,  206,  257. 

lg  (base  2 logarithm),  see  also  Binary 
logsize  function. 

Li,  Gang  (=  Kenny)  (^#3),  696,  802. 

Li2  (dilogarithm),  410,  751,  754. 

Liang,  Franklin  Mark,  105. 

Liaw,  Heh-Tyan  (HTH1  tfl),  638. 

Lieves,  310. 

Life  game,  172,  197,  269-270. 

Lights  Out  puzzle,  617. 

Lillywaite,  Peregrine,  10. 

Limericks,  436. 

Lin,  Bill  Chi  Wah  (#;&^  = 258. 

Lin,  Chen-Shang  (i^H^),  638. 

Lindstrom,  Bernt  Lennart  Daniel,  378-379, 
388,  554,  744. 

Line  graph  of  a graph,  26,  35,  42,  483, 

522,  526,  529. 

Linear  block  codes,  279. 

Linear  Boolean  programming,  206, 

209-211,  258,  261,  663. 
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Linear  extensions,  342,  see  Topological 
sorting. 

Linear  inequalities,  258,  551-553. 

Linear  ordering,  542. 

Linear  polynomials,  52. 

Linear  programming,  vii,  92. 

Linear  subgraphs,  522. 

Linear  time,  57,  82. 

Linear  transformations,  639. 

Linked  allocation,  21,  308,  309, 

333-334,  719. 

Linked  binary  trees,  generation  of, 

444-449,  473. 
random,  456-457,  478. 

Linked  lists,  381,  407,  432,  603,  631, 
647-648,  652,  727. 

Linusson,  Hans  Svante,  743. 

Lipschutz,  Seymour  Saul,  726. 

Lipski.  Witold,  Jr.,  709. 

Liskovets,  Valery  Anisimovich  ( JlucKOBeq, 
BajrepnH  Ahhchmobhu),  350. 

Listing  versus  generating,  281. 

Literals,  53,  212,  634. 

Little-endian  convention,  138-140,  144, 

152,  160,  587,  588. 

Littlewood,  Dudley  Ernest,  757. 

Littlewood,  John  Edensor,  586,  758,  801. 

Liu,  Chao-Ning  ),  362. 

Lloyd,  Edward  Keith,  15,  779. 

Llull,  Ramon  (=  Lullus,  Raimundus), 
494-497,  512. 

LO  field  in  a decision  diagram,  202-203,  216, 
220-221,  226,  241,  249,  250,  259,  263. 

Lo  Shu  magic  square,  573. 

Lobachevsky,  Nikolai  Ivanovich 
(JIo6aueBCKiii,  HnKojiafi 
IlBaHOBHUt),  168. 

Lobbing,  Martin,  238,  243,  269. 

L0C  (memory  location),  22. 

Local  optimizations,  121. 

Locality  of  reference,  222,  263. 

Locally  optimal  solutions,  34-35. 

Logarithm,  as  a multivalued  function, 

422,  773. 

Logic,  48-51,  123-124,  132. 

Loizou,  Georghios  (=  George;  AoiCou, 

recnpyioj),  747,  748. 

London,  John,  496. 

Long  distributive  law,  65,  67,  89. 

Longest  paths,  276,  832. 

Loony  Loop  puzzle,  316. 

Loopless  generation,  289-292,  300,  308,  309, 
316,  346,  362,  379,  381,  382,  470,  680, 
706,  718,  719,  729,  733-735,  747. 

Loops  from  a vertex  to  itself,  13,  18, 

19,  41,  462,  465,  529. 

Lorenz,  Max  Otto,  757. 

Louchard,  Guy,  454. 


Loukakis,  Emmanuel  (Aouxaxr);, 

Mav<iXric),  604. 

Lovasz,  Laszlo,  386,  739. 

Lovejoy,  Jeremy  Kenneth,  749. 

Lower  bounds  on  broadword  computation, 
155-159,  193-194,  616. 

Lower  bounds  on  combinational  complexity, 
103-104,  109-112,  122-124,  131-132. 
Lowercase  letters,  191. 

Lowest  common  ancestors,  see  Nearest 
common  ancestors. 

Loyd,  Samuel,  xiv,  588,  672. 

Loyd,  Walter  (=  “Sam  Loyd,  Jr.”),  1. 

Lu,  Yuan  (a®),,  630. 

Lucas,  Frangois  Edouard  Anatole,  745. 

numbers,  623. 

Lucas,  Joan  Marie,  447. 

Lucky  nodes,  483-484. 

Luczak  (=  Luczak),  Malwina  Joanna,  798. 
Ludus  Clericalis,  493,  512. 

Lukasiewicz,  Jan,  163,  195. 

Liineburg,  Heinz,  727. 

Lunnon,  William  Frederick,  766. 

Lupanov,  Oleg  Borisovich  (JlynaHOB, 

Oner  Eophcobhh),  110,  112,  129, 

565,  570,  573. 

Lutz,  Rudiger  Karl  (=  Rudi),  613. 

Lydgate,  John,  494. 

Lynch,  William  Charles,  143. 

Lyndon,  Roger  Conant,  306. 

words,  306,  see  Prime  strings. 

Lynn,  Richard  John,  487. 

m-ary  digit:  An  integer  between  0 and 
m — 1,  inclusive,  282,  302. 

Mfi:  Megamems  = millions  of  memory 
accesses,  232,  245. 

M(g)  (the  number  of  arcs  in  the  SGB 
graph  g),  22. 

M(n)  (binary  majorization  lattice),  554. 
Mm(x\y)  (2m-way  multiplexer),  214,  235, 
239,  243,  263,  266,  267,  269,  272, 

627,  630,  638,  647,  659. 

Mn  (middle  binomial  coefficient), 

457,  479-480. 
mxn  cylinders,  28,  41. 
mxn  grids,  28,  41. 
mxn  rook  graphs,  26,  41. 
mxn  toruses,  x,  28,  41. 
directed,  41. 

Macaulay,  Francis  Sowerby,  373,  388,  738. 

function  m , 374-375,  386-387,  739. 
Macchiarulo,  Luca,  622. 

MacDonald,  Peter  S.,  346. 

MacMahon,  Percy  Alexander,  414,  415, 

429,  792,  797. 

MacNeish,  Harris  Franklin,  5. 
Macro-processor,  291. 

Madre,  Jean  Christophe,  625,  635,  669,  674. 
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Maghout,  Khaled  (LjiL  jlU.),  258. 

Magic  Fifteen,  129. 

Magic  masks  (pk  and  pa.k),  141,  143-145, 
148,  154,  169,  186,  582,  585,  587, 
589-597,  600,  601,  608,  616. 

Magic  squares,  36,  573. 

Magic  trick,  440. 

Maiorana,  James  Anthony,  306,  307. 

Majority  element,  46. 

Majority  functions,  63,  68,  104,  159, 

519,  see  Medians. 

Majority  law,  65,  89. 

Majority  of  odd,  see  Median  of  odd. 

Majorization,  757-758. 

Majorization  lattices,  of  binary  vectors, 
92-94. 

of  n-tuples,  30,  528,  554. 
of  partitions,  412-414,  763. 

Makino,  Kazuhisa  (4ftffifOA),  555,  654. 

Malfatti,  Giovanni  Francesco  Giuseppe,  752. 

Malgouyres,  Remy,  611. 

Manchester  Mark  I computer,  134. 

Mann,  Henry  Berthold,  516. 

Mann,  William  Fredrick,  609. 

Mantel,  Willem,  303. 

Mapping  modules,  190,  592. 

Mapping  networks,  190,  592. 

Mapping  three  items  into  two-bit  codes, 
160-163,  195. 

Mappings  of  bits,  149,  190,  592. 

Marceau  (=  Mangel),  Marcel,  817. 

Marcisova,  Tamara,  548. 

Marckert,  Jean-Frangois,  454. 

Margenstern,  Maurice,  168. 

Mark  II  computer  (Manchester/Ferranti), 
134. 

Markov  (=  Markoff),  Andrei  Andreevich 
(MapKOB,  AHApeft  AHApeeBHu),  the 
elder,  process,  487. 

Markowsky,  George,  540,  557,  788. 

Marshall,  Albert  Waldron,  758. 

Martin,  Monroe  Harnish,  142,  307-308. 

Martinelli,  Andres,  646. 

Maruoka,  Akira  (^L|B]$),  248,  271. 

Marx,  Adolph  (=  Arthur  = Harpo),  817. 

Mary,  Saint  ("Ayia  Maptoc  ©eoxoxo?, 
navayta,  napGevog),  500-501. 

Mask:  A bit  pattern  with  Is  in  key 
positions,  141,  144-145,  148-150, 

152,  181,  182,  201. 

Masked  integers,  see  Scattered  arithmetic. 

Masking:  ANDing  with  a mask,  163. 

Master  profile  chart,  239,  240,  245, 

267-268,  643. 

Master  2-profile  chart,  272. 

Matchings  in  a graph:  Sets  of  disjoint 
edges,  35,  44,  831. 

perfect,  119,  252,  313-314,  343,  353,  442, 
444,  471,  686-687,  718-719,  768,  831. 


MATE  (the  converse  arc),  21-22,  464. 

Mate  of  a bit  string,  479. 

Mates  in  a Boolean  chain,  131. 

Math.  Comp.:  Mathematics  of  Computation 
(I960-),  a publication  of  the  American 
Mathematical  Society  since  1965; 
founded  by  the  National  Research 
Council  of  the  National  Academy 
of  Sciences  under  the  original  title 
Mathematical  Tables  and  Other  Aids 
to  Computation  (1943-1959). 

Mathews,  Edwin  Lee  (=  41),  278. 

Mathon,  Rudolf  Anton,  516-517. 

Matrices  of  0s  and  Is,  199-202,  230,  238, 
251,  264,  267,  269-270,  279,  623,  632, 
643,  665,  see  also  Bitmaps. 

multiplication  of,  182-183,  188, 

230,  264,  619. 

transposing,  33,  147,  188,  199,  201, 
591-592. 

triangularizing,  200,  725. 

Matrix  (Bush),  Irving  Joshua,  702. 

Matrix  multiplication,  20,  182-183,  188, 

230,  264,  526,  580,  619,  731. 

Matrix  transposition,  147,  188,  199, 

201,  591  592. 

Matrix  tree  theorem,  481,  482,  723,  806. 

Matroids,  674. 

Matsumoto,  Makoto  (^^jM-),  740-741. 

Matsunaga,  Yoshisuke  419,  504. 

max  (maximum)  function,  63-64,  134, 

163,  192,  213. 

Maximal  chains,  474,  476. 

Maximal  cliques,  44,  194-195,  259,  277. 

Maximal  elements  (/I),  276,  717. 

Maximal  independent  sets,  see  Kernels 
of  a graph. 

Maximal  induced  bipartite  subgraphs,  277. 

Maximal  intersecting  families,  88. 

Maximal  planar  graphs,  39. 

Maximal  proper  subsets,  190. 

Maximal  subcubes,  54-55,  82-83. 

Maximal  versus  maximum,  34-35,  671. 

Maximization,  206,  209-211,  258, 

261,  279,  663. 

Maximum  independent  sets,  34-35,  831. 

Maximum  likelihood,  279. 

Maximum  operator  (max(x,  y)  or 

max{x,y}),  63,  64,  134,  163,  192,  213. 

Maxterms,  see  Minclauses. 

Maybe,  163. 

Mayeda,  Wataru  (fuffll®),  511. 

Mayr,  Ernst  Wilhelm,  551. 

McCarthy,  David,  365. 

McClintock,  William  Edward,  295. 

McCluskey,  Edward  Joseph,  Jr.,  55. 

McCranie,  Judson  Shasta,  605. 

McCravy,  Edwin  Parker,  Jr.,  346. 

McCulloch,  Warren  Sturgis,  75. 
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McCune,  William  Walker,  Jr.,  548. 

McKay,  Brendan  Damien,  517,  714,  745. 
McKay,  John  Keith  Stuart,  391. 

McKellar,  Archie  Charles,  118,  120,  131. 
McLean,  Iain  Sinclair,  496. 

McManus,  Christopher  DeCormis,  38. 
McMillan,  Kenneth  Lauchlin,  627. 
McMullen,  Curtis  Tracy,  661. 

Mean  values,  276,  414,  438. 

Median  algebras,  64-67,  89. 

Median  chains,  133. 

Median  expansion  formula,  87. 

Median  Genocchi  numbers,  655. 

Median  graphs,  67-74,  90,  550. 

Median  labels,  67-74,  549. 

Median  sets,  72-74,  91. 

Median  words,  275. 

Medians  (( xyz )),  x,  62-74,  87-91,  102, 

125,  133,  153,  202-205,  212,  229,  235, 
255,  263,  264,  311,  598,  637. 
bitwise,  67,  71,  72,  91,  549. 
of  any  odd  number,  64,  75-76,  91-92,  94. 
of  five,  64,  68,  71,  76,  77,  87,  91. 
of  medians,  268,  644. 
of  seven,  133. 

Medicine,  492,  495. 

Meet  operation  (/  n g),  273,  275, 

276,  671-673. 

Megamems  (M/a):  One  million  memory 
accesses,  232,  245. 

Meinel,  Christoph,  639. 

Meifiner,  Otto,  760. 

Melding  operation  (/  o g),  218-219,  225, 
231,  242,  262-263,  630,  648-649. 
Mellin,  Robert  Hjalmar,  transforms, 

396,  410. 

Melodies,  491,  498,  505. 

Memo  cache,  226-230,  232-233,  256,  264, 
265,  637,  647,  659-660,  670. 
Memoization  technique,  226,  233. 

Mems  (/a):  Memory  accesses  (usually 
to  64-bit  numbers),  2,  6,  224,  403, 
468-469,  476,  786,  802,  803. 

Menon,  Vairelil  Vishwanath  (oaojoraeiloS 
aflcrajcTOio  Gia(Btr)on3) , 525. 

Merge  sorting,  181. 

Meringer,  Markus  Reinhard,  530. 

Mero,  Laszlo,  617. 

Mersenne,  Marin,  491,  505,  512. 
METRFONT,  12,  615,  883. 

METRPOST,  883. 

Meters,  poetic,  487-490,  492,  500-503, 

508,  512-513. 

Metrical  feet,  489,  501,  512-513. 

Metropolis,  Nicholas  Constantine 

(MrvtpoTtoXTic,  NixoXao?  Kcnvaxoomvoo), 
557. 

mex  (minimal  excludant)  function,  184. 


Meyer,  Albert  Ronald  da  Silva,  viii, 

123,  124,  579. 

Meyerowitz,  Aaron  David,  89,  550. 

Meynert,  Alison,  517. 

Mezei,  Jorge  Esteban  (=  Gyorgy  Istvan),  92. 
Middle  bit  of  a product,  229,  247-249, 
271-272. 

Middle  levels  conjecture,  735. 

Miers,  Charles  Robert,  700. 

Miiller,  Henry  Sedwick,  64. 

Mikado  pattern,  618. 

Mikami,  Yoshio  (H_hti;£:),  492. 
miles  graphs,  31,  44. 

Mileto,  Franco,  541. 

Military  sayings,  281. 

Miller,  Donald  John,  527. 

Miller,  Jeffrey  Charles  Percy,  143. 

Miller,  Joan  Elizabeth,  362,  381. 

Mills,  Burton  E.,  536,  539. 

Milne,  Stephen  Carl,  433. 

Milnor,  John  Willard,  88,  547. 

Miltersen,  Peter  Bro,  159. 

min  (minimum)  function,  63-64,  134, 

163,  192. 

Min-plus  algebra,  625. 

Min-plus  matrix  multiplication,  731. 

Minato,  Shin-ichi  ),  249,  258,  278, 

662,  669,  673,  675,  677. 

Minclauses,  53. 

Minimal  dominating  sets,  258,  277. 

Minimal  elements  (/-*-),  276,  717. 

Minimal  excludant,  184. 

Minimal  hitting  sets  of  a family  (/#), 

276,  671,  674. 

Minimal  partitions,  412. 

Minimal  solutions,  255. 

Minimal  versus  minimum,  34-35. 

Minimal  vertex  covers,  259,  537. 

Minimization  reduced  to  maximization,  260. 
Minimum  element  in  subarray,  196. 
Minimum-memory  evaluation,  101-103, 

106,  125,  126,  566-567. 

Minimum  operator  (min (x,y)  or  min{a;,r/}), 
63,  64,  134,  163,  192. 

Minimum  spanning  trees,  260,  277,  832. 
Minimum  vertex  covers,  34-35,  44,  831. 
Minnick,  Robert  Charles,  77,  552. 

Minsky,  Marvin  Lee,  198. 

Minterms,  52-54,  109,  111,  126,  253, 

260,  536. 

MIP-years,  2. 

Mirror  images,  444,  471,  782,  786,  799. 

Mirror  pairs,  38. 

Misiurewicz,  Michal,  760. 

Misra,  Jayadev  (QQ6Q0  q«j),  534,  680. 

Missing  subset  sum,  202. 

Mitchell,  Christopher  John,  305. 
Mittag-Leffler,  Magnus  Gosta 
(=  Gustaf),  508. 
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Mixed-radix  majorization  lattices,  554. 

Mixed-radix  number  systems,  192,  282, 
299-301,  315,  335,  345,  555,  693, 

695,  703,  760,  813. 

MMIX  computer,  iv,  viii,  41,  137,  139-142, 
144,  151,  152,  160,  180-183,  186,  187, 
189,  191,  192,  194,  199,  201,  202,  216, 
339-341,  352,  384,  583,  588,  590,  596, 
597,  598,  606,  616,  679. 

MMIXAL  assembly  language,  41. 

Mnemonics,  526. 

Mod  2 canonical  form,  see  Multilinear 
representation  of  a Boolean  function. 

Mod-5  arithmetic,  192. 

mod  (remainder)  function,  136. 
for  families  of  sets,  273,  661,  666. 

Modal  logic,  163,  195. 

Modular  arithmetic,  207,  515. 

Modular  Gray  codes  for  tuples,  299-300, 

315,  481. 
decimal,  299. 

m-ary,  304,  682,  690-693,  696. 
mixed-radix,  692-693,  714. 
quaternary,  681,  688. 
ternary,  685,  690. 

Modular  universal  cycles,  722. 

Modules  in  a network,  214-215,  262. 

modulo  77  (equivalence  modulo  a 
partition),  416. 

Mohanram,  Kartik  (tn8 8 zh/x&jsxD),  219. 

Mohar,  Bojan,  808. 

Moivre,  Abraham  de,  451,  506-508,  513. 

Moments  of  a distribution,  434,  778. 

Mona  Lisa,  9,  24,  31. 

Monadic  logic:  Logic  with  only  unary 
operators,  123-124,  132. 

mone,  586,  see  —1. 

Monic  polynomial,  681. 

Monomial  symmetric  functions,  506,  757. 

Monomials,  388. 

Monominoes,  252,  274. 

Monotone  Boolean  chains,  125,  132-133. 

Monotone  Boolean  functions,  55,  63,  79, 

81,  82,  85,  87,  95,  202,  231,  255,  256, 
258,  263,  265,  270-271,  277,  278, 

388,  460,  480,  536-537. 
computing  CNF  from  DNF,  537-538. 
decreasing,  658. 

prime  implicants  of,  255-256,  631-632, 
653-654,  674. 

self-dual,  63-64,  70,  79,  87-89,  92-93,  133, 
256,  263,  268,  550,  631,  647. 
shellability  of,  84-85,  537,  541,  552. 
threshold,  75-76,  92. 

Monotone  complexity,  106,  125,  132-133. 

Monotone-function  function  (/rn),  223—224, 
228,  263,  631. 

Monotonic  binary  Gray  codes,  295-298,  315. 

Monotonic  portions  of  curves,  177-179,  198. 


Monte  Carlo  estimates,  712. 

Montmort,  Pierre  Remond  de,  506,  513. 
Monus  operation  (x  — y = max{0,x— y}), 
x,  49,  84,  152,  156,  193,  278,  550, 

593,  608,  735. 

Moody,  John  Kenneth  Montague 
(=  Ken),  194. 

Moon,  John  Wesley,  531. 

Moore,  Edward  Forrest,  676. 

Moore,  Eliakim  Hastings,  518. 

Moore,  J Strother,  46. 

Moore,  Ronald  Williams,  537. 

Mor,  Moshe  (410  nwo),  720-721,  727. 

M0R  (multiple  or),  144,  151,  182-183,  188, 
201-202,  264,  597,  606,  616,  619. 
Morales,  Linda,  722. 

Morelli,  Luigi,  841. 

Morgenstern,  Oskar,  550,  622. 

Morphic  sequence,  800. 

Morreale,  Eugenio,  539. 

Morris,  Ernest,  322. 

Morris,  Scot  Anderson,  38,  707. 

Morse,  Harold  Calvin  Marston,  623. 

sequence,  209-210,  260. 

Morse,  Samuel  Finley  Breese,  code, 

316,  488,  508,  696. 

Morton,  Guy  Macdonald,  597. 

Moser,  Leo,  425,  531,  770,  772. 

Most  recently  used  replacement,  771. 

Most  significant  1 bit  (2Ax),  134,  143, 
192-194,  600. 

Motzkin,  Theodor  (=  Theodore)  Samuel 
(y>p31»  PN10W  -IVTIN’Jl),  431,  765,  778. 
Moundanos,  Konstantinos  (=  Dinos; 

MouvSavoc,  Kcovctavilvoc;),  630. 
Mountain  passes,  420. 

MP3  (MPEG-1  Audio  Layer  III),  183. 
Muirhead,  Robert  Franklin,  757. 

Mulder,  Henry  Martyn,  548. 

Muller,  David  Eugene,  143,  567. 

Multibyte  encoding,  200-201. 

Multibyte  processing,  151-155,  191-193. 
addition,  151,  192,  599. 
comparison,  152-153. 
max  and  min,  192,  599. 
modulo  5,  192. 
potpourri,  598. 
subtraction,  191,  192,  599. 
Multicombinations:  Combinations  with 
repetitions  permitted,  356-357, 

365,  370-373,  379,  387,  390,  393, 
493-494,  499,  512. 

Multifamily  of  sets,  278. 

Multigraphs,  13,  19-21,  40,  41,  44,  526,  532. 
Multihypergraphs,  529. 

Multilevel  logic  synthesis,  122,  see 
Boolean  chains. 

Multilinear  representation  of  a Boolean 
function,  52,  80,  96,  130,  231,  536, 
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563,  619,  675,  see  also  Reliability 
polynomiais. 

Multinomial  coefficients,  309,  345,  505. 

Multinomial  theorem,  506,  508,  513. 

Multipairs,  19. 

Multipartition  numbers  p(ni, . . . , nfc), 

429,  439. 
table,  778-779. 

Multipartitions:  Partitions  of  a multiset  or 
vector,  429-431,  439-440,  505,  513,  780. 

Multiple  outputs,  107-109,  112-117, 
121-122, 126-130. 

Multiple-precision  arithmetic,  138. 

Multiple-precision  constants,  236,  423, 

658,  818-819. 

Multiplexer,  2-way  (/?  g:  h),  see  If-then-else 
function,  Mux  operation. 

2m-way  (. Mm(x;y )),  109,  127,  131,  214, 
235,  239,  243,  263,  266,  267,  269,  272, 
627,  630,  638,  647,  659. 

Multiplication,  8,  136,  142-143,  154, 

193,  588. 

avoiding,  153,  154,  191,  597-598. 
binary,  228-229,  247-249,  264, 

271-272,  278. 
by  powers  of  2,  135,  588. 
in  Conway’s  field  of  nimbers,  184. 
in  groupoids,  163,  195. 
lower  bounds  for,  154,  158,  194. 
of  0-1  matrices,  188,  see  also  M0R,  MX0R. 
of  permutations,  326. 
of  polynomials  mod  2,  202. 
of  signed  bits,  161-162. 
of  sparse  integers,  278. 

Multiplicative  alphametics,  347. 

Multiprecision  arithmetic,  207. 

Multiset  combinations:  Submultisets 
of  a given  size,  370,  372,  379,  387, 

498,  512,  816. 

Multiset  union  (/  ttl  g),  278. 

Multisets:  Sets  with  repetitions  permitted, 
12,  18,  19,  356,  724. 
combinations  of,  370,  372,  379,  387, 

498,  512,  816. 

permutations  of,  319-321,  342,  345,  351, 
358,  368-369,  383,  384,  395,  491,  500, 
502,  507,  723,  726,  735,  760. 

Mundy,  Peter,  322. 

Munro,  James  Ian,  159. 

Murasaki  Shikibu  (—  Lady  Murasaki, 

503. 

Muroga,  Saburo  (jgJtrJP ) , 77,  538, 

552-554,  559. 

Murphy’s  Law,  529. 

Museum  of  Science  and  Industry,  115. 

Music,  488-491,  497-498,  505. 
notation,  490-491. 
rhythm,  488-490,  498. 

Musical  graph,  44,  45. 


Mutilated  chessboard,  252,  274. 

Mutual  exclusion,  88. 

Mutually  incomparable  sets,  263. 

Mutually  orthogonal  latin  squares,  37-38. 

MUX  (multiplex),  182,  594,  597,  616,  619. 

Mux  (multiplex)  operation,  96,  561,  566, 
568,  569,  see  also  If-then-else  function. 

MUX  subroutine,  229,  232,  264,  272, 

630,  660. 

MX0R  (multiple  xor),  182-183,  188,  201-202, 
264,  352,  583,  598,  619-620. 

Mycroft,  Alan,  152. 

Myers,  Eugene  Wimberly,  Jr.,  804. 

Mynhardt,  Christina  (Kieka)  Magdalena, 
673. 

Myrvold,  Wendy  Joanne,  517,  717. 

n- ary  Boolean  functions,  51-55,  78-79,  95. 

n-ary  strings,  37. 

n-cube:  The  graph  of  n-bit  strings,  adjacent 
when  they  differ  in  only  one  position, 
28,  41,  73,  129,  240,  257,  293,  295, 
313-314,  327-328,  346,  468-469,  483, 
806;  see  also  3-cube,  4-cube,  8-cube, 
bandwidth  of,  315. 
perfect  matchings  of,  313,  686-687. 
subcubes  of,  54,  82-84,  310-311,  535, 

541,  557,  570-571. 

symmetries  of,  327-328,  346,  347,  see 
also  Octahedral  groups. 

n-extension  of  a string,  306. 

n-tuple:  A sequence  or  string  of  length  n, 
281-282,  390. 

N(g)  (the  number  of  vertices  in  the  SGB 
graph  g),  22,  523. 

Nakagawa,  Noriyuki,  511. 

Name  servers,  88. 

NAME(u)  (the  name  of  a vertex),  21. 

NAND  (A),  49-51,  80-81,  104,  563. 

Nanocomputer  simulation,  32. 

Narayana  Pandita,  son  of  Nrsimha 

(=TFTW  Hpu-sn  , HT:),  319,  488, 

491,  499-500,  507,  512,  703. 

Natural  correspondence  between  forests 
and  binary  trees,  275,  441,  472,  see 
Left-child/right-sibling  links. 

Naude,  Philippe,  junior,  395. 

Navigation  piles,  167,  196. 

Near-perfect  combination  generation, 
365-371,  383. 

Near-perfect  Gray  code  for  nested 
parentheses,  446. 

Near-perfect  permutation  generation, 

369,  383. 

Near  trees,  463-468,  481,  803. 

Nearest  common  ancestors,  165-167, 

196,  832. 

Nebesky,  Ladislav,  548. 

Necessity,  in  3-valued  logic,  195. 
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Nechiporuk,  Eduard  Ivanovich 

(HeunnopyK,  Oflyapfl  MBaHOBHu),  573. 

Necklaces,  215. 

Needham,  Noel  Joseph  Terence  Montgomery 
487. 

NEG  (negation),  181,  587,  620. 

Negabinary  number  system,  184,  675. 

Negadecimal  number  system,  169. 

NegaFibonacci  number  system, 

168-171,  196. 

Negation,  135,  184,  195. 

Negative  literals,  277-278. 

Neighboring  vertices,  13. 

Nemeth,  Evelyn  (=  Evi)  Hollister 
Pratt,  689. 

Nemhauser,  George  Lann,  542. 

Nesetril,  Jaroslav,  526. 

Nested  parentheses,  186,  278,  440-446, 
450-453,  455-456,  459,  471-472,  477, 
478,  509-511,  781,  787. 

Nestings  in  a set  partition,  768. 

Netto,  Otto  Erwin  Johannes  Eugen, 

508-510. 

Network  model  of  computation, 

214-215,  262. 

Networks:  Graphs  or  digraphs  together 
with  auxiliary  data,  31-32,  44. 

Neuman,  Frantisek,  484. 

Neumann,  John  von  (=  Margittai  Neumann 
Janos),  550,  622. 

Neural  networks,  75. 

New  England,  210,  255. 

Newbies,  240-242,  648. 

Newline  symbol,  152. 

Newton,  Isaac,  identities,  806. 
rootfinding  method,  423,  775. 

NEXT  (a)  (the  next  arc  with  the  same 
initial  vertex),  21. 

Neyman,  Jerzy,  692. 

Nicely,  Thomas  Ray,  588. 

Nievergelt,  Jiirg,  510. 

Nigmatullin,  Roshal’  Gabdulkhaevich 
(HurMaTyjiJiHH,  Pomajib 
rabrjyjixaeBHM),  558. 

Nijenhuis,  Albert,  338,  362,  411,  481,  510. 

Nijon,  Herman,  346. 

Nikolskaia,  Ludmila  Nikolaievna 

(HuKOJTbCKaa,  JIiOflMHjra  HnKOJiaeBHa), 
642. 

Nikolskaia,  Maria  (=  Macha)  Nikolaievna 
(HuKOjibCKaa,  Map  ha  HnKOjraeBHa) , 

642. 

Nim  and  nimbers,  134,  184,  622. 
addition,  134,  184. 
division,  184. 
multiplication,  184,  584. 
second-order,  184. 

No-three-on-a-line  problem,  277,  831. 
with  no  two  queens  attacking,  672. 


Nodes  in  SGB  format,  21-23,  523. 

Noisy  data,  197. 

Non-Euclidean  geometry,  167-168,  608. 

Nonbinary  Gray  codes,  see  Modular  Gray 
codes  for  tuples,  Reflected  Gray 
codes  for  tuples. 

Noncommuting  variables,  815. 

Nonconjunction  (A),  49,  seeNAND. 

Noncrossing  chords,  444,  471. 

Noncrossing  partitions,  473,  511,  513. 

Nondisjunction  (V),  49,  see  NOR. 

Nonimplication  (3),  49. 

Nonlocal  Gray  codes,  296-297,  314. 

Nonstandard  ordering  of  variables,  236, 
see  Reordering  of  variables. 

Nonsubsets  (/  Y1  g),  276,  673. 

Nonsupersets  (/  \ g),  276,  670-671,  674. 

Nonuniform  Turing  machines,  257. 

Nonzero  bytes,  testing  for,  152-153. 

Nonzero  register,  converted  to  mask,  181. 

Nooten,  Barend  Adrian  Anske  Johannes 
van,  488. 

NOR  (V),  49-51,  104-105. 

Nordhaus,  Edward  Alfred,  530. 

Nordstrom,  Alan  Wayne,  310. 

Normal  Boolean  functions,  100,  102,  110, 
113,  125,  279,  565,  577. 

Normal  distribution,  428. 

Normal  families  of  sets,  677. 

Normal  forms,  see  Full  conjunctive  normal 
form,  Full  disjunctive  normal  form, 
Integer  multilinear  representation, 
Multilinear  representation  of  a 
Boolean  function. 

Normal  numbers,  699. 

Normalization  of  a Boolean  function, 

100,  561. 

Normalized  BDDs,  279. 

Northeasterly  knight’s  tour,  352. 

NOT  (bitwise  complementation),  134, 

135,  184,  217,  604. 

NOT  gates,  32,  33,  104. 

Notations,  x,  26,  48,  132,  526,  535,  589, 
592,  669,  787. 

for  Boolean  binary  operators,  48-50. 
index  to,  822-826. 

NOTBUT  gates  (c),  97,  100. 

NOTBUT  subroutine,  633. 

Noughts  and  crosses,  see  Tic-tac-toe. 

Novels,  9,  23. 

Novra,  Henry,  694. 

Nowhere  differentiable  function,  386. 

NP-complete  problems,  viii,  35,  55,  189, 
240,  265,  538,  706. 

Nucleotides,  511. 

Null  case,  499,  799,  817. 

Null  graphs  (Kn),  26,  27,  41-43,  46, 

530,  806. 

Null  link  (A),  21,  222-223,  647-648. 
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Null  spaces,  200. 

Numbers,  Catalan,  440. 

Numerals,  Arabic,  493,  512. 

Roman,  707,  814. 

Sanskrit,  491. 

NXOR  (not  xor),  590. 

Nybble:  A 4-bit  quantity,  144,  340—341,  615. 
Nylan,  Michael,  487. 

Nyp:  A 2-bit  quantity,  144,  615. 

O (all-zeros  matrix),  27. 

O-notation,  238. 

OBDD,  204. 

Objects  in  images,  174. 

Oblivious  sorting,  72. 

Ockham,  William  of  (=  Guilielmus 
ab  Occam),  51. 

Octabyte  or  octa:  A 64-bit  quantity, 

139-140. 

Octacode,  310. 

Octahedral  groups,  706. 

Octonions,  685. 

Odd  Fibonacci  number  system,  608. 

Odd  length  runs,  696. 

Odd  permutations,  40,  323,  492,  512, 

712,  713. 

Odd  product  of  graphs,  28,  42-43,  483,  526. 
Odlyzko,  Andrew  Michael,  399. 

ODNFs,  84-85,  92. 

Oettinger,  Ludwig,  760. 

OFDD,  204. 

Offord,  Albert  Cyril,  801. 

Ofman,  Yuri  Petrovich  (OcJJMaH,  fOpnii 
PleTpoBHu),  570,  595. 

OKFDD,  204. 

Okuno,  Hiroshi  “Gitchang”  669. 

Olive,  Gloria,  734. 

Olkin.  Ingram,  758. 

Olver,  Frank  William  John,  426. 

Omega  network  for  routing,  188-189,  595. 
Omphaloskepsis,  235. 

One-to-many  mapping,  149,  162. 

Onegin,  Eugene  (OHf.rnH'b,  EBremft),  437. 
Ones  counting,  see  Sideways  sum. 

Online  algorithms,  174-175,  198. 

op:  Four-bit  binary  operation  code,  220-221. 

Open  bitmaps,  197. 

Optical  character  recognition,  172,  197. 
Optimal  versus  optimum,  34-35,  246. 
Optimization  of  Boolean  chains, 

121-122,  576. 

Optimizing  the  order  of  variables,  239-240, 
245,  246,  267-269,  644. 
for  ZDDs,  272. 

Optimum  algorithm,  461. 

Optimum  Boolean  evaluation,  101-106, 

126,  133. 

Optimum  coteries,  93. 


Optimum  linear  arrangement  problem, 

268,  832. 

Optimum  solutions  to  Boolean  equations, 
206,  209-211,  251,  258,  261,  279,  663. 

OR  (bitwise  disjunction,  I),  50,  74, 

84,  134-135. 

OR  function  (inclusive  or,  V),  48-51, 

53,  63,  81. 

OR  gates  (V),  32,  33,  97. 
with  vacuum  tubes,  104. 

OR  subroutine,  272,  273,  660,  677. 

Ord-Smith,  Richard  Albert  James 
(=  Jimmy),  330,  331,  336,  347, 

348,  508,  703. 

Order  ideals,  387,  554,  787. 

Order  of  a digraph,  18. 

Order  of  a graph,  13,  44. 

Order  of  a group  element,  338,  710. 

Order  of  a latin  square,  37. 

Order  of  a set  partition,  763. 

Order  of  an  orthogonal  array,  37. 

Ordered  BDDs,  202,  203,  216,  257, 

259,  637,  655. 

Ordered  factorizations,  760. 

Ordered  forests,  509. 

Ordered  pair  of  two  Boolean  functions,  219. 

Ordered  partitions  (compositions),  25, 
308-309,  356-358,  365,  379,  390,  410, 
488,  492,  512,  726,  778,  816. 
bounded,  370,  384,  385. 

Ordered  trees,  509,  511. 

Ordering  of  variables,  216,  236,  271, 

279,  626. 

by  local  transformations,  240-246,  649. 
optimum,  239-240,  245,  248,  267-270, 
272,  644,  657,  659. 

Ordinal  numbers,  583. 

Ore,  Oystein,  ix,  42. 

Organ-pipe  order,  239,  267,  368,  515,  626, 
646-647,  713,  720,  803. 

Organic  illustrations,  485. 

Oriented  binary  trees,  570. 

Oriented  cycles,  18,  19,  32,  40,  41,  259,  276. 

Oriented  forests  and  trees,  165-167, 

174  175,  461-462,  480. 

Oriented  paths,  18,  19,  41,  159,  196,  253. 

Oriented  spanning  paths,  40. 

Oriented  spanning  trees,  481-482,  808. 

Oriented  tree  numbers,  table,  461. 

Oriented  trees,  432,  461-462,  480-482, 

509,  511,  774. 

OROR,  661,  see  ANDAND  subroutine. 

Orthogonal  arrays,  37,  519,  832. 
generalized,  517-518. 

Orthogonal  DNFs,  84-85,  92. 

Orthogonal  families  of  sets,  273. 

Orthogonal  latin  squares,  3-8,  36-38. 

Orthogonal  strings,  37. 

Orthogonal  vectors,  34,  37,  288,  312. 
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Ostergard,  Patric  Ralf  Johan,  673. 

Otter  theorem-proving  program,  548. 

Ourotoruses,  318-319. 

Out-degree  of  a vertex,  18,  19,  21,  40, 

41,  43,  482,  808. 

Outline  of  a forest,  453. 

Outshuffles,  188,  201. 

Outside  of  a curve,  176. 

Overflowing  memory,  603. 

Overlapping  subtrees,  97,  203,  257. 

Overpartitions,  see  Joint  partitions. 

Ozanam,  Jacques,  3,  7,  9. 

p (power  set,  the  family  of  all  subsets), 

275,  660-662,  666,  669. 

P = NP(?),  55. 

P-partitions,  414. 

Pm  (permutation  function),  238,  272. 

P„  (path  of  order  n),  13,  28,  39,  469,  481, 
483,  537,  803,  807,  809. 

Pn  (oriented  path  of  order  n ),  18,  41. 

Packages  for  BDD  operations,  224,  257,  677. 

Packages  for  ZDD  operations,  272, 

273,  276,  677. 

Packed  data,  operating  on,  136,  151-153, 
163,  191-192,  195,  201. 

Packing  of  data,  136-138,  147-148,  163, 

186,  196,  201,  202,  594. 

Page  faults,  191. 

Page  in  a virtual  address,  263. 

Paige,  Lowell  J.,  5-7. 

Pak,  Igor  Markovich  (IlaK,  Hropb 
MapKOBuu),  713,  749. 

Paley,  Raymond  Edward  Alan  Christopher, 
186,  586,  684. 
functions,  312. 

Palindromes,  38,  516,  543. 

Palluel,  Frangois  Crette  de,  8. 

Pan-digital  puzzles,  319,  347,  484—485. 

Panholzer,  Alois,  796,  798. 

Papadimitriou,  Christos  Harilaos 

( [lo'.Tio'.OTijjLrjTpLou,  XptcTTo?  Xapikaoo), 
ix,  604. 

Papert,  Seymour  Aubrey,  198. 

Parabolas,  176,  198,  614. 

Parallel  addition,  108,  127-128,  570. 

Parallel  computation,  vii,  91,  108,  124, 

352,  682,  706. 

Parallel  edges  of  a multigraph,  19,  41. 

Parallel  lines,  37. 

Parallel  processing  of  subwords,  151-155, 
191-193,  202. 

Parent  pointers,  461-462,  470-471,  480. 

Parentheses,  278,  440—446,  450-453, 
455-456,  459,  471-472,  477,  478, 
509-511,  781,  787. 

Parenthesis  traces,  186. 

Parity  bits,  38,  286,  308,  309. 

Parity  check  matrix,  279. 


Parity  function,  51,  77,  94-95,  98,  105,  131, 
132,  159,  194,  210,  564,  584,  590. 
suffix,  187,  201,  603. 

Parity  patterns,  199-200. 

Parker,  Ernest  Tilden,  5-7,  516. 

Parkin,  Thomas  Randall,  587-588. 

Parking  problem,  789,  805. 

Part-count  form,  393,  407,  432. 

Partial  cubes,  90;  see  also  Subcubes. 

Partial  functions,  113-114,  131. 

Partial  ordering,  342,  352,  353,  414. 
Partial-tautology  functions  (tj), 

660-662,  675. 

Partially  symmetric  Boolean  functions, 

95,  269,  642,  650. 

Partition  lattice,  432-433. 

Partition  numbers  p(n),  395-401,  409-411. 

tables,  396,  400,  778. 

Partitions,  390-440. 

of  a multiset,  428-431,  439,  505,  513,  780. 
of  a set,  see  Set  partitions. 

Partitions  of  an  integer,  25,  30,  347, 

391-415,  428-431,  434,  439,  505-508, 
513,  529,  726,  767,  769. 
balanced,  407. 

doubly  bounded,  403,  411,  413,  723. 
ordered,  see  Compositions, 
random,  400-402,  411,  426-428. 
sums  over,  393,  419,  772,  775. 
with  distinct  parts,  408,  409,  411, 

412,  431. 

without  singletons,  399,  436,  754, 

768,  771. 
parts  graphs,  25. 

Party  games,  494,  503. 

Pascal,  Ernesto,  360. 

Patashnik,  Oren,  842. 

Patents,  112,  284,  590,  594,  620,  680,  694. 
Paterson,  Kenneth  Graham,  305. 

Paterson,  Michael  Stewart,  126,  127, 

154,  601,  602. 

Path  graph  (Pn),  13,  28,  39,  469,  481, 

483,  537,  803,  807,  809. 

Paths  in  a graph,  13,  41. 
oriented,  18,  19,  41. 
shortest,  viii,  12,  16,  32,  66,  646. 
simple,  253-255,  275-276. 

Paths  on  a grid,  356-357,  379,  395. 

Patricia,  257. 

Pattern  recognition,  172. 

Patterns  in  permutations,  762. 

Patterns  of  bits,  searching  for,  152-154,  193. 
Patterson,  Nicholas  James,  561. 

Paul,  Wolfgang  Jakob,  131. 

Payne,  William  Harris,  363,  382. 

PBDD,  204. 

Peczarski,  Marcin  Piotr,  718. 

Pehoushek,  Joseph  Daniel,  87. 
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Peirce,  Charles  Santiago  Sanders,  48, 

50,  53,  418,  542. 

triangle,  418,  434-436,  438,  769,  771,  779. 
Peled,  Uri  Natan  ("Dd  pi  DIN),  654. 
Pendant  vertex,  see  Endpoint  of  a graph. 
Pentagonal  numbers,  395,  409. 

Pentagons,  168-169,  475. 

Pentagrid,  168-171,  196,  608-609. 
Pepperdine,  Andrew  Howard,  721. 

Perelman,  Grigori  Yakovlevich  (nepejibiuaH, 
rpnropHft  HKOBJieBHu),  514. 

Perez,  Aram,  183. 

Perfect  combination  generation, 

369-371,  384. 

Perfect  Gray  code  for  nested  parentheses, 
446,  477,  787. 

Perfect  hash  functions,  588-589. 

Perfect  matchings,  252,  313-314,  686-687, 
768,  831. 

Perfect  parity  patterns,  199-200. 

Perfect  partitions,  415. 

Perfect  shuffles,  38,  see  also  Zip. 

Period  length,  194,  317,  530. 

Permanent  of  a matrix,  40,  119,  125. 
Permutahedron,  475. 

Permutation  digraphs,  40. 

Permutation  function  (Pm),  238,  272. 
Permutation  generation,  319-355. 

bypassing  blocks,  331-334,  348,  719-720. 
cyclic  shift  method,  336,  338,  341,  348. 
dual,  335-337,  348. 

Ehrlich  swap  method,  337-338,  349-350. 
fastest,  339-342. 

general,  328-331,  340-341,  347-348. 
lexicographic,  319,  330,  333,  334, 

344- 345,  508. 

lexicographic  with  restricted  prefixes, 

334,  348,  718-719. 

plain  changes,  322-325,  335,  341,  343, 

345- 346,  351,  719-720. 
when  to  use,  344,  722. 

Permutation  matrices,  20,  182,  238,  523. 
Permutation  networks,  145-147, 

188-190,  592. 

Permutation  of  variables,  216,  236, 

271,  279,  626. 

by  local  transformations,  240-246, 

273,  649. 

optimum,  239-240,  245,  267-269,  272,  644. 
Permutation  representation  of  binary 
trees,  476. 

Permutations:  Arrangements  in  a row, 

271,  319-355,  390,  432,  439,  448, 
490-492,  500. 
applying,  326-328. 
balanced,  354. 
conjugate,  330. 
cycles  of,  326,  330. 
cyclic,  353,  476. 


derangements,  353. 
descents  of,  479. 

even,  40,  323,  354,  492,  512,  812. 
flip,  789. 

Gray  codes  for,  349-350,  718-720. 
groups  of,  327-328,  338,  710. 
fc-ordered,  353. 
indecomposable,  353,  811. 
induced  by  index  digits,  188. 
inverse,  342-343,  346,  448,  717. 
inversions  of,  321,  323. 
involutions,  353-354,  718. 
multiplication  of,  326. 
notations  for,  326. 
null,  499. 

odd,  40,  323,  492,  512,  712,  713. 
of  a Latin  verse,  500-503,  512-513. 
of  a multiset,  319-321,  342,  345,  351, 
358,  368-369,  383,  384,  395,  491,  500, 
502,  507,  723,  726,  735,  760. 
of  bits  within  a word,  145-149,  157,  182. 
of  bytes  within  a word,  182. 
of  the  2-adic  integers,  185. 
Omega-routable,  188-189. 
order  of,  338,  710. 
partial,  345,  see  Variations, 
r-element,  345,  348. 
rank  of,  345,  352,  717. 
representation  of,  327,  335. 
restricted,  501-503,  512-513. 
sign  of,  323,  351. 
signed,  346. 

universal  cycles  of,  354-355. 
up-down,  353. 
well-balanced,  354. 

Permuted  2m-way  multiplexer,  235, 

239,  267,  272. 

Perpendicular  lines,  168. 

Perrin,  Frangois  Olivier  Raoul,  numbers, 
537,  623,  641. 

Perverse,  Rufus  Quentin,  315. 

Petersen,  Julius  Peter  Christian,  5,  14,  525. 

graph,  14,  15,  25,  39,  42,  44,  45,  525. 
Peterson,  William  Wesley,  183. 

Petrarca,  Francesco  (=  Petrarch),  436. 
Pfaff,  Johann  Friedrich,  719. 

Phi  196,  236,  246,  270,  514. 
as  source  of  “random”  data,  348, 

516,  818-819. 

Phillips,  John  Patrick  Norman,  702. 

Philo  of  Megara  (=  Philo  the  Dialectician, 
4>lXcov  6 Meyaprcri;),  48. 

Phylogenetic  trees,  774. 

Pi  (tt),  as  source  of  “random”  data,  76, 

98,  118,  128,  149,  205,  247,  272,  310, 
317,  345,  348,  354,  356,  367,  381-383, 
388,  434-435,  459,  479,  486,  516, 

641,  650,  681,  759,  794,  811,  818-819, 
see  also  Pi  function. 
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Pi  function,  52,  54,  80-82,  118,  125, 

130,  561. 

Pl(/):  The  prime  implicants  of  /, 

255-256,  277-278. 

Pingala,  Acarya  frfFT),  487-488. 

Piano,  364,  384. 

Pickover,  Clifford  Alan,  586. 

Pigeonhole  principle,  85,  390-391,  579,  616. 
Pincusian  mathematics,  79. 

Pipelined  machine,  180-181. 

Pippenger,  Nicholas  John,  558,  573. 

Pisot,  Charles,  number,  641. 

Pistols,  271,  659. 

Pitman,  James  William,  439,  765,  777. 

Pittel,  Boris  Gershon  (IJHTTejib,  Bopnc 
PopmoHOBU'i).  428. 

Pitteway,  Michael  Lloyd  Victor,  177. 

Pitts,  Walter  Harry,  75. 

Pixel  algebra,  172. 

Pixel  patterns,  136,  185. 

Pixels,  24,  31,  171-180,  196-200. 
gray,  191,  199. 

Plain  changes,  322-325,  335,  341, 

343,  345-346,  351,  364,  447,  491, 

498,  719-720. 

Planar  graphs,  14,  15,  17,  24,  39,  44, 

233,  521,  527. 

Planar  Langford  pairings,  36. 
planeJisa  graphs,  24,  31. 
plane-miles  graphs,  24,  31. 

PLAs,  53. 

Plass,  Michael  Frederick,  87. 

Plastic  constant,  236,  641. 

Plato  = Aristocles,  son  of  Ariston 

(nXoctcov  = ’ApictoxXfic  ’Apiaxcovo?),  440. 
Playing  cards,  3-4,  389,  440. 

Pleasants,  Peter  Arthur  Barry,  766. 
Pleszczynski,  Stefan,  717. 

Poetry,  436-437,  494,  505,  520. 
meters  for,  487-490,  492,  500-503, 

508,  512-513. 
rhyme  schemes,  505,  513. 

Pohl,  Ira  Sheldon,  3. 

Poincare,  Jules  Henri,  514. 

Poinsot,  Louis,  389,  512,  745. 

Poirot,  Hercule,  18-19,  816. 

Poisson,  Simeon  Denis,  751. 
distribution,  427,  434. 
summation  formula,  397,  410. 

Polignac,  Camille  Armand  Jules  Marie 
de,  15. 

Polish  prefix  notation,  63,  472,  511. 

Poliak,  Gyorgy,  687. 

Polya,  Gyorgy  (=  George),  xiii,  18, 

586,  748,  758. 

Polyhedra,  372,  387,  475. 

Polyhedral  combinatorics,  vii. 

Polynomial  ideal,  388. 


Polynomials,  see  Integer  multilinear 
representation,  Interpolating 
polynomials,  Multilinear  representation 
of  a Boolean  function,  Reliability 
polynomials. 

computed  from  BDDs,  211-212,  260. 
represented  by  ZDDs,  278. 

Polynomials  modulo  2,  183,  189,  675. 
multiplication  of,  202. 
remainders  of,  183,  189,  199-200. 
Polynomials  modulo  5,  192. 

Polyominoes,  252,  274. 

Polyphase  sorting,  488. 

Pomerance,  Carl,  777. 

Pool  of  available  memory,  220. 

Population  count,  143,  see  Sideways  sum. 
Portability,  139-140. 

Portable  programs,  22. 

Positive  Boolean  functions,  55,  537,  see 
Monotone  Boolean  functions. 

Positive  normal  form,  see  Multilinear 

representation  of  a Boolean  function. 
Positive  semidefinite  matrices,  805-806. 
Positive  threshold  functions,  75-76,  92. 
Posner,  Edward  Charles,  519. 

Possibility,  in  3-valued  logic,  195. 

Post,  Emil  Leon,  63,  68,  546. 

Post,  Ian  Thomas,  674. 

Postal  codes,  15,  40,  276. 

Postmultiplication,  327. 

Postorder  of  nodes,  381,  441,  444,  447, 

448,  471,  781,  782,  786. 

Postpreorder  of  nodes,  469-471,  483. 
Pournader,  Roozbeh  618. 

Power  set  (p),  275,  660-662,  666,  669. 
Powers  of  2,  division  by,  135-136. 
multiplication  by,  135,  588. 
partitions  into,  414. 

Powers  of  a graph,  470,  484. 

Prakrta  Pairigaia  (3TfTrf  488,  512. 

Pratt,  Vaughan  Ronald,  125,  186,  190, 

593,  596,  601,  615. 

Precedence  of  operators,  51. 

Preferential  arrangements,  see  Weak 
orderings. 

Prefix  problem,  127-128,  132. 

Prefixes  of  strings,  132,  305. 
Premultiplication,  327,  329-330,  332,  719. 
Preorder  degree  sequence,  472. 

Preorder  of  nodes,  165-167,  329,  332, 

361,  381,  441,  444,  448-449,  461, 

462,  471-475,  478,  480,  511,  731, 

748,  781,  786,  787,  804. 

Preparata,  Franco  Paolo,  567. 

Prepostorder  of  nodes,  469-471, 

483-484,  680. 

Preprime  strings,  306-308,  317. 

Prestet,  Jean,  502,  513. 

Presume,  Livingstone  Irving,  187. 
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Prime  clauses,  54,  95,  129,  277. 

Prime  forms,  54,  64,  71,  81. 

Prime  graphs,  28. 

Prime  implicants  of  Boolean  functions, 

54,  64,  71,  82,  89,  94,  95,  129,  195, 

258,  277-278,  577,  581. 
monotone,  255-256,  631-632, 

653-654,  674. 

of  a majority  function,  92,  553. 
Prime-number  function,  110,  129. 

Prime  numbers,  137,  186. 

Prime  strings,  305-308,  317. 

factorization  into,  317,  700. 

Primitive  polynomials  modulo  2,  626-627. 
Primitive  polynomials  modulo  p,  303,  683. 
Primitive  strings:  Not  a power  of  shorter 
strings,  204. 

Principal  subforests,  300-301. 

Prins,  Geert  Caleb  Ernst,  509. 

Printing  machines,  171. 

Priority  encoders,  127. 

Priority  queues,  167. 

Pritchard,  Paul  Andrew,  587. 

Probability  distribution  functions,  400, 

428,  434,  438. 

Prodinger,  Helmut,  589,  770,  796,  798. 
Product  of  binary  numbers,  228-229, 
247-249,  271-272,  278. 

Products  of  digraphs  and  multigraphs,  526. 
Products  of  graphs,  27-28,  42-44. 
Product-of-sums  expression,  see  Conjunctive 
normal  form. 

Profile  (bo, . . . , bn)  of  a function,  233-236, 
240,  262,  263,  266,  271,  279,  629, 

631,  663. 

Program  counter,  158. 

Programmable  logic  arrays,  53. 
Programming  languages,  58. 

Projection  functions  (x^),  49,  63,  80,  141, 
265,  272,  274,  534-535,  550. 

Projections  in  a median  algebra,  67. 
Projective  planes,  finite,  518,  529,  581. 
Prokop,  Harald,  142,  187. 

Prolog  language,  57. 

Promel,  Hans  Jurgen,  718. 

Proper  prefixes  or  suffixes,  305. 

Proportional  graphs,  46. 

Proskurowski,  Andrzej,  446,  477. 

Prosody,  487,  see  Poetry,  meters  for. 
Proteins,  511. 

Proteus  verses,  501,  502,  512. 

Provan,  John  Scott,  541. 

Pruesse,  Gara,  718. 

Prune-and-graft  algorithm,  449,  473. 
Pseudo-complement  in  a lattice,  789. 
Pseudorandom  numbers,  12,  25,  96,  317. 
Ptolemy,  Claudius,  of  Alexandria 

( flxo/.£|xaio^  KkauSto?  6 ’AXs$av8piv6<;), 
501. 


Pudlak,  Pavel,  763. 

Pulse  code  modulation,  284. 

Pun  resisted,  62,  470. 

Pure  alphametics,  325,  346-347,  432. 

Pure  majority  functions,  76,  550. 

Purkiss,  Henry  John,  308. 

PUSHJ  (push  registers  and  jump),  716. 
Pushkin,  Alexander  Sergeevich  (nyinKHH-b, 
AjieKcaHflp-b  CeprteBHH-b),  437. 
Puteanus,  Erycius  (=  de  Putte,  Eerrijk), 
500-502,  512,  814-815. 

Puttenham,  George  and/or  Richard, 
504-505,  513. 

Putzolu,  Gianfranco,  541. 

Puzzles,  xiv,  1,  3-4,  7-9,  15,  316,  515,  521, 
617,  680,  694,  see  also  Alphametics, 
Chinese  ring  puzzle,  Pan-digital  puzzles. 
Pyramids,  tetrahedral,  89. 

Pyrrhics,  489-490,  501,  512. 

(/-ballot  numbers,  476-477. 
g-Catalan  numbers,  476-477. 
q-multinomial  coefficients,  384. 
q-nomial  coefficients,  369,  384,  726, 

735,  769,  793. 

ij-nomial  theorem,  749,  753. 
g-Stirling  numbers,  436,  765. 

Q(f)  (the  QDD  size  of  /),  235,  248, 

272,  627. 

QDD:  A quasi-BDD,  234. 

Quadratic  forms,  177-179,  198-199. 
Quadtrees,  597. 

Quantified  formulas,  87,  123-124,  230-232, 
264-265,  579,  585,  601,  651. 
Quasi-BDDs,  234-235,  248,  268,  627, 

638,  642,  676,  677. 

Quasi-Gray  codes,  469-471. 

Quasi-profile  (qo, ...  ,q„)  of  a function, 

235,  237,  240,  250,  262,  266,  268, 

271,  629,  659. 

Quaternary  n-tuples,  309,  688. 

Quaternions  and  octonions,  312. 

Queen  graphs  (<?„),  277,  604. 

Queen  moves  on  a chessboard,  26,  44. 
Queues,  543,  717. 

Quick,  Jonathan  Horatio,  81,  185, 

190,  265,  543. 

Quilt,  136. 

Quine,  Willard  Van  Orman,  54,  55,  82,  539. 
Quorums,  88. 

Quotient  operation  on  families  of  sets 
(f/g),  273. 

R&D  method,  305,  317. 
r-closed  graphs,  133. 
r-families  of  edges,  133. 
r-uniform  hypergraphs,  32-33,  44. 

Rabbinic  script,  490. 

Rabin,  Michael  Oser  (pm  HIV  bND’O),  596. 
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Rademacher,  Hans,  288,  398,  399,  410,  411. 

functions,  288,  312,  386,  685. 

Radix  —2,  see  Negabinary  number  system. 
Radix-2  number  systems,  historic, 

487,  490,  499. 

Radix-3  number  system,  historic,  487. 

Radix  conversion,  192. 

Radix  exchange  sort,  603. 

Radix  sorting,  430-431,  677. 

Rado,  Richard,  389,  555. 

Radoicic,  Rados,  713. 

Rafaiani,  Luigi,  841. 

RainBones  puzzle,  515. 

RAM  (random-access  machine),  158-159, 
194,  602. 

Raman,  Rajeev,  597. 
raman  graphs,  24,  531. 

Ramanujan  Iyengar,  Srinivasa 

(udjsaflsurrsw/jTrLDn'i&ysDsar  eoujmjaniT), 
graphs,  24,  398,  399,  410,  411, 

531,  750,  751. 

Ramesh,  Hariharan  (rrGiosfy  syntflammshr) . 

804. 

Ramras,  Mark  Bernard,  688. 

Ramshaw,  Lyle  Harold,  153. 

Randall,  Keith  Harold,  142,  187. 

Random  binary  tree,  454-457,  478,  485. 
Random  bit  generation,  653. 

Random  Boolean  functions,  56,  83,  541. 
Random  forests,  453-454,  478. 
random-graph  graphs,  25,  41. 

Random  graphs,  25,  41,  46. 

Random  number  generation,  12,  25,  96,  317. 
Random  oriented  trees,  481. 

Random  partitions,  400-402. 
generating,  411. 

Random  Schroder  trees,  798-799. 

Random  set  partitions,  426-428. 
generating,  428. 

Random  solutions  to  Boolean  equations, 

206,  208-209,  233. 

Random  walks,  45. 

Randomization,  118,  130. 

Randomized  data  structures,  590. 
Randomness,  true,  516. 

Randrianarivony,  Arthur,  655. 

Raney,  George  Neal,  478. 

Range,  Niko,  647. 

Range  checking,  192. 

Range  minimum  query  problem,  196,  832. 
Rank  of  a binary  matrix,  200,  279. 

Rankin,  Robert  Alexander,  338,  351,  714. 
Ranking,  487-488,  500,  512. 

combinations,  360,  363,  373,  383, 

491,  727,  728,  735. 

n-tuples,  284,  299,  315,  487-488,  512. 
other  combinatorial  objects,  477,  479,  511. 
permutations,  345,  352,  500,  717. 


Rao,  Calyampudi  Radhakrishna 
ct4>),  518. 

Rapaport,  Elvira  Strasser,  713. 

Rashed,  Roshdi  (=  Rashid,  Rushdi) 

(■uil_,  493,  812. 

Rasters,  171,  see  Bitmaps. 

Rational  2-adic  numbers,  141,  193,  585, 
see  also  Magic  masks. 

Rational  arithmetic,  810. 

Raviv,  Josef  (Ml  qov),  677. 

Ray,  Louis  Charles,  172. 

Raynaud-Richard,  Pierre,  618. 

Razborov,  Alexander  Alexandrovich 
(Pa36opoB,  AjreKcaHflp 
AjieKcaHApoBuu),  125. 

Reachability  problem,  159-160,  165, 

217,  803,  832. 

Read,  Ronald  Cedric,  370,  734. 

Read-once  branching  programs,  seeFBDDs. 

Read-once  functions,  246-247,  270,  271. 
generalized,  653. 

Read-once  threshold  functions,  654. 

Reagan,  Ronald  Wilson,  437. 

Real  numbers,  91. 
extended,  63. 

Real  roots,  439. 

Rearrangeable  networks,  see  Permutation 
networks. 

Reciprocal  of  an  odd  integer  mod  2n . 621. 

Reckhow,  Robert  Allen,  542. 

Recorde,  Robert,  827. 

Recreations,  1,  3-4,  7-9,  see  Carroll,  Dice, 
Dominoes,  Dudeney,  Games,  Puzzles. 

Recurrence  relations,  71,  140,  142,  143, 

169,  183,  187,  199,  211,  224,  228, 

246,  266,  270,  303,  380,  396,  404, 

409,  537-538,  566,  567,  623,  641,  643, 
651,  654,  669,  673,  681,  683,  693, 
697-699,  703,  728-730,  778,  789-791, 
796,  798,  809,  814. 
binary,  108,  109,  126,  140,  142,  143, 

187,  549,  566,  681,  699. 

Recurrent  states,  482. 

Recursion,  364-366,  463-464. 
versus  iteration,  366-368,  383. 

Recursion  tree,  451-452. 

Recursive  algorithms,  147,  149,  164,  184, 
225-233,  256,  264-266,  272-273, 
276-277,  506-507,  511,  513,  583, 

591,  595,  623,  652,  678. 

Recursive  coroutines,  370. 

Recursive  principle  underlying  BDDs, 

225,  229. 

Recursive  principle  underlying  ZDDs, 
659-661. 

Recursive  procedures,  70,  82,  562, 

566,  764,  802. 

Recursive  structure,  443,  451,  781,  790. 

Redei,  Laszlo,  522. 
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Red’kin,  Nikolai  Petrovich 

(Pe/TbKHH,  HttKOJiaii  IleTpOBHM), 

107,  122,  131,  577. 

Reduced  BDDs,  202-203,  225,  226,  235, 

257,  259,  262,  637,  655. 

Reduced  median  sets,  72,  91. 

Reduction  to  a BDD,  216-218. 

Redundant  coordinates,  72. 

Redundant  implicants,  94,  538,  541. 
Redundant  representations,  616. 

REF  field,  see  Reference  counters. 

Reference  counters,  227-228,  264,  268, 

633,  637,  659,  661,  669-670. 

Refinement,  432. 

Reflected  Gray  codes  for  tuples,  299—300, 
315,  382,  447,  693,  759. 
decimal,  299. 
m- ary,  682,  693,  695. 
mixed-radix,  299-301,  321,  705,  710,  719. 
ternary,  299-300,  316. 

Reflection  of  a Boolean  function,  266,  647. 
Reflection  of  a forest,  448-449,  471-472, 

476,  483. 

Reflection  of  bits,  144-145,  157,  159,  187, 

188,  308,  311,  592,  608,  675. 

Registers,  101-103,  126. 

Regular  Boolean  functions,  93,  263, 

271,  559. 

enumeration  of,  631. 

Regular  graphs,  14,  24-25,  33,  40-44,  483. 
Regular  languages,  193,  279. 

Regular  polygons,  168-169,  475. 

Regular  solids,  387. 

Reingold,  Edward  Martin  (Ttnjm, 

0”n  p rwo  pns’),  289,  362,  510. 

Reiss,  Michel,  745. 

Reitwiesner,  George  Walter,  187. 

Relative  complement,  460. 

Relay-contact  networks,  257. 

Reliability  polynomials,  80,  81,  84,  93,  206, 
211-212,  260,  261,  267,  388,  535. 
Remainder  operation  on  families  of  sets 
(/mods),  273,  661,  666. 

Remainders  mod  2",  136. 

Remainders  mod  2”  — 1,  143. 

Remainders  mod  3,  634. 

Remainders  of  polynomials  mod  2,  183, 

189,  199-200. 

Remmel,  Jeffrey  Brian,  769. 

Remond  de  Montmort,  Pierre,  506,  513. 
Removal  of  bits,  140. 

Remy,  Jean-Luc,  456,  478,  798. 

Renaming  (selectively  complementing) 
Boolean  variables,  87,  536,  544. 
Reordering  of  variables,  216,  236,  271, 

279,  626. 

by  local  transformations,  240-246, 

273,  649. 

optimum,  239-240,  245,  267-269,  272,  644. 


Replacement  functions,  265. 

Replacement  of  variables  by  constants, 

218,  262,  634. 

Replacement  of  variables  by  functions,  263. 

Replacement  selection  sorting,  727. 

Replication  of  bits,  149,  190,  599. 

Representation  of  graphs  and  digraphs, 
19-22,  159,  194. 

Representation  of  permutations,  189, 

327,  335. 

Representation  of  sets  as  integers,  143,  150, 
160,  190,  194-195,  278,  585-586. 

Representation  of  three  states  with  two 
bits,  160-163,  195. 

Residue  theorem,  419,  422. 

Resolution  principle,  539. 

Restricted  growth  strings,  279,  416-418,  432, 
766,  767,  771,  778,  786,  815. 

Restricted-to  operation  (/  1)  g),  635. 

Restriction  of  a Boolean  function,  218, 

262,  630,  634,  653-654,  see  also 
Subfunctions. 

Restriction  of  a graph,  13. 

Resultants,  807. 

Retraction  mappings  and  retracts,  74,  91. 

Reusch,  Bernd,  674. 

Reversal  of  bits,  144-145,  157,  159,  187, 

188,  308,  311,  592,  608,  675. 

Reverse  colex  order,  326,  330,  333,  335, 

344,  500,  703,  814. 

Reverse  lexicographic  order,  494,  505, 

506,  816. 

Reversing  a string,  319,  349,  354,  703,  705, 
see  also  Flip  operation/permutation. 

Reversion  of  power  series,  727. 

Revolving-door  Gray  codes,  362,  383,  405, 
463,  467,  468,  481,  804. 
near-perfect,  365-371,  383,  446. 
scheme  rst)  362-364,  370-371,  381-383. 

Rhyme  schemes,  416,  436-437,  505,  513. 

Rhythms,  488-490,  498. 

Richards,  Dana  Scott,  316,  472,  476. 

Riemann,  Georg  Friedrich  Bernhard, 
surface,  773. 

Right-child/left-sibling  links,  445,  472, 

476,  783,  786-787. 

Right  complementation  (R),  49. 

Right  path  length,  795. 

Right  projection  (R),  49,  63,  534-535. 

Right-sibling/left-child  links,  441,  445,  447, 
470,  667,  678,  782,  783,  786-787,  809. 

Right  subcube,  310. 

Right-to-left  minima,  606. 

Rightmost  bits,  140-142,  186. 

Rim  representation,  394-395,  402, 

408,  412,  747. 

Ring  sum  expansion,  see  Multilinear 

representation  of  a Boolean  function. 

Ringel,  Gerhard,  315,  525. 
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Riordan,  John,  570,  793-794. 

RISC:  Reduced  Instruction  Set  Computer, 
32,  see  also  MMIX  computer. 
rise  graphs,  31-32. 

Ritchie,  Alistair  English,  680. 

River  flows,  811. 

Rivest,  Ronald  Linn,  56,  261. 

Robbins,  David  Peter,  771. 

ROBDD:  A reduced,  ordered  binary 
decision  diagram,  202. 

Robertson,  George  Neil,  17. 

Robinson,  Gilbert  de  Beauregard,  479. 
Robinson,  John  Alan,  539. 

Robinson,  John  Paul,  310,  687,  690. 
Robinson,  Robert  William,  745,  774. 
Rochdale,  Simon,  133. 

Rodrigues,  Benjamin  Olinde,  456. 

Roelants  van  Baronaigien,  Dominique, 

446,  447,  791. 

Roget,  John  Lewis,  23. 

Roget,  Peter  Mark,  9,  23. 
roget  graphs,  23,  41. 

Rokicki,  Tomas  Gerhard,  187,  590. 

Roman  numerals,  707,  814. 

Rook  moves  on  a chessboard,  26,  41. 
Rook-neighbors,  172,  611-612. 

Rook  polynomials,  434-435. 

Rooks,  nonattacking,  434-435,  767-768. 
Rookwise  connected  components,  24, 
173-175,  198,  252,  609. 

Root  of  a BDD,  202-204,  207,  215,  227, 
280,  638,  676. 

Rooted  unlabeled  trees,  see  Oriented  trees. 
Roots  of  a forest,  441. 

Roots  of  a polynomial,  439. 

Roots  of  unity,  384,  398,  752. 

Rosa,  Alexander,  516-517. 

Rosary  permutations,  346,  713. 
Rosenbaum,  Joseph,  693. 

Rosenfeld,  Azriel  (ibann  iwnty),  173. 
Rosenkrantz,  Daniel  Jay,  638. 

Rossin,  Dominique  Gilles,  805. 

Rota,  Gian-Carlo,  390,  557. 

Rotation  lattice,  see  Tamari  lattice. 
Rotation  of  square  bitmaps,  199. 

Rotations  in  a binary  tree,  446-449,  787. 
Rote,  Gunter  (=  Rothe,  Gunther  Alfred 
Heinrich),  198. 

Rotem,  Doron  (Dim  inn),  340,  343. 

Roth,  John  Paul,  539. 

Rothaus,  Oscar  Seymour,  561. 

Rothe,  Heinrich  August,  508,  703. 
Round-robin  tournaments,  413. 

Rounding,  165,  597. 

to  an  odd  number,  134,  191,  597. 
Row-echelon  form,  725. 

Row  sums,  414. 

Roy,  Mohit  Kumar  (CSTfeo  jads  3TXI),  706. 
Rucinski,  Andrzej,  533. 


Rucksack  filling,  361,  381. 

Rudeanu,  Sergiu,  258. 

Rudell,  Richard  Lyle,  230,  241,  243,  244, 
257,  268,  635,  650. 

Rudiger,  Christian  Friedrich,  319. 

Ruler  function  ( px ),  x,  140,  152,  153,  157, 
158,  160,  164,  165,  167,  185,  187, 

188,  193,  196,  268,  286,  288,  292, 

293,  585,  589,  612,  685. 
decimal,  299. 
factorial,  348. 
summed,  607. 

Run-length  coordinates  d fc,  442,  443, 

471,  477,  781,  784. 

Run  lengths,  295-297,  314,  688,  696. 

Runlength  encoding,  611-612,  see  also 
Edges  between  pixels. 

Runs  of  0s  or  Is,  549-550. 

Runs  of  Is,  140,  143,  154-155,  187,  193; 
see  also  Consecutive  Is. 

Ruskey,  Frank,  129,  300,  301,  308,  311, 

313,  339,  352,  384,  417,  433,  446,  447, 
452,  454,  477,  511,  664,  696,  700,  713, 
714,  717,  718,  722,  791,  802. 

Rutovitz,  Denis,  172. 

Ruzsa,  Imre  Zoltan,  740-741. 

Ryser,  Herbert  John,  37,  517,  757. 

S,  the  letter,  180. 

S-boxes,  129. 

S2  (sum  of  binary  digits),  589,  see 
Sideways  sum. 

Sm  (a  symmetric  Boolean  function),  262, 
272,  274,  626,  636,  642. 

S>m  (a  symmetric  threshold  function), 

262,  643,  659,  663-664. 

Ski,...,kt(x)’  77,  see  Symmetric  Boolean 
functions. 

SIS,  585. 

Saccheri,  Giovanni  Girolamo,  168. 

Sachkov,  Vladimir  Nikolaevich  (CauKOB, 
BjiaAHMHp  HnKOJiaeBHu),  428. 

Sachs,  Horst,  525,  806,  808. 

Sack,  Jorg-Riidiger  Wolfgang,  797. 

SADD  (sideways  addition),  141,  160,  587, 
589,  590,  620. 

Saddle  point  method,  398,  419-426, 
437-439,  779. 

Sagan,  Bruce  Eli,  723. 

Saka,  Masanobu  (SIE^R),  504. 

Saks,  Michael  Ezra,  549. 

Salzer,  Herbert  Ellis,  682. 

Samet,  Hanan  (DOO  jin),  597. 

Sampson,  John  Laurence,  686. 

Samson,  Edward  Walter,  536,  539. 

Sanders,  Daniel  Preston,  17. 

Sandpiles,  482,  808. 

Sangiovanni-Vincentelli,  Alberto  Luigi,  122. 

Sanity  check  routine,  659. 
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Sanskrit,  487-489,  491,  492. 
numerals,  491. 

Sarngadeva,  son  of  Sodhaladeva  (HI  1^44 , 
'JT:),  491,  500,  703,  814. 
Sartena,  Christian,  534. 

Sasaki,  Fukashi  (fe  V “Tlh ) , 77. 

Sasao,  Tsutomu  626. 

SAT-counting,  see  Enumeration  of  solutions. 
Satisfiability  problem,  55-62,  85-87,  830. 
for  Horn  clauses,  60,  85-86,  830. 
for  Krom  clauses,  57,  60-62,  72,  85-86. 
Saturating  addition,  604. 

Saturating  subtraction,  see  Monus  operation. 
Sauerhoff,  Martin  Paul,  238,  246,  533, 

635,  642,  653,  657. 

Sauveur,  Joseph,  516. 

Savage,  Carla  Diane,  297-298,  308,  313,  315, 
405,  510,  687,  692,  713,  720,  735. 

Savage,  John  Edmund,  570. 

Savicky,  Petr,  649. 

Sawada,  Joseph  James,  700. 

Sayers,  Dorothy  Leigh,  320. 

Scaligero,  Giulio  (—  Scaliger,  Julius 
Caesar),  501. 

Scandalous  fact,  26. 

Scatter-flip  operation,  595. 

Scattered  arithmetic,  150,  190. 
addition,  150,  189. 
shifting,  190. 
subtraction,  190. 

Scattering  bits,  594. 

Schaefer,  Thomas  Jerome,  72. 

Schaffler,  Theodor  Heinrich  Otto,  285. 
Scheduling,  60-62,  86. 

Schensted,  Craige  Eugene  (=  Ea  Ea), 

87-89,  479,  547. 

Schieber,  Baruch  Menachem 
(-n>\y  oroo  pra),  165. 

Schillinger,  Joseph  Moiseyevich 

( I lIujuiHni  ep.  HochcJ)  MonceeBHu), 
497-498,  512. 

Schlafli,  Ludwig,  609. 

Schmidt,  Erik  Meineche,  257. 

Schmitt,  Peter  Hans,  546. 

Schneider,  Bernadette,  693. 

Schnorr,  Claus-Peter,  131. 

Schooten,  Frans  van,  498-499,  512. 

Schott,  Rene  Pierre,  798. 

Schroder,  Friedrich  Wilhelm  Karl 
Ernst,  80,  535. 
numbers,  479,  810. 
trees,  479,  798-799. 
triangle,  798-799. 

Schroeppel,  Richard  Crabtree,  126, 

158,  184,  593. 

Schumacher,  Heinrich  Christian,  5,  17. 

Schur,  Issai,  758. 


Schiitzenberger,  Marcel  Paul,  373, 

699,  739,  744. 

Schwenk,  Allen  John,  810. 

Scoins,  Hubert  Ian,  461,  511. 

Scope  coordinates,  448,  474. 

SCOPE  links,  444. 

Score  vectors,  413. 

Scrabble®,  10. 

Scutella,  Maria  Grazia,  60. 

Search  trees,  6. 

Second-order  logic,  124,  585. 

Second-smallest  parts,  412. 

Security  holes,  201. 

Sedgewick,  Robert,  339,  510. 

Seed  value  for  pseudorandom  numbers, 

12,  25. 

Sefer  Yetzirah  (iTPfi'  130),  490. 

Segmented  broadcasting,  see  Stretching  bits. 
Segmented  sieves,  587. 

Seidel,  Philipp  Ludwig  von,  655. 

Seitz,  Richard,  337. 

Sekanina,  Milan,  470. 

Seki,  Takakazu  (Rfl#ftl),  492,  504,  512. 
Selasky,  Hans  Petter  William  Sirevaag,  202. 
Selection  function,  see  J(x\ /)  function, 
Multiplexer. 

Self-avoiding  walks,  254. 

Self-complementary  graphs,  42,  43,  533. 
Self-conjugate  forests,  476,  783. 
Self-conjugate  partitions,  408,  434,  758. 
Self-converse  graphs,  525. 

Self-dual  Boolean  functions,  63,  79, 

81,  92,  95. 

monotone,  63-64,  70,  79,  87-89,  92-93, 
133,  256,  263,  268,  550,  631,  647. 
threshold,  79,  92. 

Self-dual  forests,  476. 

Self-dualization,  92. 

Self-loops,  13,  18,  19,  41,  462,  465,  529. 
Self-organizing  data  structures,  549. 
Self-transpose  forests,  476. 

Self-reference,  124,  873. 

Semba,  Ichiro  (fliljfe— 6[S),  442,  627,  676. 
Semidefinite  programming,  vii. 
Semidistributive  laws,  475. 

Semilabeled  trees,  432. 

Semimodular  lattices,  763. 

Semimodular  law,  474. 

Separable  functions,  see  Threshold  functions. 
Separated  tilings,  274. 

Sequences,  totally  useless,  432. 

Sequency,  287. 

Sequential  algorithms,  vii. 

Sequential  allocation,  21,  220,  227,  603. 
Sequential  representation  of  BDDs, 

206-207,  259,  261,  262. 

Series-parallel  graphs,  465-468,  481,  798. 
Series-parallel  switching  networks,  570. 

Serra,  Micaela,  700. 
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Server  locations,  91. 

SET®,  the  game,  604. 

Set  partitions,  279,  391,  415-440,  473, 
503-505,  511,  513,  628. 
conjugate  of,  434. 
dual  of,  768. 

Gray  codes  for,  417-418,  433. 

indecomposable,  513. 

noncrossing,  473,  511,  513. 

order  of,  763. 

random,  426-428. 

shadow  of,  433. 

universal  sequences  for,  440. 

Set  systems,  32,  see  Hypergraphs. 

Seth,  Vikram  ([qsnq  ^r)j  iv,  437. 

Sets,  represented  as  integers,  143,  150,  160, 
190,  194-195,  278,  585-586. 
maximal  proper  subsets  of,  190. 

Sets  of  combinations,  see  Families  of  sets. 

Seven  deadly  sins,  495-496. 

Seven-segment  display,  112-114,  129,  262. 

Seymour,  Paul  Douglas,  17. 

SGB,  9,  see  Stanford  GraphBase. 

SGB  word:  A word  in  W0RDS(5757),  275. 

Sha‘ari  Tzedeq  (pis  813. 

Shades  of  gray,  199. 

Shadows,  combinatorial,  372-379,  385-388. 
of  binary  strings,  388. 
of  set  partitions,  433. 
of  subcubes,  388. 

Shadows  of  bit  codes,  84. 

Shakespeare  (=  Shakspere),  William,  1,  436. 

Shallit,  Jeffrey  Outlaw,  589,  771. 

Shannon,  Claude  Elwood,  Jr.,  47,  110, 

257,  560,  570. 

Shao  Yung  (®S),  487. 

Shape  of  a random  binary  tree,  454-455. 

Shape  of  a random  forest,  453-454. 

Shape  of  a random  partition,  402,  411. 

Shape  of  a random  set  partition,  426-427. 

Shapiro,  Harold  Seymour,  313. 

Shared  BDDs,  215,  257,  see  BDD  base. 

Shared  subtrees,  203,  257. 

Sheehan,  John,  531. 

Sheep,  8,  38. 

Sheep-and-goats  operation,  149-150,  190. 

Sheffer,  Henry  Maurice,  50,  80. 

Shelling  a monotone  Boolean  function, 
84-85,  537,  541,  552. 

Shen,  Vincent  Yun-Shen 
118,  120,  131. 

Shi,  Zhi-Jie  Jerry  ( , 595. 

Shields,  Ian  Beaumont,  735. 

Shift  instructions,  135,  151,  153,  184,  193. 
signed,  142,  181,  585,  589. 
table  lookup  via,  137,  155,  201,  588, 
599-600,  620. 

Shift  register  sequences,  302-308,  316-318. 

Shift  sets,  156-157. 


Shirakawa,  Isao  (E=|JI|Sl),  604. 

Shmulevich,  Ilya  Vladimir  (UlMyjieBHu, 
Hjiba  BjiaAHMHpoBnu),  559,  560. 
Sholander,  Marlow  Canon,  89,  548. 
Sholomov,  Lev  Abramovich  (UIojiomob, 

JleB  A6paMOBHu),  129. 

Shortest  normal  forms,  55,  82,  83,  95,  536. 
Shortest  paths  in  a graph,  viii,  12,  16, 

32,  66,  646,  831. 

Shorthand  universal  cycles,  see  Llniversal 
cycles  of  permutations. 

Shrikhande,  Sharadchandra  Shankar 
5PfiT  sfPsi-),  5. 

Shrinking  an  edge,  463-465,  521,  798. 
Shrinking  an  image,  174-175,  198. 

Shuffle  network  for  routing,  188. 

Sibling  links,  164,  195,  381. 

Sibson,  Robin,  604. 

SICOMP:  SIAM  Journal  on  Computing , 
published  by  the  Society  for  Industrial 
and  Applied  Mathematics  since  1972. 
Sideways  addition,  134,  143-144,  187, 

194,  590,  605. 
bytewise,  143,  599. 
function  ux,  x,  see  Sideways  sum. 
in  MMIX,  see  SADD. 
summed,  187,  593. 

Sideways  heaps,  164-167,  195-196,  727. 
Sideways  sum  (isx):  Sum  of  binary  digits, 
x,  143,  295,  374,  383,  682,  725. 

Siegel,  Carl  Ludwig,  641. 

Sieling,  Detlef  Hermann,  216,  639,  650. 

Sieve  method,  760. 

Sieve  of  Eratosthenes,  137,  186. 

Sifting,  243-246,  250,  251,  269,  641, 

643,  644,  663. 
automatic,  245,  649. 
partial,  245. 

Sign  of  a permutation,  323,  351. 

Signature  of  an  alphametic,  324. 

Signed  bits,  representation  of,  161,  187. 
Signed  permutations,  346,  559. 

Signed  right  shifts,  142,  181,  585,  589. 

Silver,  Alfred  Lindsey  Leigh,  711. 

Silverman,  Jerry,  686,  688. 

SIMD  (single  instruction,  multiple  data) 
architecture,  151. 

Simoes  Pereira,  Jose  Manuel  dos 
Santos,  726. 

Simon,  Imre,  625. 

Simple  digraphs,  18,  19,  40,  42,  43,  525,  526. 
Simple  games,  550. 

Simple  graphs,  see  Graphs. 

Simple  paths,  253-255,  275-276. 
simplex  graphs,  25. 

Simplexes,  372. 

Simplicial  complexes,  387-388,  744. 
Simplicial  multicomplexes,  388. 

Simply  connected  components,  175. 
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Simpson,  James  Edward,  514. 

Sims,  Charles  Coffin,  327. 

tables,  327-333,  335-336,  347-348,  812. 
Singh,  Parmanand  (h  < fijsr),  488,  499. 

Single-faced  functions,  see  Canalizing 
functions. 

Singleton,  Robert  Richmond,  523. 
Singmaster,  David  Breyer,  515,  617. 

Sink  nodes,  202-203,  207,  219,  241. 

□ , 202-208,  249,  250,  253-254,  259, 

272,  273,  676-677. 
more  than  two,  266. 

It)  , 202-209,  250,  259,  273,  676-677. 

Sink  vertices,  18,  62,  253,  621,  622,  626. 
Six-register  algorithm,  615. 

Size  of  a BDD  (B(/)),  205,  233-235. 

Size  of  a BDD  base  (B(/i, . . . , fm)), 

216,  229,  240. 

Size  of  a digraph,  18. 

Size  of  a graph,  13,  44. 

Size  vectors,  387,  388. 

Sjostrand,  Jonas  Erik,  618. 

Skarbek,  Wladyslaw  Kazimierz,  444,  793. 
Skinny  Boolean  functions,  270-271,  675. 
Skipping  blocks  of  permutations,  331-334, 
348,  719-720. 

Sklansky,  Jack,  568. 

Skolem,  Albert  Thoralf,  ix,  8,  36,  514,  515. 
Slanina,  Matteo,  585. 

Slates  of  options,  237,  266,  641. 

Sleator,  Daniel  Dominic  Kaplan,  136,  609. 
Sleep,  Michael  Ronan,  453. 

Slepian,  David,  145. 

Sloan,  Robert  Hal,  540. 

Sloane,  Neil  James  Alexander,  518,  681. 
Slobodova,  Anna  Miklasova,  639. 

Slocum,  Gerald  Kenneth  (=  Jerry),  694. 
Slot  in  a virtual  address,  263. 

SLU  (shift  left  unsigned),  137,  619. 

Slutzki,  Giora  (’piSlbo  NTPj),  69. 

Smallest  element  of  a set,  143. 

Smallest  parts,  411,  412. 

Smearing  bits  to  the  right,  140,  143,  589. 
smile,  11,  16,  24,  39. 

Smith,  Derek  Alan,  685. 

Smith,  Henry  John  Stephen,  normal 
form,  532. 

Smith,  John  Lynn,  568. 

Smith,  Malcolm  James,  463,  464,  467,  468. 
Smith,  Mark  Andrew,  96. 

Snir,  Marc  (TOW  pin),  127,  132. 

Socrates,  son  of  Sophroniscus  of 

Alopece  (Eaixpa-cris  Xuxppcuvtaxou 
’AXaj7t£xfj0ev),  440. 

SODA:  Proceedings  of  the  ACM-SIAM 
Symposia  on  Discrete  Algorithms, 
inaugurated  in  1990. 

Sole,  Patrick,  681. 

Solid  lines  in  BDD  diagrams,  202. 


Solid  lines  in  ZDD  diagrams,  249. 

Solitary  nodes,  240-242,  648. 

Solutions  to  Boolean  equations,  206, 

251,  278,  662. 
average  weight  of,  276. 
computing  the  generating  functions  for, 
206,  211,  255,  260,  261,  650. 
enumerating,  206-207,  251,  257,  259. 
generating  all,  206,  259. 
lexicographically  greatest,  259. 
lexicographically  least,  206,  257. 
minimal,  255. 

optimum,  206,  209-211,  251,  258, 

261,  663. 

random,  206,  208-209,  233. 
weighted,  209-211,  259-261. 

Somenzi,  Fabio,  258,  626,  633-634,  639. 
Sonnets,  436. 

Sorcerer’s  apprentice,  225. 

Sorted  data,  186. 

Sorting,  192,  242,  586,  595. 

networks  for,  91,  127,  189,  190,  566-567. 
Soule,  Stephen  Parke,  599. 

Source  vertices,  18,  253. 

Sources,  62. 

Space  complexity,  257. 

Space  versus  time,  220. 

Spanning  arborescences,  481,  see  Oriented 
spanning  trees. 

Spanning  subgraphs,  13,  15,  18,  39,  211. 
Spanning  tree  function,  277. 

Spanning  trees,  211,  256,  260,  462-469, 
481-483,  511,  832. 
enumeration  of,  482-483. 

Spark  plug,  651. 

Sparse  Boolean  functions,  251,  253. 

Sparse  graphs,  20,  23. 

Sparse  integers,  278. 

Spectrum  of  a graph,  806,  808. 

Spectrum  of  an  irrational  number,  514. 
Spenser,  Edmund,  436. 

Sperner,  Emanuel,  459. 

theory,  459,  479,  744,  801. 

Spheres,  vii. 

SpinOut  puzzle,  680. 

Spira,  Philip  Martin,  125. 

Spitkovsky,  Valentin  Ilyich  (CnHTKOBCKHii, 
BajieHTHH  Mjii>hu),  653. 

Spondees,  489-490,  501,  512. 

Sprague,  Roland  Percival,  582. 

Spread  set  in  a torus,  376-379,  387. 
Sprugnoli,  Renzo,  477. 

Square  of  a graph,  470,  484. 

Square  routes,  254,  276. 

Square  strings,  204-205. 

Squarefree  integers,  505. 

Squaring  a polynomial,  189. 

Squines,  180,  198,  615. 

Squire,  Matthew  Blaze,  696. 
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SR  (shift  right,  preserving  the  sign), 

142,  181,  587,  589. 

SRU  (shift  right  unsigned),  137,  141,  589. 
Stability  number  a(H)  of  a graph  or 
hypergraph,  35. 

Stable  sets,  34,  see  Independent  sets. 

Stable  sorting,  430-431,  759. 

Stable  states,  482. 

Stachowiak,  Grzegorz,  734. 

Stack  structures,  23,  41,  59,  220-222,  227, 
429,  538-539,  543,  see  also  AVAIL  stack. 
Stahnke,  Wayne  Lee,  303. 

Stam,  Aart  Johannes,  428,  439. 

Standard  deviation  (square  root  of  variance), 
189,  533,  638,  668. 

Standard  fields  in  SGB  format,  21. 

Standard  networks  of  comparators,  190. 
Standard  sequences,  306. 

Standard  sets  in  a torus,  376-378,  387. 
Stanford  Cardinal,  31. 

Stanford  GraphBase,  iv,  ix,  9-12,  20, 

23-26,  31,  62,  252,  291,  312,  318, 

432,  468,  569,  803. 
complete  guide  to,  iv,  32. 
format  for  digraphs  and  graphs,  21-22,  41. 
Stanford  University,  ii,  iv,  457,  510. 

Stanley,  Richard  Peter,  13,  368,  390,  393, 
439,  440,  476,  499,  554,  739,  759, 

761,  768,  787,  799. 
lattice,  474-476. 

Stanton,  Dennis  Warren,  749. 

Stappers,  Filip  Jan  Jos,  649. 

Star  graphs  (A"i,n),  17,  350,  522,  807. 

Star  transpositions,  337-338,  350, 

370-371,  384. 

State  capitols,  254-255,  276. 

Stedall,  Jacqueline  Anne,  505. 

Stedman,  Fabian,  322. 

Doubles,  323. 

Steele,  Guy  Lewis,  Jr.,  148,  189,  591,  594. 
Steger,  Angelika,  718. 

Steiglitz,  Kenneth,  683. 

Stein,  Sherman  Kopald,  573. 

Steiner,  Jacob,  17. 
trees,  vii,  17. 
triple  systems,  8. 

Stephens,  Nelson  Malcolm,  766. 

Stevens,  Brett,  314. 

Stevenson,  David  Ian,  113,  564,  572,  574. 
Stewart,  Ian  Nicholas,  318. 

Stibitz,  George  Robert,  284,  286. 

Still  Life,  270. 

Stinson,  Douglas  Robert,  511. 

Stirling,  James,  504. 

approximation,  421,  423,  425,  776. 
cycle  numbers,  777. 
strings,  763-764. 
subset  numbers,  504-505. 


subset  numbers,  asymptotic  value, 
424-426. 

subset  numbers,  generalized,  82,  765. 

STOC:  Proceedings  of  the  ACM 

Symposia  on  Theory  of  Computing, 
inaugurated  in  1969. 

Stockmeyer,  Larry  Joseph,  viii,  123,  124, 
132,  566,  578,  592,  596. 

Stockton,  Fred  Grant,  614. 

Stojmenovic,  Ivan  Danca  (CTojMeHOBnh, 
IlBaH  ^(aHua),  391. 

Stolfi,  Jorge,  616. 

Stone  representation,  88-89. 

Storage  access  function,  109,  see2m-way 
multiplexer. 

Storage  allocation,  147-148,  154,  186,  191. 

Strachey,  Christopher,  144. 

Strahler,  Arthur  Newell,  811,  see 
Horton-Strahler  numbers. 

Straight  insertion  sorting,  522. 

Straight-line  computations,  830,  see 
Boolean  chains. 

Straight  lines,  digitizing,  198. 

Strassen,  Volker,  580. 

Stretching  bits,  190,  600. 

Stringology,  204,  305-308,  317-318. 

Strings,  searching  for  special  bytes 
in,  152,  202. 

Strong  broadword  chains,  193. 

Strong  components:  Strongly  connected 
components,  40,  61-62,  86,  522, 

545,  575. 

Strong  product  of  graphs  (G  El  If),  28, 
42-44,  276,  483,  526,  531. 

Strong  revolving-door  order,  804. 

STTU  (store  tetra  unsigned),  616. 

Stufken,  John,  518. 

Subcubes,  54,  82-84,  129,  150,  195,  257, 
265,  277,  310-311,  385,  388,  535, 

541,  557,  570-571,  661. 
maximal,  54-55,  82-83. 

Subforests,  300-301,  316. 

Subfunctions,  204-205,  214,  215,  257. 

Subgraphs,  13,  17. 
of  a hypercube,  90. 

Subi,  Carlos  Samuel,  313. 

Subramani,  Krishnamurthy 

(S)0aj.«rargp/rg^)  muryw^),  546. 

Subramanian,  Ashok,  551. 

Subset  function,  239. 

Subset  sum,  first  missing,  202. 

Subsets,  143,  159-160,  194-195,  281, 

286,  585-586. 
generating  all,  150. 
maximal  proper,  190. 

subsets  graphs,  25. 

Substituting  an  expression  for  a 
variable,  259. 

Substituting  one  variable  for  another,  261. 
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Substitution  of  constants  for  variables, 

218,  262,  634. 

Substitution  of  functions  for  variables, 

259,  263. 

Subtables  of  a truth  table,  204-205, 

219,  234,  235,  240,  261,  627,  629, 

638,  658-659. 

Subtraction,  135,  184,  191. 
bytewise,  191,  599. 
modulo  5,  192. 
of  sparse  integers,  278. 
saturating,  see  Monus  operation, 
scattered,  190. 
unary,  192. 

Subword  parallelism,  151-155,  191-193. 
Subwords,  12. 

Sudborough,  Ivan  Hal,  722. 

Suffix  parity  function,  187,  201,  594,  603. 
Suffixes  of  strings,  132,  305. 

Sum  of  bits,  see  Sideways  sum. 
weighted,  187. 

Sum-of-products  expression,  Boolean,  see 
Disjunctive  normal  form. 

Sum  of  sparse  integers,  278. 

Summation  of  binomial  coefficients,  640-641. 
Sums  of  squares,  260,  312. 

Sums  over  all  partitions,  393,  419,  772,  775. 
Suparta,  I Nengah,  682. 

Super-root  of  a forest,  470,  483. 

Superedge  of  a graph,  465-468,  798. 

Supowit,  Kenneth  Jay,  646. 

Support  of  a family,  659. 

Surrogates,  618. 

Surroundedness  tree,  175,  198. 

Susruta  (U^d),  492. 

Sutcliffe,  Alan,  763. 

Sutherland,  Norman  Stuart,  ix. 

Sutner,  Klaus,  617. 

SWAC  computer,  5-6. 

Swap-in-place  algorithm,  240—242,  268,  273. 
Swapping  adjacent  levels,  240—246,  650. 
Swapping  bits,  144-147,  187-188,  619. 

between  variables,  581. 

Swapping  with  the  first  element,  337-338, 
350,  370-371,  384. 

SWAR  methods,  151-155,  191-193. 

SWARC  compiler,  597. 

Sweet  Boolean  functions,  256,  277. 

Swetz,  Frank  Joseph,  487. 

Swift,  Jonathan  (=  Lemuel  Gulliver), 

47,  620. 

Swinnerton-Dyer,  Henry  Peter  Francis,  751. 
Sylow,  Peter  Ludvig  Mejdell,  2-subgroup, 
584,  592. 

Sylvester,  James  Joseph,  312,  408, 

658,  685,  750. 

Symmetric  Boolean  functions,  77-79,  94-95, 
98-99,  104-106,  108,  109,  116,  126, 

131,  132,  194,  213,  219,  231,  257, 


261,  262,  266,  272,  274,  278,  564,  565, 
619,  626,  636,  642,  643. 
partially,  95,  269,  642,  650. 

Sm,  262,  272,  274,  626,  636,  642. 

5>m,  262,  643,  659,  663-664. 

Symmetric  difference  operation  (/  © <?),  273. 
Symmetric  functions,  393,  506,  536, 

757,  815. 

Symmetric  group,  584. 

Symmetric  matrices,  40,  44. 

Symmetric  order  (inorder)  of  nodes, 

165,  441,  447,  448,  455,  476,  477, 

782,  787,  788. 

Symmetric  polynomials,  393,  506, 

536,  757,  815. 

Symmetrical  mean  values,  414. 

Symmetries,  327,  see  Automorphisms, 
of  a Boolean  function,  269,  642. 
of  a chessboard,  650,  664,  673. 
of  a graph,  14-15,  39,  45,  528,  532-533. 
Symmetrizing  operation  (a  § k),  274. 
Symmetry  breaking,  265,  663. 

Syntax,  context-free,  58,  484. 

Synthesis  of  BDDs,  218-233,  257. 

breadth-first  versus  depth-first,  227-229. 
Synthesis  of  ZDDs,  251,  272,  273,  276,  677. 
Szabo,  Jozsef,  707. 

Szego,  Gabor,  xiii. 

Szekely,  Laszlo  Aladar,  763. 

Szele,  Tibor,  522. 

Szemeredi,  Endre,  91. 

Szily,  Koloman  von,  672. 

Szorenyi,  Balazs,  540. 

t-ary  trees,  472,  476,  478,  790,  795. 
complete,  484. 
random,  478. 

Table  lookup,  141,  142,  596. 
by  shifting,  137,  155,  201,  588, 

599-600,  620. 

Tableau  shapes,  394,  435,  see  Ferrers 
diagrams. 

Tableaux,  29-30,  43,  342-343,  476,  479, 

723,  790,  800. 

Tables  of  Boolean  function  statistics,  79. 
Tables  of  numerical  quantities,  818-819. 
ballot  numbers  ( Cmn ),  451. 

Bell  numbers  (ron),  418. 

Bernoulli  numbers  (Bn),  820. 

Catalan  numbers  (Cn),  450-451. 
Fibonacci  numbers,  820. 

Genocchi  numbers,  655. 
harmonic  numbers,  820. 
oriented  tree  numbers,  461. 
partition  numbers,  396,  400,  778. 

Schroder  numbers,  479. 

Tacquet,  Andre,  498. 

Tags,  82. 
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Tail  coefficients,  761. 

Tail  recursion,  635. 

Takagi,  Teiji  (i6;fcg}&),  374,  740. 

function,  374-375,  386-387. 

Takahasi,  Hidetosi  (iHilS^f'Sc),  552. 

Takasu,  Satoru  552. 

Takenaga,  Yasuhiko  (ftzkijl  jS> ) , 647. 
TAKE.RISC  program,  32. 

Tamari,  Dov  (’inn  31),  born  Bernhard 
Teitler,  787. 

lattice,  474-475,  788,  790. 

Tame  configurations  of  Life,  269-270. 

Tang,  Changjie  (JH'Jf'fc),  446. 

Tang,  Donald  Tao-Nan  (^}ii!^f),  362. 
Tangle  puzzle,  see  Loony  Loop  puzzle. 
Tangled  nodes,  240-242,  648. 

Tannenbaum,  Meyer,  556. 

Target  bits,  641,  650. 

Tarjan,  Robert  Endre,  1,  62,  87,  165, 

575,  607. 

Tarry,  Gaston,  5,  745. 

Tartaglia,  Niccolo  Fontana,  499. 

Tastes,  492. 

Tatami  tilings,  274. 

Tautology  function  (T),  49,  542,  660. 
Taylor,  Brook,  series,  13,  425,  773. 

Taylor,  Lloyd  William,  285. 

Tchuente,  Maurice,  713. 

Telephone,  285. 

Television,  284. 

Temperley,  Harold  Neville  Vazeille,  402. 
Templates,  220-223,  630. 

Tengbergen,  Cornelia  van  Ebbenhorst,  457. 
Tensor  product  of  graphs,  see 
Direct  product  of  graphs. 

Terminology,  13,  536-537. 

Ternary  Boolean  functions,  104-105. 
Ternary  Boolean  operations,  63,  536, 
580-581. 
table,  78. 

Ternary  n-tuples,  299,  306-307,  315, 

316,  685,  690,  702. 

Ternary  operations,  229-230,  263-264,  272. 
ANDAND,  229-230,  264,  663. 

MUX,  229,  232,  264,  272,  630,  660. 
OROR,  661. 

Ternary  strings,  382,  745. 

Ternary  trees,  472,  476,  487,  see  also 
t- ary  trees. 

Ternary  tries,  complete,  520. 

Ternary  vectors,  163. 

Terquem,  Olry,  389. 

Teslenko,  Maxim  Vasilyevich  (Tecjiemco, 
MaKCHM  Bacuji henna),  646. 
Tessellation,  168,  179,  196. 

Tetrabyte  or  tetra:  A 32-bit  quantity, 
139-140. 

Tetragrams,  487. 

Tetrahedra,  25. 


Tetrahedral  pyramids,  88-89. 

TfeX,  12,  883. 

Text  processing,  151-153,  191-192,  201-202. 
Theobald,  Thorsten,  639. 

Theorem  proving,  59,  548. 

Theory  meets  practice,  vii,  13,  153-154. 
Thimonier,  Loys,  813. 

Thin  BDDs,  267,  271. 

Thin  ZDDs,  272. 

Thinning  an  image,  172-173,  197. 

Thomas,  Herbert  Christopher  (=  Ivo),  49. 
Thomas,  Robin,  17. 

Thompson,  Kenneth  Lane,  200. 

Thoreau,  David  Henry  (=  Henry 
David),  96,  280. 

Thorup,  Mikkel,  599,  616. 

Three-in-a-row  function,  215,  262. 
Three-register  algorithm,  177-180,  198-199. 
Three-state  encodings,  160-163,  195. 
Three-valued  logic,  163,  195,  496. 
Three-variable  Boolean  functions, 

63,  104-105. 
table,  78. 

Threshold  functions,  75-77,  79,  92-95, 
213-214,  261,  262,  268,  461,  567, 

653,  801. 

of  threshold  functions,  77,  92,  553. 

Thue,  Axel,  ix,  623. 

sequence,  209-210,  260. 

Tic-tac-toe,  114-117,  129-130,  269,  347,  573. 
Tightly  colorable  graphs,  44. 

Tilings,  168,  179,  196,  see  also  Exact 
cover  problems, 
of  the  plane,  45. 

Time,  mixed-radix  representation  of,  192. 
Time  stamps,  636. 

Time  versus  space,  220. 

Tip  of  an  arc,  18. 

TlP(a)  (final  vertex  of  arc  a),  21. 

Tippett,  Leonard  Henry  Caleb,  753. 

Tiring  irons,  285. 

Tison,  Pierre  Louis  Joseph,  540. 

Tocher,  Keith  Douglas,  134,  191,  597. 

Toda,  Iwao  (J3  HSi),  552. 

Todorov,  Dobromir  Todorov  (ToaopoB, 
,Ho6poMnp  ToflopoB),  517. 

Tokushige,  Norihide  (fj5JtfBl5l),  740-741. 
Tolstoy,  Leo  Nikolaevich,  (Tojicroii,  JleB 
HmcojiaeBHu),  9. 

Tomlin,  Mary  Jean  (=  Lily),  60,  62,  86. 
Tompkins,  Charles  Brown,  5-7,  337,  510. 
Toolkits  for  BDD  operations,  220, 

224,  257,  677. 

Toolkits  for  ZDD  operations,  272,  273,  276, 
677,  see  also  Family  algebra. 

Tootill,  Geoffrey  Colin,  294,  679. 
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Top-down  algorithms,  228,  257,  663,  678. 
Top-down  synthesis,  103-105,  126,  564. 
Topological  sorting,  60,  73,  85,  97,  207,  253, 
342-344,  352-353,  415,  734,  762,  832. 
Toppling,  482. 

Topswops,  354. 

Torok,  Eva,  733. 

Torture  test,  315. 

Torus  graphs,  x,  468,  469,  483,  805,  808. 
directed,  808. 

Toruses,  28,  41,  197,  309,  318,  327, 

468-469,  680. 
directed,  352. 
generalized,  45-46. 
kingwise,  44. 

n-dimensional,  374-379,  387,  599. 
twisted,  350. 

Tot  tibi  . . . , 500-503,  512-513. 

Total  ordering,  342. 

Toth,  Zoltan,  701. 

Touchard,  Jacques,  765. 

Tournament  digraphs,  40,  413,  808. 

transitive,  18,  27,  40,  41. 

Trace  of  a partition,  394,  402,  408,  749. 
Trading  tails,  421,  776. 

Trailing  zeros,  140,  see  Ruler  function. 
Transaction  database,  see  Multifamily 
of  sets. 

Transdichotomous  methods,  see  Broadword 
computations. 

Transducers,  finite-state,  128. 

Transformed  BDDs,  639. 

Transition  counts,  294,  313. 

Transitive  closure,  159,  165. 

Transitive  laws,  518. 

Transitive  relations,  352,  354,  416. 
Transitive  tournaments,  808. 

Translation,  tiling  by,  45-46. 
Transmogrification,  240-241,  648,  660. 
Transpose  of  a forest,  471-472,  476. 
Transpose  of  a tiling,  663. 

Transposed  allocation,  202,  588. 

Transposing  a 0-1  matrix,  33,  147,  188, 

199,  201,  591-592. 

Transposing  adjacent  elements,  320-325, 
349,  353,  719,  720. 

Transpositions:  Cyclic  permutations 
of  order  2,  476. 

Transversals  of  a hypergraph,  see  Hitting 
sets  of  a family. 

Transversals  of  a latin  square,  6-7,  37,  516. 
Traveling  Salesrep  Problem,  viii,  254-255, 
276,  344,  832. 

Traversal  in  postorder,  606,  612-613. 
Traversal  in  preorder,  606,  612-613. 

Treaps,  590. 


Tree  function,  424,  773. 

Tree  of  losers,  727. 

Tree  representation  of  a series-parallel 
graph,  466,  481. 

Trees,  67,  81,  85,  91,  440-485,  508-511. 
binary,  see  Binary  trees, 
binomial,  360-361,  381,  484,  809. 
Fibonacci,  484,  485. 
free,  17,  44,  67,  91,  462,  480-481,  484, 

509,  577,  789,  807,  809. 

Greg,  774. 

Gray  codes  for,  446-449,  473. 
lattices  of,  473-476. 
of  partitions,  408. 

of  restricted  growth  strings,  417,  766. 
oriented,  432,  461-462,  480-482, 

509,  511,  774. 

random,  452-457,  478,  481,  485,  798-799. 
Schroder,  479,  798-799. 
spanning,  211,  256,  260,  462-469, 

481-483,  511,  832. 
t-ary,  472,  476,  478,  790,  795. 
traversal  of,  469-471,  606,  612-613. 

Trellis  of  a code,  677. 

Trend-free  Gray  codes,  296-297,  315. 
Triangle  function,  133. 

Triangle  inequality,  16,  19. 

Triangles  (3-cliques),  374. 

Triangular  grids,  25,  88,  525,  554,  787. 
Triangularizing  a matrix,  200,  725,  805. 
Tribonacci  sequence,  488,  814. 

Trick,  magic,  440. 

Tricks  versus  techniques,  134,  616. 
sneaky,  221. 

Tries,  38-39,  202,  253,  257,  310. 

traversal  of,  363-364. 

Trinomials,  189. 

Tripartite  subgraphs,  maximal  induced,  277. 
Tripartitions,  429. 

Triple  product  identity,  396,  410. 

Triple  systems,  8,  32,  44. 

Triple  zipper  function,  201,  596. 

Triply  linked  forests,  470—471,  480, 

606-607,  612,  785. 

Trivalent  graphs,  14,  15,  39,  531. 

Trivial  functions,  49,  63,  64,  565. 

Trivial  trees,  783. 

Trochees,  489,  501. 

Trominoes,  252,  274. 

Tropical  algebra,  625. 

Trost,  Ernst,  760. 

Trotter,  Hale  Freeman,  323. 

Trotter,  William  Thomas,  714-715. 
Trowbridge,  Terry  Jay,  707. 

TRUE,  202. 

TrueType,  615. 

Truth,  47,  63,  79. 
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Truth  tables,  47,  49,  51-53,  71,  94-96, 

98,  100,  102,  105,  141,  161-162,  202, 
204-206,  212,  215,  219-221,  223, 
233-235,  250,  257,  259,  262,  263,  278, 
534,  569,  621,  646,  647,  651,  676. 
extended,  241-242,  648. 
fully  elaborated,  212-213,  260-261,  274. 
in  hexadecimal  notation,  105,  132,  572. 
of  partial  functions,  114-116. 
two-dimensional,  110-111,  117-121. 
Tsuboi,  Teiichi  ),  553,  554,  559. 

Tsukiyama,  Shuji  604. 

Tukey,  John  Wilder,  47. 

Tuliani,  Jonathan  R.,  699. 

Tuma,  Jiff,  763. 

Tuple:  A sequence  containing  a given 

number  of  elements,  439,  486-490,  499. 
Turan,  Gyorgy,  540. 

Turing,  Alan  Mathison,  134. 

machines,  257,  609. 

Tweedledee,  48,  79. 

Twelvefold  Way,  390,  407,  439,  499. 

Twisted  binomial  trees,  484. 

Twisted  toruses,  350. 

Two-in-a-row  function,  208,  259,  263. 
Two-level  representations  of  Boolean 

functions,  see  Conjunctive  normal  form, 
Disjunctive  normal  form. 

Two-line  arrays,  767-768. 

Two-line  form  of  permutation,  326. 
Two-variable  Boolean  functions,  47-50, 
79-80,  259,  272,  279. 
table,  49. 

Two’s  complement  notation,  134,  158,  582. 
Tyler,  Douglas  Blaine,  476. 

Typesetting,  171,  883. 

U (universal  set),  276. 

UCS  (Universal  Character  Set),  201. 

Uhlig,  Dietmar,  131. 

Ulam,  Stanislaw  Marcin,  605. 

numbers,  195. 

Ultraparallel  lines,  168. 

Ultrasweet  Boolean  functions,  277. 

Ulyanov,  Vladimir  Ilyich  (YnbaHOB, 
BjiaAHMHp  IJjibHu),  86,  365. 

Umbral  notation,  765,  770,  771,  779. 

Unary  notation  (radix  1),  192. 

Unary  operator:  A function  of  one  variable. 
Unate  Boolean  functions,  265,  537. 

Unate  cascades,  654. 

Unate  cube  set  algebra,  see  Family  algebra. 
Unbiased  rounding,  191,  597. 

Uncoloring,  664. 

Uncompressing  bits,  189. 

Underflow  mask,  601. 

Undirected  cycles,  346. 

Undirected  graphs  as  a special  case  of 
directed  graphs,  19,  830. 


Undoing,  334,  719. 

Uneasy  nodes,  467,  481. 

Unger,  Stephen  Herbert,  151. 

Unicode,  201. 

Uniform  hypergraphs,  32-33,  44,  673. 
Union-find  algorithm,  69,  763. 

Union  of  graphs,  26,  see  also  Direct 
sum  of  graphs. 

Union  operation  (/  U g),  273,  see  also 
OR  subroutine. 

UNIQUE  subroutine,  226-227,  648. 

Unique  tables,  226-229,  264. 

Uniquely  thin  BDDs,  267. 

Unit  family  (e),  273,  658,  662,  669-670. 

Unit  vectors,  376,  480. 

United  States  of  America  graph,  15,  34, 

39-40,  210-211,  231-233,  244-246,  250, 
254-255,  265,  269,  276,  277,  636,  670. 
UNIVAC  1206  Military  Computer,  5. 
Universal  algebras,  535. 

Universal  Character  Set,  201. 

Universal  cycles  of  combinations,  389, 

498,  512,  833. 

Universal  cycles  of  n-tuples,  833,  see 
de  Bruijn  cycles. 

Universal  cycles  of  permutations, 

354-355,  833. 
modular,  722. 

Universal  cycles  of  set  partitions,  440. 
Universal  family  (p),  275,  660-662,  666,  669. 
Universal  hashing,  271. 

Universal  quantification  (V),  87,  230,  265. 
Universal  set  ( U ),  276. 

Universe,  protons  in  the,  124. 

Universities,  31. 

Unlabeled  free  trees,  462,  480. 

Unlabeled  graphs,  14,  532. 

Unlabeled  objects,  390,  432,  774. 

Unlabeled  rooted  trees,  see  Oriented  trees. 
Unlucky  nodes,  483. 

Unpacking  of  data,  134,  136-138,  189,  594. 
Unranking,  487-488,  500,  512. 
combinations,  381,  383. 
integer  partitions,  412. 
n-tuples,  283-284,  299,  308,  315, 

487  488,  512. 

other  combinatorial  objects,  452,  477,  511. 
permutations,  345,  352,  491,  500,  717. 
set  partitions,  432. 

Unrooted  trees,  see  Free  trees. 

Unsigned  2-adic  integers,  581. 

Unsolvable  problems,  545,  585. 

Unusual  correspondence,  762. 

Up-down  partitions,  414,  759. 

Up-down  permutations,  353. 

Up-down  sequences,  316. 

Upper  bounds  on  combinational  complexity, 
110-112,  129,  see  also  Footprints 
of  a Boolean  function. 
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Upper  halfplane,  608. 

Upper  shadow,  372. 

Uppercase  letters,  191. 

Urban,  Genevie  Hawkins,  172. 

Uri,  Dario,  390,  617,  841. 

Urns  and  balls,  390. 

Useless  sequences,  432. 

Ushijima,  Kazuo  446. 

UTF-8:  8-bit  UCS  Transformation 
Format,  201. 

UTF-16:  16-bit  UCS  Transformation 
Format,  201. 

Utilities  graph  (^3,3),  17,  39,  42,  521. 
Utility  fields  in  SGB  format,  21,  524. 

V field  in  a decision  diagram,  202-203, 

216,  220-221,  226,  259,  263. 

Vacillating  tableau  loops,  434,  815. 
Vacuum-tube  circuits,  104,  126. 

Vakhovsky,  Evgenii  Borisovich  (BaxoscKHH, 
EBreHHH  BopucoBHu),  806. 

Valency,  see  Degree  of  a vertex. 

Validity  function  (T),  49. 
van  Baronaigien,  see  Roelants  van 
Baronaigien. 

van  Ebbenhorst  Tengbergen,  Cornelia,  457. 
van  Emde  Boas,  Peter,  164. 
van  Leeuwen,  Marcus  Aurelius  Augustinus, 
ix,  750,  768. 

van  Nooten,  Barend  Adrian  Anske 
Johannes,  488. 

van  Schooten,  Frans,  498-499,  512. 

Van  Wyk,  Christopher  John,  611. 
van  Zanten,  Arend  Jan,  682,  728. 

Vardy,  Alexander  (’111  mtOpN),  677. 
Variance,  189,  533,  638,  668. 

Variations:  Permutations  of  combinations, 
345,  348,  499,  512,  707,  709,  717. 

Varol,  Yaakov  Leon  (inn  yiNp  JpV’), 

340,  343. 

Vatriquant,  Simon,  324. 

Vazsonyi,  Endre,  695. 

Veblen,  Oswald,  176. 

Vector  partitions,  429-431,  439. 

Vector  spaces,  380,  385,  480. 
binary,  677. 

basis  for,  200,  585,  618. 

Vector-valued  Boolean  functions,  107,  121. 
Vedic  chants,  487. 

Velthuis,  Frans  Jozef,  883. 

Venice,  doge  of,  495. 

Vergil  (=  Publius  Vergilius  Maro),  501. 
Veroff,  Robert  Louis,  548. 

Vershik,  Anatoly  Moiseevich  (BepniHK, 
AHaTOJiHH  MoHceeBHu),  402,  754. 
Vertex  connectivity,  528. 

Vertex  covers,  537,  831. 
minimal,  195,  259. 
minimum,  34-35,  44. 


Vertex  degree,  264. 

Vertex  variables,  21,  23. 

VERTICES (g)  (the  first  vertex  node),  22,  523. 
Vesztergombi,  Katalin,  530. 

Vices,  495-496. 

Vickers,  Virgil  Eugene,  686,  688. 

Viennot,  Gerard  Michel  Frangois 
Xavier,  811. 

Vikulin,  Anatoly  Petrovich  (Bhkvjtmh, 
AHaTOJiHH  1 IcTpoBH '! ) , 557. 

Vinci,  Leonardo  di  ser  Piero  da,  9,  24. 
Vinnicombe,  Robert  Ian  James,  346. 

Virgin,  500-501. 

Virtual  addresses,  263. 

Virtues,  493-496,  500-501,  512. 

Vishkin,  Uzi  Yehoshua  (ppvni  WlPP 
>!1V),  165. 

Visible  nodes,  240-242,  648. 

Visiting  an  object,  281,  623. 

Vitale,  Fabio,  167. 

Viterbi,  Andrew  (=  Andrea)  James,  677. 

Vo,  Kiem-Phong,  800. 
von  Christ,  Wilhelm,  490. 
von  Dyck,  Walther  Franz  Anton,  510. 
von  Ettingshausen,  Andreas,  508. 
von  Neumann,  John  (=  Margittai  Neumann 
Janos),  550,  622. 
von  Seidel,  Philipp  Ludwig,  655. 
von  Szily,  Koloman,  672. 

Voting,  451,  496,  802. 

Vowels,  38,  275. 

Vu,  Van  Ha,  556. 

Vuillemin,  Jean  Etienne,  261,  606,  675. 

Wn  (wheel  graph  of  order  n),  42,  46, 

482,  526,  533. 

Wada,  Eiiti  (ftlffl^— ),  587. 

Wagnalls,  Adam  Willis,  48. 

Wagner,  Eric  Gerhardt,  539. 

Wakerly,  John  Francis,  574. 

Walks  in  a graph,  19-20,  40,  45. 

Wallis,  John,  286,  491,  499,  502,  505, 

513,  679. 

Walsh,  Joseph  Leonard,  287,  288,  684. 
functions,  287-289,  312. 
transform,  288-289,  312. 

Walsh,  Timothy  Robert  Stephen,  363, 

719,  720,  736. 

Walter  of  Burley  (=  Burleigh  = Gualterus 
Burleus),  51. 

Wang,  Da-Lun  (I^jm),  374,  376,  528. 
Wang,  Ping  Yang  (jE^F,  nee  £§  *P),  374,  376. 
Wang,  Shinmin  Patrick  (3; 517. 

Wang,  Terry  Min  Yih  (jEJRijl),  308. 

Warren,  Henry  Stanley,  Jr.,  140,  143,  144, 
157,  183,  184,  581,  589,  594,  598,  619. 
Warren,  Jon,  454. 

Washburn,  Seth  Harwood,  680. 

Watanabe,  Hitoshi  (IlKoPW),  511. 
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Watanabe,  Masatoshi  (<ff;8?i$i),  883. 
Watkins,  John  Jaeger,  531. 

Watson,  George  Neville,  794. 

Wayne,  Alan,  347. 

Weak  orderings,  354,  509,  654,  720. 

Weak  second-order  logic,  123-124,  132. 
Weakley,  William  Douglas,  673. 

Weber,  Karl,  541. 

Wegener,  Ingo  Werner,  124,  216,  238,  243, 
246,  258,  269,  272,  533,  566,  627,  630, 
635,  638,  639,  642,  644,  647,  649,  653. 
Wegman,  Mark  N,  624. 

Wegner,  Gerd,  739. 

Wegner,  Peter  (=  Weiden,  Puttilo 
Leonovich  = Rcm/ich,  IlyTTHJio 
JleoHOBHn),  140,  144. 

Weichsel,  Paul  Morris,  526. 

Weight  enumeration,  681. 

Weighted  solutions,  209-211,  259-261,  279. 
Weighted  sum  of  bits,  187. 

Weinberger,  Arnold,  568. 

Weiner,  Peter  Gallegos,  118,  120,  131. 
Weisner,  Louis,  517. 

Well-balanced  Langford  pairings,  2-3,  36. 
Well-balanced  permutations,  354. 

Wells,  Mark  Brimhall,  510,  511,  709. 

Welter,  Cornelis  Petrus,  585. 

Wen,  King  of  Chou  (jfig  = 

486-487,  512. 

Werchner,  Ralph,  246,  653. 

Wermuth,  Udo  Wilhelm  Emil,  ix. 

Wernicke,  August  Ludwig  Paul,  5. 

Weste,  Neil  Harry  Earle,  176. 

Weston,  Andrew,  339. 

Wheatley,  Henry  Benjamin,  834. 

Wheel  graphs  (Wn),  42,  46,  482,  526,  533. 
Wheeler,  David  John,  143. 

Wheels,  concentric,  497. 

Whipple,  Francis  John  Welsh,  376. 

White,  Arthur  Thomas,  II,  323. 

White,  Dennis  Edward,  769. 

White  pixels,  136,  172,  199. 

Whitworth,  William  Allen,  419, 

502-503,  513. 

Wibold,  bishop  of  Cambrai  (=  Wiboldus, 
Cameracensis  episcopus),  493-495,  505. 
Width  of  a Langford  pairing,  3,  515. 
Wiedemann,  Douglas  Henry,  73,  384, 

561,  696. 

Wiener,  Norbert,  821. 

Wikipedia,  202. 

Wild  configurations  of  Life,  269. 

Wilf,  Herbert  Saul,  338,  362,  411,  481, 

510,  721,  726. 

Wilkes,  Maurice  Vincent,  143. 

Willard,  Dan  Edward,  154,  192. 

William  of  Ockham  (=  Guilielmus  ab 
Occam),  51. 


Williams,  Aaron  Michael,  355,  722-723, 
734-735. 

Williams,  Robin  McLaurim,  60,  62,  544. 
Williamson,  Stanley  Gill,  339,  350,  718,  761. 
Wilson,  David  Whitaker,  605. 

Wilson,  Richard  Michael,  517. 

Wilson,  Robin  James,  15,  42,  531. 

Wilson,  Wilfrid  George,  323. 

Winder,  Robert  Owen,  64,  554,  581. 

Window  optimization,  245-246. 

Winker,  Steven  Karl,  686. 

Winkler,  Peter  Mann,  297-298,  315, 

687,  692,  798. 

Wise,  David  Stephen,  597. 

Witness  bits,  519. 

Wolf,  Margarete  Caroline,  815. 

Wolfel  (=  Woelfel),  Peter  Philipp,  247. 
Wolfram,  Stephen,  618. 

Wong,  Chak-Kuen  (itiifll),  149,  190,  532. 
Wong,  Roderick  Sue-Chuen  426. 

Wood,  Frank  Washington,  112. 

Woodcock,  Jennifer  Roselynn,  664. 
Woodrum,  Luther  Jay,  142,  588. 

Woods,  Donald  Roy,  597. 

Wool,  Avishai  (1)11  555. 

Word  cubes,  11,  39. 

Word  ladders,  11-12,  32. 

Word  problems,  68. 

Word  squares,  11,  38. 

words  graphs,  12-13,  31,  39,  42. 

WORDS(n),  the  n most  common  five-letter 
words  of  English,  10-12. 

Wordsize  scalability,  see  Broadword 
computations. 

Working  units,  80. 

Worm’s  walk,  441,  452-454,  471,  781, 

787,  797,  800. 

Wraparound  parity  patterns,  199. 

Wrapping  around,  41,  299,  309,  318. 

Wright,  Edward  Maitland,  778,  779. 

WS1S,  124. 

Wunderlich,  Charles  Marvin,  605. 

Wyde:  A 16-bit  quantity,  139-140. 

Wyman,  Max,  425,  770,  772. 

Xiang,  Limin  446. 

XL25  game,  617. 

X0R  (bitwise  exclusive-or) , 716. 

XOR  (bitwise  exclusive-or,  ©),  90, 

134-136,  538. 

identities  involving,  135,  185,  187,  585. 
XOR  function  (exclusive  or,  ©),  49-52, 

77,  80,  81,  534-535,  553. 

XOR  gates  (©),  32,  33,  97. 

XOR  subroutine,  272,  273,  633. 
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Y functions,  88-89,  92,  256. 

Yablonsky,  Sergei  Vsevolodovich 

(R6jiohckhh,  Cepreii  BceBOJiOAOBnu), 
558. 

Yajima,  Shuzo  627,  647,  676. 

Yakubovich,  Yuri  Vladimirovich  (MKyfjOBH'i, 
lOpnii  BjianHMupoBHu),  402,  428. 

Yan,  Catherine  Huafei  768. 

Yang  Hsiung  (jfH  or  $g  $£),  487-488. 
Yannakakis,  Mihalis  (Fiavvaxocxnc, 

MtxaXr)?),  604. 

Yano,  Tamaki  (£I?il),  503,  504. 

Yates,  Frank,  289. 

Yee,  Ae  Ja  750. 

Yes/no  quantifiers  (A,  N),  231,  265,  635. 
Yijing , see  I Ching. 

Yin  and  yang,  486-487. 

Yoshida,  Mitsuyoshi  (cfEB3fc&),  274. 
Yoshigahara,  Nobuyuki  (=  Nob) 
(P'rMWZ),  347,  707. 

Young,  Alfred,  tableaux,  342-343,  476, 

479,  723,  790,  800. 

Yiicel,  Melek  Diker,  561. 

Yuen,  Chung  Kwong  (gni^yfc),  682. 

Z order,  see  Zip. 

z-nomial  coefficients,  369,  384,  726,  735,  769. 
2-nomial  theorem,  749,  753. 

2-profile  (20, . . . , 2n)  of  a function,  250, 

272,  275,  663. 

Z-transform  of  a function,  272,  674. 

Z(f)  (the  ZDD  size  of  /),  250,  272,  278. 
Zn(x-,y)  (bit  n of  xy),  247. 

Zn,a(x)  (middle  bit  of  ax),  247. 

Zaks,  Shmuel  (Dpt  tiNlOiy),  471,  472, 

476,  784,  789. 

Zanten,  Arend  Jan  van,  682,  728. 

ZDD:  A zero-suppressed  BDD,  204,  249. 
Zeads,  250,  638,  658,  665-666. 

Zehfuss,  Johann  Georg,  526. 


Zeilberger,  Doron  (in:D»S  inn),  409,  750. 
Zero-byte  test,  152-153,  191. 

Zero-one  principle,  68,  186. 

Zero-or-set  instructions,  141,  142,  600. 
Zero-suppressed  BDDs,  249-256,  258, 

272-  279,  547,  677. 
compared  to  BDDs,  250-253,  272, 

273,  663-665. 

family  algebra  for,  253,  255,  273, 

275-277,  660,  673. 
mixed  with  BDDs,  256. 
profiles  of,  250,  272,  275,  663. 
toolkits  for,  272,  273,  276,  677. 

Zeta  function,  396,  589,  751,  779. 

Zhang,  Linbo  (?£#$?),  883. 

Zhao,  Xishun  (iit^jllijl),  546. 

Zhegalkin  (=  Gegalkine),  Ivan  Ivanovich 
(JKerajiKHH,  HBaH  HBaHOBnu),  51,  535. 
Zhou  Wenwang,  see  King  Wen  of  Chou. 
Ziegler,  Gunter  Matthias,  788. 

Zigzag  paths,  558. 

Zijlstra,  Erik,  164. 

Zimmermann,  Paul  Vincent  Marie,  594. 

Zip:  The  zipper  function  (x  J y), 

147-148,  182,  188,  189,  198,  201, 

242,  588,  591,  594. 
triple  (three-way),  201,  596. 

Zip-fastener  method,  597. 

Zito,  Jennifer  Snyder,  532. 

Zoghbi,  Antoine  Chaiban 

jLyU,  391. 

ZSNZ  (zero  or  set  if  nonzero),  142,  600. 

ZSZ  (zero  or  set  if  zero),  141. 

Zuev,  Yuri  Anatol’evich 

(3yeB,  lOpnii  AuaTojn.cBH'i),  559. 
ZUNIQUE  subroutine,  659. 

Zuse,  Konrad,  570. 

Zykov,  Aleksander  Aleksandrovich  (3mkob, 
A.ieKcaiiap  AneKcaHupoBHu),  26. 
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The  Art  of  Computer  Programming 

DONALD  E.  KNUTH 

Knuth’s  multivolume  analysis  of  algorithms  is  widely  recognized  as  the  definitive  description  of 
classical  computer  science.  The  first  three  volumes  of  this  work  have  long  comprised  a unique 
and  invaluable  resource  in  programming  theory  and  practice.  Scientists  have  marveled  at  the 
beauty  and  elegance  of  Knuth’s  analysis,  while  practicing  programmers  have  successfully  applied 
his  “cookbook”  solutions  to  their  day-to-day  problems. 

The  level  of  these  first  three  volumes  has  remained  so  high,  and  they  have  displayed  so  wide 
and  deep  a familiarity  with  the  art  of  computer  programming,  that  a sufficient  “review" 
of  future  volumes  could  almost  be:  “Knuth,  Volume  n has  been  published." 

— Data  Processing  Digest 

Knuth,  Volume  n has  been  published,  where  n = 4A. 

In  this  long-awaited  new  volume,  the  old  master  turns  his  attention  to  some  of  his  favorite  topics 
in  broadword  computation  and  combinatorial  generation  (exhaustively  listing  fundamental 
combinatorial  objects,  such  as  permutations,  partitions,  and  trees),  as  well  as  his  more  recent 
interests,  such  as  binary  decision  diagrams. 

The  hallmark  qualities  that  distinguish  his  previous  volumes  are  manifest  here  anew:  detailed 
coverage  of  the  basics,  illustrated  with  well<hosen  examples;  occasional  forays  into  more  esoteric 
topics  and  problems  at  the  frontiers  of  research;  impeccable  writing  peppered  with  occasional  bits 
of  humor;  extensive  collections  of  exercises,  all  with  solutions  or  helpful  hints;  a careful  attention 
to  history;  implementations  of  many  of  the  algorithms  in  his  classic  step-by-step  form. 

There  is  an  amazing  amount  of  information  on  each  page.  Knuth  has  obviously  thought  long 
and  hard  about  which  topics  and  results  are  most  central  and  important,  and  then,  what  are  the  ] 
most  intuitive  and  succinct  ways  of  presenting  that  material.  Since  the  areas  that  he  covers  in  this  j 
volume  have  exploded  since  he  first  envisioned  writing  about  them,  it  is  wonderful  how  he  has 
managed  to  provide  such  thorough  treatment  in  so  few  pages. 

— Frank  Ruskey,  Department  of  Computer  Science,  University  of  Victoria 

The  book  is  Volume  4A,  because  Volume  4 has  itself  become  a multivolume  undertaking. 
Combinatorial  searching  is  a rich  and  important  topic,  and  Knuth  has  too  much  to  say  about  it 
that  is  new,  interesting,  and  useful  to  fit  into  a single  volume,  or  two,  or  maybe  even  three.  This 
book  alone  includes  approximately  1 500  exercises,  with  answers  for  self-study,  plus  hundreds  of 
useful  facts  that  cannot  be  found  in  any  other  publication.  Volume  4A  surely  belongs  beside  the  — j 
first  three  volumes  of  this  classic  work  in  every  serious  programmer’s  library. 
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